What would be a good tool, or set of tools, to download a list of URLs and extract only the text content? Spidering is not required, but control over the download file names, and threading would be a bonus.
The platform is linux.
wget
|
html2ascii
Note: html2ascii can also be called html2a
or html2text
(and I wasn't able to find a proper man page on the net for it).
See also: lynx
.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With