The webpage I want do get has windows-1251 charset encoding. Is there a way to not just simply download this page, but automatically.. um.. convert the text inside to a uft-8?
I have tried to add additional parameters to wget like
--header='Accept-Charset: utf-8'
but no luck.
You can use
iconv -f ISO-8859-1 -t UTF-8 ./index.html > ./utf.html
to convert the encoding. See https://wiki.archlinux.org/index.php/Convert_a_text_file%27s_encoding
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With