I feel stacked here trying to change encodings with Python 2.5 I have XML response, which I encode to UTF-8: <code>response.encode('utf-8')</code>. That is fine, but the program which uses this info doesn't like this encoding and I have to convert it to other code page. Real example is that I use ghostscript python module to embed pdfmark data to a PDF file - end result is with wrong characters in Acrobat. I've done numerous combinations with <code>.encode()</code> and <code>.decode()</code> between 'utf-8' and 'latin-1' and it drives me crazy as I can't output correct result. If I output the string to a file with <code>.encode('utf-8')</code> and then convert this file from UTF-8 to CP1252 (aka latin-1) with i.e. iconv.exe and embed the data everything is fine. Basically can someone help me convert i.e. character á which is UTF-8 encoded as hex: <code>C3 A1</code> to latin-1 as hex: <code>E1</code>? Thanks in advance

Instead of <code>.encode('utf-8')</code>, use <code>.encode('latin-1')</code>.

Python: convert string from UTF-8 to Latin-1

Tags:

I feel stacked here trying to change encodings with Python 2.5

I have XML response, which I encode to UTF-8: response.encode('utf-8'). That is fine, but the program which uses this info doesn't like this encoding and I have to convert it to other code page. Real example is that I use ghostscript python module to embed pdfmark data to a PDF file - end result is with wrong characters in Acrobat.

I've done numerous combinations with .encode() and .decode() between 'utf-8' and 'latin-1' and it drives me crazy as I can't output correct result.

If I output the string to a file with .encode('utf-8') and then convert this file from UTF-8 to CP1252 (aka latin-1) with i.e. iconv.exe and embed the data everything is fine.

Basically can someone help me convert i.e. character á which is UTF-8 encoded as hex: C3 A1 to latin-1 as hex: E1?

Thanks in advance

223

asked Nov 28 '10 23:11

romor

2 Answers

Instead of .encode('utf-8'), use .encode('latin-1').

184

answered Sep 21 '22 09:09

Ignacio Vazquez-Abrams

data="UTF-8 data"
udata=data.decode("utf-8")
data=udata.encode("latin-1","ignore")

Should do it.

answered Sep 19 '22 09:09

Utku Zihnioglu

Related questions
                            
                                How to use TailCalls?
                            
                                Google Protocol Buffers: ZigZag Encoding
                            
                                Android - how to set an alarm to a specific date
                            
                                PHP Security - (int) vs FILTER_VALIDATE_INT
                            
                                C# : How to pause the thread and continue when some event occur?
                            
                                What's wrong with the following Clojure protocol?
                            
                                The amortized complexity of std::next_permutation?
                            
                                f# keyword use and using
                            
                                razor syntax with errors compiles when it should not compile
                            
                                @GeneratedValue(strategy = GenerationType.AUTO) not working as thought
                            
                                Crystal Reports 13 And Asp.Net 3.5
                            
                                BasedOn="{StaticResource {x:Type TextBox}}" in Code Behind for Style

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With