<pre class="prettyprint"><code>{u'Status': u'OK', u'City': u'Ciri\xe8', u'TimezoneName': '', u'ZipPostalCode': '', u'CountryCode': u'IT', u'Dstoffset': u'0', u'Ip': u'x.x.x.x', u'Longitude': u'7.6', u'CountryName': u'Italy', u'RegionCode': u'12', u'Latitude': u'45.2333', u'Isdst': '', u'Gmtoffset': u'0', u'RegionName': u'Piemonte'} </code></pre> This is the output of my object. I would like to access City but It's encoded. How can I read all parameters and decode it <pre class="prettyprint"><code>>>> data['City'] u'Ciri\xe8' >>>data['City'].decode('utf-8') Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/encodings/utf_8.py", line 16, in decode return codecs.utf_8_decode(input, errors, True) UnicodeEncodeError: 'ascii' codec can't encode character u'\xe8' in position 4: ordinal not in range(128) </code></pre> I want plaintext not unicode string. Thank you!

What you want is not clear. If by 'plaintext' you mean remove accentuation, try this: <pre class="prettyprint"><code>>>> s = u'Ciri\xe8' >>> from unicodedata import normalize >>> normalize('NFKD', s).encode('ASCII', 'ignore') 'Cirie' </code></pre>

Transform unicode string in python

Tags:

python

dictionary

unicode

{u'Status': u'OK', u'City': u'Ciri\xe8', u'TimezoneName': '', u'ZipPostalCode': '', u'CountryCode': u'IT', u'Dstoffset': u'0', u'Ip': u'x.x.x.x', u'Longitude': u'7.6', u'CountryName': u'Italy', u'RegionCode': u'12', u'Latitude': u'45.2333', u'Isdst': '', u'Gmtoffset': u'0', u'RegionName': u'Piemonte'}

This is the output of my object. I would like to access City but It's encoded. How can I read all parameters and decode it

>>> data['City']
u'Ciri\xe8'

>>>data['City'].decode('utf-8')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/encodings/utf_8.py", line 16, in decode
    return codecs.utf_8_decode(input, errors, True)
UnicodeEncodeError: 'ascii' codec can't encode character u'\xe8' in position 4: ordinal not in range(128)

I want plaintext not unicode string. Thank you!

241

asked Apr 22 '12 02:04

dani

1 Answers

What you want is not clear. If by 'plaintext' you mean remove accentuation, try this:

>>> s = u'Ciri\xe8'
>>> from unicodedata import normalize
>>> normalize('NFKD', s).encode('ASCII', 'ignore')
'Cirie'

107

answered Sep 17 '22 22:09

Pedro Werneck

Related questions
                            
                                Is There '?' Control Flow in Python? [duplicate]
                            
                                dateutil.parser.parse() gives error "initial_value must be unicode or None, not str" on Windows platform
                            
                                Parse X-Forwarded-For to get ip with werkzeug on Heroku
                            
                                Python optparse, default values, and explicit options
                            
                                Python : How `len()` is executed [duplicate]
                            
                                Remove more than one key from Python dict
                            
                                How to determine if a path is a subdirectory of another?
                            
                                Qt/PyQt: How do I create a drop down widget, such as a QLabel, QTextBrowser, etc.?
                            
                                Possible to do a string replace with a dictionary?
                            
                                psycopg2 cursor.execute() with SQL query parameter causes syntax error
                            
                                Convert one list to set, but if empty use a default one
                            
                                Inline comments for ConfigParser
                            
                                How to use libxml2 with python on macOs?
                            
                                pip search django produces time out error
                            
                                how do i install beautiful soup for python on my mac? see error
                            
                                Indexing NumPy 2D array with another 2D array
                            
                                Instantiating objects in python
                            
                                How to sort list of date object?
                            
                                How do I get the visitor's current timezone then convert timezone.now() to string of the local time in Django 1.4?
                            
                                Django Invalid Block Tag: 'endfor', expected 'endblock'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With