string.decode() vs. unicode(string)

Tags:

myString = 'éíěřáé'

I need to decode this string to unicode. Is there any difference between folowing usages and between these two methods in general?

Click to copy

myString.decode(encoding='UTF-8', errors='ignore')

and

Click to copy

unicode(myString, encoding='UTF-8', errors='ignore')

502

asked Aug 08 '12 09:08

Meloun

2 Answers

The unicode constructor can take other types apart from strings:

Click to copy

>>> unicode(10)
u'10'

For the bytestring case, however, the two forms are mostly equivalent. Some encoding options are not valid for the unicode constructor as they do not result in unicode output, but are valid for the .decode method of bytestrings, such as 'hex':

Click to copy

>>> unicode('10', encoding='hex')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: decoder did not return an unicode object (type=str)

answered Oct 04 '22 04:10

Martijn Pieters

They're essentially the same, but with some minor performance shortcuts in either case; str.decode knows that its argument is a string, so it can shortcut type checking of its argument, while unicode.__new__ has shortcuts for some common encodings including UTF-8.

Both methods call into PyCodec_Decode in the general case.

answered Oct 04 '22 04:10

ecatmur

Related questions
                            
                                Fuzzy String Searching with Whoosh in Python
                            
                                Buildout vs virtualenv + pip for django?
                            
                                Python: special characters giving me problems (from PDFminer)
                            
                                Are there any usable path-finding libraries for python? [closed]
                            
                                Overwrite existing read-only files when using Python's tarfile
                            
                                WYSIWYG tool for programming GUI in Python? [closed]
                            
                                Setup OpenCV 2.3 w/ python bindings in ubuntu
                            
                                exceptions + signaling end-of-iterator: why is it bad in Java and normal in Python?
                            
                                How do I compare python functions in terms of performance?
                            
                                Create 32-bit exe's from python code on 64-bit machine
                            
                                Using Celery as a control channel for Twisted applications
                            
                                Why is a line in this python function necessary? (memoized recursion)
                            
                                Access Gmail Imap with OAuth 2.0 Access token
                            
                                PyParsing lookaheads and greedy expressions
                            
                                Python functions and their __call__ attribute
                            
                                Unexpected performance curve from CPython merge sort
                            
                                Is there a way to set the Pygame icon in the taskbar? set_icon() only seem to affect the small icon in the actual window
                            
                                How to send a zip file as an attachment in python?
                            
                                insert or update keys in a python dictionary
                            
                                Why is the argument of os.umask() inverted? (umask 0o000 makes chmod 0o777)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

string.decode() vs. unicode(string)

Tags:

python

string

unicode

decode

Meloun

People also ask

2 Answers

Martijn Pieters

ecatmur

Recent Activity

Donate For Us