I want to concatenate a list of various Python objects into one string. The objects can be literally anything. I thought I could simply do this using the following code:
' '.join([str(x) for x in the_list])
but unfortunately that sometimes gives me a UnicodeEncodeError:
UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 80: ordinal not in range(128)
in this SO answer I found someone who says that I need to use .encode('utf-8')
, so I changed my code to this:
' '.join([x.encode('utf-8') for x in the_list])
But if the objects are not strings or unicodes but for example int
s I get an AttributeError: 'int' object has no attribute 'encode'
. So this means I need to use some kind of if-statement to check what kind of type it is and how to convert it. But when should I use .encode('utf-8')
and when should I use str()
?
It would be even better if I could also do some kind of oneliner for this, but I wouldn't know how? Does anybody else know? All tips are welcome!
Python 2.x use repr()
. Python 3.x use repr()
if you don't mind non-ASCII Unicode in the result, or ascii()
if you do:
>>> a=1 # integer
>>> class X: pass
...
>>> x=X() # class
>>> y='\u5000' # Unicode string
>>> z=b'\xa0' # non-ASCII byte string
>>> ' '.join(ascii(i) for i in (a,x,y,z))
"1 <__main__.X object at 0x0000000002974B38> '\\u5000' b'\\xa0'"
Example of differences between 2.X and 3.X repr()
, and 3.X ascii()
:
>>> # Python 3
>>> s = 'pingüino' # Unicode string
>>> s
'pingüino'
>>> repr(s)
"'pingüino'"
>>> print(repr(s))
'pingüino'
>>> ascii(s)
"'ping\\xfcino'"
>>> print(ascii(s))
'ping\xfcino'
>>> # Python 2
>>> s = u'pingüino'
>>> s
u'ping\xfcino'
>>> repr(s)
"u'ping\\xfcino'"
>>> print(repr(s))
u'ping\xfcino'
You can try join
ing with a unicode object instead..
u' '.join(unicode(x) for x in thelist)
Or what you had before will work fine in python3. Just be sure to:
For more details see this talk
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With