Python string prints as [u'String']

People also ask

What is U in Python print?

The 'u' in front of a string means the string is a Unicode string. A Unicode is a way for a string to represent more characters than a regular ASCII string can.

How do I get rid of U in Python?

In python, to remove Unicode ” u “ character from string then, we can use the replace() method to remove the Unicode ” u ” from the string. After writing the above code (python remove Unicode ” u ” from a string), Ones you will print “ string_unicode ” then the output will appear as a “ Python is easy. ”.

What does U mean in Python string?

The 'u' in front of the string values means the string is a Unicode string. Unicode is a way to represent more characters than normal ASCII can manage. The fact that you're seeing the u means you're on Python 2 - strings are Unicode by default on Python 3, but on Python 2, the u in front distinguishes Unicode strings.

What is print \r Python?

Put the \r at the beginning or end of your printed string, e.g. '\rthis string will start at the beginning of the line' . or 'the next string will start at the beginning of the line\r' . Conceptually, \r moves the cursor to the beginning of the line and then keeps outputting characters as normal.

[u'ABC'] would be a one-element list of unicode strings. Beautiful Soup always produces Unicode. So you need to convert the list to a single unicode string, and then convert that to ASCII.

I don't know exaxtly how you got the one-element lists; the contents member would be a list of strings and tags, which is apparently not what you have. Assuming that you really always get a list with a single element, and that your test is really only ASCII you would use this:

 soup[0].encode("ascii")

However, please double-check that your data is really ASCII. This is pretty rare. Much more likely it's latin-1 or utf-8.

 soup[0].encode("latin-1")


 soup[0].encode("utf-8")

Or you ask Beautiful Soup what the original encoding was and get it back in this encoding:

 soup[0].encode(soup.originalEncoding)

You probably have a list containing one unicode string. The repr of this is [u'String'].

You can convert this to a list of byte strings using any variation of the following:

# Functional style.
print map(lambda x: x.encode('ascii'), my_list)

# List comprehension.
print [x.encode('ascii') for x in my_list]

# Interesting if my_list may be a tuple or a string.
print type(my_list)(x.encode('ascii') for x in my_list)

# What do I care about the brackets anyway?
print ', '.join(repr(x.encode('ascii')) for x in my_list)

# That's actually not a good way of doing it.
print ' '.join(repr(x).lstrip('u')[1:-1] for x in my_list)

import json, ast
r = {u'name': u'A', u'primary_key': 1}
ast.literal_eval(json.dumps(r))

will print

{'name': 'A', 'primary_key': 1}

If accessing/printing single element lists (e.g., sequentially or filtered):

my_list = [u'String'] # sample element
my_list = [str(my_list[0])]

pass the output to str() function and it will remove the unicode output u''. also by printing the output it will remove the u'' tags from it.

[u'String'] is a text representation of a list that contains a Unicode string on Python 2.

If you run print(some_list) then it is equivalent to
print'[%s]' % ', '.join(map(repr, some_list)) i.e., to create a text representation of a Python object with the type list, repr() function is called for each item.

Don't confuse a Python object and its text representation—repr('a') != 'a' and even the text representation of the text representation differs: repr(repr('a')) != repr('a').

repr(obj) returns a string that contains a printable representation of an object. Its purpose is to be an unambiguous representation of an object that can be useful for debugging, in a REPL. Often eval(repr(obj)) == obj.

To avoid calling repr(), you could print list items directly (if they are all Unicode strings) e.g.: print ",".join(some_list)—it prints a comma separated list of the strings: String

Do not encode a Unicode string to bytes using a hardcoded character encoding, print Unicode directly instead. Otherwise, the code may fail because the encoding can't represent all the characters e.g., if you try to use 'ascii' encoding with non-ascii characters. Or the code silently produces mojibake (corrupted data is passed further in a pipeline) if the environment uses an encoding that is incompatible with the hardcoded encoding.

Use dir or type on the 'string' to find out what it is. I suspect that it's one of BeautifulSoup's tag objects, that prints like a string, but really isn't one. Otherwise, its inside a list and you need to convert each string separately.

In any case, why are you objecting to using Unicode? Any specific reason?

Related questions
                            
                                What are all possible pos tags of NLTK?
                            
                                How to check if string input is a number?
                            
                                How do I detect whether sys.stdout is attached to terminal or not? [duplicate]
                            
                                demystify Flask app.secret_key
                            
                                Execution of Python code with -m option or not
                            
                                Why is using 'eval' a bad practice?
                            
                                How do I pick 2 random items from a Python set? [duplicate]
                            
                                Insert a row to pandas dataframe
                            
                                Assert a function/method was not called using Mock
                            
                                Why are trailing commas allowed in a list?
                            
                                Python: Continuing to next iteration in outer loop
                            
                                Print very long string completely in pandas dataframe
                            
                                Pretty-Print JSON Data to a File using Python
                            
                                CMD opens window store when I type python
                            
                                Non-alphanumeric list order from os.listdir()
                            
                                Python assigning multiple variables to same value? list behavior
                            
                                Why use Python's os module methods instead of executing shell commands directly?
                            
                                Change a Django form field to a hidden field
                            
                                Maven equivalent for python [closed]
                            
                                Python ElementTree module: How to ignore the namespace of XML files to locate matching element when using the method "find", "findall"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python string prints as [u'String']

Tags:

python

unicode

ascii

People also ask

Recent Activity

Donate For Us