how strings are stored by python in computers?

Tags:

I believe most of you who are familiar with Python have read Dive Into Python 3. In chapter 4.3, it says this:

In Python 3, all strings are sequences of Unicode characters. There is no such thing as a Python string encoded in UTF-8, or a Python string encoded as CP-1252. “Is this string UTF-8?” is an invalid question.

Somehow I understand what this means: strings = characters in the Unicode set, and Python can help you encode characters according to different encoding methods. However, are characters in Pythons stored as bytes in computers anyway? For example, s = 'strings', and s is surely stored in my computer as a byte strem '0100100101...' or whatever. Then what is this encoding method used here - The "default" encoding method of Python?

Thanks!

963

asked Mar 15 '12 08:03

endless

1 Answers

Python 3 distinguishes between text and binary data. Text is guaranteed to be in Unicode, though no specific encoding is specified, as far as I could see. So it could be UTF-8, or UTF-16, or UTF-32¹ – but you wouldn't even notice.

The main point here is: You shouldn't even care. If you want to deal with text, then use text strings and access them by code point (which is the number of a single Unicode character and independent of the internal UTF – which may organise code points in several smaller code units). If you want bytes, then use b"" and access them by byte. And if you want to have a string in a byte sequence in a specific encoding, you use .encode().

¹ Or even UTF-9, if someone is insane enough to implement Python on a PDP-10.

179

answered Sep 23 '22 15:09

Joey

Related questions
                            
                                set up a MySQLdb connection object for multiple databases
                            
                                Django HTTP Request get vs getlist behavior
                            
                                Can you tell if an array is a view of another?
                            
                                SQLAlchemy Polymorphic Relationship with Concrete Inheritance
                            
                                Behaviour of Mutlple inheritance in python
                            
                                What is the type of os.environ? and Why does it not support viewkeys method
                            
                                Platform-dependent performance issues when selecting a large number of files with gtk.FileChooserDialog
                            
                                python decorator for class OR function
                            
                                Python string as file argument to subprocess
                            
                                Unable to do heroku's Python tutorial within Dropbox folder
                            
                                ImportError: cannot import name linsolve
                            
                                Python CSV module - quotes go missing
                            
                                In python, produce HTML highlighting the differences of two simple strings
                            
                                Stop python from generating pyc files
                            
                                Parsing TCL lists in Python
                            
                                PyQt mouse events for QTabWidget
                            
                                Is python site manual totally generated by pydoc?
                            
                                Python Pandas Pivot Table
                            
                                How to order django-mptt tree by DateTimeField?
                            
                                griddata scipy interpolation not working (giving nan)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how strings are stored by python in computers?

Tags:

python

string

encoding

utf

endless

People also ask

1 Answers

Joey

Recent Activity

Donate For Us