I want to use a larger variety of Unicode symbols for variable names in my Python 3 scripts. What characters are acceptable to use in Python 3 variable names? I recently started using Unicode symbols (such as Greek and Asian symbols) for code obfuscation.

According to PEP 3131, the first character of an identifier needs to belong to <code>ID_Start</code>, the rest to <code>ID_Continue</code>, defined as follows: <blockquote> <code>ID_Start</code> is defined as all characters having one of the general categories uppercase letters (Lu), lowercase letters (Ll), titlecase letters (Lt), modifier letters (Lm), other letters (Lo), letter numbers (Nl), the underscore, and characters carrying the Other_ID_Start property. XID_Start then closes this set under normalization, by removing all characters whose NFKC normalization is not of the form <code>ID_Start ID_Continue*</code> anymore. <code>ID_Continue</code> is defined as all characters in <code>ID_Start</code>, plus nonspacing marks (Mn), spacing combining marks (Mc), decimal number (Nd), connector punctuations (Pc), and characters carryig the Other_ID_Continue property. Again, <code>XID_Continue</code> closes this set under NFKC-normalization; it also adds <code>U+00B7</code> to support Catalan. </blockquote> That's a long list (currently around 120.000 characters) - fortunately there is a helpful project on GitHub that contains the list and a script to generate it.

What Unicode symbols are accepted in Python 3 variable names?

1 Answers

According to PEP 3131, the first character of an identifier needs to belong to ID_Start, the rest to ID_Continue, defined as follows:

ID_Start is defined as all characters having one of the general categories uppercase letters (Lu), lowercase letters (Ll), titlecase letters (Lt), modifier letters (Lm), other letters (Lo), letter numbers (Nl), the underscore, and characters carrying the Other_ID_Start property. XID_Start then closes this set under normalization, by removing all characters whose NFKC normalization is not of the form ID_Start ID_Continue* anymore.

ID_Continue is defined as all characters in ID_Start, plus nonspacing marks (Mn), spacing combining marks (Mc), decimal number (Nd), connector punctuations (Pc), and characters carryig the Other_ID_Continue property. Again, XID_Continue closes this set under NFKC-normalization; it also adds U+00B7 to support Catalan.

That's a long list (currently around 120.000 characters) - fortunately there is a helpful project on GitHub that contains the list and a script to generate it.

answered Sep 28 '22 03:09

Tim Pietzcker

Related questions
                            
                                Is JSON syntax a strict subset of Python syntax?
                            
                                Does my code prevent directory traversal?
                            
                                Django attribute error. 'module' object has no attribute 'rindex'
                            
                                Dictionary creation with fromkeys and mutable objects. A surprise [duplicate]
                            
                                Python module to enable ANSI colors for stdout on Windows?
                            
                                What is the best way to get a semi long unique id (non sequential) key for Database objects
                            
                                Passing multiple files with asterisk to python shell in Windows
                            
                                Trouble installing SciPy on windows
                            
                                How to redirect JVM output without tear up output from the application?
                            
                                Is it possible to import flask configuration values in modules without circular import?
                            
                                generalized cumulative functions in NumPy/SciPy?
                            
                                Selecting columns from pandas.HDFStore table
                            
                                Django Serialize Queryset to JSON to construct RESTful response with only field information and id
                            
                                Networkx in Python - draw node attributes as labels outside the node
                            
                                Django: why i can't get the tracebacks (in case of error) when i run LiveServerTestCase tests?
                            
                                Determining "days" in python when the timedelta.days is less than one
                            
                                TypeError: 'int' object does not support item assignment
                            
                                How do you load .ui files onto python classes with PySide?
                            
                                What is the purpose of a backslash at the end of a line?
                            
                                Changing matshow xticklabel position from top to bottom of the figure

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What Unicode symbols are accepted in Python 3 variable names?

Tags:

python

syntax

variables

python-3.x

unicode

Devyn Collier Johnson

People also ask

1 Answers

Tim Pietzcker

Recent Activity

Donate For Us