Valid characters in a python class name

Tags:

metaprogramming

I'm dynamically creating python classes, and I know not all characters are valid in this context.

Is there a method somewhere in the class library that I can use to sanitize a random text string, so that I can use it as a class name? Either that or a list of the allowed characters would be a good help.

Addition regarding clashes with identifier names: Like @Ignacio pointed out in the answer below, any character that is valid as an identifier is a valid character in a class name. And you can even use a reserved word as a class name without any trouble. But there's a catch. If you do use a reserved word, you won't be able to make the class accessible like other (non-dynamically-created) classes (e.g., by doing globals()[my_class.__name__] = my_class). The reserved word will always take precedence in such case.

902

asked Apr 12 '12 08:04

Filipe Correia

1 Answers

Python 3

Python Language Reference, §2.3, "Identifiers and keywords"

The syntax of identifiers in Python is based on the Unicode standard annex UAX-31, with elaboration and changes as defined below; see also PEP 3131 for further details.

Within the ASCII range (U+0001..U+007F), the valid characters for identifiers are the same as in Python 2.x: the uppercase and lowercase letters A through Z, the underscore _ and, except for the first character, the digits 0 through 9.

Python 3.0 introduces additional characters from outside the ASCII range (see PEP 3131). For these characters, the classification uses the version of the Unicode Character Database as included in the unicodedata module.

Identifiers are unlimited in length. Case is significant.
identifier   ::=  xid_start xid_continue* id_start     ::=  <all characters in general categories Lu, Ll, Lt, Lm, Lo, Nl, the underscore, and characters with the Other_ID_Start property> id_continue  ::=  <all characters in id_start, plus characters in the categories Mn, Mc, Nd, Pc and others with the Other_ID_Continue property> xid_start    ::=  <all characters in id_start whose NFKC normalization is in "id_start xid_continue*"> xid_continue ::=  <all characters in id_continue whose NFKC normalization is in "id_continue*"> 
The Unicode category codes mentioned above stand for:

Lu - uppercase letters

Ll - lowercase letters

Lt - titlecase letters

Lm - modifier letters

Lo - other letters

Nl - letter numbers

Mn - nonspacing marks

Mc - spacing combining marks

Nd - decimal number

Pc - connector punctuations

Other_ID_Start - explicit list of characters in PropList.txt to support backwards compatibility

Other_ID_Continue - likewise

All identifiers are converted into the normal form NFKC while parsing; comparison of identifiers is based on NFKC.

A non-normative HTML file listing all valid identifier characters for Unicode 4.1 can be found at https://www.dcl.hpi.uni-potsdam.de/home/loewis/table-3131.html.

Python 2

Python Language Reference, §2.3, "Identifiers and keywords"

Identifiers (also referred to as names) are described by the following lexical definitions:
identifier ::=  (letter|"_") (letter | digit | "_")* letter     ::=  lowercase | uppercase lowercase  ::=  "a"..."z" uppercase  ::=  "A"..."Z" digit      ::=  "0"..."9" 
Identifiers are unlimited in length. Case is significant.

161

answered Oct 17 '22 06:10

Ignacio Vazquez-Abrams

Related questions
                            
                                Why would shutil.copy() raise a permission exception when cp doesn't?
                            
                                install filter on logging level in python using dictConfig
                            
                                Sending messages with Telegram - APIs or CLI?
                            
                                Opening a .ipynb.txt File
                            
                                parametrize and running a single test in pytest
                            
                                How can you test that two dictionaries are equal with pytest in python
                            
                                Why 1//0.01 == 99 in Python?
                            
                                Can I use a class attribute as a default value for an instance method?
                            
                                How to make a list of n numbers in Python and randomly select any number?
                            
                                Find number of columns in csv file
                            
                                Neural Network training with PyBrain won't converge
                            
                                Can you create a Python list from a string, while keeping characters in specific keywords together?
                            
                                Pandas: append dataframe to another df
                            
                                module 'matplotlib' has no attribute 'verbose'
                            
                                Glade or no glade: What is the best way to use PyGtk?
                            
                                How to retrieve a variable's name in python at runtime?
                            
                                Searching a sorted list? [closed]
                            
                                remove colorbar from figure in matplotlib
                            
                                When to use == and when to use is?
                            
                                Python: avoiding if condition for this code?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With