Given a regex character class/set, how can i get a list of all matchable characters (in python 3). E.g.: <pre class="prettyprint"><code>[\dA-C] </code></pre> should give <pre class="prettyprint"><code>['0','1','2','3','4','5','6','7','8','9','A','B','C'] </code></pre>

I think what you are looking for is <code>string.printable</code> which returns all the printable characters in Python. For example: <pre class="prettyprint"><code>>>> import string >>> string.printable '0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ!"#$%&\'()*+,-./:;<=>?@[\\]^_`{|}~ \t\n\r\x0b\x0c' </code></pre> Now to check content satisfied by your regex, you may do: <pre class="prettyprint"><code>>>> import re >>> x = string.printable >>> pattern = r'[\dA-C]' >>> print(re.findall(pattern, x)) ['0', '1', '2', '3', '4', '5', '6', '7', '8', '9', 'A', 'B', 'C'] </code></pre> <code>string.printable</code> is a combination of digits, letters, punctuation, and whitespace. Also check String Constants for complete list of constants available with string module. <hr> In case you need the list of all <code>unicode</code> characters, you may do: <pre class="prettyprint"><code>import sys unicode_list = [chr(i) for i in range(sys.maxunicode)] </code></pre> Note: It will be a huge list, and console might get stuck for a while to give the result as value of <code>sys.maxunicode</code> is: <pre class="prettyprint"><code>>>> sys.maxunicode 1114111 </code></pre> In case you are dealing with some specific unicode formats, refer Unicode Character Ranges for limiting the ranges you are interested in.

<pre class="prettyprint"><code>import re x = '123456789ABCDE' pattern = r'[\dA-C]' print(re.findall(pattern,x)) #prints ['1', '2', '3', '4', '5', '6', '7', '8', '9', 'A', 'B', 'C'] </code></pre> Is this what you are looking for? If you don't have <code>x</code> and just want to match ascii characters you can use : <pre class="prettyprint"><code>import re import string x = string.ascii_uppercase + string.digits pattern = r'[\dA-C]' print(re.findall(pattern,x)) </code></pre> If you want to take inputs for the pattern you can simply just do: <pre class="prettyprint"><code> pattern = input() #with either one from above </code></pre>

How to get a list of matchable characters from a regex class

Tags:

python

string

regex

python-3.x

Given a regex character class/set, how can i get a list of all matchable characters (in python 3). E.g.:

[\dA-C]

should give

['0','1','2','3','4','5','6','7','8','9','A','B','C']

881

asked Oct 17 '16 19:10

o17t H1H' S'k

2 Answers

I think what you are looking for is string.printable which returns all the printable characters in Python. For example:

>>> import string
>>> string.printable
'0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ!"#$%&\'()*+,-./:;<=>?@[\\]^_`{|}~ \t\n\r\x0b\x0c'

Now to check content satisfied by your regex, you may do:

>>> import re
>>> x = string.printable
>>> pattern = r'[\dA-C]'
>>> print(re.findall(pattern, x))
['0', '1', '2', '3', '4', '5', '6', '7', '8', '9', 'A', 'B', 'C']

string.printable is a combination of digits, letters, punctuation, and whitespace. Also check String Constants for complete list of constants available with string module.

In case you need the list of all unicode characters, you may do:

import sys
unicode_list = [chr(i) for i in range(sys.maxunicode)]

Note: It will be a huge list, and console might get stuck for a while to give the result as value of sys.maxunicode is:

>>> sys.maxunicode
1114111

In case you are dealing with some specific unicode formats, refer Unicode Character Ranges for limiting the ranges you are interested in.

151

answered Nov 02 '22 23:11

Moinuddin Quadri

import re

x = '123456789ABCDE'
pattern = r'[\dA-C]'
print(re.findall(pattern,x))    
#prints ['1', '2', '3', '4', '5', '6', '7', '8', '9', 'A', 'B', 'C']

Is this what you are looking for?

If you don't have x and just want to match ascii characters you can use :

import re
import string

x = string.ascii_uppercase + string.digits
pattern = r'[\dA-C]'
print(re.findall(pattern,x))

If you want to take inputs for the pattern you can simply just do:

 pattern = input() #with either one from above

answered Nov 02 '22 23:11

MooingRawr

Related questions
                            
                                How to increment a date using Arrow?
                            
                                Filter values inside Python generator expressions
                            
                                How to skip a single loop iteration in python? [duplicate]
                            
                                Whats the difference between 'rb' and 'rU' in the open() function for csv
                            
                                Unable to get a single linebreak while sending email through Sendgrid
                            
                                Python 2 __missing__ method
                            
                                How convert output tensor to one-hot tensor?
                            
                                A DRY approach to Python try-except blocks?
                            
                                Python open html file, take screenshot, crop and save as image
                            
                                Reading in file block by block using specified delimiter in python
                            
                                python map function with min argument and two lists
                            
                                Django Error: Your URL pattern is invalid. Ensure that urlpatterns is a list of url() instances
                            
                                Function annotation for subclasses of abstract class
                            
                                Convert complex NumPy array into (n, 2)-array of real and imaginary parts
                            
                                pd.Timedelta conversion on a dataframe column
                            
                                Django form. How hide colon from initial_text?
                            
                                lxml xsi:schemaLocation namespace URI validation issue
                            
                                Install Matlab engine in Anaconda Python (Linux)
                            
                                how to trigger function in another object when variable changed. Python
                            
                                "Stratify" parameter from sklearn's train_test_split not working correctly?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With