Regex to extract ONLY alphanumeric words

Tags:

I am looking for a regex to extract the word that ONLY contain alphanumeic characters:

string = 'This is a $dollar sign !!'
matches = re.findall(regex, string)
matches = ['This', 'is', 'sign']

This can be done by tokenizing the string and evaluate each token individually using the following regex:

^[a-zA-Z0-9]+$

Due to performance issues, I want to able to extract the alphanumeric tokens without tokenizing the whole string. The closest I got to was

regex = \b[a-zA-Z0-9]+\b

, but it still extracts substrings containing alphanumeric characters:

string = 'This is a $dollar sign !!'
matches = re.findall(regex, string)
matches = ['This', 'is', 'dollar', 'sign']

Is there a regex able to pull this off? I've tried different things but can't come up with a solution.

311

asked Jan 05 '19 22:01

GRoutar

2 Answers

Instead of word boundaries, lookbehind and lookahead for spaces (or the beginning/end of the string):

(?:^|(?<= ))[a-zA-Z0-9]+(?= |$)

https://regex101.com/r/TZ7q1c/1

Note that "a" is a standalone alphanumeric word, so it's included too.

['This', 'is', 'a', 'sign']

108

answered Sep 24 '22 23:09

CertainPerformance

There is no need to use regexs for this, python has a built in isalnum string method. See below:

string = 'This is a $dollar sign !!'

matches = [word for word in string.split(' ') if word.isalnum()]

answered Sep 22 '22 23:09

hegash

Related questions
                            
                                Error- AttributeError: 'DirectoryIterator' object has no attribute 'ndim in autoencoder design in keras
                            
                                How to connect to Odoo database from an android application
                            
                                Is there a faster alternative to np.diff?
                            
                                Why does Exception proxy __str__ onto the args?
                            
                                How to send python output to telegram CHANNEL not to Group and gmail email group
                            
                                How can i check that a list is in my array in python
                            
                                How to return a list of frequencies for a certain value in a dict
                            
                                In python, how do I invert a 2D dictionary?
                            
                                Error in Google Colaboratory - AttributeError: module 'PIL.Image' has no attribute 'register_decoder'
                            
                                Pandas: Enumerate duplicates in index
                            
                                Python "in" and "==" confusion
                            
                                Log Python Systemd output to log file
                            
                                How to return rows with Null values in pyspark dataframe?
                            
                                Subsetting pandas dataframe and retain original size
                            
                                How to check version 4 UUIDs in python? [closed]
                            
                                How to implement RBF activation function in Keras?
                            
                                Selenium Threads: how to run multi-threaded browser with proxy ( python)
                            
                                What is the recommended way to compute a weighted sum of selected columns of a pandas dataframe?
                            
                                How can I write a function fmap that returns the same type of iterable that was inputted?
                            
                                Django ImageField is not updating when update() method is used

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Regex to extract ONLY alphanumeric words

Tags:

python

regex

alphanumeric

GRoutar

People also ask

2 Answers

CertainPerformance

hegash

Recent Activity

Donate For Us