How to exclude a character from a regex group?

Tags:

python

regex

I want to strip all non-alphanumeric characters EXCEPT the hyphen from a string (python). How can I change this regular expression to match any non-alphanumeric char except the hyphen?

re.compile('[\W_]')

Thanks.

430

asked Nov 05 '10 17:11

atp

1 Answers

You could just use a negated character class instead:

re.compile(r"[^a-zA-Z0-9-]")

This will match anything that is not in the alphanumeric ranges or a hyphen. It also matches the underscore, as per your current regex.

>>> r = re.compile(r"[^a-zA-Z0-9-]") >>> s = "some#%te_xt&with--##%--5 hy-phens  *#" >>> r.sub("",s) 'sometextwith----5hy-phens'

Notice that this also replaces spaces (which may certainly be what you want).

Edit: SilentGhost has suggested it may likely be cheaper for the engine to process with a quantifier, in which case you can simply use:

re.compile(r"[^a-zA-Z0-9-]+")

The + will simply cause any runs of consecutively matched characters to all match (and be replaced) at the same time.

answered Sep 29 '22 04:09

eldarerathis

Related questions
                            
                                Python for-loop without index and item
                            
                                How to map a function using multiple columns in pandas?
                            
                                Python nested context manager on multiple lines [duplicate]
                            
                                Python and Windows Named Pipes
                            
                                Truncating unicode so it fits a maximum size when encoded for wire transfer
                            
                                Multivariate spline interpolation in python/scipy?
                            
                                What is the equivalence in Python 3 of letters in Python 2?
                            
                                How do I see the Python doc on Linux?
                            
                                Setting SQLAlchemy autoincrement start value
                            
                                How to exclude mock package from python coverage report using nosetests
                            
                                Topic distribution: How do we see which document belong to which topic after doing LDA in python
                            
                                How to make nosetests use python3
                            
                                Matplotlib automatic legend outside plot [duplicate]
                            
                                Export Pandas DataFrame into a PDF file using Python
                            
                                Passing a tuple as command line argument
                            
                                Find out if/which BLAS library is used by Numpy
                            
                                Show training and validation accuracy in TensorFlow using same graph
                            
                                Using statsmodel estimations with scikit-learn cross validation, is it possible?
                            
                                Matplotlib: how to adjust space between legend markers and labels?
                            
                                Difference between cross_val_score and cross_val_predict

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With