Remove specific characters from a string in Python

Tags:

I'm trying to remove specific characters from a string using Python. This is the code I'm using right now. Unfortunately it appears to do nothing to the string.

for char in line:     if char in " ?.!/;:":         line.replace(char,'')

How do I do this properly?

307

asked Oct 15 '10 03:10

Matt Phillips

1 Answers

Strings in Python are immutable (can't be changed). Because of this, the effect of line.replace(...) is just to create a new string, rather than changing the old one. You need to rebind (assign) it to line in order to have that variable take the new value, with those characters removed.

Also, the way you are doing it is going to be kind of slow, relatively. It's also likely to be a bit confusing to experienced pythonators, who will see a doubly-nested structure and think for a moment that something more complicated is going on.

Starting in Python 2.6 and newer Python 2.x versions *, you can instead use str.translate, (see Python 3 answer below):

line = line.translate(None, '!@#$')

or regular expression replacement with re.sub

import re line = re.sub('[!@#$]', '', line)

The characters enclosed in brackets constitute a character class. Any characters in line which are in that class are replaced with the second parameter to sub: an empty string.

Python 3 answer

In Python 3, strings are Unicode. You'll have to translate a little differently. kevpie mentions this in a comment on one of the answers, and it's noted in the documentation for str.translate.

When calling the translate method of a Unicode string, you cannot pass the second parameter that we used above. You also can't pass None as the first parameter. Instead, you pass a translation table (usually a dictionary) as the only parameter. This table maps the ordinal values of characters (i.e. the result of calling ord on them) to the ordinal values of the characters which should replace them, or—usefully to us—None to indicate that they should be deleted.

So to do the above dance with a Unicode string you would call something like

translation_table = dict.fromkeys(map(ord, '!@#$'), None) unicode_line = unicode_line.translate(translation_table)

Here dict.fromkeys and map are used to succinctly generate a dictionary containing

{ord('!'): None, ord('@'): None, ...}

Even simpler, as another answer puts it, create the translation table in place:

unicode_line = unicode_line.translate({ord(c): None for c in '!@#$'})

Or, as brought up by Joseph Lee, create the same translation table with str.maketrans:

unicode_line = unicode_line.translate(str.maketrans('', '', '!@#$'))

* for compatibility with earlier Pythons, you can create a "null" translation table to pass in place of None:

import string line = line.translate(string.maketrans('', ''), '!@#$')

Here string.maketrans is used to create a translation table, which is just a string containing the characters with ordinal values 0 to 255.

answered Oct 06 '22 12:10

intuited

Related questions
                            
                                Is there a difference between "==" and "is"?
                            
                                How do you round UP a number?
                            
                                Is it possible to break a long line to multiple lines in Python? [duplicate]
                            
                                How to save/restore a model after training?
                            
                                Dealing with multiple Python versions and PIP?
                            
                                Python and pip, list all versions of a package that's available?
                            
                                Changing the "tick frequency" on x or y axis in matplotlib
                            
                                How to check Django version
                            
                                How to delete the contents of a folder?
                            
                                Get statistics for each group (such as count, mean, etc) using pandas GroupBy?
                            
                                Create list of single item repeated N times
                            
                                Does Python's time.time() return the local or UTC timestamp?
                            
                                Filter dict to contain only certain keys?
                            
                                How to calculate number of days between two given dates
                            
                                Add a new item to a dictionary in Python [duplicate]
                            
                                How to urlencode a querystring in Python?
                            
                                ImportError: Cannot import name X
                            
                                Can I force pip to reinstall the current version?
                            
                                TensorFlow not found using pip
                            
                                Split string with multiple delimiters in Python [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Remove specific characters from a string in Python

Tags:

python

string

replace

immutability

Matt Phillips

People also ask

1 Answers

Python 3 answer

intuited

Recent Activity

Donate For Us