I have a data frame similar to the one below: <pre class="prettyprint"><code>Name Volume Value May21 23 21321 James 12 12311 Adi22 11 4435 Hello 34 32454 Girl90 56 654654 </code></pre> I want the output to be in the format: <pre class="prettyprint"><code>Name Volume Value May 23 21321 James 12 12311 Adi 11 4435 Hello 34 32454 Girl 56 654654 </code></pre> Want to remove all the numbers from the Name column. Closest I have come is doing it at a cell level with the following code: <pre class="prettyprint"><code>result = ''.join([i for i in df['Name'][1] if not i.isdigit()]) </code></pre> Any idea how to do it in a better way at the series/dataframe level.

You can apply str.replace to the <code>Name</code> column in combination with regular expressions: <pre class="prettyprint"><code>import pandas as pd # Example DataFrame df = pd.DataFrame.from_dict({'Name' : ['May21', 'James', 'Adi22', 'Hello', 'Girl90'], 'Volume': [23, 12, 11, 34, 56], 'Value' : [21321, 12311, 4435, 32454, 654654]}) df['Name'] = df['Name'].str.replace('\d+', '') print(df) </code></pre> Output: <pre class="prettyprint"><code> Name Value Volume 0 May 21321 23 1 James 12311 12 2 Adi 4435 11 3 Hello 32454 34 4 Girl 654654 56 </code></pre> In the regular expression <code>\d</code> stands for "any digit" and <code>+</code> stands for "one or more". Thus, <code>str.replace('\d+', '')</code> means: "Replace all occurring digits in the strings with nothing".

How to remove numbers from string terms in a pandas dataframe

Name    Volume  Value May21   23      21321 James   12      12311 Adi22   11      4435 Hello   34      32454 Girl90  56      654654

I want the output to be in the format:

Name    Volume  Value May     23      21321 James   12      12311 Adi     11      4435 Hello   34      32454 Girl    56      654654

Want to remove all the numbers from the Name column.

Closest I have come is doing it at a cell level with the following code:

result = ''.join([i for i in df['Name'][1] if not i.isdigit()])

Any idea how to do it in a better way at the series/dataframe level.

820

asked Jan 18 '17 12:01

mank

1 Answers

You can apply str.replace to the Name column in combination with regular expressions:

import pandas as pd  # Example DataFrame df = pd.DataFrame.from_dict({'Name'  : ['May21', 'James', 'Adi22', 'Hello', 'Girl90'],                              'Volume': [23, 12, 11, 34, 56],                              'Value' : [21321, 12311, 4435, 32454, 654654]})  df['Name'] = df['Name'].str.replace('\d+', '')  print(df)

Output:

    Name   Value  Volume 0    May   21321      23 1  James   12311      12 2    Adi    4435      11 3  Hello   32454      34 4   Girl  654654      56

In the regular expression \d stands for "any digit" and + stands for "one or more".

Thus, str.replace('\d+', '') means: "Replace all occurring digits in the strings with nothing".

198

answered Sep 21 '22 19:09

Milo

Related questions
                            
                                What is the most 'pythonic' way to logically combine a list of booleans?
                            
                                apscheduler in Flask executes twice [duplicate]
                            
                                Get the name of a decorated function? [duplicate]
                            
                                How to use avg and sum in SQLAlchemy query
                            
                                How do I generate circular thumbnails with PIL?
                            
                                Using more than one flag in python re.findall
                            
                                Tensorflow Data Adapter Error: ValueError: Failed to find data adapter that can handle input
                            
                                SQLAlchemy boolean value is None
                            
                                pip: Could not find an activated virtualenv (required)
                            
                                KeyError: 'TCL_Library' when I use cx_Freeze
                            
                                Pandas - Strip white space
                            
                                UnicodeDecodeError: 'utf-8' codec can't decode byte 0x96 in position 35: invalid start byte
                            
                                Remove the newline character in a list read from a file [duplicate]
                            
                                Kivy: How to change window size?
                            
                                List of objects to JSON with Python
                            
                                One Hot Encoding using numpy [duplicate]
                            
                                How to reverse order of keys in python dict?
                            
                                Variance Inflation Factor in Python
                            
                                How to find whether a number belongs to a particular range in Python? [duplicate]
                            
                                What does __contains__ do, what can call __contains__ function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to remove numbers from string terms in a pandas dataframe

Tags:

python

string

pandas

mank

People also ask

1 Answers

Milo

Recent Activity

Donate For Us