I have a list of names in which I have made all uppercase, removed spaces, and non-alphabetic characters to more easily merge with another list -- both are in pandas dataframe. One of the dataframe's names have some names with <code>JR</code> attached to the end while their counterparts in the other dataframe to not contain this suffix. How can I strip all <code>JR</code> from both? I tried something like the following: <code>df['NAME'] = df['NAME'].str.replace('JR','')</code> but I think this would remove all instances of <code>JR</code> and not when it is the last 2 characters. Any help would be appreciated.

You could use replace with a regex: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame(data=['Name JR', 'Name JR Middle', 'JR Name'], columns=['name']) df['name'] = df.name.str.replace(r'\bJR$', '', regex=True).str.strip() print(df) </code></pre> Output <pre class="prettyprint"><code> name 0 Name 1 Name JR Middle 2 JR Name </code></pre> The pattern <code>'\bJR$'</code> matches the word JR only at the end of the string.

One option is to remove <code>JR</code> using <code>string.endswith</code>, and remove it from the rows that contain it sclicing the <code>str</code> object: <pre class="prettyprint"><code>m = s.str.endswith('JR') s.loc[m] = s.loc[m].str[:-2] </code></pre> <hr> Example Using @danielmesejo's dataframe: <pre class="prettyprint"><code>df = pd.DataFrame(data=['Name JR', 'Name JR Middle', 'JR Name'], columns=['name']) m = df.name.str.endswith('JR') df.name.loc[m] = df.name.loc[m].str[:-2] name 0 Name 1 Name JR Middle 2 JR Name </code></pre>

Remove certain characters if on end of string in Pandas

3 Answers

You could use replace with a regex:

Click to copy

import pandas as pd

df = pd.DataFrame(data=['Name JR', 'Name JR Middle', 'JR Name'], columns=['name'])
df['name'] = df.name.str.replace(r'\bJR$', '', regex=True).str.strip()

print(df)

Output

Click to copy

             name
0            Name
1  Name JR Middle
2         JR Name

The pattern '\bJR$' matches the word JR only at the end of the string.

186

answered Oct 20 '22 00:10

Dani Mesejo

You need:

Click to copy

def jr_replace(x):
    match = re.sub(r'JR$',"",x)
    return match

df['NAME'] = df['NAME'].apply(jr_replace)

print(df)

answered Oct 19 '22 22:10

Sociopath

One option is to remove JR using string.endswith, and remove it from the rows that contain it sclicing the str object:

Click to copy

m = s.str.endswith('JR')
s.loc[m] = s.loc[m].str[:-2]

Example

Using @danielmesejo's dataframe:

Click to copy

df = pd.DataFrame(data=['Name JR', 'Name JR Middle', 'JR Name'], columns=['name'])
m = df.name.str.endswith('JR')
df.name.loc[m] =  df.name.loc[m].str[:-2]

            name
0           Name 
1  Name JR Middle
2         JR Name

answered Oct 19 '22 23:10

yatu

Related questions
                            
                                How to get rid of white lines in confusion matrix?
                            
                                Call python script from .Net Core using pythonnet
                            
                                Django Tutorial: 'detail' is not a valid view function or pattern name
                            
                                Reshape vertical series to horizontal in Python
                            
                                Tying Autoencoder Weights in a Dense Keras Layer
                            
                                contains pyspark SQL: TypeError: 'Column' object is not callable
                            
                                Finding Similar Document
                            
                                Discord.py Rewrite gathering list of all commands
                            
                                Using default arguments in a function with variable arguments. Is this possible?
                            
                                'NoneType' object has no attribute 'text' in BeautifulSoup
                            
                                Issue clicking Javascript button with python/Selenium
                            
                                PytestWarning: Module already imported so cannot be rewritten: pytest_remotedata
                            
                                pd.DataFrame(data, columns=[]). How to pass a data which is with nested dictionary?
                            
                                conditional fill in pandas dataframe
                            
                                Logical AND of multiple columns in pandas
                            
                                How can I avoid PROJ_LIB error in importing basemap?
                            
                                Showing class attributes in the PyCharm debugger when subclassing str
                            
                                How do you round a string in Python?
                            
                                Wrapping asyncio.gather in a timeout
                            
                                How to add new fields in django user model [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Remove certain characters if on end of string in Pandas

Tags:

python

pandas

a.powell

People also ask

3 Answers

Dani Mesejo

Sociopath

yatu

Recent Activity

Donate For Us