I have a pandas dataframe containing (besides other columns) full names: <pre class="prettyprint"><code> fullname martin master andreas test </code></pre> I want to create a new column which splits the fullname column along the blank space and assigns the last element to a new column. The result should look like: <pre class="prettyprint"><code> fullname lastname martin master master andreas test test </code></pre> I thought it would work like this: <pre class="prettyprint"><code>df['lastname'] = df['fullname'].str.split(' ')[-1] </code></pre> However, I get a <code>KeyError: -1</code> I use <code>[-1]</code>, that is the last element of the split group, in order to be sure that I get the real last name. In some cases (e.g. a name like andreas martin master), this helps to get the last name, that is, master. So how can I do this?

You need another <code>str</code> to access the last splits for every row, what you did was essentially try to index the series using a non-existent label: <pre class="prettyprint"><code>In [31]: df['lastname'] = df['fullname'].str.split().str[-1] df Out[31]: fullname lastname 0 martin master master 1 andreas test test </code></pre>

Split pandas column and add last element to a new column

Tags:

python

split

pandas

I have a pandas dataframe containing (besides other columns) full names:

 fullname
 martin master
 andreas test

I want to create a new column which splits the fullname column along the blank space and assigns the last element to a new column. The result should look like:

 fullname           lastname
 martin master      master
 andreas test       test

I thought it would work like this:

df['lastname'] = df['fullname'].str.split(' ')[-1]

However, I get a KeyError: -1

I use [-1], that is the last element of the split group, in order to be sure that I get the real last name. In some cases (e.g. a name like andreas martin master), this helps to get the last name, that is, master.

So how can I do this?

435

asked Jul 21 '16 08:07

beta

1 Answers

You need another str to access the last splits for every row, what you did was essentially try to index the series using a non-existent label:

In [31]:

df['lastname'] = df['fullname'].str.split().str[-1]
df
Out[31]:
         fullname lastname
0   martin master   master
1    andreas test     test

answered Oct 11 '22 15:10

EdChum

Related questions
                            
                                Apply styles while exporting to 'xlsx' in pandas with XlsxWriter
                            
                                flask-migrate doesn't detect models
                            
                                I can't install Gevent
                            
                                What does the "tk.call" function do in Python/Tkinter?
                            
                                How to vertically concatenate two arrays in Python? [duplicate]
                            
                                Creating classes with a lot of imported functions here and there
                            
                                Pandas: Always selecting the first sheet/tab in an Excel Sheet
                            
                                Find all local Maxima and Minima when x and y values are given as numpy arrays
                            
                                Create sample numpy array with randomly placed NaNs
                            
                                Seaborn distplot y-axis normalisation wrong ticklabels
                            
                                How do you implement token authentication in Flask?
                            
                                python 3.5 type hints: can i check if function arguments match type hints?
                            
                                How to access weighting of indiviual decision trees in xgboost?
                            
                                How to create custom Scrapy Item Exporter?
                            
                                POS tagging using spaCy
                            
                                Django Rest Framework custom response message
                            
                                How to reset background color of a python tkinter button?
                            
                                Django datetime default value in migrations
                            
                                No module named 'pandas' in Pycharm
                            
                                Get the Flask view function that matches a url

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With