Using split() function on a pandas dataframe

Question

I have the following dataframe:

enter image description here

I'm trying to get rid of the percentage signs. In order to do this I decided to apply a function to the Democrat and Republican column and try to split() by the percentage sign. The following code tries to do that:

gallup_2012[['Democrat/Lean Democratic', 'Republican/Lean 
Republican']].apply(lambda x: x.split('%')[0])

However, when I try to do this, I get the following error:

("'Series' object has no attribute 'split'", u'occurred at index Democrat/Lean > Democratic')

I'm not quite sure why this error occurs, as I can apply other functions to this series. It's just that the split() function doesn't work.

Any help would be appreciated!

ksai · Accepted Answer

df[[ ]] returns a dataframe, so if you use df.apply() then it would be applied on pd.Series. And Series doesn't have split() method, But if you use df[ ] and use df.apply() then you would be able to achieve what you want. The drawback is only that you can apply only on one column.

gallup_2012['Democrat/Lean Democratic'].apply(lambda x: x.split('%')[0])

Henrique Coura · Answer

You could use the str.replace method on the desired columns

df["column"] = df["column"].str.replace("%", "")

Using split() function on a pandas dataframe

Tags:

python

split

pandas

bugsyb

2 Answers

ksai

Henrique Coura

Recent Activity

Donate For Us

Using split() function on a pandas dataframe

Tags:

python

split

pandas

bugsyb

2 Answers

ksai

Henrique Coura

Related questions

Recent Activity

Donate For Us