How to split one column into multiple columns in Pandas using regular expression?

2 Answers

You can use split by regex ,\s+ (, and one or more whitespaces):

#borrowing sample from `Allen`
df[['street', 'city', 'state']] = df['address'].str.split(',\s+', expand=True)
print (df)
                              address id             street          city  \
0  71 Pilgrim Avenue, Chevy Chase, MD  a  71 Pilgrim Avenue   Chevy Chase   
1         72 Main St, Chevy Chase, MD  b         72 Main St   Chevy Chase   

  state  
0    MD  
1    MD

And if need remove column address add drop:

df[['street', 'city', 'state']] = df['address'].str.split(',\s+', expand=True)
df = df.drop('address', axis=1)
print (df)
  id             street         city state
0  a  71 Pilgrim Avenue  Chevy Chase    MD
1  b         72 Main St  Chevy Chase    MD

122

answered Sep 21 '22 16:09

jezrael

df = pd.DataFrame({'address': {0: '71 Pilgrim Avenue, Chevy Chase, MD',
      1: '72 Main St, Chevy Chase, MD'},
     'id': {0: 'a', 1: 'b'}})
#if your address format is consistent, you can simply use a split function.
df2 = df.join(pd.DataFrame(df.address.str.split(',').tolist(),columns=['street', 'city', 'state']))
df2 = df2.applymap(lambda x: x.strip())

answered Sep 23 '22 16:09

Allen

Related questions
                            
                                Determine if there is at least one zero in a multidimensional numpy array
                            
                                Django 1.9 JSONField update behavior
                            
                                Why Won't Google API V3 Return Children?
                            
                                How to forbid two conflicting options
                            
                                Drawing filled polygon using mouse events in open cv using python
                            
                                linear interpolation between two data points
                            
                                Sklearn - How to predict probability for all target labels
                            
                                Downloading a song through python-requests
                            
                                hash function that outputs integer from 0 to 255?
                            
                                Serve protected media files with django
                            
                                send email with a pandas dataframe as attachment
                            
                                Difficulty with python while installing YouCompleteMe in vim
                            
                                Elegant way to delete items in a list which do not has substrings that appear in another list
                            
                                How exactly does random.random() work in python?
                            
                                PyQt4 to PyQt5 -> mainFrame() deprecated, need fix to load web pages
                            
                                Representing voxels with matplotlib
                            
                                Fastest way to cast all dataframe columns to float - pandas astype slow
                            
                                How to get the symmetric difference of two dictionaries
                            
                                Keras training only specific outputs
                            
                                TypeError: run() missing 1 required positional argument: 'fetches' on Session.run()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to split one column into multiple columns in Pandas using regular expression?

Tags:

python

pandas

designil

People also ask

2 Answers

jezrael

Allen

Recent Activity

Donate For Us