Pandas split column into multiple columns by comma

Tags:

I am trying to split a column into multiple columns based on comma/space separation.

My dataframe currently looks like

     KEYS                                                  1 0   FIT-4270                                          4000.0439 1   FIT-4269                                          4000.0420, 4000.0471 2   FIT-4268                                          4000.0419 3   FIT-4266                                          4000.0499 4   FIT-4265                                          4000.0490, 4000.0499, 4000.0500, 4000.0504,

I would like

   KEYS                                                  1           2            3        4  0   FIT-4270                                          4000.0439 1   FIT-4269                                          4000.0420  4000.0471 2   FIT-4268                                          4000.0419 3   FIT-4266                                          4000.0499 4   FIT-4265                                          4000.0490  4000.0499  4000.0500  4000.0504

My code currently removes The KEYS column and I'm not sure why. Could anyone improve or help fix the issue?

v = dfcleancsv[1]  #splits the columns by spaces into new columns but removes KEYS?  dfcleancsv = dfcleancsv[1].str.split(' ').apply(Series, 1)

733

asked Jun 02 '16 19:06

Anekdotin

2 Answers

In case someone else wants to split a single column (deliminated by a value) into multiple columns - try this:

series.str.split(',', expand=True)

This answered the question I came here looking for.

Credit to EdChum's code that includes adding the split columns back to the dataframe.

pd.concat([df[[0]], df[1].str.split(', ', expand=True)], axis=1)

Note: The first argument df[[0]] is DataFrame.

The second argument df[1].str.split is the series that you want to split.

split Documentation

concat Documentation

190

answered Sep 19 '22 01:09

Anthony R

Using Edchums answer of

pd.concat([df[[0]], df[1].str.split(', ', expand=True)], axis=1)

I was able to solve it by substituting my variables.

dfcleancsv = pd.concat([dfcleancsv['KEYS'], dfcleancsv[1].str.split(', ', expand=True)], axis=1)

answered Sep 17 '22 01:09

Anekdotin

Related questions
                            
                                Error when trying to overload an operator "/"
                            
                                How to send a dictionary to a function that accepts **kwargs?
                            
                                How do I use string formatting to show BOTH leading zeros and precision of 3?
                            
                                Efficient Numpy 2D array construction from 1D array
                            
                                Asyncio two loops for different I/O tasks?
                            
                                Dictionary of lists to dataframe
                            
                                Prevent Flask jsonify from sorting the data
                            
                                Install Tensorflow 2.0 in conda enviroment
                            
                                Django urls straight to html template
                            
                                Undo a file readline() operation so file-pointer is back in original state
                            
                                Detect charset and convert to utf-8 in Python? [duplicate]
                            
                                Matplotlib imshow() stretch to "fit width"
                            
                                How to change Tkinter Button state from disabled to normal?
                            
                                Remove characters before and including _ in python 2.7
                            
                                Enable Python to Connect to MySQL via SSH Tunnelling
                            
                                Django - ImproperlyConfigured: Module "django.contrib.auth.middleware"
                            
                                Use pandas.shift() within a group
                            
                                Testing for reference equality in Python
                            
                                python - can lambda have more than one return
                            
                                Get AWS Account ID from Boto

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas split column into multiple columns by comma

Tags:

python

split

pandas

dataframe

csv

Anekdotin

People also ask

2 Answers

Anthony R

Anekdotin

Recent Activity

Donate For Us