Pandas: split column of lists of unequal length into multiple columns

Tags:

pandas

I have a Pandas dataframe that looks like the below:

                   codes
1                  [71020]
2                  [77085]
3                  [36415]
4                  [99213, 99287]
5                  [99233, 99233, 99233]

I'm trying to split the lists in df['codes'] into columns, like the below:

                   code_1      code_2      code_3   
1                  71020
2                  77085
3                  36415
4                  99213       99287
5                  99233       99233       99233

where columns that don't have a value (because the list was not that long) are filled with blanks or NaNs or something.

I've seen answers like this one and others similar to it, and while they work on lists of equal length, they all throw errors when I try to use the methods on lists of unequal length. Is there a good way do to this?

432

asked Jun 20 '17 22:06

user139014

2 Answers

Try:

pd.DataFrame(df.codes.values.tolist()).add_prefix('code_')     code_0   code_1   code_2 0   71020      NaN      NaN 1   77085      NaN      NaN 2   36415      NaN      NaN 3   99213  99287.0      NaN 4   99233  99233.0  99233.0

Include the index

pd.DataFrame(df.codes.values.tolist(), df.index).add_prefix('code_')     code_0   code_1   code_2 1   71020      NaN      NaN 2   77085      NaN      NaN 3   36415      NaN      NaN 4   99213  99287.0      NaN 5   99233  99233.0  99233.0

We can nail down all the formatting with this:

f = lambda x: 'code_{}'.format(x + 1) pd.DataFrame(     df.codes.values.tolist(),     df.index, dtype=object ).fillna('').rename(columns=f)     code_1 code_2 code_3 1   71020               2   77085               3   36415               4   99213  99287        5   99233  99233  99233

140

answered Oct 14 '22 10:10

piRSquared

Another solution:

In [95]: df.codes.apply(pd.Series).add_prefix('code_') Out[95]:     code_0   code_1   code_2 1  71020.0      NaN      NaN 2  77085.0      NaN      NaN 3  36415.0      NaN      NaN 4  99213.0  99287.0      NaN 5  99233.0  99233.0  99233.0

answered Oct 14 '22 09:10

MaxU - stop WAR against UA

Related questions
                            
                                Installing matplotlib on Mac OSX Mountain Lion
                            
                                How to make an external database query iterable?
                            
                                How to copy a table from excel to word using pythonCOM
                            
                                Returning a Structure using ctypes in Python
                            
                                Why should PyImport_AppendInittab() be called before Py_Initialize()?
                            
                                How to mock a Python class that is two imports deep?
                            
                                How can I query rows with unique values on a joined column?
                            
                                creating inset in matplot lib
                            
                                Using lists and tuples in Python if statements
                            
                                Need a file system metadata layer for applications
                            
                                __getitem__ or square brackets for recursive data structure
                            
                                python popen rsync with rsh option
                            
                                How to fix an error at start after compiling with PyInstaller got pyodbc?
                            
                                Geopy ValueError "Didn't find exactly one match" when geocoding
                            
                                Does os.walk leak memory?
                            
                                Changing the “locale preferred encoding” in Python 3 in Windows
                            
                                SQLAlchemy 'bulk_save_objects' vs 'add_all' underlying logic difference?
                            
                                Python: why is `return` not allowed in a module
                            
                                sudo pip install VS pip install --user
                            
                                How to determine if my Python Requests call to API returns no data

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With