My data-structure is: <pre class="prettyprint"><code>ds = [{ "name": "groupA", "subGroups": [123,456] }, { "name": "groupB", "subGroups": ['aaa', 'bbb' , 'ccc'] }] </code></pre> This gives the following dataframe <pre class="prettyprint"><code>df = pd.DataFrame(ds) name subGroups 0 groupA [123, 456] 1 groupB [aaa, bbb, ccc] </code></pre> I want: <pre class="prettyprint"><code> name subGroupsFlattend 0 groupA 123 1 groupA 456 2 groupB aaa 3 groupB bbb 4 groupB ccc </code></pre> Any ideas?

Use <code>explode</code>: <pre class="prettyprint"><code>df = df.explode('subGroups') </code></pre>

Flatten a list of elements in Pandas DataFrame

Tags:

python-3.x

pandas

dataframe

My data-structure is:

ds = [{
    "name": "groupA",
    "subGroups": [123,456]
},
{
    "name": "groupB",
    "subGroups": ['aaa', 'bbb' , 'ccc']
}]

This gives the following dataframe

df = pd.DataFrame(ds)

    name    subGroups
0   groupA  [123, 456]
1   groupB  [aaa, bbb, ccc]

I want:

    name    subGroupsFlattend
0   groupA  123
1   groupA  456
2   groupB  aaa
3   groupB  bbb
4   groupB  ccc

Any ideas?

453

asked Mar 23 '18 14:03

More Than Five

2 Answers

Use explode:

df = df.explode('subGroups')

154

answered Sep 17 '22 13:09

Jake Reece

You can fix your output by following :

pd.DataFrame({'name':df.name.repeat(df.subGroups.str.len()),'subGroup':df.subGroups.sum()})
Out[364]: 
     name subGroup
0  groupA      123
0  groupA      456
1  groupB      aaa
1  groupB      bbb
1  groupB      ccc

answered Sep 17 '22 13:09

BENY

Related questions
                            
                                How to run Python 3 function even after user has closed web browser/tab?
                            
                                Is OOP possible using discord.py without cogs?
                            
                                I want to use NumPy/SciPy. Should I use Python 2 or 3?
                            
                                Is this possible to load the page after the javascript execute using python?
                            
                                Easiest way to remove unicode representations from a string in python 3?
                            
                                Python 3 Tkinter - Messagebox with a toplevel as master?
                            
                                When is the object() built-in useful?
                            
                                zip function giving incorrect output
                            
                                Splitting Thai text by characters
                            
                                Set vs. set python
                            
                                Iterating through array
                            
                                How to compare individual characters in two strings in Python 3
                            
                                pyQt: How do I update a label?
                            
                                Network capturing with Selenium/PhantomJS
                            
                                Custom Python gTTS voice
                            
                                python3: UTF-8 encoding in http.server
                            
                                python getattr() with multiple params
                            
                                Python list comprehension with dummy names identical to iterator name: ill-advised?
                            
                                Convert ascii string to base64 without the "b" and quotation marks
                            
                                Python Pandas Fillna Median not working

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With