I have a dataframe like this: <pre class="prettyprint"><code>col1, col2 A 0 A 1 B 2 C 3 </code></pre> I would like to get this: <pre class="prettyprint"><code>{ A: [0,1], B: [2], C: [3] } </code></pre> I tried: <pre class="prettyprint"><code>df.set_index('col1')['col2'].to_dict() </code></pre> but that is not quite correct. The first issue I have is 'A' is repeated, I end up getting A:1 only (0 gets overwritten). How to fix?

You can use a dictionary comprehension on a groupby. <pre class="prettyprint"><code>>>> {idx: group['col2'].tolist() for idx, group in df.groupby('col1')} {'A': [0, 1], 'B': [2], 'C': [3]} </code></pre>

Pandas: Convert dataframe to dict of lists

Tags:

python

pandas

dataframe

I have a dataframe like this:

col1, col2
A      0
A      1
B      2
C      3

I would like to get this:

{ A: [0,1], B: [2], C: [3] }

I tried:

df.set_index('col1')['col2'].to_dict()

but that is not quite correct. The first issue I have is 'A' is repeated, I end up getting A:1 only (0 gets overwritten). How to fix?

308

asked May 11 '16 01:05

user4979733

1 Answers

You can use a dictionary comprehension on a groupby.

>>> {idx: group['col2'].tolist() 
     for idx, group in df.groupby('col1')}
{'A': [0, 1], 'B': [2], 'C': [3]}

178

answered Nov 12 '22 12:11

Alexander

Related questions
                            
                                How can I return an empty (null?) item back from a map method in PySpark?
                            
                                Python AttributeError: 'str' object has no attribute 'DataFrame'
                            
                                Reversing lists of numbers in python
                            
                                Weights in Convolutional network?
                            
                                Trace Bug which happends only sometimes in CI
                            
                                Use of relationship and ForeignKey modules in SQLAlchemy
                            
                                How to use pyinstaller with hidden imports for scipy.optimize leastsq
                            
                                How does a for loop evaluate its argument
                            
                                difference between two regular expressions: [abc]+ and ([abc])+
                            
                                Fast way to split an int into bytes
                            
                                How to accurately minus X month on a date in Python?
                            
                                Delete specific cache in Flask-Cache or Flask-Caching
                            
                                How can I import Pandas with Jython
                            
                                Why is Gaussian Filter different between cv2 and skimage?
                            
                                Softmax derivative in NumPy approaches 0 (implementation)
                            
                                Read .nc (netcdf) files using python
                            
                                Jupyter shows plot without plt.show()
                            
                                Place a chart in plotly popup
                            
                                How to serialize Python dict to JSON
                            
                                Multi threading in Tkinter GUI, threads in different classes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With