Create a dataframe from a dict where values are variable-length lists

Tags:

I have a dict where the values are is a list, for example;

my_dict = {1: [964725688, 6928857],
           ...

           22: [1667906, 35207807, 685530997, 35207807],
           ...
           }

In this example, the max items in a list is 4, but it could be greater than that.

I would like to convert it to a dataframe like:

1  964725688
1  6928857
...
22 1667906
22 35207807
22 685530997
22 35207807

548

asked May 11 '17 18:05

spitfiredd

2 Answers

my_dict ={1: [964725688, 6928857], 22: [1667906, 35207807, 685530997, 35207807]}

df = pd.DataFrame( [ [k,ele] for k,v in my_dict.iteritems() for ele in v ])

print df

   0   1        
0   1  964725688
1   1    6928857
2  22    1667906
3  22   35207807
4  22  685530997
5  22   35207807

171

answered Nov 10 '22 07:11

galaxyan

First Idea
pandas

s = pd.Series(my_dict)
pd.Series(
    np.concatenate(s.values),
    s.index.repeat(s.str.len())
)

1     964725688
1       6928857
22      1667906
22     35207807
22    685530997
22     35207807
dtype: int64

Faster!
numpy

values = list(my_dict.values())
lens = [len(value) for value in values]
keys = list(my_dict.keys())
pd.Series(np.concatenate(values), np.repeat(keys, lens))

1     964725688
1       6928857
22      1667906
22     35207807
22    685530997
22     35207807
dtype: int64

Interesting
pd.concat

pd.concat({k: pd.Series(v) for k, v in my_dict.items()}).reset_index(1, drop=True)

1     964725688
1       6928857
22      1667906
22     35207807
22    685530997
22     35207807
dtype: int64

answered Nov 10 '22 05:11

piRSquared

Related questions
                            
                                Sending email with attached file in Django
                            
                                Django implementation of default value in database
                            
                                How to redirect stderr and stdout into /var/log directory in background process?
                            
                                Concating pandas dataframe
                            
                                Python: How to convert Pyspark column to date type if there are null values
                            
                                Filtering pyspark dataframe if text column includes words in specified list
                            
                                How can I split this string up?
                            
                                What is best practice to create a small database in python?
                            
                                Python data frame Export to csv with Quotation marks (")
                            
                                Rendering a 3D mesh into an image using python
                            
                                Converting svg from Highcharts data into data points
                            
                                Adding gravity to a bouncing ball using vectors
                            
                                Multiply all rows in a Pandas DataFrame by dictionary
                            
                                How to run python-socketio in Thread?
                            
                                Pandas - merging dataframes conditionally on multiple columns
                            
                                Other option for colored scrollbar in tkinter based program?
                            
                                Using collections.Counter to count emojis with different colors
                            
                                PySpark sampleBy using multiple columns
                            
                                How do I just keep the rows with the maximum value in a column for items of the same type? [duplicate]
                            
                                python Get the unique values from a dictionary

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Create a dataframe from a dict where values are variable-length lists

Tags:

python

pandas

dataframe

spitfiredd

People also ask

2 Answers

galaxyan

piRSquared

Recent Activity

Donate For Us