Convert groupby values into list of arrays [duplicate]

Tags:

I know that df.groupby.groups.keys() gives me the list ['a','b','c'], and df.groupby.groups.values() gives me something like arr, but as an Int64Index object. However, I tried df.loc[df.groupby.groups.values()]['label'] and it isn't getting the desired result.

How do I accomplish this? Thanks!

341

asked Jun 21 '18 06:06

irene

1 Answers

preferably as a list of numpy arrays.

Preferably not, because you're asking for ragged arrays, which means that the inner arrays (AKA, the rows) are not all of the same length. This is inconvenient for numpy, meaning it cannot store these arrays efficiently as C arrays internally. It ends up falling back to slow python objects.

In this situation, I'd recommend nested python lists. That's achievable through a groupby + apply.

lst = df.groupby('label')['data'].apply(pd.Series.tolist).tolist()
print(lst)
[[1.09, 5.0], [2.1, 2.0], [1.9]]

159

answered Sep 30 '22 10:09

cs95

Related questions
                            
                                Joining .wav files without writing on disk in Python
                            
                                What does the redirection mean in apache beam (python)
                            
                                How i can use environment variables on .ini file in Pyramid?
                            
                                ValueError: multiclass-multioutput format is not supported using sklearn roc_auc_score function
                            
                                How to find the length of line segments using python
                            
                                Pyspark sql: Create a new column based on whether a value exists in a different DataFrame's column
                            
                                Integrate Scrapy with Django : How to
                            
                                How to create a tensorflow dataset from a DataFrame with vector columns?
                            
                                Python make datetime offset aware
                            
                                Pytorch - Pick best probability after softmax layer
                            
                                What is the point of "stderr" in Python?
                            
                                Keras: How to slice tensor using information from another tensor?
                            
                                Increase resolution of figure while preserving dimensions in Python matplotlib
                            
                                Python- Renaming duplicated values based on another variable
                            
                                Why images can be shown only if they are in `static` folder?
                            
                                form the largest number possible in a list [duplicate]
                            
                                cannot Install libxml2 in virtualenv
                            
                                Conda-Build: Unsatisfiable dependencies for platform osx-64: {"torch[version='>=0.4']"}
                            
                                Python set attribute equal to another attribute
                            
                                Cannot connect to single-node Kafka server through Docker

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Convert groupby values into list of arrays [duplicate]

Tags:

python

pandas

irene

People also ask

1 Answers

cs95

Recent Activity

Donate For Us