I have a <code>DataFrame</code> and I want to get both group names and corresponding group counts as a list or numpy array. However when I convert the output to matrix I only get group counts I dont get the names. Like in the example below: <pre class="prettyprint"><code> df = pd.DataFrame({'a':[0.5, 0.4, 5 , 0.4, 0.5, 0.6 ]}) b = df['a'].value_counts() print(b) </code></pre> output: <pre class="prettyprint"><code>[0.4 2 0.5 2 0.6 1 5.0 1 Name: a, dtype: int64] </code></pre> what I tried is <code>print[b.as_matrix()]</code>. Output: <pre class="prettyprint"><code>[array([2, 2, 1, 1])] </code></pre> In this case I do not have the information of corresponding group names which also I need. Thank you.

Convert it to a <code>dict</code>: <pre class="prettyprint"><code>bd = dict(b) print(bd) # {0.40000000000000002: 2, 0.5: 2, 0.59999999999999998: 1, 5.0: 1} </code></pre> Don't worry about the long decimals. They're just a result of floating point representation; you still get what you expect from the dict. <pre class="prettyprint"><code>bd[0.4] # 2 </code></pre>

Python Pandas: Get dataframe.value_counts() result as list

Tags:

python

pandas

dataframe

numpy

I have a DataFrame and I want to get both group names and corresponding group counts as a list or numpy array. However when I convert the output to matrix I only get group counts I dont get the names. Like in the example below:

  df = pd.DataFrame({'a':[0.5, 0.4, 5 , 0.4, 0.5, 0.6 ]})
  b = df['a'].value_counts()
  print(b)

output:

[0.4    2
0.5    2
0.6    1
5.0    1
Name: a, dtype: int64]

what I tried is print[b.as_matrix()]. Output:

[array([2, 2, 1, 1])]

In this case I do not have the information of corresponding group names which also I need. Thank you.

495

asked May 28 '17 20:05

s900n

1 Answers

Convert it to a dict:

bd = dict(b)
print(bd)
# {0.40000000000000002: 2, 0.5: 2, 0.59999999999999998: 1, 5.0: 1}

Don't worry about the long decimals. They're just a result of floating point representation; you still get what you expect from the dict.

bd[0.4]
# 2

133

answered Nov 14 '22 22:11

Arya McCarthy

Related questions
                            
                                Getting the target of a symbolic link with pathlib
                            
                                Speed up Pandas cummin/cummax
                            
                                Edit element in browser with python selenium
                            
                                Can I read multiple files into a Spark Dataframe from S3, passing over nonexistent ones?
                            
                                How to query with raw SQL using Session or engine
                            
                                Python unittest framework: Test description
                            
                                How to use string as input for csv reader without storing it to file
                            
                                Extracting nearest lat-lon and time value from netcdf using xarray
                            
                                How can I get pytest to ignore Test* classes that don't subclass unittest?
                            
                                pandas logical and operator with and without brackets produces different results [duplicate]
                            
                                ModuleNotFoundError: No module named 'forms'
                            
                                Omit joining lines in matplotlib plot e.g. y = tan(x)
                            
                                how to get the current row index with Openpyxl
                            
                                Tensorflow: Using neural network to classify positive or negative phrases
                            
                                Previous month datetime pandas
                            
                                Python Seaborn - How are outliers determined in boxplots
                            
                                Using fillna method on multiple columns of a Pandas DataFrame failed
                            
                                How to download pdf files using Python?
                            
                                Matrix norm in TensorFlow
                            
                                Call built-in function if overwritten by a variable of the same name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With