What's a quick way to produce the inverse output of the value_counts function? For example, if I have the following series: <pre class="prettyprint"><code>1 24 2 2 3 1 4 2 5 3 6 12 7 21 8 204 9 400 10 71 11 160 Name: foo, dtype: float64 </code></pre> How can I concisely produce the following array? <pre class="prettyprint"><code>numpy.array([1, 1, 1, ... , 2, 2, 3, 4, 4, 5, 5, 5, 6, ... ]) </code></pre>

You can use <code>np.repeat</code>. If your Series is named <code>s</code>, it's possible to write: <pre class="prettyprint"><code>np.repeat(s.index.values, s.values) </code></pre> Here <code>s.index.values</code> are the values to repeat, and <code>s.values</code> specifies the number of times that each value should be repeated. The output is a 1D array.

Pandas: inverse of value_counts function

Tags:

python

pandas

numpy

What's a quick way to produce the inverse output of the value_counts function?

For example, if I have the following series:

1      24
2       2
3       1
4       2
5       3
6      12
7      21
8     204
9     400
10     71
11    160
Name: foo, dtype: float64

How can I concisely produce the following array?

numpy.array([1, 1, 1, ... , 2, 2, 3, 4, 4, 5, 5, 5, 6, ... ])

990

asked Feb 18 '16 21:02

Andrew Mao

1 Answers

You can use np.repeat. If your Series is named s, it's possible to write:

np.repeat(s.index.values, s.values)

Here s.index.values are the values to repeat, and s.values specifies the number of times that each value should be repeated. The output is a 1D array.

188

answered Oct 06 '22 09:10

Alex Riley

Related questions
                            
                                Long to wide data. Pandas
                            
                                re.split with spaces in python
                            
                                Why is numpy list access slower than vanilla python?
                            
                                Environmental path to Python not working?
                            
                                OCaml map a string to a list of strings
                            
                                Decoding Ebcdic
                            
                                Drop multi-indexed rows of a DataFrame based on 'AND' condition between levels
                            
                                PILKit was unable to import the Python Imaging Library
                            
                                Removing columns which has only "nan" values from a NumPy array
                            
                                how to copy an array into a bigger array(partial copy)
                            
                                Using StatsModels to plot quantile regression for 2nd order polynomial
                            
                                Vagrant not installing pip during provision
                            
                                Custom iteration behavior in dict subclass
                            
                                Pylint complains "no value for argument 'cls'"
                            
                                How do I call the Google Vision API with an image stored in Google Cloud Storage?
                            
                                How to extract a Google link's href from search results with Selenium?
                            
                                How to have different results for 'list' (players/) and 'detail' (players/{id})?
                            
                                Matplotlib multiprocessing fonts corruption using savefig
                            
                                Understanding difference between Double Quote and Single Quote with __repr__()
                            
                                Python: Counting cumulative occurrences of values in a pandas series

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With