Suppose I have pandas data frame with 2 columns: <pre class="prettyprint"><code>df: Col1 Col2 1 1 1 2 1 2 1 2 3 4 3 4 </code></pre> Then I want to keep only the unique couple values (col1, col2) of these two columns and give their frequncy: <pre class="prettyprint"><code>df2: Col1 Col2 Freq 1 1 1 1 2 3 3 4 2 </code></pre> I think to use <code>df['Col1', 'Col2'].value_counts()</code> but it works only for one column. Does it exist a function to deal with many columns?

You need <code>groupby</code> + <code>size</code> + <code>Series.reset_index</code>: <pre class="prettyprint"><code>df = df.groupby(['Col1', 'Col2']).size().reset_index(name='Freq') print (df) Col1 Col2 Freq 0 1 1 1 1 1 2 3 2 3 4 2 </code></pre>

Unique values of two columns for pandas dataframe [duplicate]

Tags:

python

pandas

dataframe

unique

Suppose I have pandas data frame with 2 columns:

df: Col1  Col2       1     1       1     2       1     2       1     2       3     4       3     4

Then I want to keep only the unique couple values (col1, col2) of these two columns and give their frequncy:

df2: Col1  Col2  Freq       1     1     1       1     2     3       3     4     2

I think to use df['Col1', 'Col2'].value_counts() but it works only for one column. Does it exist a function to deal with many columns?

728

asked Jul 04 '17 13:07

curious_one

1 Answers

You need groupby + size + Series.reset_index:

df = df.groupby(['Col1', 'Col2']).size().reset_index(name='Freq') print (df)    Col1  Col2  Freq 0     1     1     1 1     1     2     3 2     3     4     2

answered Oct 07 '22 22:10

jezrael

Related questions
                            
                                Append item to MongoDB document array in PyMongo without re-insertion
                            
                                Faithfully Preserve Comments in Parsed XML
                            
                                How to check if a key-value pair is present in a dictionary?
                            
                                Difference between "raise" and "raise e"?
                            
                                how to install tensorflow on anaconda python 3.6
                            
                                installing urllib in Python3.6
                            
                                2D arrays in Python
                            
                                How can I place a table on a plot in Matplotlib?
                            
                                Why is matplotlib plotting my circles as ovals?
                            
                                Check if list items contains substrings from another list
                            
                                Celery task schedule (Ensuring a task is only executed one at a time)
                            
                                How do I convert an array to string using the jinja template engine?
                            
                                Scrapy - Silently drop an item
                            
                                Get array elements from index to end
                            
                                Python - Download File Using Requests, Directly to Memory
                            
                                Add n tasks to celery queue and wait for the results
                            
                                How to pass all Python's traffics through a http proxy?
                            
                                Response' object is not subscriptable Python http post request
                            
                                Python subprocess .check_call vs .check_output
                            
                                How to encrypt text with a password in python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With