Pandas 'describe' is not returning summary of all columns

Tags:

1 Answers

As of pandas v15.0, use the parameter, DataFrame.describe(include = 'all') to get a summary of all the columns when the dataframe has mixed column types. The default behavior is to only provide a summary for the numerical columns.

Example:

In[1]:  df = pd.DataFrame({'$a':['a', 'b', 'c', 'd', 'a'], '$b': np.arange(5)}) df.describe(include = 'all')  Out[1]:          $a    $b count   5   5.000000 unique  4   NaN top     a   NaN freq    2   NaN mean    NaN 2.000000 std     NaN 1.581139 min     NaN 0.000000 25%     NaN 1.000000 50%     NaN 2.000000 75%     NaN 3.000000 max     NaN 4.000000

The numerical columns will have NaNs for summary statistics pertaining to objects (strings) and vice versa.

Summarizing only numerical or object columns

To call describe() on just the numerical columns use describe(include = [np.number])

To call describe() on just the objects (strings) using describe(include = ['O']).

In[2]:  df.describe(include = [np.number])  Out[3]:           $b count   5.000000 mean    2.000000 std     1.581139 min     0.000000 25%     1.000000 50%     2.000000 75%     3.000000 max     4.000000  In[3]:  df.describe(include = ['O'])  Out[3]:      $a count   5 unique  4 top     a freq    2

answered Sep 24 '22 04:09

ilyas patanam

Related questions
                            
                                Pandas: sum up multiple columns into one column without last column
                            
                                how do I .decode('string-escape') in Python3?
                            
                                How to test a variable is null in python [duplicate]
                            
                                How to wrap code/text in Jupyter notebooks
                            
                                Why assert is not largely used?
                            
                                pandas DataFrame "no numeric data to plot" error
                            
                                Delete unused packages from requirements file
                            
                                Class that acts as mapping for **unpacking
                            
                                How to use numpy.genfromtxt when first column is string and the remaining columns are numbers?
                            
                                Why do I get this many iterations when adding to and removing from a set while iterating over it?
                            
                                Accessing dictionary by key in Django template
                            
                                No handlers could be found for logger
                            
                                Removing duplicate columns after a DF join in Spark
                            
                                Resolving a relative url path to its absolute path
                            
                                Python - Convert string representation of date to ISO 8601
                            
                                Storing and Accessing node attributes python networkx
                            
                                How to install a package inside virtualenv?
                            
                                'pytest' exits with no error, but with "collected 0 items"
                            
                                How can I force Python's file.write() to use the same newline format in Windows as in Linux ("\r\n" vs. "\n")?
                            
                                Flask throwing 'working outside of request context' when starting sub thread

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas 'describe' is not returning summary of all columns

Tags:

python

pandas

user2808117

People also ask

1 Answers

ilyas patanam

Recent Activity

Donate For Us