pandas 'as_index' function doesn't work as expected

Tags:

pandas

This is a minimum reproducible example of my original dataframe called 'calls':

       phone_number    call_outcome   agent  call_number
0      83473306392   NOT INTERESTED  orange            0
1     762850680150  CALL BACK LATER  orange            1
2     476309275079   NOT INTERESTED  orange            2
3     899921761538  CALL BACK LATER     red            3
4     906739234066  CALL BACK LATER  orange            4

Writing this pandas command...

most_calls = calls.groupby('agent') \
.count().sort('call_number', ascending=False)

Returns this...

           phone_number  call_outcome  call_number
agent                                          
orange          2234          2234         2234
red             1478          1478         1478
black            750           750          750
green            339           339          339
blue             199           199          199

Which is correct, but for the fact that I want 'agent' to be a variable and not indexed.

I've used the as_index=False function on numerous occasions and am familiar with specifying axis=1. However in this instance it doesn't matter where or how I incorporate these parameters, every permutation returns an error.

These are some examples I've tried and the corresponding errors:

most_calls = calls.groupby('agent', as_index=False) \
.count().sort('call_number', ascending=False)

ValueError: invalid literal for long() with base 10: 'black'

And

most_calls = calls.groupby('agent', as_index=False, axis=1) \
.count().sort('call_number', ascending=False)

ValueError: as_index=False only valid for axis=0

606

asked Jun 25 '15 12:06

RDJ

1 Answers

I believe that, irrespective of the groupby operation you've done, you just need to call reset_index to say that the index column should just be a regular column.

Starting with a mockup of your data:

import pandas as pd
calls = pd.DataFrame({
    'agent': ['orange', 'red'],
    'phone_number': [2234, 1478],
    'call_outcome': [2234, 1478],
})
>> calls
    agent   call_outcome    phone_number
0   orange  2234    2234
1   red     1478    1478

here is the operation you did with reset_index() appended:

>> calls.groupby('agent').count().sort('phone_number', ascending=False).reset_index()
    agent   call_outcome    phone_number
0   orange  1   1
1   red     1   1

answered Oct 21 '22 17:10

Ami Tavory

Related questions
                            
                                python pandas read_excel returns UnicodeDecodeError on describe()
                            
                                Matplotlib Legend Guide basic examples
                            
                                Using Pandas to Iteratively Add Columns to a Dataframe
                            
                                IPv6 address representation in Python
                            
                                Solving formulas in parallel with z3
                            
                                Generating all possible n*n binary matrix in python [closed]
                            
                                dateutil.parser.parse is parsing '0001' as 2001. How can i solve this to read it as 0001 only
                            
                                scipy.integrate.quad gives wrong result on large ranges
                            
                                SQLAlchemy: How to change a MySQL server system variable using SQLAlchemy?
                            
                                How to save the result of a comparison using Django's 'with' template tag?
                            
                                OpenCV / Image Processing techniques to find the centers of bright spots in an image
                            
                                Python (Matplotlib) - Tick marks on ternary plot
                            
                                Python - Opening and changing large text files
                            
                                opencv error Assertion failed python
                            
                                Loading bigger than memory hdf5 file in pyspark
                            
                                Python, strptime is skipping zeros in the millisecond section
                            
                                How to get an arbitrary element from a frozenset?
                            
                                How to convert a binary (string) into a float value?
                            
                                Cycle through list starting at a certain element
                            
                                Why isn't isnumeric working?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas 'as_index' function doesn't work as expected

Tags:

python

pandas

RDJ

People also ask

1 Answers

Ami Tavory

Recent Activity

Donate For Us