Can I use pandas' pivot_table to aggregate over a column with missing values?

Tags:

Can I use pandas pivot_table to aggregate over a column with missing values and have those missing values included as separate category?

In:
df = pd.DataFrame({'a': pd.Series(['X', 'X', 'Y', 'Y', 'N', 'N'], dtype='category'), 
                   'b': pd.Series([None, None, 'd', 'd', 'd', 'd'], dtype='category')})

Out:
    a   b
0   X   NaN
1   X   NaN
2   Y   d
3   Y   d
4   N   d
5   N   d

In:
df.groupby('a')['b'].apply(lambda x: x.value_counts(dropna=False)).unstack(1)

Out:
    NaN d
a       
N   NaN 2.0
X   2.0 0.0
Y   NaN 2.0

Can I achieve the same result using pandas pivot_table? If yes than how? Thanks.

216

asked Nov 23 '20 20:11

Roman Shevtsiv

1 Answers

For some unknown reason, dtype="category" does not work with pivot_table() when counting NaN values. Casting them to regular strings enables regular pivot_table(aggfunc="size").

df.astype(str).pivot_table(index="a", columns="b", aggfunc="size")

Result

b    d  nan
a          
N  2.0  NaN
X  NaN  2.0
Y  2.0  NaN

One can optionally do .fillna(0) to replace nans with 0s

112

answered Sep 30 '22 19:09

Bill Huang

Related questions
                            
                                I don't understand why I get this warning about conversion between signed and unsigned, is my compiler wrong? [duplicate]
                            
                                Run Invoke-Command in remote computer as administrator
                            
                                Facebook Sdk Login with new Activity Result API
                            
                                Rust diesel conditionally filter a query
                            
                                Error MSB3644: The reference assemblies for framework ".NETFramework,Version=v5.0" were not found
                            
                                Realtime Multiplayer Game with firebase
                            
                                is it possible to dispatch function when react component is unmount ? (React functional componet)
                            
                                What is @types/node package in NodeJs?
                            
                                scraping headlines from news website with infinite loading
                            
                                Python Dictionary with generic keys and Callable[T] values
                            
                                VSCode - tab icon color reflect git status
                            
                                Replace IronPython with Python.Net?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With