Pandas crosstab, but with values from aggregation of third column

Question

Here is my problem:

df = pd.DataFrame({'A': ['one', 'one', 'two', 'two', 'one'] ,
                   'B': ['Ar', 'Br', 'Cr', 'Ar','Ar'] ,
                   'C': [1, 0, 0, 1,0 ]})

I would like to generate something like output of pd.crosstab function, but values on the intersection of column and row should come from aggregation of third column:

    Ar,  Br, Cr
one 0.5 0  0
two 1  0  0

For example, there are two cases of 'one' and 'Ar' corresponding values in column 'C' are 1,0 we sum up values in column 'C' (0+1) and divide by number of values in column 'C', so we get (0+1)/2 =0.5. Whenever combination is not present we (like 'Cr' and 'one') we set it to zero. Any thoughts?

MaxU - stop WAR against UA · Accepted Answer

you can use pivot_table() method, which uses aggfunc='mean' per-default:

In [46]: df.pivot_table(index='A', columns='B', values='C', fill_value=0)
Out[46]:
B     Ar  Br  Cr
A
one  0.5   0   0
two  1.0   0   0

piRSquared · Answer

I like groupby and unstack

df.groupby(['A', 'B']).C.mean().unstack(fill_value=0)

enter image description here

Pandas crosstab, but with values from aggregation of third column

Tags:

python

pandas

aggregate

user1700890

2 Answers

MaxU - stop WAR against UA

piRSquared

Recent Activity

Donate For Us

Pandas crosstab, but with values from aggregation of third column

Tags:

python

pandas

aggregate

user1700890

2 Answers

MaxU - stop WAR against UA

piRSquared

Related questions

Recent Activity

Donate For Us