Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Pivoting tables in a pandas dataframe

I have a requirement where in I am trying to count the values and Put them in the pivot table.

This is my dataframe,

  Cola        Colb          
 Apple    Rippened 
Orange    Rippened
 Apple  UnRippened
 Mango  UnRippened

I want the Output to be like this,

        Rippened  UnRippened
Apple          1           1
Mango          0           1
Orange         1           0

Kindly share your thoughts.

like image 212
Siddharth Thanga Mariappan Avatar asked Nov 28 '22 20:11

Siddharth Thanga Mariappan


2 Answers

I love this question....

Option 1

pd.get_dummies(df.Cola).T.dot(pd.get_dummies(df.Colb))

        Rippened  UnRippened
Apple          1           1
Mango          0           1
Orange         1           0

Option 2

i, r = pd.factorize(df.Cola.values)
j, c = pd.factorize(df.Colb.values)
n, m = r.size, c.size
b = np.bincount(i * m + j, minlength=n * m).reshape(n, m)

pd.DataFrame(b, r, c)

        Rippened  UnRippened
Apple          1           1
Orange         1           0
Mango          0           1

Option 3

df.groupby(['Cola', 'Colb']).size().unstack(fill_value=0)

Colb    Rippened  UnRippened
Cola                        
Apple          1           1
Mango          0           1
Orange         1           0

Option 4

df.groupby('Cola').Colb.value_counts().unstack(fill_value=0)

Colb    Rippened  UnRippened
Cola                        
Apple          1           1
Mango          0           1
Orange         1           0
like image 89
piRSquared Avatar answered Dec 20 '22 03:12

piRSquared


Using my favourite: pd.crosstab

df = pd.crosstab(df.Cola, df.Colb)
print(df)

Colb    Rippened  UnRippened
Cola                        
Apple          1           1
Mango          0           1
Orange         1           0
like image 26
cs95 Avatar answered Dec 20 '22 02:12

cs95