Plotting errorbar with mean and std after grouping

Tags:

I have the following dataframe:

                    mean       std
insert quality                    
0.0    good     0.009905  0.003662
0.1    good     0.450190  0.281895
       poor     0.376818  0.306806
0.2    good     0.801856  0.243288
       poor     0.643859  0.322378
0.3    good     0.833235  0.172025
       poor     0.698972  0.263266
0.4    good     0.842288  0.141925
       poor     0.706708  0.241269
0.5    good     0.853634  0.118604
       poor     0.685716  0.208073
0.6    good     0.845496  0.118609
       poor     0.675907  0.207755
0.7    good     0.826335  0.133820
       poor     0.656934  0.222823
0.8    good     0.829707  0.130154
       poor     0.627111  0.213046
0.9    good     0.816636  0.137371
       poor     0.589331  0.232756
1.0    good     0.801211  0.147864
       poor     0.554589  0.245867

What should I do if wanted to plot 2 curves (points + errors) using as the X axis the index column "Insert" and differentiating the two curves by "Quality" [good, poor]? They should be of different colors too.

I'm kinda stuck, I produced every kind of plot apart the one I need.

415

asked Jan 13 '16 13:01

Marco Pietrosanto

1 Answers

You could loop through the groups in df.groupby('quality') and call group.plot on each group.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

df = pd.DataFrame({
    'insert': [0.0, 0.1, 0.1, 0.2, 0.2, 0.3, 0.3, 0.4, 0.4, 0.5, 0.5, 0.6, 0.6,
    0.7, 0.7, 0.8, 0.8, 0.9, 0.9, 1.0, 1.0],
    'mean': [0.009905, 0.45019, 0.376818, 0.801856, 0.643859, 0.833235,
    0.698972, 0.842288, 0.706708, 0.853634, 0.685716, 0.845496, 0.675907,
    0.826335, 0.656934, 0.829707, 0.627111, 0.816636, 0.589331, 0.801211,
    0.554589],
    'quality': ['good', 'good', 'poor', 'good', 'poor', 'good', 'poor', 'good',
    'poor', 'good', 'poor', 'good', 'poor', 'good', 'poor', 'good', 'poor',
    'good', 'poor', 'good', 'poor'], 
    'std': [0.003662, 0.281895, 0.306806, 0.243288, 0.322378, 0.172025,
    0.263266, 0.141925, 0.241269, 0.118604, 0.208073, 0.118609, 0.207755,
    0.13382, 0.222823, 0.130154, 0.213046, 0.137371, 0.232756, 0.147864,
    0.245867]})

fig, ax = plt.subplots()    # 1

for key, group in df.groupby('quality'):
    group.plot('insert', 'mean', yerr='std', label=key, ax=ax)   # 2

plt.show()

enter image description here

To make both plots appear on the same axes:

create your own axes object, ax.
set the ax parameter to the axes object in each call to group.plot

It might look better as a bar plot:

# fill in missing data with 0, so the bar plots are aligned
df = df.pivot(index='insert', columns='quality').fillna(0).stack().reset_index()

colors = ['green', 'red']
positions = [0, 1]

for group, color, pos in zip(df.groupby('quality'), colors, positions):
    key, group = group
    print(group)
    group.plot('insert', 'mean', yerr='std', kind='bar', width=0.4, label=key, 
               position=pos, color=color, alpha=0.5, ax=ax)

ax.set_xlim(-1, 11)  
plt.show()

enter image description here

181

answered Sep 27 '22 22:09

unutbu

Related questions
                            
                                Overloading the [] operator in python class to refer to a numpy.array data member
                            
                                Spark using Python : save RDD output into text files
                            
                                Mutable default argument for a Python namedtuple
                            
                                Flask-Admin / Flask-SQLAlchemy: set user_id = current_user for INSERT
                            
                                MySQLdb raises "execute() first" error even though I execute before calling fetchall
                            
                                Where can the RDS_DB_NAME setting for an Elastic Beanstalk environment be changed
                            
                                Difference between local and dense layers in CNNs
                            
                                Can't reproduce distance value between sources obtained with astropy
                            
                                How to change request url before making request in scrapy?
                            
                                Installed Anaconda for python 2 and 3. Can't run 2
                            
                                Errno13, Permission denied when trying to read file
                            
                                How to scrape elements that immediately follows a certain element?
                            
                                Django Admin - remove permissions from the list on Add/Edit Group page
                            
                                Pandas groupby slice of a string
                            
                                print first paragraph in python
                            
                                Why is pandas.apply() executing on null elements?
                            
                                Python: why is zip(*) used instead of unzip()? [closed]
                            
                                How to read JSON file that contains list of dictionaries into pandas data frame?
                            
                                How to calculate GPU memory usage in Theano?
                            
                                Cannot assign values to a 'double slice' using numpy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Plotting errorbar with mean and std after grouping

Tags:

python

pandas

matplotlib

plot

Marco Pietrosanto

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us