Seaborn stacked histogram/barplot

Tags:

I have a pandas.DataFrame and I want to plot a graph based on two columns: Age (int), Survived (int - 0 or 1). Now I have something like this:

enter image description here

This is the code I use:

class DataAnalyzer:

    def _facet_grid(self, func, x: List[str], col: str = None, row: str = None) -> None:
        g = sns.FacetGrid(self.train_data, col=col, row=row)
        if func == sns.barplot:
            g.map(func, *x, ci=None)
        else:
            g.map(func, *x)
        g.add_legend()
        plt.show()

    def analyze(self) -> None:
        # Check if survival rate is connected with Age
        self._facet_grid(plt.hist, col='Survived', x=['Age'])

So this is shown on two subplots. This is good, but its harder to see the difference between the amount of records which have 0 vs 1 in the Survived column, for the particular age range.

So I want to have something like this:

enter image description here

In this scenario you could see this difference. Is there some way to do it on seaborn (cuz there I can easily operate on pandas.DataFrame)? I don't want to use vanilla matplotlib if that's possible

408

asked Dec 22 '18 20:12

dabljues

Video Answer

2 Answers

Starting seaborn 0.11.0, you can do this

# stacked histogram
import matplotlib.pyplot as plt
f = plt.figure(figsize=(7,5))
ax = f.add_subplot(1,1,1)

# mock your data frame
import pandas as pd
import numpy as np
_df = pd.DataFrame({
    "age":np.random.normal(30,30,1000),
    "survived":np.random.randint(0,2,1000)
})

# plot
import seaborn as sns
sns.histplot(data=_df, ax=ax, stat="count", multiple="stack",
             x="age", kde=False,
             palette="pastel", hue="survived",
             element="bars", legend=True)
ax.set_title("Seaborn Stacked Histogram")
ax.set_xlabel("Age")
ax.set_ylabel("Count")

enter image description here

166

answered Nov 09 '22 14:11

Gena Kukartsev

Just stack the total histogram with the survived -0 one. It's hard to give the exact function without the precise form of the dataframe, but here's a basic example with one of seaborn examples dataset.

import matplotlib.pyplot as plt 
import seaborn as sns 
tips = sns.load_dataset("tips") 
sns.distplot(tips.total_bill, color="gold", kde=False, hist_kws={"alpha": 1}) 
sns.distplot(tips[tips.sex == "Female"].total_bill, color="blue", kde=False, hist_kws={"alpha":1}) 
plt.show()

answered Nov 09 '22 14:11

bombadilhom

Related questions
                            
                                Is it possible to save a fernet key for a later session?
                            
                                Firebase storage Upload file -python
                            
                                Is there an advantage of using the property decorator compared to the property class?
                            
                                4D input in LSTM layer in Keras
                            
                                Getting Bad Request 400 when sending data to django rest framework through axios
                            
                                Rename a file or directory shortcut in jupyter lab?
                            
                                numpy: arange include endpoint
                            
                                Why Z has to be 2-dimensional for 3d plotting in matplotlib
                            
                                Re-displaying the current heading after a page break
                            
                                Is there .all() or .any() equivalent in python Tensorflow
                            
                                Why can't you replace integers with lists using `replace` method - pandas
                            
                                How to mask weights in PyTorch weight parameters?
                            
                                whats does assert _sre.MAGIC == MAGIC, SRE module mismatch AssertionError: SRE module mismatch error mean?
                            
                                Multiple outputs in keras Sequential models
                            
                                Pandas .at throwing ValueError: At based indexing on an integer index can only have integer indexers
                            
                                Odoo - Custom template menu load
                            
                                What is variable shadowing?
                            
                                How to avoid decoding to str: need a bytes-like object error in pandas?
                            
                                Rename named parameter in Python to avoid naming conflicts with import statement
                            
                                DJango filter_queryset

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Seaborn stacked histogram/barplot

Tags:

python

pandas

matplotlib

seaborn

dabljues

People also ask

Video Answer

2 Answers

Gena Kukartsev

bombadilhom

Recent Activity

Donate For Us