heatmap-like plot, but for categorical variables in seaborn

Tags:

Same question as heatmap-like plot, but for categorical variables but using python and seaborn instead of R:

Imagine I have the following dataframe:

df = pd.DataFrame({"John":"No Yes Maybe".split(),
                   "Elly":"Yes Yes Yes".split(),
                   "George":"No Maybe No".split()},
                   index="Mon Tue Wed".split())

Now I would like to plot a heatmap and color each cell by its corresponding value. That is "Yes", "No", "Maybe", for instance becomes "Green", "Gray", "Yellow". The legend should have those three colors and the corresponding values.

I solved this problem myself in the following manner. I can't seem to pass a categorical color map to seaborn's heatmap, so instead I replace all text by numbers and reconstruct the color map used by seaborn internally afterwards i.e.:

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
import numpy as np
import matplotlib.patches as mpatches

# create dictionary with value to integer mappings
value_to_int = {value: i for i, value in enumerate(sorted(pd.unique(df.values.ravel())))}

f, ax = plt.subplots()
hm = sns.heatmap(df.replace(value_to_int).T, cmap="Pastel2", ax=ax, cbar=False)
# add legend
box = ax.get_position()
ax.set_position([box.x0, box.y0, box.width * 0.7, box.height])
legend_ax = f.add_axes([.7, .5, 1, .1])
legend_ax.axis('off')
# reconstruct color map
colors = plt.cm.Pastel2(np.linspace(0, 1, len(value_to_int)))
# add color map to legend
patches = [mpatches.Patch(facecolor=c, edgecolor=c) for c in colors]
legend = legend_ax.legend(patches,
    sorted(value_to_int.keys()),
    handlelength=0.8, loc='lower left')
for t in legend.get_texts():
    t.set_ha("left")

Categorical heatmap in seaborn

My question: is there a more succinct way of making this heatmap? If not, this might be a feature worth implementing in which case I'll post it on the seaborn issue tracker.

674

asked Mar 25 '16 20:03

inodb

2 Answers

You can use a discrete colormap and modify the colorbar, instead of using a legend.

value_to_int = {j:i for i,j in enumerate(pd.unique(df.values.ravel()))} # like you did
n = len(value_to_int)     
# discrete colormap (n samples from a given cmap)
cmap = sns.color_palette("Pastel2", n) 
ax = sns.heatmap(df.replace(value_to_int), cmap=cmap) 
# modify colorbar:
colorbar = ax.collections[0].colorbar 
r = colorbar.vmax - colorbar.vmin 
colorbar.set_ticks([colorbar.vmin + r / n * (0.5 + i) for i in range(n)])
colorbar.set_ticklabels(list(value_to_int.keys()))                                          
plt.show()

categorical seaborn heatmap

The colorbar part is adapted from this answer

HTH

127

answered Oct 18 '22 17:10

jrjc

I would probably use bokeh for this purpose as it has categorical heatmaps built in. Y-axis labels are written horizontally too, which is more readable.

http://docs.bokeh.org/en/0.11.1/docs/gallery/heatmap_chart.html

answered Oct 18 '22 19:10

Sohrab T

Related questions
                            
                                Simple get/post request blocked in python 3 but not in python 2
                            
                                Does Python have something as robust as Ruby's rvm?
                            
                                PHP - Equivalent for Python PDB(Debugger) in PHP? [closed]
                            
                                Python - SSL - wrong version number
                            
                                Weird IPython ipdb behaviour
                            
                                How do I upgrade the SQLite version used by Python's SQLite3 module on Mac?
                            
                                Combine two Pandas dataframes, resample on one time column, interpolate
                            
                                Cannot catch KeyboardInterrupt in command prompt twice?
                            
                                Understanding Gradient Policy Deriving
                            
                                Install Opencv from source to conda environment
                            
                                How can I speed up update/replace operations in PostgreSQL?
                            
                                Ruby HAML with Django?
                            
                                Best practices for Python deployment -- multiple versions, standard install locations, packaging tools etc
                            
                                How to escape $ on Python string Template class?
                            
                                ImportError: No module named vtkCommonPython
                            
                                How do I make Django signal handlers not fail silently when an exception is encountered in the signal handler?
                            
                                Refactor with pyCharm from "user" to "self.user"
                            
                                Why doesn't the MySQLdb Connection context manager close the cursor?
                            
                                How do I patch an object so that all methods are mocked except one?
                            
                                Setting column types while reading csv with pandas

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

heatmap-like plot, but for categorical variables in seaborn

Tags:

python

matplotlib

seaborn

inodb

People also ask

2 Answers

jrjc

Sohrab T

Recent Activity

Donate For Us