My objective is to draw a horizontal red line on y = 0 on a plot made using seaborn: <code>sns.lmplot</code>splitted by <code>col=</code> or <code>row=</code>. <pre class="prettyprint"><code>import numpy as np, seaborn as sns, pandas as pd np.random.seed(5) myData = pd.DataFrame({'x' : np.arange(1, 101), 'y': np.random.normal(0, 4, 100),\ 'z' : ['a','b']*50, 'w':np.random.poisson(0.15,100)}) sns.lmplot("x", "y", col="z", row="w", data=myData, fit_reg=False) plt.plot(np.linspace(-20,120,1000), [0]*1000, 'r-') </code></pre> We can see that only the last plot, of the array of plots, is marked by the red line: <img src="https://i.stack.imgur.com/v1YnW.png" alt="enter image description here"> Thanks for your help, EDIT: reworded the question to account for the case where we generate an array of plots using <code>col=</code> and/or <code>row=</code> and we want the line to be traced on each plot.

Since I came across this looking for an answer, here is a more general answer that I eventually discovered: <code>map_dataframe</code> will also accept a user defined function (and passes the data frame to this function) which is quite powerful because you can plot anything onto the facetgrid. In the OP case: <pre class="prettyprint"><code>def plot_hline(y,**kwargs): data = kwargs.pop("data") #get the data frame from the kwargs plt.axhline(y=y, c='red',linestyle='dashed',zorder=-1) #zorder places the line underneath the other points myPlot = sns.FacetGrid(col="z", row='w', hue='hueMe', data=myData, size=5) myPlot.map(plt.scatter, "x", "y").set(xlim=(-20,120) , ylim=(-15,15)) myPlot.map_dataframe(plot_hline,y=0) plt.show() </code></pre> My problem was slightly more complex because I wanted a different horizontal line on each facet. To replicate my case, assume the 'z' variable has two samples (a and b) and each with an observed value 'obs' (which I've added to myData below). 'hueMe' represents modeled values for each sample. <pre class="prettyprint"><code>myData = pd.DataFrame({'x' : np.arange(1, 101), 'y': np.random.normal(0, 4, 100), 'z' : ['a','b']*50, 'w':np.random.poisson(0.15,100), 'hueMe':['q','w','e','r','t']*20, 'obs':[3,2]*50}) </code></pre> When you pass the data frame to <code>plot_hline</code>, you need to drop the duplicate values of 'obs' for each 'z' sample because axhline can only take a single value for <code>y</code>. (remember in our case each sample has 1 observed value 'obs' but multiple modeled 'hueMe' values). further, <code>y</code> must be a scalar (rather than a series) so you need to index into the data frame to extract the value itself. <pre class="prettyprint"><code>def plot_hline(y,z, **kwargs): data = kwargs.pop("data") #the data passed in through kwargs is a subset of the original data - only the subset for the row and col being plotted. it's a for loop in disguise. data = data.drop_duplicates([z]) #drop the duplicate rows yval = data[y].iloc[0] #extract the value for your hline. plt.axhline(y=yval, c='red',linestyle='dashed',zorder=-1) myPlot = sns.FacetGrid(col="z", row='w', hue='hueMe', data=myData, size=5) myPlot.map(plt.scatter, "x", "y").set(xlim=(-20,120) , ylim=(-15,15)) myPlot.map_dataframe(plot_hline,y='obs',z='z') plt.show() </code></pre> resulting plot Now seaborn maps the output from your function onto each facet of <code>FacetGrid</code>. Note, if you are using a different plotting function than axhline, you might not necessarily need to extract the value from the series. Hope this helps someone!

Drawing lines on scatter with Seaborn

Tags:

draw

seaborn

My objective is to draw a horizontal red line on y = 0 on a plot made using seaborn: sns.lmplotsplitted by col= or row=.

import numpy as np, seaborn as sns, pandas as pd
np.random.seed(5)

myData = pd.DataFrame({'x' :  np.arange(1, 101), 'y': np.random.normal(0, 4, 100),\
'z' : ['a','b']*50, 'w':np.random.poisson(0.15,100)})


sns.lmplot("x", "y", col="z", row="w", data=myData, fit_reg=False)
plt.plot(np.linspace(-20,120,1000), [0]*1000, 'r-')

We can see that only the last plot, of the array of plots, is marked by the red line:

enter image description here

Thanks for your help,

EDIT: reworded the question to account for the case where we generate an array of plots using col= and/or row= and we want the line to be traced on each plot.

782

asked Mar 23 '16 23:03

Alex Fortin

3 Answers

So this chunk of code works for the general case where we use col=, row=, and hue=.

import numpy as np, seaborn as sns, pandas as pd
np.random.seed(5)

myData = pd.DataFrame({'x' :  np.arange(1, 101), 'y': np.random.normal(0, 4, 100),\
'z' : ['a','b']*50, 'w':np.random.poisson(0.15,100), 'hueMe':['q','w','e','r','t']*20})

myPlot = sns.FacetGrid(col="z", row='w', hue='hueMe', data=myData, size=5)
myPlot = myPlot.map(plt.scatter, "x", "y").set(xlim=(-20,120) , ylim=(-15,15))
myPlot = myPlot.map_dataframe(plt.plot, [-20,120], [0,0], 'r-').add_legend().set_axis_labels("x", "y")
plt.show()

enter image description here

Not sure why the color of the horizontal line comes out as the last color used on each individual plot, but I give up on this for now :)

answered Oct 16 '22 03:10

Alex Fortin

Since I came across this looking for an answer, here is a more general answer that I eventually discovered:

map_dataframe will also accept a user defined function (and passes the data frame to this function) which is quite powerful because you can plot anything onto the facetgrid. In the OP case:

def plot_hline(y,**kwargs):
    data = kwargs.pop("data") #get the data frame from the kwargs
    plt.axhline(y=y, c='red',linestyle='dashed',zorder=-1) #zorder places the line underneath the other points

myPlot = sns.FacetGrid(col="z", row='w', hue='hueMe', data=myData, size=5)
myPlot.map(plt.scatter, "x", "y").set(xlim=(-20,120) , ylim=(-15,15))
myPlot.map_dataframe(plot_hline,y=0)
plt.show()

My problem was slightly more complex because I wanted a different horizontal line on each facet.

To replicate my case, assume the 'z' variable has two samples (a and b) and each with an observed value 'obs' (which I've added to myData below). 'hueMe' represents modeled values for each sample.

myData = pd.DataFrame({'x' :  np.arange(1, 101), 
                       'y': np.random.normal(0, 4, 100),
                       'z' : ['a','b']*50,
                       'w':np.random.poisson(0.15,100),
                       'hueMe':['q','w','e','r','t']*20,
                       'obs':[3,2]*50})

When you pass the data frame to plot_hline, you need to drop the duplicate values of 'obs' for each 'z' sample because axhline can only take a single value for y. (remember in our case each sample has 1 observed value 'obs' but multiple modeled 'hueMe' values). further, y must be a scalar (rather than a series) so you need to index into the data frame to extract the value itself.

def plot_hline(y,z, **kwargs):
    data = kwargs.pop("data") #the data passed in through kwargs is a subset of the original data - only the subset for the row and col being plotted. it's a for loop in disguise.
    data = data.drop_duplicates([z]) #drop the duplicate rows
    yval = data[y].iloc[0] #extract the value for your hline.
    plt.axhline(y=yval, c='red',linestyle='dashed',zorder=-1)


myPlot = sns.FacetGrid(col="z", row='w', hue='hueMe', data=myData, size=5)
myPlot.map(plt.scatter, "x", "y").set(xlim=(-20,120) , ylim=(-15,15))
myPlot.map_dataframe(plot_hline,y='obs',z='z')
plt.show()

resulting plot

Now seaborn maps the output from your function onto each facet of FacetGrid. Note, if you are using a different plotting function than axhline, you might not necessarily need to extract the value from the series.

Hope this helps someone!

answered Oct 16 '22 03:10

JRS

Seaborn is really just an interface for matplotlib, so you can use all of your standard matplotlib functions as well. Importing pyplot and plotting a red horizontal line after your seaborn plot works for me.

import numpy as np, seaborn as sns, pandas as pd
import matplotlib.pyplt as plt
np.random.seed(5)

myData = pd.DataFrame({'x' :  np.arange(1, 101), 'y': np.random.normal(0, 4, 100)})

sns.lmplot("x", "y", data=myData, line_kws={'xdata': '0,1','ydata': '0,0','color': 'k', 'linestyle':'-', 'linewidth':'5'}, fit_reg=False)
plt.plot(np.linspace(-20,120,1000), [0]*1000, 'r')

My image is here - http://i.imgur.com/J7Lvt52.png

answered Oct 16 '22 01:10

Tim

Related questions
                            
                                Android: Redrawing a specific view inside a layout
                            
                                Interact with complex figure in iOS
                            
                                How often to call SpriteBatch.Begin()/.End()?
                            
                                Jtree to JPanel
                            
                                R: Generate coordinate data from user-drawn points?
                            
                                Explaining DrawArc method?
                            
                                Canvas drawImage scale height and width in CustomPainter
                            
                                Java Graphics repaint behavior
                            
                                Draw text and add to a UIImage iOS 5/6
                            
                                Does the draw order affects objects position in depth? (images included)
                            
                                iOS - Draw gradient - Swift
                            
                                Rendering text on a circular circumference in react native
                            
                                How do I style divicons in leaflet.draw edit mode?
                            
                                How to speed up python's 'turtle' function and stop it freezing at the end

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Drawing lines on scatter with Seaborn

Tags:

draw

seaborn

Alex Fortin

People also ask

3 Answers

Alex Fortin

JRS

Tim

Recent Activity

Donate For Us