<p>suppose I have DataFrame with columns <code>['X_Axis','col_2','col_3',...,'col_n',]</code></p> <p>I need to plot the first column on X-Axis and rest on Y-Axis. FYI : all the values have been grouped according to X-Axis, the X-Axis values range from <code>0-25</code> and all other column values have been normalized to the scale of <code>0 - 1</code>. I want it on same graph plot, not subplots.</p> <p>Preferred : FactorPlot , normal line graph.</p>

<ul> <li>Some seaborn plots will accept a wide dataframe, <code>sns.pointplot(data=df, x='X_Axis', y='col_2')</code>, but not <code>sns.pointplot(data=df, x='X_Axis', y=['col_2', 'col_3'])</code>, so it's better to reshape the DataFrame.</li> <li>Reshape the DataFrame from wide to long with <code>pandas.DataFrame.melt</code>. <ul> <li>Converting the dataframe from a wide to long form is standard for all seaborn plots, not just the examples shown.</li> </ul> </li> <li><strong>Tested in <code>python 3.8.12</code>, <code>pandas 1.3.4</code>, <code>matplotlib 3.4.3</code>, <code>seaborn 0.11.2</code></strong></li> </ul> <h3>Sample DataFrame</h3> <pre class="prettyprint lang-py prettyprint-override"><code>import pandas as pd import seaborn as sns df = pd.DataFrame({'X_Axis':[1,3,5,7,10,20], 'col_2':[.4,.5,.4,.5,.5,.4], 'col_3':[.7,.8,.9,.4,.2,.3], 'col_4':[.1,.3,.5,.7,.1,.0], 'col_5':[.5,.3,.6,.9,.2,.4]}) # display(df) X_Axis col_2 col_3 col_4 col_5 0 1 0.4 0.7 0.1 0.5 1 3 0.5 0.8 0.3 0.3 2 5 0.4 0.9 0.5 0.6 3 7 0.5 0.4 0.7 0.9 4 10 0.5 0.2 0.1 0.2 5 20 0.4 0.3 0.0 0.4 # convert to long (tidy) form dfm = df.melt('X_Axis', var_name='cols', value_name='vals') # display(dfm.head()) X_Axis cols vals 0 1 col_2 0.4 1 3 col_2 0.5 2 5 col_2 0.4 3 7 col_2 0.5 4 10 col_2 0.5 </code></pre> <h3>Current Plot Methods</h3> <h3> <code>catplot</code>: figure-level</h3> <p>Use <code>seaborn.catplot</code> with <code>kind=</code> (e.g. <code>kind='point'</code> to reproduce the <code>FactorPlot</code> default):</p> <pre class="prettyprint lang-py prettyprint-override"><code>g = sns.catplot(x="X_Axis", y="vals", hue='cols', data=dfm, kind='point') </code></pre> <p><img src="https://i.stack.imgur.com/I6XzD.png" alt="enter image description here"></p> <h3> <code>pointplot</code>: axes-level</h3> <pre class="prettyprint lang-py prettyprint-override"><code>sns.pointplot(x="X_Axis", y="vals", hue='cols', data=dfm) </code></pre> <p><img src="https://i.stack.imgur.com/qdcAq.png" alt="enter image description here"></p> <h3>Original</h3> <h3> <code>factorplot</code>: was renamed to <code>catplot</code> v0.9.0 (July 2018)</h3> <p>New versions of seaborn get warning:</p> <blockquote> <p>The <code>factorplot</code> function has been renamed to <code>catplot</code>. The original name will be removed in a future release. Please update your code. Note that the default <code>kind</code> in <code>factorplot</code> (<code>'point'</code>) has changed <code>'strip'</code> in <code>catplot</code>.</p> </blockquote> <pre class="prettyprint lang-py prettyprint-override"><code>g = sns.factorplot(x="X_Axis", y="vals", hue='cols', data=dfm) # using pd.melt instead of pd.DataFrame.melt for pandas < 0.20.0 # dfm = pd.melt(df, 'X_Axis', var_name='cols', value_name='vals') # g = sns.factorplot(x="X_Axis", y="vals", hue='cols', data=dfm) </code></pre> <p><img src="https://i.stack.imgur.com/GQ7ao.png" alt="graph"></p>

Plot multiple columns of pandas DataFrame using Seaborn

Tags:

python

pandas

dataframe

matplotlib

seaborn

suppose I have DataFrame with columns ['X_Axis','col_2','col_3',...,'col_n',]

I need to plot the first column on X-Axis and rest on Y-Axis. FYI : all the values have been grouped according to X-Axis, the X-Axis values range from 0-25 and all other column values have been normalized to the scale of 0 - 1. I want it on same graph plot, not subplots.

Preferred : FactorPlot , normal line graph.

425

asked Jul 06 '17 06:07

Rakmo

2 Answers

Some seaborn plots will accept a wide dataframe, sns.pointplot(data=df, x='X_Axis', y='col_2'), but not sns.pointplot(data=df, x='X_Axis', y=['col_2', 'col_3']), so it's better to reshape the DataFrame.
Reshape the DataFrame from wide to long with pandas.DataFrame.melt.
- Converting the dataframe from a wide to long form is standard for all seaborn plots, not just the examples shown.
Tested in python 3.8.12, pandas 1.3.4, matplotlib 3.4.3, seaborn 0.11.2

Sample DataFrame

import pandas as pd
import seaborn as sns

df = pd.DataFrame({'X_Axis':[1,3,5,7,10,20],
                   'col_2':[.4,.5,.4,.5,.5,.4],
                   'col_3':[.7,.8,.9,.4,.2,.3],
                   'col_4':[.1,.3,.5,.7,.1,.0],
                   'col_5':[.5,.3,.6,.9,.2,.4]})

# display(df)
   X_Axis  col_2  col_3  col_4  col_5
0       1    0.4    0.7    0.1    0.5
1       3    0.5    0.8    0.3    0.3
2       5    0.4    0.9    0.5    0.6
3       7    0.5    0.4    0.7    0.9
4      10    0.5    0.2    0.1    0.2
5      20    0.4    0.3    0.0    0.4

# convert to long (tidy) form
dfm = df.melt('X_Axis', var_name='cols', value_name='vals')

# display(dfm.head())
   X_Axis   cols  vals
0       1  col_2   0.4
1       3  col_2   0.5
2       5  col_2   0.4
3       7  col_2   0.5
4      10  col_2   0.5

Current Plot Methods

`catplot`: figure-level

Use seaborn.catplot with kind= (e.g. kind='point' to reproduce the FactorPlot default):

g = sns.catplot(x="X_Axis", y="vals", hue='cols', data=dfm, kind='point')

enter image description here

`pointplot`: axes-level

sns.pointplot(x="X_Axis", y="vals", hue='cols', data=dfm)

enter image description here

Original

`factorplot`: was renamed to `catplot` v0.9.0 (July 2018)

New versions of seaborn get warning:

The factorplot function has been renamed to catplot. The original name will be removed in a future release. Please update your code. Note that the default kind in factorplot ('point') has changed 'strip' in catplot.

g = sns.factorplot(x="X_Axis", y="vals", hue='cols', data=dfm)

# using pd.melt instead of pd.DataFrame.melt for pandas < 0.20.0
# dfm = pd.melt(df, 'X_Axis', var_name='cols',  value_name='vals')
# g = sns.factorplot(x="X_Axis", y="vals", hue='cols', data=dfm)

graph

118

answered Oct 22 '22 03:10

jezrael

in addition to mighty @jezrael for those who come from google if you intend to plot lines with the index of the original dataframe just do as follows:

df = pd.DataFrame({'col_2':[.4,.5,.4,.5,.5,.4],
                   'col_3':[.7,.8,.9,.4,.2,.3],
                   'col_4':[.1,.3,.5,.7,.1,.0],
                   'col_5':[.5,.3,.6,.9,.2,.4]})

# resetting index before melting to save the current index in 'index' column...
df = df.reset_index().melt('index', var_name='cols',  value_name='vals')
g = sns.catplot(x="index", y="vals", hue='cols', data=df, kind='point')

answered Oct 22 '22 03:10

adir abargil

Related questions
                            
                                How should we test exceptions with nose?
                            
                                Prettyprint to a file?
                            
                                Pandas max value index
                            
                                How to set the default of a JSONField to empty list in Django and django-jsonfield?
                            
                                Check if value from one dataframe exists in another dataframe
                            
                                What is the deal with the pony in Python community? [closed]
                            
                                Is it possible to plot implicit equations using Matplotlib?
                            
                                Is there a way to uninstall multiple packages with pip?
                            
                                How to make PyQt window state to maximised in pyqt
                            
                                About refreshing objects in sqlalchemy session
                            
                                SQLAlchemy delete doesn't cascade
                            
                                Python sockets error TypeError: a bytes-like object is required, not 'str' with send function
                            
                                Is there Django List View model sort?
                            
                                matplotlib: change title and colorbar text and tick colors
                            
                                parsing a tab-separated file in Python
                            
                                Python: Start new command prompt on Windows and wait for it finish/exit
                            
                                Why can't I set a global variable in Python?
                            
                                Python 3.2 - cookielib
                            
                                Create dummies from column with multiple values in pandas
                            
                                Should I pin my Python dependencies versions?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Plot multiple columns of pandas DataFrame using Seaborn

Tags:

python

pandas

dataframe

matplotlib

seaborn

Rakmo

People also ask

2 Answers

Sample DataFrame

Current Plot Methods

`catplot`: figure-level

`pointplot`: axes-level

Original

`factorplot`: was renamed to `catplot` v0.9.0 (July 2018)

jezrael

adir abargil

Recent Activity

Donate For Us

Plot multiple columns of pandas DataFrame using Seaborn

Tags:

python

pandas

dataframe

matplotlib

seaborn

Rakmo

People also ask

2 Answers

Sample DataFrame

Current Plot Methods

catplot: figure-level

pointplot: axes-level

Original

factorplot: was renamed to catplot v0.9.0 (July 2018)

jezrael

adir abargil

Related questions

Recent Activity

Donate For Us

`catplot`: figure-level

`pointplot`: axes-level

`factorplot`: was renamed to `catplot` v0.9.0 (July 2018)