I am trying to plot some data in pandas and the inbuilt plot function conveniently plots one line per column. What I want to do is to manually assign each line a color based on a classification I make. The following works: <pre class="prettyprint"><code>df = pd.DataFrame({'1': [1, 2, 3, 4], '2': [1, 2, 1, 2]}) s = pd.Series(['c','y'], index=['1','2']) df.plot(color = s) </code></pre> But when my indices are integers it no longer works and throws as KeyError: <pre class="prettyprint"><code>df = pd.DataFrame({1: [1, 2, 3, 4], 2: [1, 2, 1, 2]}) s = pd.Series(['c','y'], index=[1,2]) df.plot(color = s) </code></pre> The way I understand it is that when an integer index is used it somehow has to start from 0. That is my guess since the following works as well: <pre class="prettyprint"><code>df = pd.DataFrame({0: [1, 2, 3, 4], 1: [1, 2, 1, 2]}) s = pd.Series(['c','y'], index=[1,0]) df.plot(color = s) </code></pre> My question is: <ul> <li>What is happening here?</li> <li>Assuming I have an integer index that does not start from 0 or is not formed of successive numbers, how can I make this work without having to convert the index to string or reindex starting from 0?</li> </ul> EDIT: I realised that even in the first case, the code doesn't do what I expected it to do. It seems like pandas matches the index of DataFrame and Series only if both are integer indices starting from 0. If that isn't the case, a KeyError is thrown or if the index is a str the order of the elements is used. Is this correct? And is there a way to match the Series and DataFrame indices? Or do I have to make sure I pass a list of colours in the right order?

To set color for each line you can use the parameter <code>style</code>. For example: <pre class="prettyprint"><code>df = pd.DataFrame({'A': [1, 2, 4], 'B': [1, 3, 9]}) df.plot(style={'A': 'r', 'B': 'g'}) </code></pre> <img src="https://i.stack.imgur.com/IKwj2.png" alt="enter image description here"> Using the shortcut string notation in the form marker|line|color you can also set marker and line types: <pre class="prettyprint"><code>df = pd.DataFrame({'A': [1, 2, 4], 'B': [1, 3, 9]}) df.plot(style={'A': '*:r', 'B': '+--g'}) </code></pre> <img src="https://i.stack.imgur.com/X7HJv.png" alt="enter image description here">

What is happening here? The keyword argument color is inherited from matplotlib.pyplot.plot(). The details in the documentation don't make it clear that you can put in a list of colors when plotting. Given that color is a keyword argument from matplotlib, I'd recommend not using a Pandas Series to hold the color values. How can I make this work? Use a list instead of a Series. If you were using a Series with an index meant to match the columns of your DataFrame to specific colors, you will need to sort the Series first. If the columns are not in order, you will need to sort the columns as well. <pre class="prettyprint"><code># Option 1 s = s.sort_index() df.plot(color = s.values) # as per Fiabetto's answer # Option 2 df.plot(color = ['c', 'y']) # other method </code></pre>

Assign line colors in pandas

Tags:

python

pandas

I am trying to plot some data in pandas and the inbuilt plot function conveniently plots one line per column. What I want to do is to manually assign each line a color based on a classification I make.

The following works:

df = pd.DataFrame({'1': [1, 2, 3, 4], '2': [1, 2, 1, 2]})
s = pd.Series(['c','y'], index=['1','2'])
df.plot(color = s)

But when my indices are integers it no longer works and throws as KeyError:

df = pd.DataFrame({1: [1, 2, 3, 4], 2: [1, 2, 1, 2]})
s = pd.Series(['c','y'], index=[1,2])
df.plot(color = s)

The way I understand it is that when an integer index is used it somehow has to start from 0. That is my guess since the following works as well:

df = pd.DataFrame({0: [1, 2, 3, 4], 1: [1, 2, 1, 2]})
s = pd.Series(['c','y'], index=[1,0])
df.plot(color = s)

My question is:

What is happening here?
Assuming I have an integer index that does not start from 0 or is not formed of successive numbers, how can I make this work without having to convert the index to string or reindex starting from 0?

EDIT:

I realised that even in the first case, the code doesn't do what I expected it to do. It seems like pandas matches the index of DataFrame and Series only if both are integer indices starting from 0. If that isn't the case, a KeyError is thrown or if the index is a str the order of the elements is used.

Is this correct? And is there a way to match the Series and DataFrame indices? Or do I have to make sure I pass a list of colours in the right order?

350

asked Sep 11 '15 14:09

GebitsGerbils

2 Answers

To set color for each line you can use the parameter style. For example:

df = pd.DataFrame({'A': [1, 2, 4], 'B': [1, 3, 9]})
df.plot(style={'A': 'r', 'B': 'g'})

enter image description here

Using the shortcut string notation in the form marker|line|color you can also set marker and line types:

df = pd.DataFrame({'A': [1, 2, 4], 'B': [1, 3, 9]})
df.plot(style={'A': '*:r', 'B': '+--g'})

enter image description here

153

answered Sep 17 '22 22:09

Mykola Zotko

What is happening here?

The keyword argument color is inherited from matplotlib.pyplot.plot(). The details in the documentation don't make it clear that you can put in a list of colors when plotting. Given that color is a keyword argument from matplotlib, I'd recommend not using a Pandas Series to hold the color values.

How can I make this work?

Use a list instead of a Series. If you were using a Series with an index meant to match the columns of your DataFrame to specific colors, you will need to sort the Series first. If the columns are not in order, you will need to sort the columns as well.

# Option 1
s = s.sort_index()
df.plot(color = s.values) # as per Fiabetto's answer

# Option 2
df.plot(color = ['c', 'y']) # other method

answered Sep 19 '22 22:09

thecircus

Related questions
                            
                                with and closing of files in Python
                            
                                Python: lists and copy of them
                            
                                How can I read the contents of all the files in a directory with pandas?
                            
                                Python: Random number generator with mean and Standard Deviation
                            
                                SQLAlchemy: Convert column value back and forth between internal and database format
                            
                                How can I generate a colormap array from a simple array in matplotlib
                            
                                Python Numpy mask NaN not working
                            
                                Python for loop decrementing index
                            
                                Count occurrences of item in JSON element
                            
                                Convert Content-Type header into file extension
                            
                                Python fails to open 11gb csv in r+ mode but opens in r mode
                            
                                PEP 8 and list comprehension
                            
                                How can I export an instance and all its related objects in Django?
                            
                                Pass Python list to embedded Rust function
                            
                                Alternative to `any` that returns the last evaluated object?
                            
                                Selenium pdf automatic download not working
                            
                                Flask SqlAlchemy join two models without foreign key MYSQL
                            
                                How to estimate density function and calculate its peaks?
                            
                                Python watchdog windows wait till copy finishes
                            
                                Get the length of reversed list

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With