Legend only shows one label when plotting with pandas

Tags:

I have two Pandas DataFrames that I'm hoping to plot in single figure. I'm using IPython notebook.

I would like the legend to show the label for both of the DataFrames, but so far I've been able to get only the latter one to show. Also any suggestions as to how to go about writing the code in a more sensible way would be appreciated. I'm new to all this and don't really understand object oriented plotting.

%pylab inline import pandas as pd  #creating data  prng = pd.period_range('1/1/2011', '1/1/2012', freq='M') var=pd.DataFrame(randn(len(prng)),index=prng,columns=['total']) shares=pd.DataFrame(randn(len(prng)),index=index,columns=['average'])  #plotting  ax=var.total.plot(label='Variance') ax=shares.average.plot(secondary_y=True,label='Average Age') ax.left_ax.set_ylabel('Variance of log wages') ax.right_ax.set_ylabel('Average age') plt.legend(loc='upper center') plt.title('Wage Variance and Mean Age') plt.show()

Legend is missing one of the labels

557

asked Feb 24 '14 12:02

Artturi Björk

1 Answers

This is indeed a bit confusing. I think it boils down to how Matplotlib handles the secondary axes. Pandas probably calls ax.twinx() somewhere which superimposes a secondary axes on the first one, but this is actually a separate axes. Therefore also with separate lines & labels and a separate legend. Calling plt.legend() only applies to one of the axes (the active one) which in your example is the second axes.

Pandas fortunately does store both axes, so you can grab all line objects from both of them and pass them to the .legend() command yourself. Given your example data:

You can plot exactly as you did:

ax = var.total.plot(label='Variance') ax = shares.average.plot(secondary_y=True, label='Average Age')  ax.set_ylabel('Variance of log wages') ax.right_ax.set_ylabel('Average age')

Both axes objects are available with ax (left axe) and ax.right_ax, so you can grab the line objects from them. Matplotlib's .get_lines() return a list so you can merge them by simple addition.

lines = ax.get_lines() + ax.right_ax.get_lines()

The line objects have a label property which can be used to read and pass the label to the .legend() command.

ax.legend(lines, [l.get_label() for l in lines], loc='upper center')

And the rest of the plotting:

ax.set_title('Wage Variance and Mean Age') plt.show()

enter image description here

edit:

It might be less confusing if you separate the Pandas (data) and the Matplotlib (plotting) parts more strictly, so avoid using the Pandas build-in plotting (which only wraps Matplotlib anyway):

fig, ax = plt.subplots()  ax.plot(var.index.to_datetime(), var.total, 'b', label='Variance') ax.set_ylabel('Variance of log wages')  ax2 = ax.twinx() ax2.plot(shares.index.to_datetime(), shares.average, 'g' , label='Average Age') ax2.set_ylabel('Average age')  lines = ax.get_lines() + ax2.get_lines() ax.legend(lines, [line.get_label() for line in lines], loc='upper center')  ax.set_title('Wage Variance and Mean Age') plt.show()

answered Sep 24 '22 20:09

Rutger Kassies

Related questions
                            
                                Django Rest Framework custom authentication
                            
                                Pytorch: Can't call numpy() on Variable that requires grad. Use var.detach().numpy() instead
                            
                                What is Adaptive average pooling and How does it work?
                            
                                Concatenate generator and item
                            
                                paramiko no existing session exception
                            
                                How to indent the contents of a multi-line string?
                            
                                Python: Array v. List [duplicate]
                            
                                How to set a files owner in python?
                            
                                Sklearn kNN usage with a user defined metric
                            
                                django best approach for creating multiple type users
                            
                                python import of local module failing when run as systemd/systemctl service
                            
                                Adding a 'count' column to the result of a groupby in pandas?
                            
                                How to set up Airflow Send Email?
                            
                                Does Python have an equivalent to 'switch'?
                            
                                Converting python objects for rpy2
                            
                                Error exception must derive from BaseException even when it does (Python 2.7)
                            
                                Pandas: Creating aggregated column in DataFrame
                            
                                Django Call Class based view from another class based view
                            
                                Why does HTTP POST request body need to be JSON enconded in Python?
                            
                                Python convert tuple to array [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Legend only shows one label when plotting with pandas

Tags:

python

pandas

matplotlib

plot

Artturi Björk

People also ask

1 Answers

edit:

Rutger Kassies

Recent Activity

Donate For Us