I am dealing with the following data frame (only for illustration, actual df is quite large): <pre class="prettyprint"><code> seq x1 y1 0 2 0.7725 0.2105 1 2 0.8098 0.3456 2 2 0.7457 0.5436 3 2 0.4168 0.7610 4 2 0.3181 0.8790 5 3 0.2092 0.5498 6 3 0.0591 0.6357 7 5 0.9937 0.5364 8 5 0.3756 0.7635 9 5 0.1661 0.8364 </code></pre> Trying to plot multiple line graph for the above coordinates (x as "x1 against y as "y1"). Rows with the same "seq" is one path, and has to be plotted as one separate line, like all the x, y coordinates corresponding the seq = 2 belongs to one line, and so on. I am able to plot them, but on a separate graphs, I want all the lines on the same graph, Using subplots, but not getting it right. <pre class="prettyprint"><code>import matplotlib as mpl import matplotlib.pyplot as plt %matplotlib notebook df.groupby("seq").plot(kind = "line", x = "x1", y = "y1") </code></pre> This creates 100's of graphs (which is equal to the number of unique seq). Suggest me a way to obtain all the lines on the same graph. **UPDATE* To resolve the above problem, I implemented the following code: <pre class="prettyprint"><code> fig, ax = plt.subplots(figsize=(12,8)) df.groupby('seq').plot(kind='line', x = "x1", y = "y1", ax = ax) plt.title("abc") plt.show() </code></pre> Now, I want a way to plot the lines with specific colors. I am clustering path from seq = 2 and 5 in cluster 1; and path from seq = 3 in another cluster. So, there are two lines under cluster 1 which I want in red and 1 line under cluster 2 which can be green. How should I proceed with this?

Consider the dataframe <code>df</code> <pre class="prettyprint"><code>df = pd.DataFrame(dict( ProjID=np.repeat(range(10), 10), Xcoord=np.random.rand(100), Ycoord=np.random.rand(100), )) </code></pre> Then we create abstract art like this <pre class="prettyprint"><code>df.set_index('Xcoord').groupby('ProjID').Ycoord.plot() </code></pre> <img src="https://i.stack.imgur.com/OKNJL.png" alt="enter image description here">

Pandas groupby results on the same plot

Tags:

python

pandas

matplotlib

I am dealing with the following data frame (only for illustration, actual df is quite large):

   seq          x1         y1
0  2           0.7725      0.2105
1  2           0.8098      0.3456
2  2           0.7457      0.5436
3  2           0.4168      0.7610
4  2           0.3181      0.8790
5  3           0.2092      0.5498
6  3           0.0591      0.6357
7  5           0.9937      0.5364
8  5           0.3756      0.7635
9  5           0.1661      0.8364

Trying to plot multiple line graph for the above coordinates (x as "x1 against y as "y1").

Rows with the same "seq" is one path, and has to be plotted as one separate line, like all the x, y coordinates corresponding the seq = 2 belongs to one line, and so on.

I am able to plot them, but on a separate graphs, I want all the lines on the same graph, Using subplots, but not getting it right.

import matplotlib as mpl
import matplotlib.pyplot as plt

%matplotlib notebook

df.groupby("seq").plot(kind = "line", x = "x1", y = "y1")

This creates 100's of graphs (which is equal to the number of unique seq). Suggest me a way to obtain all the lines on the same graph.

**UPDATE*

To resolve the above problem, I implemented the following code:

     fig, ax = plt.subplots(figsize=(12,8))
     df.groupby('seq').plot(kind='line', x = "x1", y = "y1", ax = ax)
     plt.title("abc")
     plt.show()

Now, I want a way to plot the lines with specific colors. I am clustering path from seq = 2 and 5 in cluster 1; and path from seq = 3 in another cluster.

So, there are two lines under cluster 1 which I want in red and 1 line under cluster 2 which can be green.

How should I proceed with this?

855

asked Mar 23 '17 22:03

Liza

2 Answers

You need to init axis before plot like in this example

import pandas as pd
import matplotlib.pylab as plt
import numpy as np

# random df
df = pd.DataFrame(np.random.randint(0,10,size=(25, 3)), columns=['ProjID','Xcoord','Ycoord'])

# plot groupby results on the same canvas 
fig, ax = plt.subplots(figsize=(8,6))
df.groupby('ProjID').plot(kind='line', x = "Xcoord", y = "Ycoord", ax=ax)
plt.show()

enter image description here

198

answered Oct 03 '22 15:10

Serenity

Consider the dataframe df

df = pd.DataFrame(dict(
        ProjID=np.repeat(range(10), 10),
        Xcoord=np.random.rand(100),
        Ycoord=np.random.rand(100),
    ))

Then we create abstract art like this

df.set_index('Xcoord').groupby('ProjID').Ycoord.plot()

enter image description here

answered Oct 03 '22 15:10

piRSquared

Related questions
                            
                                ChoiceField: Select a valid choice. That is not one of the available choices
                            
                                Why does functools.lru_cache break this function?
                            
                                Circular dependency when squashing Django migrations
                            
                                Flask-Uploads IOError: [Errno 2] No such file or directory
                            
                                How to search for specific value in Json array using Python
                            
                                How to count objects in image using python?
                            
                                Hurst Exponent in python
                            
                                Convert python abbreviated month name to full name
                            
                                Django get related objects ManyToMany relationships
                            
                                sorting list of lists and getting indices in unsorted list
                            
                                How Can I Use Python FileNotFoundError to print name of missing file?
                            
                                Gensim installation problems
                            
                                How to extract only the month and day from a datetime object?
                            
                                Split dictionary depending on key lists
                            
                                Pandas - Find longest stretch without Nan values
                            
                                how to preserve links when scraping a table with beautiful soup and pandas
                            
                                Appending a range to a list
                            
                                How to access a column whose name I cannot access in chained operations
                            
                                get the i-th slice of the k-th dimension in a numpy array
                            
                                Python: how to remove a value from a dict if it exactly matches the key?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With