A colleague of mine handed me a script that is used to collect data from a database and plot it. When I used the script myself, the plots do not look the same, and it has to do with the version of Matplotlib. The script that does the plotting of the data is quite short: <pre class="prettyprint"><code>import matplotlib.pyplot as plt import csv import os from dateutil import parser def plot(outputDir,plotsDir,FS): allfiles = os.listdir(outputDir) flist = [] for f in allfiles: if 'csv' in f.lower(): flist.append(f) for f in flist: with open(outputDir + '/' + f, 'rt') as ff: data = list(csv.reader(ff,delimiter=FS)) values = [i[2] for i in data[1::]] values = ['NaN' if v is '' else v for v in values] time = [parser.parse(i[1]) for i in data[1::]] plt.xlabel('Time_[UTC]') plt.plot(time, values) plt.xticks(rotation=40) if os.path.isdir(plotsDir) != 1: os.mkdir(plotsDir, 777) plt.savefig('{}/{}_Data.png'.format(plotsDir, f[:-4]), bbox_inches='tight', dpi=160) plt.clf() outputdir = 'C:/Users/matthijsk/Documents/Test' plotsdir = outputdir + '/plots' fs = ',' plot(outputdir, plotsdir, fs) </code></pre> When I run it using Matplotlib version 2.1.0, my image looks like this: <img src="https://i.stack.imgur.com/6ce55.png" alt="Matplotlib version 2.1.0"> When I run it using Matplotlib version 2.0.2, it looks the way it is supposed to: <img src="https://i.stack.imgur.com/X8421.png" alt="Matplotlib version 2.0.2"> The file the script is reading looks like this: <pre class="prettyprint"><code>stationNo,dtg(UTC),TT_[°C],source_TT,quality_TT 10381,2017-01-01 00:00:00,3.0,ob,na 10381,2017-01-01 01:00:00,3.0,ob,na 10381,2017-01-01 02:00:00,2.4,ob,na 10381,2017-01-01 03:00:00,2.5,ob,na 10381,2017-01-01 04:00:00,2.5,ob,na 10381,2017-01-01 05:00:00,2.3,ob,na 10381,2017-01-01 06:00:00,1.9,ob,na 10381,2017-01-01 07:00:00,1.0,ob,na 10381,2017-01-01 08:00:00,0.1,ob,na 10381,2017-01-01 09:00:00,0.9,ob,na </code></pre> Can anyone explain me what was changed in Matplotlib that caused this? And apparently I'm doing something wrong with the plotting that is causing this. Can anyone notice a mistake? I've already tried using <pre class="prettyprint"><code>values = [float(value) if value.isnumeric() else None for value in values] </code></pre> But that didn't solve it. Note: I'd rather not use any non-standard packages (like Pandas) since it's quite a hassle to get approvement to install such packages.

The data is read in as strings. In matplotlib 2.0 those were automatically converted to floating point numbers such that they can be plotted. In matplotlib 2.1, categorical plots have been introduced. This now allows for something like <pre class="prettyprint"><code>plt.plot(["apple", "banana", "cherry"], [2,1,3]) </code></pre> While this is of course great for certain applications, it breaks the previous option of plotting strings that are convertable to floats. I guess this if fine, it just gives the user the responsibility to do the conversion himself. In this case you would want to do this conversion like <pre class="prettyprint"><code>values = [None if v is '' else float(v) for v in values] </code></pre> In case you already have a numpy array: <code>np.array(values).astype(float)</code> In general, one can use <code>numpy.loadtxt</code> to read files into float arrays. If the file contains dates, usage of a converter as in reading a comma-delimited file with a date object and a float with Python would be possible. Another option to read in text files would be <code>pandas.read_csv</code>.

Difference in plotting with different matplotlib versions

Tags:

python

matplotlib

A colleague of mine handed me a script that is used to collect data from a database and plot it. When I used the script myself, the plots do not look the same, and it has to do with the version of Matplotlib.

The script that does the plotting of the data is quite short:

import matplotlib.pyplot as plt
import csv
import os
from dateutil import parser

def plot(outputDir,plotsDir,FS):
    allfiles = os.listdir(outputDir)
    flist = []
    for f in allfiles:
        if 'csv' in f.lower(): flist.append(f)
    for f in flist:
        with open(outputDir + '/' + f, 'rt') as ff:
            data = list(csv.reader(ff,delimiter=FS))
        values = [i[2] for i in data[1::]]
        values = ['NaN' if v is '' else v for v in values]
        time = [parser.parse(i[1]) for i in data[1::]]
        plt.xlabel('Time_[UTC]')
        plt.plot(time, values)
        plt.xticks(rotation=40)
        if os.path.isdir(plotsDir) != 1:
            os.mkdir(plotsDir, 777)
        plt.savefig('{}/{}_Data.png'.format(plotsDir, f[:-4]), bbox_inches='tight', dpi=160)
        plt.clf()


outputdir = 'C:/Users/matthijsk/Documents/Test'
plotsdir = outputdir + '/plots'
fs = ','
plot(outputdir, plotsdir, fs)

When I run it using Matplotlib version 2.1.0, my image looks like this: When I run it using Matplotlib version 2.0.2, it looks the way it is supposed to:

The file the script is reading looks like this:

stationNo,dtg(UTC),TT_[°C],source_TT,quality_TT
10381,2017-01-01 00:00:00,3.0,ob,na
10381,2017-01-01 01:00:00,3.0,ob,na
10381,2017-01-01 02:00:00,2.4,ob,na
10381,2017-01-01 03:00:00,2.5,ob,na
10381,2017-01-01 04:00:00,2.5,ob,na
10381,2017-01-01 05:00:00,2.3,ob,na
10381,2017-01-01 06:00:00,1.9,ob,na
10381,2017-01-01 07:00:00,1.0,ob,na
10381,2017-01-01 08:00:00,0.1,ob,na
10381,2017-01-01 09:00:00,0.9,ob,na

Can anyone explain me what was changed in Matplotlib that caused this? And apparently I'm doing something wrong with the plotting that is causing this. Can anyone notice a mistake? I've already tried using

values = [float(value) if value.isnumeric() else None for value in values]

But that didn't solve it. Note: I'd rather not use any non-standard packages (like Pandas) since it's quite a hassle to get approvement to install such packages.

602

asked Nov 07 '17 10:11

Matthijs Kramer

1 Answers

The data is read in as strings. In matplotlib 2.0 those were automatically converted to floating point numbers such that they can be plotted.

In matplotlib 2.1, categorical plots have been introduced. This now allows for something like

plt.plot(["apple", "banana", "cherry"], [2,1,3])

While this is of course great for certain applications, it breaks the previous option of plotting strings that are convertable to floats. I guess this if fine, it just gives the user the responsibility to do the conversion himself.

In this case you would want to do this conversion like

values = [None if v is '' else float(v) for v in values]

In case you already have a numpy array: np.array(values).astype(float)

In general, one can use numpy.loadtxt to read files into float arrays. If the file contains dates, usage of a converter as in reading a comma-delimited file with a date object and a float with Python would be possible.

Another option to read in text files would be pandas.read_csv.

answered Sep 30 '22 22:09

ImportanceOfBeingErnest

Related questions
                            
                                Force string format in pylint
                            
                                change scatter plot marker thickness
                            
                                SQLAlchemy and Falcon - session initialization
                            
                                Scale(Normalise) a column in SPARK Dataframe - Pyspark
                            
                                How to install pymysql on AWS lambda
                            
                                3D discrete heatmap in matplotlib
                            
                                Calculating entropy from GLCM of an image
                            
                                Difference between timestamps in Arrow
                            
                                Why does pyplot.contour() require Z to be a 2D array?
                            
                                How to access class probabilities in keras?
                            
                                Python logging multiple modules logger not working outside main program
                            
                                How to read Youtube live stream using openCV python?
                            
                                Are multiple `with` statements on one line equivalent to nested `with` statements, in python?
                            
                                Comparison of a Dataframe column values with a list
                            
                                Why does naive string concatenation become quadratic above a certain length?
                            
                                What is meant by shift in dataframe?
                            
                                How to get Python Pillow (PIL) version?
                            
                                How to store a networkx graph for visualizing in Gephi?
                            
                                KeyError: 0 when accessing value in pandas series
                            
                                python tox: how to use a different python envlist in environment

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With