I am trying to speed up the process of saving my charts to images. Right now I am creating a cString Object where I save the chart to by using savefig; but I would really, really appreciate any help to improve this method of saving the image. I have to do this operation dozens of times, and the savefig command is very very slow; there must be a better way of doing it. I read something about saving it as uncompressed raw image, but I have no clue of how to do it. I don't really care about agg if I can switch to another faster backend too.
ie:
RAM = cStringIO.StringIO() CHART = plt.figure(.... **code for creating my chart** CHART.savefig(RAM, format='png')
I have been using matplotlib with FigureCanvasAgg backend.
Thanks!
Saving a plot on your disk as an image file Now if you want to save matplotlib figures as image files programmatically, then all you need is matplotlib. pyplot. savefig() function. Simply pass the desired filename (and even location) and the figure will be stored on your disk.
savefig() As the name suggests savefig() method is used to save the figure created after plotting data. The figure created can be saved to our local machines by using this method.
Matplotlib is not thread-safe: in fact, there are known race conditions that affect certain artists. Hence, if you work with threads, it is your responsibility to set up the proper locks to serialize access to Matplotlib artists.
If you just want a raw buffer, try fig.canvas.print_rgb
, fig.canvas.print_raw
, etc (the difference between the two is that raw
is rgba, whereas rgb
is rgb. There's also print_png
, print_ps
, etc)
This will use fig.dpi
instead of the default dpi value for savefig
(100 dpi). Still, even comparing fig.canvas.print_raw(f)
and fig.savefig(f, format='raw', dpi=fig.dpi)
the print_canvas
version is marginally faster insignificantly faster, since it doesn't bother resetting the color of the axis patch, etc, that savefig
does by default.
Regardless, though, most of the time spent saving a figure in a raw format is just drawing the figure, which there's no way to get around.
At any rate, as a pointless-but-fun example, consider the following:
import matplotlib.pyplot as plt import numpy as np import cStringIO plt.ion() fig = plt.figure() ax = fig.add_subplot(111) num = 50 max_dim = 10 x = max_dim / 2 * np.ones(num) s, c = 100 * np.random.random(num), np.random.random(num) scat = ax.scatter(x,x,s,c) ax.axis([0,max_dim,0,max_dim]) ax.set_autoscale_on(False) for i in xrange(1000): xy = np.random.random(2*num).reshape(num,2) - 0.5 offsets = scat.get_offsets() + 0.3 * xy offsets.clip(0, max_dim, offsets) scat.set_offsets(offsets) scat._sizes += 30 * (np.random.random(num) - 0.5) scat._sizes.clip(1, 300, scat._sizes) fig.canvas.draw()
If we look at the raw draw time:
import matplotlib.pyplot as plt import numpy as np import cStringIO fig = plt.figure() ax = fig.add_subplot(111) num = 50 max_dim = 10 x = max_dim / 2 * np.ones(num) s, c = 100 * np.random.random(num), np.random.random(num) scat = ax.scatter(x,x,s,c) ax.axis([0,max_dim,0,max_dim]) ax.set_autoscale_on(False) for i in xrange(1000): xy = np.random.random(2*num).reshape(num,2) - 0.5 offsets = scat.get_offsets() + 0.3 * xy offsets.clip(0, max_dim, offsets) scat.set_offsets(offsets) scat._sizes += 30 * (np.random.random(num) - 0.5) scat._sizes.clip(1, 300, scat._sizes) fig.canvas.draw()
This takes ~25 seconds on my machine.
If we instead dump a raw RGBA buffer to a cStringIO buffer, it's actually marginally faster at ~22 seconds (This is only true because I'm using an interactive backend! Otherwise it would be equivalent.):
import matplotlib.pyplot as plt import numpy as np import cStringIO fig = plt.figure() ax = fig.add_subplot(111) num = 50 max_dim = 10 x = max_dim / 2 * np.ones(num) s, c = 100 * np.random.random(num), np.random.random(num) scat = ax.scatter(x,x,s,c) ax.axis([0,max_dim,0,max_dim]) ax.set_autoscale_on(False) for i in xrange(1000): xy = np.random.random(2*num).reshape(num,2) - 0.5 offsets = scat.get_offsets() + 0.3 * xy offsets.clip(0, max_dim, offsets) scat.set_offsets(offsets) scat._sizes += 30 * (np.random.random(num) - 0.5) scat._sizes.clip(1, 300, scat._sizes) ram = cStringIO.StringIO() fig.canvas.print_raw(ram) ram.close()
If we compare this to using savefig
, with a comparably set dpi:
import matplotlib.pyplot as plt import numpy as np import cStringIO fig = plt.figure() ax = fig.add_subplot(111) num = 50 max_dim = 10 x = max_dim / 2 * np.ones(num) s, c = 100 * np.random.random(num), np.random.random(num) scat = ax.scatter(x,x,s,c) ax.axis([0,max_dim,0,max_dim]) ax.set_autoscale_on(False) for i in xrange(1000): xy = np.random.random(2*num).reshape(num,2) - 0.5 offsets = scat.get_offsets() + 0.3 * xy offsets.clip(0, max_dim, offsets) scat.set_offsets(offsets) scat._sizes += 30 * (np.random.random(num) - 0.5) scat._sizes.clip(1, 300, scat._sizes) ram = cStringIO.StringIO() fig.savefig(ram, format='raw', dpi=fig.dpi) ram.close()
This takes ~23.5 seconds. Basically, savefig
just sets some default parameters and calls print_raw
, in this case, so there's very little difference.
Now, if we compare a raw image format with a compressed image format (png), we see a much more significant difference:
import matplotlib.pyplot as plt import numpy as np import cStringIO fig = plt.figure() ax = fig.add_subplot(111) num = 50 max_dim = 10 x = max_dim / 2 * np.ones(num) s, c = 100 * np.random.random(num), np.random.random(num) scat = ax.scatter(x,x,s,c) ax.axis([0,max_dim,0,max_dim]) ax.set_autoscale_on(False) for i in xrange(1000): xy = np.random.random(2*num).reshape(num,2) - 0.5 offsets = scat.get_offsets() + 0.3 * xy offsets.clip(0, max_dim, offsets) scat.set_offsets(offsets) scat._sizes += 30 * (np.random.random(num) - 0.5) scat._sizes.clip(1, 300, scat._sizes) ram = cStringIO.StringIO() fig.canvas.print_png(ram) ram.close()
This takes ~52 seconds! Obviously, there's a lot of overhead in compressing an image.
At any rate, this is probably a needlessly complex example... I think I just wanted to avoid actual work...
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With