I've been working with HDF5 files with <code>C</code> and <code>Matlab</code>, both using the same way for reading from and writing to datasets: <ul> <li>open file with <code>h5f</code> </li> <li>open dataset with <code>h5d</code> </li> <li>select space with <code>h5s</code> </li> </ul> and so on... But now I'm working with <code>Python</code>, and with its <code>h5py</code> library I see that it has two ways to manage HDF5: high-level and low-level interfaces. And with the former it takes less lines of code to get the information from a single variable of the file. Is there any noticeable loss of performance when using the high-level interface? For example when dealing with a file with many variables inside, and we must read just one of them.

High-level interfaces are generally going with a performance loss of some sort. After that, whether it is noticeable (worth being investigated) will depend on what you are doing exactly with your code. Just start with the high-level interface. If the code is overall too slow, start profiling and move the bottlenecks down to the lower-level interface and see if it helps.

HDF5 for Python: high level vs low level interfaces. h5py

Tags:

performance

python

hdf5

h5py

I've been working with HDF5 files with C and Matlab, both using the same way for reading from and writing to datasets:

open file with h5f
open dataset with h5d
select space with h5s

and so on...

But now I'm working with Python, and with its h5py library I see that it has two ways to manage HDF5: high-level and low-level interfaces. And with the former it takes less lines of code to get the information from a single variable of the file.

Is there any noticeable loss of performance when using the high-level interface?
For example when dealing with a file with many variables inside, and we must read just one of them.

780

asked Nov 11 '11 16:11

Nicolás Ozimica

1 Answers

High-level interfaces are generally going with a performance loss of some sort. After that, whether it is noticeable (worth being investigated) will depend on what you are doing exactly with your code.

Just start with the high-level interface. If the code is overall too slow, start profiling and move the bottlenecks down to the lower-level interface and see if it helps.

152

answered Nov 10 '22 12:11

lgautier

Related questions
                            
                                Cannot convert 'vector<unsigned long>' to Python object
                            
                                Django registration alternatives
                            
                                Python: Clustering Search Engine Keywords
                            
                                Free Alternative to Ubigraph
                            
                                Python generator vs callback function
                            
                                QuerySet: LEFT JOIN with AND
                            
                                A semaphore-like mechanism for Celery
                            
                                Memory allocated to Python is not released back in Linux even after gc.collect()
                            
                                Porting python-twisted based code to scala: framework advice needed
                            
                                Set focus on Tkinter window (depends on platform?)
                            
                                How to send a value from Arduino to Python and then use that value
                            
                                How to manage my application window with python-xlib?
                            
                                matplotlib scatter_hist with stepfilled histtype in histogram
                            
                                Are there any examples of iOS apps written using PyObjC? [closed]
                            
                                Python - Iteration over nested lists
                            
                                1D non-maximum suppression in Python/scipy
                            
                                Bpython-like editor/IDE?
                            
                                How to change `object_name` of a model in Django
                            
                                Using `issubclass()` with Django models
                            
                                How to shutdown a timed out http POST using urlopen by urllib2 in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With