I have data coming from a csv which has a few thousand columns and ten thousand (or so) rows. Within each column the data is of the same type, but different columns have data of different type*. Previously I have been pickling the data from numpy and storing on disk, but it's quite slow, especially because usually I want to load some subset of the columns rather than all of them. I want to put the data into hdf5 using pytables, and my first approach was to put the data in a single table, with one hdf5 column per csv column. Unfortunately this didn't work, I assume because of the 512 (soft) column limit. What is a sensible way to store this data? * I mean, the type of the data after it has been converted from text.

How to store wide tables in pytables / hdf5

Tags:

I have data coming from a csv which has a few thousand columns and ten thousand (or so) rows. Within each column the data is of the same type, but different columns have data of different type*. Previously I have been pickling the data from numpy and storing on disk, but it's quite slow, especially because usually I want to load some subset of the columns rather than all of them.

I want to put the data into hdf5 using pytables, and my first approach was to put the data in a single table, with one hdf5 column per csv column. Unfortunately this didn't work, I assume because of the 512 (soft) column limit.

What is a sensible way to store this data?

* I mean, the type of the data after it has been converted from text.

Related questions
                            
                                Is there a way to tell if a classpath resource is a file or a directory?
                            
                                Computer Blocking CORS OPTIONS Request
                            
                                Elasticsearch cache clear doesn't seems to do what I expected
                            
                                PHP traits - change value of static property in inherited class
                            
                                What is the counterpart of iOS' QLPreviewController or UIDocumentInteractionController on Android?
                            
                                Python - How to Concatenate Strings in a Successive Way?
                            
                                Difference between Angular Dart and Polymer Dart
                            
                                JAX-RS: OPTIONS for every Resource
                            
                                Naming two kinds of scope in Java
                            
                                How can I invoke an unknown Rust function with some arguments using reflection?
                            
                                Best Practice for setting unique Chef Node Attributes
                            
                                ASM call conventions

How to store wide tables in pytables / hdf5

Tags:

Recent Activity

Donate For Us