Is there any way to train a sklearn model by disk data like HDF5 or such ?

1 Answers

What you ask for is called out-of-core or streaming learning. It is only possible with a subset of the scikit-learn models that implement the partial_fit method for incremental fitting.

There is an example in the documentation. There is no specific utility to fit models on data in HDF5 in particular but can can adapt this example to fetch the data from any external datasource (e.g. HDF5 data on the local disk or a database over the network, for instance using the pandas SQL adapter).

answered Nov 03 '22 09:11

ogrisel

Related questions
                            
                                Distinction between linear and non linear regression?
                            
                                What are hidden units in individual LSTM cells?
                            
                                Benefits of TDD in machine learning
                            
                                Overfitting after first epoch
                            
                                Simple Neural Network with backpropagation in Swift
                            
                                how to detect language spoken in google cloud platform machine learning speech api
                            
                                Accessing gradient values of keras model outputs with respect to inputs
                            
                                Why does get_weights return an empty list?
                            
                                Why is naïve Bayes generative?
                            
                                Tensorflow: stack all row pairs from a tensor
                            
                                Backpropagation algorithm giving bad results
                            
                                Keras predict() returns a better accuracy than evaluate()
                            
                                100% classifier accuracy after using train_test_split
                            
                                OCR for Devanagari (Hindi / Marathi / Sanskrit)
                            
                                Neural Network size for Animation system
                            
                                scikit learn: desired amount of Best Features (k) not selected
                            
                                Matrix factorization for collaborative filtering - new users and items?
                            
                                How to normalize an image color?
                            
                                Unseen nominal values in weka
                            
                                Do convolutional neural networks suffer from the vanishing gradient?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there any way to train a sklearn model by disk data like HDF5 or such ?

Tags:

machine-learning

scikit-learn

erogol

People also ask

1 Answers

ogrisel

Recent Activity

Donate For Us