Is there a keras method to split data?

Tags:

I think the title is self explanatory but to ask it in details, there's sklearn's method train_test_split() which works like: X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size = 0.3, stratify = Y) It means: the method will split data with 0.3 : 0.7 ratio and will try to make percentage of labels in both data equal. Is there a keras equivalent of this?

979

asked Feb 01 '18 15:02

CerushDope

1 Answers

Now there is using the keras Dataset class. I'm running keras-2.2.4-tf along with the new tensorflow release.

Basically, load all the data into a Dataset using something like tf.data.Dataset.from_tensor_slices. Then split the data into new datasets for training and validation. For example, shuffle all the records in the dataset. Then use all but the first 400 as training and the first 400 as validation.

ds = ds_in.shuffle(buffer_size=rec_count)
ds_train = ds.skip(400)
ds_validate = ds.take(400)

An instance of the Dataset class is a natural container to pass around for the Keras models. I copied the concept from a tensorflow or keras training example but can't seem to find it again.

The canned datasets using the load_data method create numpy.ndarray classes so they are a little different but can be easily converted to a keras Dataset. I suspect this hasn't been done because so much existing code would break.

169

answered Sep 27 '22 21:09

dturvene

Related questions
                            
                                Imports behave differently when in __init__.py that is imported
                            
                                Python and OpenCV - Improving my lane detection algorithm
                            
                                Python - Loop parallelisation with joblib
                            
                                HOW TO LABEL the FEATURE IMPORTANCE with forests of trees?
                            
                                How to free memory of python deleted object?
                            
                                A surprise with 1**math.nan and 0j**math.nan
                            
                                How to Bind and Send from Google Cloud Forwarding Rule IP Address?
                            
                                Upload a CSV file and read it in Bokeh Web app
                            
                                Test Driven Development (TDD) for Web Scraping
                            
                                Bokeh: DataTable - how to set selected rows
                            
                                python docx.opc.exceptions.PackageNotFoundError: Package not found when opening Document
                            
                                pidbox received method enable_events() [reply_to:None ticket:None] in Django-Celery
                            
                                Check view method parameter name in Django class based views
                            
                                Segmentation with Single Point Class Annotations via Graph Cuts?
                            
                                SharePlum error : "Can't get User Info List"
                            
                                Getting glob to follow symlinks in Python
                            
                                Pythonocc/Opencascade | Create pipe along straight lines through points, profile wont change normal
                            
                                Nothing is being detected in Tensorflow Object detection API
                            
                                Which operator (+ vs +=) should be used for performance? (In-place Vs not-in-place)
                            
                                difference between pandas read sql query and read sql table

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a keras method to split data?

Tags:

python

machine-learning

keras

scikit-learn

CerushDope

People also ask

1 Answers

dturvene

Recent Activity

Donate For Us