How to perform k-fold cross validation with tensorflow?

Tags:

I am following the IRIS example of tensorflow.

My case now is I have all data in a single CSV file, not separated, and I want to apply k-fold cross validation on that data.

I have

data_set = tf.contrib.learn.datasets.base.load_csv(filename="mydata.csv",                                                    target_dtype=np.int)

How can I perform k-fold cross validation on this dataset with multi-layer neural network as same as IRIS example?

212

asked Sep 28 '16 13:09

mommomonthewind

1 Answers

I know this question is old but in case someone is looking to do something similar, expanding on ahmedhosny's answer:

The new tensorflow datasets API has the ability to create dataset objects using python generators, so along with scikit-learn's KFold one option can be to create a dataset from the KFold.split() generator:

import numpy as np  from sklearn.model_selection import LeaveOneOut,KFold  import tensorflow as tf import tensorflow.contrib.eager as tfe tf.enable_eager_execution()  from sklearn.datasets import load_iris data = load_iris() X=data['data'] y=data['target']  def make_dataset(X_data,y_data,n_splits):      def gen():         for train_index, test_index in KFold(n_splits).split(X_data):             X_train, X_test = X_data[train_index], X_data[test_index]             y_train, y_test = y_data[train_index], y_data[test_index]             yield X_train,y_train,X_test,y_test      return tf.data.Dataset.from_generator(gen, (tf.float64,tf.float64,tf.float64,tf.float64))  dataset=make_dataset(X,y,10)

Then one can iterate through the dataset either in the graph based tensorflow or using eager execution. Using eager execution:

for X_train,y_train,X_test,y_test in tfe.Iterator(dataset):     ....

156

answered Sep 21 '22 21:09

Dan Reia

Related questions
                            
                                what does axes.flat in matplotlib do?
                            
                                Is there a difference between capital and lowercase string prefixes?
                            
                                Matplotlib: TypeError: 'AxesSubplot' object is not subscriptable [duplicate]
                            
                                How to sort one list based on another? [duplicate]
                            
                                Django model class methods for predefined values
                            
                                ctypes loading a c shared library that has dependencies
                            
                                Exploitable Python Functions [closed]
                            
                                Overflow in exp in scipy/numpy in Python?
                            
                                Remove Max and Min values from python list of integers
                            
                                Python: How to get group ids of one username (like id -Gn )
                            
                                How to convert an image from np.uint16 to np.uint8?
                            
                                Why does json.dumps(list(np.arange(5))) fail while json.dumps(np.arange(5).tolist()) works
                            
                                How to set and get a parent class attribute from an inherited class in Python?
                            
                                Animate a rotating 3D graph in matplotlib
                            
                                Android Market API - Python ImportError: No module named google.protobuf
                            
                                Is "norm" equivalent to "Euclidean distance"?
                            
                                Impute entire DataFrame (all columns) using Scikit-learn (sklearn) without iterating over columns
                            
                                Read all but last line of CSV file in pandas
                            
                                Python dictionary doesn't have all the keys assigned, or items
                            
                                PhantomJS with Selenium error: Message: 'phantomjs' executable needs to be in PATH

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to perform k-fold cross validation with tensorflow?

Tags:

python

tensorflow

train-test-split

cross-validation

mommomonthewind

People also ask

1 Answers

Dan Reia

Recent Activity

Donate For Us