Split data directory into training and test directory with sub directory structure preserved

Tags:

I am interested in using ImageDataGenerator in Keras for data augmentation. But it requires that training and validation directories with sub directories for classes be fed in separately as below (this is from Keras documentation). I have a single directory with 2 subdirectories for 2 classes (Data/Class1 and Data/Class2). How do I randomly split this into training and validation directories

    train_datagen = ImageDataGenerator(
    rescale=1./255,
    shear_range=0.2,
    zoom_range=0.2,
    horizontal_flip=True)

    test_datagen = ImageDataGenerator(rescale=1./255)

    train_generator = train_datagen.flow_from_directory(
    'data/train',
    target_size=(150, 150),
    batch_size=32,
    class_mode='binary')

   validation_generator = test_datagen.flow_from_directory(
    'data/validation',
    target_size=(150, 150),
    batch_size=32,
    class_mode='binary')

   model.fit_generator(
    train_generator,
    steps_per_epoch=2000,
    epochs=50,
    validation_data=validation_generator,
    validation_steps=800)

I am interested in re-running my algorithm multiple times with random training and validation data splits.

370

asked Oct 12 '17 19:10

Sharanya Arcot Desai

1 Answers

Thank you guys! I was able to write my own function to create training and test data sets. Here's the code for anyone who's looking.

import os
source1 = "/source_dir"
dest11 = "/dest_dir"
files = os.listdir(source1)
import shutil
import numpy as np
for f in files:
    if np.random.rand(1) < 0.2:
        shutil.move(source1 + '/'+ f, dest11 + '/'+ f)

133

answered Oct 02 '22 13:10

Sharanya Arcot Desai

Related questions
                            
                                Matplotlib cursor value with two axes
                            
                                Simplest way to check if multiple items are (or are not) in a list? [duplicate]
                            
                                Django Allauth Custom Login Does Not Show Errors
                            
                                why does Python lint want me to use different local variable name, than a global, for the same purpose
                            
                                Running scrapy from script not including pipeline
                            
                                Numpy: get the column and row index of the minimum value of a 2D array
                            
                                How to install python smtplib module in ubuntu os
                            
                                How to remove accent in Python 3.5 and get a string with unicodedata or other solutions?
                            
                                Proper way to use "opposite boolean" in Pandas data frame boolean indexing
                            
                                Delete a column in a pandas' DataFrame if its sum is less than x
                            
                                Enumerating three variables in python list comprehension
                            
                                Convert datetime.time into datetime.timedelta in Python 3.4
                            
                                Got Failed to decode JSON object when calling a POST request in flask python
                            
                                How to execute local python scripts in Jenkins UI
                            
                                SeqIO.parse on a fasta.gz
                            
                                How do you exit PDB /and/ kill the program?
                            
                                Reverse 32bit integer
                            
                                Why "if-else-break" breaks in python?
                            
                                Attempting to reset tensorflow graph when using keras, failing
                            
                                How to print pretty JSON on a html page from a django template?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Split data directory into training and test directory with sub directory structure preserved

Tags:

python

machine-learning

neural-network

deep-learning

keras

Sharanya Arcot Desai

People also ask

1 Answers

Sharanya Arcot Desai

Recent Activity

Donate For Us