I have a files list and I want to split it into 3 parts : training, validation and testing. I have tries this code and I don't know if it is correct or not.
files = glob.glob("/dataset/%s/*" % emotion)
training = files[:int(len(files)*0.8)] #get first 80% of file list
validation = files[-int(len(files)*0.1):] #get middle 10% of file list
testing = files[-int(len(files)*0.1):] #get last 10% of file list
I am not sure if the testing list is duplicated or it the correct last 10% of the file list.
You can take advantage of numpy split:
train, validate, test = np.split(files, [int(len(files)*0.8), int(len(files)*0.9)])
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With