Training, Validation, Testing Batch Size Ratio

Tags:

I'm doing transfer learning using Inception on Tensorflow, this is the training code that I followed : https://raw.githubusercontent.com/tensorflow/hub/master/examples/image_retraining/retrain.py

At the bottom part of the code, we can specify the parameters according to our dataset. (there are training, val, test percentage and training, val, test batch size)
Let's say I have a very large dataset (1 mil) and I already set the training, validation, testing percentage to 75:15:10.

But I have no idea how to set the batch parameters correctly :

train_batch_size
validation_batch_size
test_batch_size

For now, I set the train_batch_size to 64, do I need to set the same value for validation_batch_size? Or should it be bigger or smaller than the train_batch_size?

885

asked Jan 29 '19 02:01

gameon67

1 Answers

You can follow the advice from the other answers for the dataset split ratio. However, the batch size has absolutely nothing to do with how you've split your datasets.

The batch size determines how many training examples are processed in parallel for training/inference. The batch size at training time can affect how fast and how well your training converges. You can find a discussion of this effect here. Thus, for train_batch_size, it's worth picking a batch size that is neither too small nor too large (as discussed in the previously linked discussion). For some applications, using the largest possible training batches can actually be desirable, but in general, you select it through experiments and validation.

However, for validation_batch_size and test_batch_size, you should pick the largest batch size that your hardware can handle without running out of memory and crashing. Finding this is usually a simple trial and error process. The larger your batch size at inference time, the faster it will be, since more inputs can be processed in parallel.

EDIT: Here's an additional useful link (Pg. 276) for the training batch size trade-off from Goodfellow et al's deep learning book.

158

answered Nov 15 '22 07:11

Proyag

Related questions
                            
                                Detect changes to a nested dictionary with Python
                            
                                Why do I need to shuffle my PCollection for it to autoscale on Cloud Dataflow?
                            
                                how to get rid of spaces between variables and strings when printed
                            
                                Errno 13 Permission denied when running virtualenv
                            
                                How to show labels in Seaborn plots (No handles with labels found to put in legend.)?
                            
                                Equivalent of LIMIT and OFFSET of SQL in pandas?
                            
                                Convert Dictionary to Numpy array
                            
                                Pandas: Adding a df column based on other column with multiple values map to the same new column value
                            
                                Retain order when taking unique rows in a NumPy array
                            
                                "ValueError: Invalid async_mode specified" when bundling a Flask app using cx_Freeze
                            
                                Get rows corresponding to the minimum with pandas groupby
                            
                                Flask view raises "AttributeError: 'function' object has no attribute"
                            
                                A function composition operator in Python
                            
                                Difference between super() and super (className,self) in Python [duplicate]
                            
                                How to correctly upgrade pip using ansible?
                            
                                Appending Pandas DataFrame to existing Excel document
                            
                                Golang equivalent of creating a subprocess in Python
                            
                                Dask dataframe - split column into multiple rows based on delimiter
                            
                                How do I execute an SQLite script from within python?
                            
                                Pandas - convert float to proper datetime or time object

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Training, Validation, Testing Batch Size Ratio

Tags:

python

tensorflow

conv-neural-network

training-data

gameon67

People also ask

1 Answers

Proyag

Recent Activity

Donate For Us