What's the difference between data time major and batch major?

1 Answers

Trying to put it in simplest terms: these are different representations (or arrangements) of the same data.

2D example

For example, imagine you have the data like this (just for the sake of illustration, not real data):

1 11 21 31
2 12 22 32
3 13 23 33
...
100 111 121 131

... where each row corresponds to a training input and each column corresponds to a different feature. The matrix has size (batch_size, features), where batch_size=100 and features=4.

Next, in some cases, you may get a transposed matrix as input (for instance, it's an output from the previous step):

1 2 3 ... 100
11 12 13 ... 111
21 22 23 ... 121
31 32 33 ... 131

In this case, the matrix shape is (features, batch_size). Note: the data itself doesn't change. Only the array dimensions have changed: batch is the 0-axis in the first example and 1-axis in the second example. Also note that one can swap different presentations very easily and efficiently. In tensorflow, it can be done with tf.transpose.

Time major vs Batch major

When in comes to RNNs, the tensors usually go to rank 3+, but the idea stays the same. If the input is (batch_size, sequence_num, features), it's called batch major, because the 0 axis is the batch_size. If the input is (sequence_num, batch_size, features), it's called time major likewise. The features is always the last dimension (at least I don't know real cases when it's not), so there's no further variety in naming.

Depending on the network structure, it might expect specifically the batch or the time as the 0 axis, so the format of input data matters. And depending on the previous layers, one can get either of the those representations to be fed into an RNN. So the conversion from one arrangement to another might be required, either by the library function or by the caller. As far as I can remember, batch major is the default in tensorflow and keras, so it simply boils down what shape is produced from the layer just before the RNN.

Once again: there is one-to-one correspondence between batch major and time major representations. Any tensor can be represented as both. But for a particular implementation, one of those can be expected or required.

104

answered Oct 22 '22 17:10

Maxim

Related questions
                            
                                clever any() like function to check if at least n elements are True?
                            
                                How to add gaussian noise in an image in Python using PyMorph
                            
                                Python regex to match punctuation at end of string
                            
                                Is it possible to get the default background color using curses in python?
                            
                                How to remove RunTimeWarning Errors from code?
                            
                                How to find index of minimum non zero element with numpy?
                            
                                pyspark - create DataFrame Grouping columns in map type structure
                            
                                Print sample set of columns from dataframe in Pandas? [duplicate]
                            
                                Python: print variable name and value easily
                            
                                What is assigned to `variable`, in `with expression as variable`?
                            
                                Flask database migrations on heroku
                            
                                BeautifulSoup and class with spaces
                            
                                django.db.utils.IntegrityError: duplicate key value violates unique constraint "auth_permission_pkey"
                            
                                How to bind enter key to a tkinter button
                            
                                Why is a computation much slower within a Dask/Distributed worker?
                            
                                'function' object has no attribute 'assert_called_once_with'
                            
                                additional row colors in seaborn cluster map
                            
                                Python: Lib to use epoll if available, fallback to select
                            
                                Convert Google Vision API response to JSON
                            
                                Longest Common Subsequence in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the difference between data time major and batch major?

Tags:

python

machine-learning

tensorflow

lstm

rnn

Chemss-Eddine BenHassine

People also ask

1 Answers

2D example

Time major vs Batch major

Maxim

Recent Activity

Donate For Us