Matlab: How can I split my data matrix into two random subsets of column vectors while keeping the label information?

Tags:

I have a data matrix X (60x208) and a matrix of labels Y (1x208). I want to split my data matrix X into two random subsets of column vectors: training (which will be 70% of the data) and testing (which will be 30% of the data), but I need to still be able to identify which label from Y corresponds to each column vector. I couldn't find any function to do this, any ideas?

EDIT: Thought I should add, there are only two labels in Y: 1 and 2 (not sure if this makes a difference)

286

asked Oct 30 '14 19:10

user3457834

1 Answers

That's pretty easy to do. Use randperm to generate a random permutation of indices from 1 up to as many points as you have... which is 208 in your case.

Once you generate this sequence, simply use this and subset into your X and Y to extract the training and test data and labels. As such, do something like this:

num_points = size(X,2);
split_point = round(num_points*0.7);
seq = randperm(num_points);
X_train = X(:,seq(1:split_point));
Y_train = Y(seq(1:split_point));
X_test = X(:,seq(split_point+1:end));
Y_test = Y(seq(split_point+1:end));

The split_point determines how many points we need to place into our training set, and we will need to round it in case this calculation yields any decimal points. I also didn't hard code 208 in there because your data set might grow and so this will work with any size data set you choose. X_train and Y_train will contain your data and labels for your training set while X_test and Y_test will contain your data and labels for your test set.

As such, the first column of X_train is your data point for the first element of your training set, with the first element of Y_train serving as the label for that particular point... and so on and so forth!

160

answered Nov 15 '22 05:11

rayryeng

Related questions
                            
                                How can I draw multiple 3d cubes in matlab
                            
                                Read multiple images on a folder in Matlab
                            
                                Dynamical access to nested fields in Matlab
                            
                                Datetick not showing enough tick marks in plot
                            
                                How to calculate word co-occurence
                            
                                Is it possible return cell array that contains one instance in several cells?
                            
                                Removing deadspace in subplots while retaining title & labels
                            
                                MATLAB Quiver - Tiny arrows
                            
                                apply bsxfun or arrayfun to every row of a matrix
                            
                                How to do circular crop using matlab?
                            
                                Comparing fsolve results in python and matlab
                            
                                MATLAB operators as functions
                            
                                Interpolation between two curves (matlab)
                            
                                Finding first samples greater than a threshold value efficiently in Python (and MATLAB comparison)
                            
                                Matlab: Why is '1' + 1 == 50?
                            
                                Close Variable Editor by commands
                            
                                Bring axes to front without redrawing the figure?
                            
                                Why are isscalar, isvector and ismatrix all true for A = 1?
                            
                                Matlab element-wise division by zero
                            
                                Fortran minimization of a function with additional arguments

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Matlab: How can I split my data matrix into two random subsets of column vectors while keeping the label information?

Tags:

label

machine-learning

matlab

sample

user3457834

People also ask

1 Answers

rayryeng

Recent Activity

Donate For Us