Passing tensorDataset or Dataloader to skorch

Tags:

I want to apply cross validation in Pytorch using skorch, so I prepared my model and my tensorDataset which returns (image,caption and captions_length) and so it has X and Y, so I'll not be able to set Y in the method

Click to copy

net.fit(dataset)

but when I tried that I got an error :

ValueError: Stratified CV requires explicitly passing a suitable y

Here's part of my code:

Click to copy

start = time.time()
net = NeuralNetClassifier(
        decoder, criterion= nn.CrossEntropyLoss,
        max_epochs=args.epochs,
        lr=args.lr,
        optimizer=optim.SGD,
        device='cuda',  # uncomment this to train with CUDA
       )
net.fit(dataset, y=None)
end = time.time()

253

asked Jun 07 '19 08:06

Omar Abdelaziz

1 Answers

You are (implicitly) using the internal CV split of skorch which uses a stratified split in case of the NeuralNetClassifier which in turn needs information about the labels beforehand.

When passing X and y to fit separately this works fine since y is accessible at all times. The problem is that you are using torch.dataset.Dataset which is lazy and does not give you access to y directly, hence the error.

Your options are the following.

Set `train_split=None` to disable the internal CV split

Click to copy

net = NeuralNetClassifier(
    train_split=None,
)

You will lose internal validation and, as such, features like early stopping.

Split your data beforehand

Split your dataset into two datasets, dataset_train and dataset_valid, then use skorch.helper.predefined_split:

Click to copy

net = NeuralNetClassifier(
    train_split=predefined_split(dataset_valid),
)

You lose nothing but depending on your data this might be complicated.

Extract your `y` and pass it to fit

Click to copy

y_train = np.array([y for X, y in iter(my_dataset)])
net.fit(my_dataset, y=y_train)

This only works if your y fits into memory. Since you are using TensorDataset you can also do the following to extract your y:

Click to copy

y_train = my_dataset.y

125

answered Oct 03 '22 00:10

nemo

Related questions
                            
                                Gradient Descent: Do we iterate on ALL of the training set with each step in GD? or Do we change GD for each training set?
                            
                                How to classify URLs? what are URLs features? How to select and Extract features from URL
                            
                                Get a classification report stating the class wise precision and recall for multinomial Naive Bayes using 10 fold cross validation
                            
                                TensorFlow - why doesn't this sofmax regression learn anything?
                            
                                Python Neural Network Reinforcement Learning [closed]
                            
                                Why does support vectors in SVM have alpha (Lagrangian multiplier) greater than zero?
                            
                                Music21 Getting All notes with Durations
                            
                                Tensorflow: why is zip() function used in the steps involving applying the gradients?
                            
                                How does keras(or any other ML framework) calculate the gradient of a lambda function layer for backpropagation?
                            
                                wit.ai: how does it identify intent and classifies entities from user expressions
                            
                                ValueError: Input 0 is incompatible with layer conv_1: expected ndim=3, found ndim=4
                            
                                Weird accuracy in multilabel classification keras
                            
                                legacy_init_op in TensorFlow Serving
                            
                                Multidimensional Input to Keras
                            
                                What is "linear projection" in convolutional neural network
                            
                                How to use the function merge and switch of tensorflow?
                            
                                How to get results from custom loss function in Keras?
                            
                                Understanding decision_function values
                            
                                4D input in LSTM layer in Keras
                            
                                Data Preprocessing for NLP Pre-training Models (e.g. ELMo, Bert)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Passing tensorDataset or Dataloader to skorch

Tags:

machine-learning

deep-learning

computer-vision

pytorch

skorch

Omar Abdelaziz

People also ask

1 Answers

Set `train_split=None` to disable the internal CV split

Split your data beforehand

Extract your `y` and pass it to fit

nemo

Recent Activity

Donate For Us

Passing tensorDataset or Dataloader to skorch

Tags:

machine-learning

deep-learning

computer-vision

pytorch

skorch

Omar Abdelaziz

People also ask

1 Answers

Set train_split=None to disable the internal CV split

Split your data beforehand

Extract your y and pass it to fit

nemo

Related questions

Recent Activity

Donate For Us

Set `train_split=None` to disable the internal CV split

Extract your `y` and pass it to fit