FastAi What does the slice(lr) do in fit_one_cycle()

Tags:

In Lesson 3 - planet, I saw these 2 lines of code:

lr = 0.01
learn.fit_one_cycle(5, slice(lr))

if the slice(min_lr, max_lr) then I understand the fit_one_cycle() will use the spread-out Learning Rates from slice(min_lr, max_lr). (Hopefully, my understanding to this is correct)

But in this case slice(lr) only has one parameter,

What are the differences between fit_one_cycle(5, lr) and fit_one_cycle(5, slice(lr)) ? And what are the benefits of using slice(lr) instead of lr directly?

946

asked Dec 31 '19 01:12

Franva

1 Answers

Jeremy took a while to explain what slice does in Lesson 5.

What I understood was that the fastai.vision module divides the architecture in 3 groups and trains them with variable learning rates depending on what you input. (Starting layers usually don't require large variations in parameters)

Additionally, if you use 'fit_one_cycle', all the groups will have learning rate annealing with their respective variable learning.

Check Lesson 5 https://course.fast.ai/videos/?lesson=5 (use the transcript finder to quickly go to the 'slice' part)

181

answered Sep 22 '22 03:09

hargun3045

Related questions
                            
                                How to implement multivariate linear stochastic gradient descent algorithm in tensorflow?
                            
                                Vectorization: Not a valid collection
                            
                                How can you remove only the interaction terms in a polynomial regression using scikit-learn?
                            
                                How is the gradient and hessian of logarithmic loss computed in the custom objective function example script in xgboost's github repository?
                            
                                Leaky_Relu in Caffe
                            
                                Decision tree using continuous variable [closed]
                            
                                Python - calculate the co-occurrence matrix
                            
                                Text classification using Keras: How to add custom features?
                            
                                Is it possible to have non-trainable layer in Keras?
                            
                                How to choose the window size of CNN in deep learning?
                            
                                Cross entropy loss suddenly increases to infinity
                            
                                Homogeneous vs heterogeneous ensembles
                            
                                std::function has performances issues, how to avoid it?
                            
                                How does shuffling work with ImageDataGenerator in Machine Learning?
                            
                                How to model a shared layer in keras?
                            
                                sigmoid_cross_entropy loss function from tensorflow for image segmentation
                            
                                definition of error rate in classification and why some researchers use error rate instead of accuracy
                            
                                Column-dependent bounds in torch.clamp
                            
                                PyTorch LSTM input dimension
                            
                                Are the k-fold cross-validation scores from scikit-learn's `cross_val_score` and `GridsearchCV` biased if we include transformers in the pipeline?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

FastAi What does the slice(lr) do in fit_one_cycle()

Tags:

machine-learning

pytorch

fast-ai

Franva

People also ask

1 Answers

hargun3045

Recent Activity

Donate For Us