How do I train gpt 2 from scratch?

Tags:

I want to train gpt 2 from scratch but there is only fine-tuning approach based on pretrained models in articles I found. I've used this https://github.com/nshepperd/gpt-2 for train with existing model. Should I edit these Python scripts to train from scratch?

508

asked Dec 13 '19 17:12

Alex

1 Answers

I found the answer in 'issues' of this repo https://github.com/nshepperd/gpt-2

If you want to not use the released model at all, for instance because you want to train a model with incompatible hyperparameters, it should be sufficient to just skip the restore from the released model checkpoint (around train.py:164-177) on your first run so the parameters will all be randomly initialized.

174

answered Oct 25 '22 18:10

Alex

Related questions
                            
                                How does pandas Dataframe.loc accept the [...] syntax?
                            
                                What are the Tensorflow qint8, quint8, qint32, qint16, and quint16 datatypes?
                            
                                impossible to catch asyncio.TimeoutError?
                            
                                How to sort a list by length and then in reverse alphabetical order
                            
                                Intel MKL FATAL ERROR: Cannot load mkl_intel_thread.dll
                            
                                What solver should I use if my objective function is an nonlinear (also exponential explanation) function? Python GEKKO
                            
                                How do I count letters in a string?
                            
                                Cannot Import Name 'keras_export' From 'tensorflow.python.util.tf_export'
                            
                                How do I pass a keyword argument to the forward used by a pre-forward hook?
                            
                                Why does reading a whole file take up more RAM than its size on DISK?
                            
                                Add keys to a dictionary with automatically incremented values
                            
                                How can I cancel an active boto3 s3 file_download?
                            
                                Which SSIM is correct : skimage.metrics.structural_similarity()?
                            
                                What exactly does pygame.init() do?
                            
                                Can I train a Tensorflow keras model with complex input/output?
                            
                                How to generate all possible combinations with a given condition to make it more efficient?
                            
                                How to use Font Awesome icons in python plotly dash
                            
                                Zero predictions despite masking support for zero-padded mini batch LSTM training in keras
                            
                                How do I crop an image using a binary mask image of the same picture to remove the background in python?
                            
                                Pandas Dataframe: Multiplying Two Columns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I train gpt 2 from scratch?

Tags:

python

machine-learning

nlp

nlg

Alex

People also ask

1 Answers

Alex

Recent Activity

Donate For Us