Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in language-model

How is transformers loss calculated for blank token predictions?

Spacy download en_core_web_lg manually

Negative results using kenlm

python language-model

Difference between spaCy models sm, md, lg

spacy language-model

Correct way to calculate probabilities using ARPA LM data

nlp n-gram language-model

OpenAI Fine-tuning API: Why would I use LlamaIndex or LangChain instead of fine-tuning a model?

Difference between Instruction Tuning vs Non Instruction Tuning Large Language Models

How to tune a Machine Translation model with huge language model?

Understanding Character Level Embedding in Keras LSTM

Pretraining a language model on a small custom corpus

calculate perplexity in pytorch

How to compute perplexity using KenLM?

Using custom beam scorer in TensorFlow CTC (language model)

tensorflow language-model

Which model (GPT2, BERT, XLNet and etc) would you use for a text classification task? Why?

Python interface to ARPA files

NLTK package to estimate the (unigram) perplexity

TensorFlow Embedding Lookup

Input shape for Keras LSTM/GRU language model