Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

New posts in large-language-model

Streaming ChatGPT's results with Flask and LangChain

Ollama - How to inject context or get model to answer based on context? [closed]

Langchain not returning full response expected from OpenAI model

How to compute sentence level perplexity from hugging face language models?

translation invariance of Rotary Embedding

What is the right way to do system prompting with Ollama in Langchain using Python?

How to activate verbosity in Langchain

Figuring out general specs for running LLM models [closed]

Input type into Linear4bit is torch.float16, but bnb_4bit_compute_type=torch.float32 (default). This will lead to slow inference or training speed

ImportError: libcudart.so.12: cannot open shared object file: No such file or directory

LM Studio - Failed to load model

How do I slim down SBERT's sentencer-transformer library?