LLM
9 pages tagged "LLM"
What are large language models?
How can progress in non-agentic LLMs lead to capable AI agents?
What is reinforcement learning from human feedback (RLHF)?
What is "jailbreaking" a large language model (LLM)?
What is "wireheading"?
How can LLMs be understood as “simulators”?
What does it mean when an AI is "hallucinating"?
What alignment techniques are used on LLMs?
What is scaffolding?