AI/ML

RLHF (Reinforcement Learning from Human Feedback)

Também chamado de:RLHF

📖O que é

A training technique that aligns LLM outputs with human preferences. Process: (1) train a reward model from human comparisons of outputs, (2) use reinforcement learning (PPO) to optimize the LLM against the reward model. RLHF makes models more helpful, harmless, and honest. Used by Claude, ChatGPT, and other assistants. Alternatives include DPO (Direct Preference Optimization) and Constitutional AI.

Sua exploração

0 termos visitados no total

Termos relacionados explorados0/2

Termos Relacionados

LLM (Modelo de Linguagem Grande)AI/ML

A neural network trained on vast text corpora to understand and generate human language. L…

Ver termo →

Training (ML)AI/ML

The process of optimizing a model's parameters by exposing it to data and adjusting weight…

Ver termo →

Voltar ao glossário