AI/ML
Synthetic Data (AI Training)
Também chamado de:AI-Generated Training Data
📖O que é
Artificially generated training data produced by LLMs or other AI models, used to augment or replace human-annotated datasets. Techniques include prompt-based generation, retrieval-augmented pipelines, and iterative self-refinement. Synthetic data slashes costs from $5-20 per human preference point to under $0.01 per sample and became central to post-training pipelines in 2024-2025.
Sua exploração
0 termos visitados no totalTermos relacionados explorados0/3
Termos Relacionados
Knowledge DistillationAI/ML
A technique for transferring capabilities from a large 'teacher' model to a smaller 'stude…
Ver termo →DPO (Direct Preference Optimization)AI/ML
A simplified alternative to RLHF that aligns LLM outputs with human preferences without tr…
Ver termo →Fine-TuningAI/ML
The process of further training a pre-trained model on a specialized dataset to improve pe…
Ver termo →