AI/ML
DeepSeek
Também chamado de:DeepSeek-R1DeepSeek-V3
📖O que é
A Chinese AI lab that released DeepSeek-R1 in January 2025, a 671B-parameter open-weight reasoning model achieving performance comparable to OpenAI's o1 at significantly lower cost. DeepSeek-R1 generates visible chain-of-thought reasoning using GRPO training and demonstrated that pure RL with verifiable rewards can produce emergent reasoning. DeepSeek-V3 uses a MoE architecture with ~37B active parameters.
Sua exploração
0 termos visitados no totalTermos relacionados explorados0/3
Termos Relacionados
Reasoning ModelAI/ML
A class of LLMs trained with reinforcement learning to generate step-by-step internal chai…
Ver termo →Mixture of Experts (MoE)AI/ML
A neural network architecture that routes each input to a subset of specialized 'expert' s…
Ver termo →Open-Source AI ModelsAI/ML
AI models with publicly released weights that can be downloaded, modified, and self-hosted…
Ver termo →