AI/ML

DeepSeek

Também chamado de:DeepSeek-R1DeepSeek-V3

📖O que é

A Chinese AI lab that released DeepSeek-R1 in January 2025, a 671B-parameter open-weight reasoning model achieving performance comparable to OpenAI's o1 at significantly lower cost. DeepSeek-R1 generates visible chain-of-thought reasoning using GRPO training and demonstrated that pure RL with verifiable rewards can produce emergent reasoning. DeepSeek-V3 uses a MoE architecture with ~37B active parameters.

Sua exploração

0 termos visitados no total

Termos relacionados explorados0/3

Termos Relacionados

Reasoning ModelAI/ML

A class of LLMs trained with reinforcement learning to generate step-by-step internal chai…

Ver termo →

Mixture of Experts (MoE)AI/ML

A neural network architecture that routes each input to a subset of specialized 'expert' s…

Ver termo →

Open-Source AI ModelsAI/ML

AI models with publicly released weights that can be downloaded, modified, and self-hosted…

Ver termo →

Voltar ao glossário