AI/ML

Transformer

📖O que é

The neural network architecture underlying modern LLMs, introduced in 'Attention Is All You Need' (2017). Transformers use self-attention mechanisms to process input sequences in parallel (unlike recurrent networks). Key components: multi-head attention, positional encoding, feedforward layers, and layer normalization. Variants include encoder-only (BERT), decoder-only (GPT), and encoder-decoder (T5).

Sua exploração

0 termos visitados no total

Termos relacionados explorados0/2

Termos Relacionados

LLM (Modelo de Linguagem Grande)AI/ML

A neural network trained on vast text corpora to understand and generate human language. L…

Ver termo →

Attention MechanismAI/ML

A neural network component that allows models to weigh the relevance of different parts of…

Ver termo →

Voltar ao glossário