AI/ML

Inference

📖O que é

The process of running a trained model on new inputs to generate predictions or outputs. Inference is the 'using' phase (vs. training). Inference cost depends on model size, input/output token count, and hardware (GPUs/TPUs). API providers (Anthropic, OpenAI) charge per token for inference. On-device inference (llama.cpp, GGUF) runs locally without API calls.

Sua exploração

0 termos visitados no total

Termos relacionados explorados0/2

Termos Relacionados

LLM (Modelo de Linguagem Grande)AI/ML

A neural network trained on vast text corpora to understand and generate human language. L…

Ver termo →

Token (AI/NLP)AI/ML

The basic unit of text processed by language models—typically a word, subword, or characte…

Ver termo →

Voltar ao glossário