Glossary
- Activation function
-
A mathematical function applied to a neuron’s input to determine the output, such as ReLU (rectified linear unit), Sigmoid, or Softmax.
- Agent
-
An autonomous system that can perform tasks, make decisions, and interact with users or environments.
- Artificial intelligence (AI)
-
The simulation of human intelligence in machines, including reasoning, learning, and problem-solving.
- Attention mechanism
-
A technique in neural networks that allows models to focus on specific parts of input sequences, crucial for transformers.
- Backpropagation
-
The algorithm used to train neural networks by adjusting weights based on error gradients.
- Beam search
-
A decoding algorithm in sequence generation models that selects the most probable sequences.
- Bias
-
Systematic errors in machine learning models that can lead to unfair outcomes.
- Chatbot
-
An AI system that interacts with users via natural language.
- Cold start problem
-
The challenge of making predictions when little or no historical data is available.
- Context window
-
The amount of text (measured in tokens) an LLM can process at once.
- Corpus
-
A collection of text data used to train or fine-tune language models.
- Decoder
-
The component in transformer-based models that generates output sequences from encoded information.
- Dense vector
-
A numerical representation of text in vector space, often used in embeddings.
- Domain adaptation
-
The process of fine-tuning a model to perform better on a specific domain.
- Dropout ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access