An autonomous AI system that can perceive its environment, make decisions, and take actions to achieve specific goals. AI agents combine language models with tool use, planning, and memory to perform...
Definition
Artificial Intelligence
Terms related to artificial intelligence, machine learning, and deep learning.
The challenge of ensuring that AI systems pursue goals and behaviors that are consistent with human intentions and values. Alignment research seeks to solve problems like reward hacking, goal misspeci...
DefinitionThe study and implementation of moral principles guiding the development and deployment of AI systems. AI ethics addresses issues such as fairness, accountability, transparency, privacy, bias, and the...
DefinitionA research field focused on ensuring AI systems operate safely, reliably, and in alignment with human values. AI safety encompasses alignment research, robustness testing, adversarial defense, interpr...
DefinitionA mathematical function applied to a neuron's output in a neural network that introduces non-linearity, enabling the network to learn complex patterns. Common activation functions include ReLU, sigmoi...
DefinitionAn AI design pattern where a language model autonomously plans, executes, and iterates on multi-step tasks using tools, memory, and reasoning. Agentic workflows combine planning, action execution, obs...
DefinitionAn AI technique that identifies patterns in data that deviate significantly from expected behavior. Anomaly detection is used in fraud prevention, cybersecurity, predictive maintenance, and quality co...
DefinitionThe simulation of human intelligence processes by computer systems, including learning, reasoning, and self-correction. AI encompasses a broad range of techniques from rule-based systems to deep neura...
DefinitionA technique in neural networks that allows the model to focus on relevant parts of the input when producing output. Self-attention enables each element in a sequence to attend to all other elements, c...
DefinitionA type of neural network that learns to compress input data into a compact latent representation and then reconstruct it. Autoencoders are used for dimensionality reduction, denoising, anomaly detecti...
DefinitionA vehicle capable of sensing its environment and navigating without human input, using a combination of AI technologies including computer vision, sensor fusion, path planning, and deep learning for r...
DefinitionA fundamental algorithm in neural network training that calculates the gradient of the loss function with respect to each weight by propagating the error backward through the network layers, enabling...
DefinitionA technique that normalizes the inputs of each layer in a neural network by adjusting and scaling the activations. Batch normalization stabilizes training, allows higher learning rates, and reduces th...
DefinitionA standardized test or dataset used to evaluate and compare the performance of AI models on specific tasks. Common AI benchmarks include MMLU for knowledge, HumanEval for code generation, and ImageNet...
DefinitionSystematic errors in AI systems that produce unfair or discriminatory outcomes, often stemming from biased training data, flawed model design, or societal prejudices encoded in the data. Addressing AI...
DefinitionA prompting technique that encourages language models to break down complex reasoning into intermediate steps before arriving at a final answer. Chain-of-thought prompting significantly improves perfo...
DefinitionAn AI-powered software application designed to simulate conversation with human users through text or voice interfaces. Modern chatbots leverage large language models to understand context, maintain c...
DefinitionAn unsupervised learning technique that groups similar data points together based on their features without predefined labels. Common algorithms include K-Means, DBSCAN, and hierarchical clustering, u...
DefinitionA field of artificial intelligence that enables computers to interpret and understand visual information from the world, such as images and videos, by extracting meaningful data using deep learning al...
DefinitionAn AI training approach developed by Anthropic where the model is guided by a set of principles (a "constitution") to self-critique and revise its outputs. Constitutional AI reduces the need for human...
Definition