A

An autonomous AI system that can perceive its environment, make decisions, and take actions to achieve specific goals. AI agents combine language models with tool use, planning, and memory to perform...

Definition
A

The challenge of ensuring that AI systems pursue goals and behaviors that are consistent with human intentions and values. Alignment research seeks to solve problems like reward hacking, goal misspeci...

Definition
A

The study and implementation of moral principles guiding the development and deployment of AI systems. AI ethics addresses issues such as fairness, accountability, transparency, privacy, bias, and the...

Definition
A

A research field focused on ensuring AI systems operate safely, reliably, and in alignment with human values. AI safety encompasses alignment research, robustness testing, adversarial defense, interpr...

Definition
A

A type of neural network that learns to compress input data into a compact latent representation and then reconstruct it. Autoencoders are used for dimensionality reduction, denoising, anomaly detecti...

Definition
B

A standardized test or dataset used to evaluate and compare the performance of AI models on specific tasks. Common AI benchmarks include MMLU for knowledge, HumanEval for code generation, and ImageNet...

Definition
B

Systematic errors in AI systems that produce unfair or discriminatory outcomes, often stemming from biased training data, flawed model design, or societal prejudices encoded in the data. Addressing AI...

Definition
C

An AI-powered software application designed to simulate conversation with human users through text or voice interfaces. Modern chatbots leverage large language models to understand context, maintain c...

Definition
C

An unsupervised learning technique that groups similar data points together based on their features without predefined labels. Common algorithms include K-Means, DBSCAN, and hierarchical clustering, u...

Definition
C

A field of artificial intelligence that enables computers to interpret and understand visual information from the world, such as images and videos, by extracting meaningful data using deep learning al...

Definition