Definition

The process of assigning meaningful tags or annotations to raw data such as text, images, or audio to create training datasets for supervised learning. Data labeling quality directly impacts model accuracy and is often the most time-consuming part of ML projects.

Defined Term