Definition

AI systems capable of processing and generating multiple types of data, such as text, images, audio, and video, within a single model. Multimodal models can understand relationships across different modalities, enabling tasks like image captioning and visual question answering.

Defined Term