AI Glossary

Multimodal AI

Multimodal AI refers to AI systems that can process and understand multiple forms of information such as text, images, audio, and video.

Overview

Unlike traditional AI systems that work with one type of data, multimodal AI combines several forms of information simultaneously.

Humans naturally process information from multiple senses. Multimodal AI moves closer to this capability.

An AI assistant analyzes an uploaded image and answers questions about what it contains.