Advertisement

Multimodal

A model that handles more than one data type, such as text plus images.

Multimodality expands use cases like document understanding and captioning.

Advertisement

Related terms

Back to Fundamentals of Generative AI

Advertisement