Multimodal Model
Architecture / Model
Definition
An AI model capable of processing and generating multiple data types simultaneously: text, image, audio, video. Multimodal models like GPT-4V, Gemini, and Claude 3 represent the convergence of AI capabilities.
In French
Modèle multimodal — Modèle IA capable de traiter et générer plusieurs types de données simultanément : texte, image, audio, vidéo. Les modèles multimodaux comme GPT-4V, Gemini et Claude 3 représentent la convergence des capacités IA.
Related terms
Autoencoder
Diffusion Model
Foundation Model
GAN (Generative Adversarial Network)
Large Language Model (LLM)
Neural Network
Explore the full glossary
Discover all artificial intelligence terms in our glossary.