Examples of Multimodal

Microsoft introduces open-source multimodal Phi-4 reasoning model

Microsoft has released a new multimodal reasoning model: Phi-4-reasoning-vision-15B. The model combines two existing algorithms using a mid-fusion approach and can analyze images, scientific graphs, ...

Microsoft open-sources multimodal reasoning model with 15B parameters

The company mainly trained Phi-4-reasoning-vision-15B on open-source data. The data included images and text-based descriptions of the objects depicted in those images. Before it started training the ...

DeepSeek V4 Adds Blackwell SM100 and FP4 Support for Lower-Cost Scaling

DeepSeek V4 ships native multimodal input with lower latency, plus support for Blackwell SM100 and FP4 compute scaling.

Searchenginejournal.com

Google Introduces Gemini And Updates Bard With Gemini Pro

Google introduces Gemini, their largest and most capable AI model, marking a significant advance in AI technology. Gemini offers unprecedented multimodal capabilities, excelling in understanding and ...

VentureBeat

Meta introduces Chameleon, a state-of-the-art multimodal model

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now As competition in the generative AI field ...

Time

Multimodal AI

This article is published by AllBusiness.com, a partner of TIME. What is “Multimodal AI”? MultiModal AI is a type of artificial intelligence that can integrate and process information from multiple ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results