Multimodal Video Examples

Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

Microsoft has introduced a new AI model that, it says, can process speech, vision, and text locally on-device using less compute capacity than previous models. Innovation in generative artificial ...

Forbes

The New Foundation Model Ecosystem And The Video Data Gold Rush

Forbes contributors publish independent expert analyses and insights. Anjana Susarla is a professor of Responsible AI at the Eli Broad College of Business at Michigan State University. This voice ...

Virtualization Review

AI in 2025: Multimodal, Small and Agentic

If your organization hasn't started an AI adoption journey, it might already be falling behind. 2024 may have been a banner year for AI in the enterprise, but 2025 is promising even more improvements ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

The New Foundation Model Ecosystem And The Video Data Gold Rush

AI in 2025: Multimodal, Small and Agentic

Trending now