Apple researchers have developed an adapted version of the SlowFast-LLaVA model that beats larger models at long-form video analysis and understanding. Here’s what that means. Very basically, when an ...
The next step in the evolution of generative AI technology will rely on ‘world models’ to improve physical outcomes in the real world. Tesla’s viral videos show its Optimus humanoid robot serving ...
In a major step toward more adaptable and intuitive machines, Kempner Institute Investigator Yilun Du and his collaborators ...
Over the last few months, many AI boosters have been increasingly interested in generative video models and their seeming ability to show at least limited emergent knowledge of the physical properties ...
Meta AI describes a system that predicts fMRI-measured brain responses during naturalistic film viewing by jointly modeling ...
Last week, Google introduced Veo 3, its newest video generation model that can create 8-second clips with synchronized sound effects and audio dialog—a first for the company’s AI tools. The model, ...
Artificial intelligence startup Luma AI Inc. today announced the launch of Ray3, a powerful text-to-video AI model with built-in reasoning, designed for high-quality cinematic visual production for ...
Alibaba Group has released the new generation of its large language model that can understand text, audio, images and video. But this time, the Chinese tech giant is releasing the model, Qwen3.5-Omni, ...
GLM-5V-Turbo is Z.ai's first native multimodal agent foundation model, built for vision-based coding and agentic task ...
A new AI video model from China has flooded the internet with copyrighted content — causing so much backlash that its owner, ByteDance, has promised to “strengthen current safeguards.” Over the past ...