Visual Reasoning - Search News

Causal reasoning meets visual representation learning: A prospective study

With the emergence of huge amounts of heterogeneous multi-modal data, including images, videos, texts/languages, audios, and multi-sensor data, deep learning-based methods have shown promising ...

How DeepSeek AI Uses 90% Fewer Tokens to Match Billion-Dollar Models

Explore how DeepSeek AI's new visual pointing method reduces computational costs by 90 percent while matching the performance ...

TechCrunch

‘Visual’ AI models might not see anything at all

The latest round of language models, like GPT-4o and Gemini 1.5 Pro, are touted as “multimodal,” able to understand images and audio as well as text. But a new study makes clear that they don’t really ...

The Droid Guy

Grok 4 Shows Early Strengths in Coding, Reasoning, and Visual Tasks While Struggling With Images and Memory

Grok 4 and its reasoning-focused counterpart, Grok 4 Heavy, arrived with an immediate sense of ambition, offering multimodal AI designed to handle coding, logic, and perception tasks. In the initial ...

Forbes

The Dawn Of Superhuman AI Reasoning

In the ever-evolving saga of AI, 2024 will mark another watershed moment akin to the debut of ChatGPT. Yet, this new chapter isn’t penned in words; it’s envisioned through the lens of visual reasoning ...

Neowin

Alibaba releases new visual reasoning model that can see, understand, and think

Alibaba has released QVQ-Max, a new visual reasoning model that it says can see, understand, and think about the world. Alibaba, the Chinese tech giant, has announced a new Qwen AI bot called QVQ-Max, ...

VentureBeat

LMSYS launches ‘Multimodal Arena’: GPT-4 tops leaderboard, but AI still can’t out-see humans

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More LMSYS organization launched its “Multimodal Arena” today, a new ...

NextBigFuture

Google Nano Banana Pro Visual Reasoning Model

Nano Banana Pro can use Google Search to research topics based on your query, and reason on how to present factual and grounded information. Nano Banana Pro excels in visual design, world knowledge, ...

Bloomberg L.P.

OpenAI Releases New Reasoning Models for Coding and Visual Tasks

OpenAI is rolling out a pair of new artificial intelligence models that mimic the process of human reasoning to field more complicated coding questions and visual tasks, the latest in a flurry of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results