Test 2 Models - Search News

ChatGPT Images 2.0 vs Nano Banana 2: Which AI Image Tool Wins for Real-World Tasks?

ChatGPT Images 2.0 and Google Nano Banana 2 go head-to-head in six AI image tests, ranging from product photos to headshots ...

Ars Technica

Has Gemini surpassed ChatGPT? We put the AI models to the test.

The last time we did comparative tests of AI models from OpenAI and Google at Ars was in late 2023, when Google’s offering was still called Bard. In the roughly two years since, a lot has happened in ...

Forbes

Gemini 3 Just Scored 100% On A Critical Test All Other AI Models Fail

Google’s new Gemini 3 has become the first major AI model to get a perfect score on a new self-harm safety benchmark, the CARE test. That milestone comes as hundreds of millions of people have come to ...

ZDNet

I put GPT-5.2 through a 14-round test, and the AI model raised some serious questions

I wore the world's first HDR10 smart glasses TCL's new E Ink tablet beats the Remarkable and Kindle Anker's new charger is one of the most unique I've ever seen Best laptop cooling pads Best flip ...

Time

AI Models Are Getting Smarter. New Tests Are Racing to Catch Up

Pillay is an editorial fellow at TIME. Pillay is an editorial fellow at TIME. Despite their expertise, AI developers don't always know what their most advanced systems are capable of—at least, not at ...

TechCrunch

OpenAI launches two ‘open’ AI reasoning models

OpenAI announced Tuesday the launch of two open-weight AI reasoning models with similar capabilities to its o-series. Both are freely available to download from the online developer platform Hugging ...

9to5google

Gemini app adding 2.0 Pro and 2.0 Flash Thinking Experimental

Google is following the consumer launch of 2.0 Flash with new preview models that will be available to test in the Gemini app: 2.0 Pro Experimental and 2.0 Flash Thinking Experimental. In December, ...

VentureBeat

Google launches Gemini 2.0 Pro, Flash-Lite and connects reasoning model Flash Thinking to YouTube, Maps and Search

Google's Gemini series of AI large language models (LLMs) started off rough nearly a year ago with some embarrassing incidents of image generation gone awry, but it has steadily improved since then, ...

TechCrunch

Google unveils a next-gen family of AI reasoning models

On Tuesday, Google unveiled Gemini 2.5, a new family of AI reasoning models that pauses to “think” before answering a question. To kick off the new family of models, Google is launching Gemini 2.5 Pro ...

ZDNet

AI models know when they're being tested - and change their behavior, research shows

Several frontier AI models show signs of scheming. Anti-scheming training reduced misbehavior in some models. Models know they're being tested, which complicates results. New joint safety testing from ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results