Performance Coding - Search News

DeepCoder delivers top coding performance in efficient 14B open model

Researchers at Together AI and Agentica have released DeepCoder-14B, a new coding model that delivers impressive performance comparable to leading proprietary models like OpenAI's o3-mini. Built on ...

15d

Minset Announces Its Autonomous Medical Coding AI, mCoder, Achieves State-of-the-Art Performance on MDACE, a Leading Public Benchmark for Automated Medical Coding

Minset today announced that mCoder, its autonomous medical coding system, has achieved state-of-the-art performance on MDACE, a leading public benchmark for explainable and verifiable automated ...

Hosted on MSN

Claude Opus 4 achieves record performance in AI coding capabilities

Anthropic’s latest AI model, Claude Opus 4, has surpassed OpenAI’s GPT-4.1 in coding abilities, marking a significant shift in how AI systems can assist with software development tasks. The new model ...

Geeky Gadgets

Llama 3 reasoning and coding performance tested

Following on from the launch of the new Llama 3 large language model by Meta and Mark Zuckerberg. WorldofAI has been testing out the performance and capabilities of Llama 3 when reasoning and coding.

NextBigFuture

OpenAI Releases O3 Model With High Performance and High Cost

OpenaI o3 sets new records in several key areas, particularly in reasoning, coding and mathematical problem-solving. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in ...

Geeky Gadgets

ChatGPT-o1 vs Claude 3.5 coding performance compared

If you would like to learn more about how the latest AI models from OpenAI perform when compared with Claude 3.5 when used with the Cursor AI platform. You will be pleased to know that All About AI ...

InfoWorld

OpenAI GPT-4.1 models promise improved coding and instruction following

The GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano models, available only via the API, will provide better performance than GPT-4o and GPT-4o mini at a lower price, OpenAI said. OpenAI has announced a new ...

Hosted on MSN

Anthropic launches Claude Opus 4.6 with performance upgrades and coding accuracy

AI startup Anthropic continues to make headlines across the tech industry with a series of launches, from Cowork and AI agents to its latest model, Claude Opus 4.6. Just two months after releasing ...

VentureBeat

Is Anthropic 'nerfing' Claude? Users increasingly report performance degradation as leaders push back

Credit: VentureBeat made with GPT-Image-1.5 and Google Gemini 3.1 Pro Image A growing number of developers and AI power users are taking to social media to accuse Anthropic of degrading the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results