Top artificial intelligence systems now ace many textbook-style math questions, yet they still fall apart on genuinely new problems. The gap between polished performance on familiar benchmarks and ...
Google DeepMind’s AlphaProof system scored at a silver-medal level when tested against the 2024 International Mathematical Olympiad, solving problems that have historically separated elite human ...