The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got ...
VUB's Data Analytics Lab has published new results showing that it is possible to develop original mathematical proofs using commercial language models. In a paper posted to the arXiv preprint server, ...
Computers are extremely good with numbers, but they haven’t gotten many human mathematicians fired. Until recently, they could barely hold their own in high school-level math competitions. But now ...
Marijn Heule turns mathematical statements into something like Sudoku puzzles, then has computers go to work on them. His proofs have been called “disgusting,” but they go beyond what any human can do ...
At a secret meeting in 2025, some of the world's leading mathematicians gathered to test OpenAI's newest large language model, o4-mini. Experts at the meeting were amazed by how much the model's ...
Forbes contributors publish independent expert analyses and insights. Dr. Lance B. Eliot is a world-renowned AI scientist and consultant. In today’s column, I examine an insightful AI research study ...
AI math proof verification reached a new frontier as DeepMind’s AlphaProof Nexus solved nine open Erdős research problems with Lean-verified proofs, some unsolved for 56 years. The May 2026 Science Ne ...
Artificial intelligence has formally verified the prizewinning proof that solved the sphere packing problem in eight dimensions, a result closely tied to Maryna Viazovska’s Fields Medal. The ...
A mathematician will turn a groundbreaking 100-page proof into computer code. The proof tool, Lean, lets users turn proofs written in prose into rules and logic for testing. Kevin Buzzard already uses ...
Kendra Pierre-Louis: For Scientific American’s Science Quickly, I’m Kendra Pierre-Louis, in for Rachel Feltman. In 1997, Deep Blue, a supercomputer built by IBM, did the unexpected: it defeated chess ...
A series of recent research papers have shown that ChatGPT and related large language models can produce original, verifiable mathematical proofs, including solutions to problems that had not been ...