ChatGPTâs 17-minute proof turns math into a verification problem
ChatGPT 5.5 Pro solved an open math problem in 17 minutesđˇ Scraped: May 9, 2026
- â Proof in 17 minutes
- â Open number-theory problem
- â New bar for human work
When Fields Medalist Timothy Gowers handed ChatGPT 5.5 Pro a set of open problems in number theory, he wasnât testing whether the model could do mathâhe was testing whether it could invent it. The results, published in a preprint generated entirely by the AI, suggest it can. In under two hours, the model produced complete research papers, including a breakthrough that improved an exponential bound to a polynomial one in just 31 minutes and 40 seconds.
For context, thatâs faster than most human mathematicians can even read a proof, let alone generate one.
The experiment, detailed in The Decoderâs report, wasnât a cherry-picked demo. Gowers, who won his Fields Medal for work in functional analysis and combinatorics, gave the AI no mathematical guidanceâjust raw problems from a paper by number theorist Mel Nathanson.
The modelâs output, evaluated by Gowers and MIT student Isaac Rajagopal, included a key idea Rajagopal described as 'completely original.' Thatâs not just a compliment; itâs a warning shot for human researchers. If an LLM can conjure original mathematical insights in minutes, whatâs left for the rest of us to prove?
The model produced a proof fast enough to reset the benchmark for human contribution
Article imageđˇ Scraped: May 9, 2026
The source material also shows that the implications stretch beyond academia. Number theory isnât just an abstract playgroundâit underpins cryptography, algorithms, and even blockchain security. If ChatGPT 5.5 Pro can autonomously improve bounds in these areas, itâs not hard to imagine AI-generated proofs being deployed in real-world systems, for better or worse. The MIT researcherâs praise for the modelâs 'originality' is particularly striking.
Originality, after all, has been the last bastion of human intellectual superiority in math. If thatâs now in play, the competitive landscape shifts dramatically.
Gowersâ takeaway is blunt: the new bar for mathematical contributions is 'proving something LLMs canât.' Thatâs a high barâand one that may only get higher as models like ChatGPT 5.5 Pro continue to evolve. The experiment also raises uncomfortable questions about verification. How do we trust a proof we didnât write, especially when the AIâs reasoning isnât fully transparent? The answer, for now, seems to be: we donât. At least, not without a lot more scrutiny.
For the tech industry, this is a marketing milestone. For mathematicians, itâs an existential one. And for anyone who assumed AI would remain a tool rather than a colleague, itâs a wake-up call. The era of AI as a research partner isnât comingâitâs already here.

