Measuring AI's ability to autonomously generate mathematical proofs
Mission
AI has the potential to accelerate scientific discovery, but we lack reliable ways to measure research-level reasoning because it is difficult to separate true reasoning from memorization. Mathematics is the ideal testbed: correctness is unambiguous, verification standards are exceptionally high, and there is no reproducibility crisis. The First Proof Project will provide independent assessments of the reasoning abilities of AI systems in the context of research mathematics. Progress in rigorous evaluation here can inform how AI should be developed and deployed across science.
Team and Editors
Editorial Board
Board of Directors
Executive Director
Get Involved
If you are a mathematician interested in contributing to First Proof, please reach out to us at contact@1stproof.org.