


This work represents a breakthrough in AI’s mathematical reasoning capabilities and is an important milestone in the development of general AI systems.
This time, the artificial intelligence algorithm achieved a major breakthrough in the Mathematics Olympiad (IMO).
In the latest issue of the internationally authoritative journal "Nature", a paper was published introducing an artificial intelligence system called AlphaGeometry. The system is able to solve Olympic geometry problems without human demonstration. Experts believe that this is an important milestone in artificial intelligence's progress towards possessing human reasoning capabilities. The publication of this research result is of great significance for promoting the further development of artificial intelligence.

Paper link: https://www.nature.com/articles/s41586-023-06747-5
DeepMind was also published in the paper The code and model were open sourced for a while, GitHub: https://github.com/google-deepmind/alphageometry
This is an artificial intelligence system, from the hands of Google DeepMind researchers , which can solve complex geometric problems at a level close to that of a human Olympiad gold medalist.
In a benchmark test of 30 Mathematical Olympiad geometry questions, AlphaGeometry solved 25 of the geometry questions within the standard Mathematical Olympiad time limit, while the previous state-of-the-art system only solved 10 of the geometry problems. By comparison, human gold medalists solved an average of 25.9 problems.

# Theorem proving is a challenging task for learning-based AI models. The main reason is that human proofs in most mathematical fields are difficult to translate into machine-verifiable language, thus limiting the amount of data used to train AI models. To overcome this problem, DeepMind proposes an alternative method of using synthetic data for theorem proving. They developed a general guidance framework called AlphaGeometry that has applicability in many fields. By leveraging synthetic data, AlphaGeometry is able to train AI models for theorem proving and produce high-quality results. This method provides an effective solution to the difficulty of theorem proving.
Research Introduction
AlphaGeometry combines language models with "symbolic engines" to perform mathematical inferences with the help of symbols and logical rules. Among them, language models are good at identifying and predicting the subsequent steps of the process, but lack the rigor required for mathematical reasoning; on the other hand, symbolic engines are purely based on formal logic and strict rules, which This enables it to guide the language model toward rational decision-making.
In the research of AlphaGeometry, DeepMind conducted tests from the benchmark test set of 30 Olympic geometry problems (IMO-AG-30) spanning 2000 to 2022. The results showed that AlphaGeometry performed well in competition time Able to solve 25 problems within the limit. The previous state-of-the-art method (Wu’s method) could only solve 10.
It is well known that AI systems often struggle to solve complex problems in geometry and mathematics due to a lack of reasoning skills and training data. The AlphaGeometry system combines the predictive power of neural language models with a rule-constrained inference engine, which work together to find new solutions.
In addition, to solve the data challenge, the study generated a large amount of synthetic training data, namely 100 million examples, in which many theorems have more than 200 steps to prove, which is 4 longer than the average proof length of Mathematical Olympiad theorems. times.
AlphaGeometry demonstrates AI’s growing ability to reason logically and discover and verify new knowledge. Solving Olympic-level geometry problems is an important milestone for AI on the road to more advanced and general artificial intelligence systems.
Ngô Bảo Châu, Fields Medal winner and IMO gold medalist, said: "Now I completely understand why AI researchers first try to solve the geometry problems of the International Mathematical Olympiad (IMO) , because finding their solutions is a bit like playing chess, where we have relatively few reasonable moves at each move. But I'm still shocked that they were able to achieve this. It's an impressive achievement."
Wu Baozhu, winner of the 2010 Fields Medal, is currently a professor at the University of Chicago.
AlphaGeometry is a neuro-symbolic system consisting of a neural language model and a symbolic deduction engine that work together to find proofs of complex geometric theorems. One system provides quick, intuitive ideas, while the other provides more thoughtful, rational decisions.
Because language models are good at identifying general patterns and relationships in data, they can quickly predict potentially useful structures, but often lack rigorous reasoning or interpretation. The symbolic deduction engine, on the other hand, is based on formal logic and uses explicit rules to reach conclusions, which together make up AlphaGeometry.
AlphaGeometry's language model guides its symbolic deduction engine to find possible solutions to geometric problems. General Olympiad geometry problems are based on diagrams and require the addition of new geometric structures to solve, such as points, lines or circles. AlphaGeometry's language model can predict which new structures would be most useful to add out of countless possibilities. These clues help fill in the gaps and allow the symbolic engine to make further inferences about the diagram and get closer to a solution.
For example, the picture below (top) shows the process of AlphaGeometry solving a simple question. The question is "Let ABC be any triangle with AB = AC. Prove that ∠ABC = ∠BCA."
The AlphaGeometry proof process goes like this: AlphaGeometry starts the proof search by running a symbolic deduction engine. This engine starts from the premises of the theorem and exhaustively derives new statements until the theorem is proved or the new statements are exhausted. If the symbolic engine fails to find a proof, the language model constructs an auxiliary point that adds provable conditions before the symbolic engine restarts. This cycle continues until a solution is found. For the simple example, the loop terminates after the first auxiliary structure "add point D to the midpoint of BC".
The picture below (below) shows AlphaGeometry’s solution to the IMO problem. "Prove that the circumcircles (O1) and (O2) of triangles FKM and KQH are tangent to each other..." AlphaGeometry can also prove such a complex problem, and the proof process also provides auxiliary points, etc. The proof has been greatly shortened and edited for illustrative purposes.

Generate 100 million mathematical reasoning training data
Humans can learn geometry by sketching on paper, examine diagrams, and use modern Have the knowledge to discover new, more complex geometric properties and relationships. This study's approach to generating synthetic data simulates this knowledge-building process at scale. The method for generating synthetic data is shown in Figure 3.
Using highly parallel computing, the system first generates random graphs of 500 million geometric objects and exhaustively derives all relationships between points and lines in each graph. AlphaGeometry finds all the proofs contained in each graph and then works backwards to find out what additional structure, if any, is needed to obtain those proofs. This process is "symbol deduction and retrospection."
Visual representation of synthetic data generated by AlphaGeometry
After that, this huge data pool was Filtering to exclude similar examples resulted in a training dataset of 100 million.
Pioneering Artificial Intelligence Reasoning Capabilities
Every solution provided by AlphaGeometry has been computer checked and verified. The researchers also compared their results with previous artificial intelligence methods and human performance in Olympic competitions. Additionally, math coach and former Olympiad gold medalist Evan Chen evaluates a range of AlphaGeometry solutions for us.
Chen Yiting, a PhD candidate in mathematics at MIT, won the IMO 2014 gold medal.
Evan Chen said: "AlphaGeometry's output is impressive because it is both verifiable and clean. Past artificial intelligence solutions to proof-based competitive problems have sometimes been hit-and-miss ( The output is sometimes correct and requires human checking), while AlphaGeometry does not have this weakness: its solutions have a machine-verifiable structure. On the other hand, its output is still human-readable. One can imagine a coordinate system through brute force A computer program that solves geometry problems: think of pages and pages of tedious algebraic calculations, AlphaGeometry doesn’t do that, it uses the classical rules of geometry with angles and similar triangles like a human student would.”
Recently, financial technology company XTX Markets established the Artificial Intelligence Olympiad Mathematical Prize (AI-MO Prize) to encourage the development of artificial intelligence models that can perform mathematical reasoning. Since each Olympiad has six problems, only two of which typically focus on geometry, AlphaGeometry can only be applied to one-third of the problems in a given Olympiad.
Despite this, AlphaGeometry relied solely on its geometric problem-solving abilities to become the first artificial intelligence model in the world to pass the IMO bronze medal threshold in 2000 and 2015.
DeepMind is already working to advance inference for the next generation of artificial intelligence systems. The researchers believe that given the broad potential of using large-scale synthetic data to train AI systems from scratch, this approach could influence the direction in which future AI systems discover new knowledge in mathematics and other fields.
AlphaGeometry pioneers mathematical reasoning in artificial intelligence—from exploring the beauty of pure mathematics to using language models to solve mathematical and scientific problems. It is hoped that this technology will continue to improve to solve more advanced, abstract mathematical problems.
In addition to mathematics, the impact of AlphaGeometry may also cover more fields including geometric problems, such as computer vision, architecture, and even theoretical physics.
Reference content:
https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system -for-geometry/
The above is the detailed content of Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMind's model and receives praise in the Fields Medal winner news. For more information, please follow other related articles on the PHP Chinese website!

Running large language models at home with ease: LM Studio User Guide In recent years, advances in software and hardware have made it possible to run large language models (LLMs) on personal computers. LM Studio is an excellent tool to make this process easy and convenient. This article will dive into how to run LLM locally using LM Studio, covering key steps, potential challenges, and the benefits of having LLM locally. Whether you are a tech enthusiast or are curious about the latest AI technologies, this guide will provide valuable insights and practical tips. Let's get started! Overview Understand the basic requirements for running LLM locally. Set up LM Studi on your computer

Guy Peri is McCormick’s Chief Information and Digital Officer. Though only seven months into his role, Peri is rapidly advancing a comprehensive transformation of the company’s digital capabilities. His career-long focus on data and analytics informs

Introduction Artificial intelligence (AI) is evolving to understand not just words, but also emotions, responding with a human touch. This sophisticated interaction is crucial in the rapidly advancing field of AI and natural language processing. Th

Introduction In today's data-centric world, leveraging advanced AI technologies is crucial for businesses seeking a competitive edge and enhanced efficiency. A range of powerful tools empowers data scientists, analysts, and developers to build, depl

This week's AI landscape exploded with groundbreaking releases from industry giants like OpenAI, Mistral AI, NVIDIA, DeepSeek, and Hugging Face. These new models promise increased power, affordability, and accessibility, fueled by advancements in tr

But the company’s Android app, which offers not only search capabilities but also acts as an AI assistant, is riddled with a host of security issues that could expose its users to data theft, account takeovers and impersonation attacks from malicious

You can look at what’s happening in conferences and at trade shows. You can ask engineers what they’re doing, or consult with a CEO. Everywhere you look, things are changing at breakneck speed. Engineers, and Non-Engineers What’s the difference be

Simulate Rocket Launches with RocketPy: A Comprehensive Guide This article guides you through simulating high-power rocket launches using RocketPy, a powerful Python library. We'll cover everything from defining rocket components to analyzing simula


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

WebStorm Mac version
Useful JavaScript development tools

ZendStudio 13.5.1 Mac
Powerful PHP integrated development environment

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.