Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun's 1978 law theorem, demonstrating world-class geometric level-AI-php.cn

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun's 1978 law theorem, demonstrating world-class geometric level

王林

Jan 18, 2024 pm 06:18 PM

AIGooglemath ai

Google DeepMind releases Nature again, the Alpha series of AI returns, and the level of mathematics improves by leaps and bounds.

AlphaGeometry, no need for human demonstration to reach the geometric level of IMO gold medal players.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

It feels like AlphaZero learned the game "Mastering the game of Go without human knowledge" back then.

AlphaGeometry got 25 of the 30 IMO difficulty geometric theorem proving questions right, while the average human gold medal player got 25.9 right. In addition, the previous SOTA method (Wu Wenjun method in 1978) could only get 10 correct.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

IMO gold medalist陈伊婷（Evan Chen） is responsible for evaluating the answers generated by the AI. He commented:

AlphaGeometry's output is impressive, both reliable and clean. Past AI solutions have been hit-and-miss, resulting in output that sometimes required manual review.

AlphaGeometry's solutions have a verifiable structure that can be verified by both machines and understood by humans. It uses classic geometry rules like angles and similar triangles just like students.

In addition to the outstanding results, there are three key points in this study that have attracted the attention of the industry:

No need for human demonstration, that is, only AI synthetic data training is used, continuing AlphaZero’s self-learning Go method.
Large model combined with other AI methods, similar to AlphaGo and OpenAI Q* rumors.
Unlike many previous methods, AlphaGeometry can generate human-readable proofs, and both the model and code are open source.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

#The team believes that AlphaGeometry provides a potential framework for achieving advanced reasoning capabilities and discovering new knowledge.

This may help advance theorem proving in artificial intelligence - seen as a key step in building AGI.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

In addition, during the communication process with the author's team, Qubits inquired whether AlphaGeometry would really be allowed to participate in an IMO competition, just like the AlphaGo challenge back then. Like the human Go champion.

They said they are working hard to improve the system's capabilities and also need to enable AI to solve a wider range of mathematical problems beyond geometry.

AI proves that geometry also draws auxiliary lines

Previously, the AI system could not solve geometric problems well, and it was stuck due to the lack of high-quality training data.

Humans learning geometry can use paper and pen to use existing knowledge on images to discover new and more complex geometric properties and relationships.

The Google team generated 1 billion random geometric object graphs for this purpose, as well as all the relationships between their points and lines, and finally screened out 100 million unique theorems and proofs of different difficulties. AlphaGeometry completely performed on these data Train from scratch.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

The system consists of two modules that cooperate with each other to find complex geometric proofs.

Language model, predicts the geometric structure that can be used to solve the problem (that is, adding auxiliary lines) .
Symbolic reasoning engine, uses logical rules to derive conclusions.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

First author Trieu Trinh introduced that the operation process of AlphaGeometry is similar to that the human brain is divided into two types: fast and slow.

This is the concept of "System 1, System 2" popularized in the best-selling book "Thinking Fast and Slow" by Nobel Prize winner Daniel Kahneman.

System 1 provides quick, intuitive ideas, and System 2 provides more thoughtful, rational decisions.

On the one hand, language models are good at identifying patterns and relationships in data and can quickly predict potentially useful auxiliary structures, but often lack the ability to rigorously reason or explain their decisions.

On the other hand, symbolic reasoning engines are based on formal logic and use explicit rules to draw conclusions. They are rational and explainable, but they are slow and inflexible, especially when dealing with large, complex problems alone.

For example, when solving an IMO 2015 competition question, the blue part is the auxiliary structure added by AlphaGeometry's language model, and the green part is the simplified version of the final proof, with a total of 109 steps.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

During the process of solving the problem, AlphaGeometry also discovered an unused prerequisite in the 2004 IMO competition question, and thus discovered a broader version of the theorem.

It can be proved that P, B, and C are collinear without the condition that O is the midpoint of BC.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

In addition, the study also found that for the three problems with the lowest human scores, AlphaGeometry also requires a very long proving process and the addition of a lot of auxiliary structures to solve.

But on the relatively easy questions, there was no significant correlation between the average human score and the length of the proof generated by the AI (p = −0.06).

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjuns 1978 law theorem, demonstrating world-class geometric level

One More Thing

Regarding the connection and difference between AlphaGeometry and AlphaGo, during the communication process with the team, Google scientistQuoc Le Introduction:

They all search in a very complex decision space, but AlphaGo’s method is more traditional(Note: Neural network is responsible for pattern recognition), The neural network in AlphaGeometry is responsible for suggesting the next action to be taken and guiding the search algorithm to move in the right direction in the decision space.

Although this result is named after the Alpha series, and the first unit is also Google DeepMind, the author is actually a former Google Brain member.

Quoc Le master needs no introduction. Trieu Trinh, the first author, and Thang Luong, the corresponding author, have both worked at Google for six or seven years. Thang Luong himself was also an IMO player in high school.

Among the two Chinese authors, He He is an assistant professor at New York University. Wu Yuhuai previously participated in the research of Google's large mathematical model Minerva, and has now left Google to join Musk's team and become one of the co-founders of xAI.

Paper address: https://www.nature.com/articles/s41586-023-06747-5.

Reference link:
[1]https://www.nature.com/articles/d4186-024-00141 -5.

[2]https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry.

The above is the detailed content of Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun's 1978 law theorem, demonstrating world-class geometric level. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

How to Build Your Personal AI Assistant with Huggingface SmolLMApr 18, 2025 am 11:52 AM

Harness the Power of On-Device AI: Building a Personal Chatbot CLI In the recent past, the concept of a personal AI assistant seemed like science fiction. Imagine Alex, a tech enthusiast, dreaming of a smart, local AI companion—one that doesn't rely

AI For Mental Health Gets Attentively Analyzed Via Exciting New Initiative At Stanford UniversityApr 18, 2025 am 11:49 AM

Their inaugural launch of AI4MH took place on April 15, 2025, and luminary Dr. Tom Insel, M.D., famed psychiatrist and neuroscientist, served as the kick-off speaker. Dr. Insel is renowned for his outstanding work in mental health research and techno

The 2025 WNBA Draft Class Enters A League Growing And Fighting Online HarassmentApr 18, 2025 am 11:44 AM

"We want to ensure that the WNBA remains a space where everyone, players, fans and corporate partners, feel safe, valued and empowered," Engelbert stated, addressing what has become one of women's sports' most damaging challenges. The anno

Comprehensive Guide to Python Built-in Data Structures - Analytics VidhyaApr 18, 2025 am 11:43 AM

Introduction Python excels as a programming language, particularly in data science and generative AI. Efficient data manipulation (storage, management, and access) is crucial when dealing with large datasets. We've previously covered numbers and st

First Impressions From OpenAI's New Models Compared To AlternativesApr 18, 2025 am 11:41 AM

Before diving in, an important caveat: AI performance is non-deterministic and highly use-case specific. In simpler terms, Your Mileage May Vary. Don't take this (or any other) article as the final word—instead, test these models on your own scenario

AI Portfolio | How to Build a Portfolio for an AI Career?Apr 18, 2025 am 11:40 AM

Building a Standout AI/ML Portfolio: A Guide for Beginners and Professionals Creating a compelling portfolio is crucial for securing roles in artificial intelligence (AI) and machine learning (ML). This guide provides advice for building a portfolio

What Agentic AI Could Mean For Security OperationsApr 18, 2025 am 11:36 AM

The result? Burnout, inefficiency, and a widening gap between detection and action. None of this should come as a shock to anyone who works in cybersecurity. The promise of agentic AI has emerged as a potential turning point, though. This new class

Google Versus OpenAI: The AI Fight For StudentsApr 18, 2025 am 11:31 AM

Immediate Impact versus Long-Term Partnership? Two weeks ago OpenAI stepped forward with a powerful short-term offer, granting U.S. and Canadian college students free access to ChatGPT Plus through the end of May 2025. This tool includes GPT‑4o, an a

See all articles