Home  >  Article  >  Technology peripherals  >  Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

王林
王林forward
2024-01-18 18:18:20847browse

Google DeepMind releases Nature again, the Alpha series of AI returns, and the level of mathematics improves by leaps and bounds.

AlphaGeometry, no need for human demonstration to reach the geometric level of IMO gold medal players.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

It feels like AlphaZero learned the game "Mastering the game of Go without human knowledge" back then.

AlphaGeometry got 25 of the 30 IMO difficulty geometric theorem proving questions right, while the average human gold medal player got 25.9 right. In addition, the previous SOTA method (Wu Wenjun method in 1978) could only get 10 correct.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

IMO gold medalist陈伊婷(Evan Chen) is responsible for evaluating the answers generated by the AI. He commented:

AlphaGeometry's output is impressive, both reliable and clean. Past AI solutions have been hit-and-miss, resulting in output that sometimes required manual review.

AlphaGeometry's solutions have a verifiable structure that can be verified by both machines and understood by humans. It uses classic geometry rules like angles and similar triangles just like students.

In addition to the outstanding results, there are three key points in this study that have attracted the attention of the industry:

  • No need for human demonstration, that is, only AI synthetic data training is used, continuing AlphaZero’s self-learning Go method.
  • Large model combined with other AI methods, similar to AlphaGo and OpenAI Q* rumors.
  • Unlike many previous methods, AlphaGeometry can generate human-readable proofs, and both the model and code are open source.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

#The team believes that AlphaGeometry provides a potential framework for achieving advanced reasoning capabilities and discovering new knowledge.

This may help advance theorem proving in artificial intelligence - seen as a key step in building AGI.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

In addition, during the communication process with the author's team, Qubits inquired whether AlphaGeometry would really be allowed to participate in an IMO competition, just like the AlphaGo challenge back then. Like the human Go champion.

They said they are working hard to improve the system's capabilities and also need to enable AI to solve a wider range of mathematical problems beyond geometry.

AI proves that geometry also draws auxiliary lines

Previously, the AI ​​system could not solve geometric problems well, and it was stuck due to the lack of high-quality training data.

Humans learning geometry can use paper and pen to use existing knowledge on images to discover new and more complex geometric properties and relationships.

The Google team generated 1 billion random geometric object graphs for this purpose, as well as all the relationships between their points and lines, and finally screened out 100 million unique theorems and proofs of different difficulties. AlphaGeometry completely performed on these data Train from scratch.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

The system consists of two modules that cooperate with each other to find complex geometric proofs.

  • Language model, predicts the geometric structure that can be used to solve the problem (that is, adding auxiliary lines) .
  • Symbolic reasoning engine, uses logical rules to derive conclusions.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

First author Trieu Trinh introduced that the operation process of AlphaGeometry is similar to that the human brain is divided into two types: fast and slow.

This is the concept of "System 1, System 2" popularized in the best-selling book "Thinking Fast and Slow" by Nobel Prize winner Daniel Kahneman.

System 1 provides quick, intuitive ideas, and System 2 provides more thoughtful, rational decisions.

On the one hand, language models are good at identifying patterns and relationships in data and can quickly predict potentially useful auxiliary structures, but often lack the ability to rigorously reason or explain their decisions.

On the other hand, symbolic reasoning engines are based on formal logic and use explicit rules to draw conclusions. They are rational and explainable, but they are slow and inflexible, especially when dealing with large, complex problems alone.

For example, when solving an IMO 2015 competition question, the blue part is the auxiliary structure added by AlphaGeometry's language model, and the green part is the simplified version of the final proof, with a total of 109 steps.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

During the process of solving the problem, AlphaGeometry also discovered an unused prerequisite in the 2004 IMO competition question, and thus discovered a broader version of the theorem.

It can be proved that P, B, and C are collinear without the condition that O is the midpoint of BC.

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

In addition, the study also found that for the three problems with the lowest human scores, AlphaGeometry also requires a very long proving process and the addition of a lot of auxiliary structures to solve.

But on the relatively easy questions, there was no significant correlation between the average human score and the length of the proof generated by the AI ​​(p = −0.06).

Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level

One More Thing

Regarding the connection and difference between AlphaGeometry and AlphaGo, during the communication process with the team, Google scientistQuoc Le Introduction:

They all search in a very complex decision space, but AlphaGo’s method is more traditional(Note: Neural network is responsible for pattern recognition), The neural network in AlphaGeometry is responsible for suggesting the next action to be taken and guiding the search algorithm to move in the right direction in the decision space.

Although this result is named after the Alpha series, and the first unit is also Google DeepMind, the author is actually a former Google Brain member.

Quoc Le master needs no introduction. Trieu Trinh, the first author, and Thang Luong, the corresponding author, have both worked at Google for six or seven years. Thang Luong himself was also an IMO player in high school.

Among the two Chinese authors, He He is an assistant professor at New York University. Wu Yuhuai previously participated in the research of Google's large mathematical model Minerva, and has now left Google to join Musk's team and become one of the co-founders of xAI.

Paper address: https://www.nature.com/articles/s41586-023-06747-5.

Reference link:
[1]https://www.nature.com/articles/d4186-024-00141 -5.

[2]https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry.

The above is the detailed content of Google Mathematical AI publishes an article in Nature: Proving beyond Wu Wenjun’s 1978 law theorem, demonstrating world-class geometric level. For more information, please follow other related articles on the PHP Chinese website!

Statement:
This article is reproduced at:51cto.com. If there is any infringement, please contact admin@php.cn delete