Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMind's model and receives praise in the Fields Medal winner news-AI-php.cn

Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMind's model and receives praise in the Fields Medal winner news

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jan 18, 2024 pm 12:45 PM

industry

This work represents a breakthrough in AI’s mathematical reasoning capabilities and is an important milestone in the development of general AI systems.

This time, the artificial intelligence algorithm achieved a major breakthrough in the Mathematics Olympiad (IMO).

Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMinds model and receives praise in the Fields Medal winner news

In the latest issue of the internationally authoritative journal "Nature", a paper was published introducing an artificial intelligence system called AlphaGeometry. The system is able to solve Olympic geometry problems without human demonstration. Experts believe that this is an important milestone in artificial intelligence's progress towards possessing human reasoning capabilities. The publication of this research result is of great significance for promoting the further development of artificial intelligence.

Paper link: https://www.nature.com/articles/s41586-023-06747-5

DeepMind was also published in the paper The code and model were open sourced for a while, GitHub: https://github.com/google-deepmind/alphageometry

This is an artificial intelligence system, from the hands of Google DeepMind researchers , which can solve complex geometric problems at a level close to that of a human Olympiad gold medalist.

In a benchmark test of 30 Mathematical Olympiad geometry questions, AlphaGeometry solved 25 of the geometry questions within the standard Mathematical Olympiad time limit, while the previous state-of-the-art system only solved 10 of the geometry problems. By comparison, human gold medalists solved an average of 25.9 problems.

# Theorem proving is a challenging task for learning-based AI models. The main reason is that human proofs in most mathematical fields are difficult to translate into machine-verifiable language, thus limiting the amount of data used to train AI models. To overcome this problem, DeepMind proposes an alternative method of using synthetic data for theorem proving. They developed a general guidance framework called AlphaGeometry that has applicability in many fields. By leveraging synthetic data, AlphaGeometry is able to train AI models for theorem proving and produce high-quality results. This method provides an effective solution to the difficulty of theorem proving.

Research Introduction

AlphaGeometry combines language models with "symbolic engines" to perform mathematical inferences with the help of symbols and logical rules. Among them, language models are good at identifying and predicting the subsequent steps of the process, but lack the rigor required for mathematical reasoning; on the other hand, symbolic engines are purely based on formal logic and strict rules, which This enables it to guide the language model toward rational decision-making.

In the research of AlphaGeometry, DeepMind conducted tests from the benchmark test set of 30 Olympic geometry problems (IMO-AG-30) spanning 2000 to 2022. The results showed that AlphaGeometry performed well in competition time Able to solve 25 problems within the limit. The previous state-of-the-art method (Wu’s method) could only solve 10.

It is well known that AI systems often struggle to solve complex problems in geometry and mathematics due to a lack of reasoning skills and training data. The AlphaGeometry system combines the predictive power of neural language models with a rule-constrained inference engine, which work together to find new solutions.

In addition, to solve the data challenge, the study generated a large amount of synthetic training data, namely 100 million examples, in which many theorems have more than 200 steps to prove, which is 4 longer than the average proof length of Mathematical Olympiad theorems. times.

AlphaGeometry demonstrates AI’s growing ability to reason logically and discover and verify new knowledge. Solving Olympic-level geometry problems is an important milestone for AI on the road to more advanced and general artificial intelligence systems.

Ngô Bảo Châu, Fields Medal winner and IMO gold medalist, said: "Now I completely understand why AI researchers first try to solve the geometry problems of the International Mathematical Olympiad (IMO) , because finding their solutions is a bit like playing chess, where we have relatively few reasonable moves at each move. But I'm still shocked that they were able to achieve this. It's an impressive achievement."

Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMinds model and receives praise in the Fields Medal winner news

^{Wu Baozhu, winner of the 2010 Fields Medal, is currently a professor at the University of Chicago.}

AlphaGeometry is a neuro-symbolic system consisting of a neural language model and a symbolic deduction engine that work together to find proofs of complex geometric theorems. One system provides quick, intuitive ideas, while the other provides more thoughtful, rational decisions.

Because language models are good at identifying general patterns and relationships in data, they can quickly predict potentially useful structures, but often lack rigorous reasoning or interpretation. The symbolic deduction engine, on the other hand, is based on formal logic and uses explicit rules to reach conclusions, which together make up AlphaGeometry.

AlphaGeometry's language model guides its symbolic deduction engine to find possible solutions to geometric problems. General Olympiad geometry problems are based on diagrams and require the addition of new geometric structures to solve, such as points, lines or circles. AlphaGeometry's language model can predict which new structures would be most useful to add out of countless possibilities. These clues help fill in the gaps and allow the symbolic engine to make further inferences about the diagram and get closer to a solution.

For example, the picture below (top) shows the process of AlphaGeometry solving a simple question. The question is "Let ABC be any triangle with AB = AC. Prove that ∠ABC = ∠BCA."

The AlphaGeometry proof process goes like this: AlphaGeometry starts the proof search by running a symbolic deduction engine. This engine starts from the premises of the theorem and exhaustively derives new statements until the theorem is proved or the new statements are exhausted. If the symbolic engine fails to find a proof, the language model constructs an auxiliary point that adds provable conditions before the symbolic engine restarts. This cycle continues until a solution is found. For the simple example, the loop terminates after the first auxiliary structure "add point D to the midpoint of BC".

The picture below (below) shows AlphaGeometry’s solution to the IMO problem. "Prove that the circumcircles (O1) and (O2) of triangles FKM and KQH are tangent to each other..." AlphaGeometry can also prove such a complex problem, and the proof process also provides auxiliary points, etc. The proof has been greatly shortened and edited for illustrative purposes.

Generate 100 million mathematical reasoning training data

Humans can learn geometry by sketching on paper, examine diagrams, and use modern Have the knowledge to discover new, more complex geometric properties and relationships. This study's approach to generating synthetic data simulates this knowledge-building process at scale. The method for generating synthetic data is shown in Figure 3.

Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMinds model and receives praise in the Fields Medal winner news

Using highly parallel computing, the system first generates random graphs of 500 million geometric objects and exhaustively derives all relationships between points and lines in each graph. AlphaGeometry finds all the proofs contained in each graph and then works backwards to find out what additional structure, if any, is needed to obtain those proofs. This process is "symbol deduction and retrospection."

Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMinds model and receives praise in the Fields Medal winner news

^{Visual representation of synthetic data generated by AlphaGeometry}

After that, this huge data pool was Filtering to exclude similar examples resulted in a training dataset of 100 million.

Pioneering Artificial Intelligence Reasoning Capabilities

Every solution provided by AlphaGeometry has been computer checked and verified. The researchers also compared their results with previous artificial intelligence methods and human performance in Olympic competitions. Additionally, math coach and former Olympiad gold medalist Evan Chen evaluates a range of AlphaGeometry solutions for us.

Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMinds model and receives praise in the Fields Medal winner news

^{Chen Yiting, a PhD candidate in mathematics at MIT, won the IMO 2014 gold medal.}

Evan Chen said: "AlphaGeometry's output is impressive because it is both verifiable and clean. Past artificial intelligence solutions to proof-based competitive problems have sometimes been hit-and-miss ( The output is sometimes correct and requires human checking), while AlphaGeometry does not have this weakness: its solutions have a machine-verifiable structure. On the other hand, its output is still human-readable. One can imagine a coordinate system through brute force A computer program that solves geometry problems: think of pages and pages of tedious algebraic calculations, AlphaGeometry doesn’t do that, it uses the classical rules of geometry with angles and similar triangles like a human student would.”

Recently, financial technology company XTX Markets established the Artificial Intelligence Olympiad Mathematical Prize (AI-MO Prize) to encourage the development of artificial intelligence models that can perform mathematical reasoning. Since each Olympiad has six problems, only two of which typically focus on geometry, AlphaGeometry can only be applied to one-third of the problems in a given Olympiad.

Despite this, AlphaGeometry relied solely on its geometric problem-solving abilities to become the first artificial intelligence model in the world to pass the IMO bronze medal threshold in 2000 and 2015.

DeepMind is already working to advance inference for the next generation of artificial intelligence systems. The researchers believe that given the broad potential of using large-scale synthetic data to train AI systems from scratch, this approach could influence the direction in which future AI systems discover new knowledge in mathematics and other fields.

AlphaGeometry pioneers mathematical reasoning in artificial intelligence—from exploring the beauty of pure mathematics to using language models to solve mathematical and scientific problems. It is hoped that this technology will continue to improve to solve more advanced, abstract mathematical problems.

In addition to mathematics, the impact of AlphaGeometry may also cover more fields including geometric problems, such as computer vision, architecture, and even theoretical physics.

^{Reference content:}

^{https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system -for-geometry/}

The above is the detailed content of Deep learning achieves genius-level performance in geometric reasoning. Nature publishes DeepMind's model and receives praise in the Fields Medal winner news. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:机器之心. If there is any infringement, please contact admin@php.cn delete

From Friction To Flow: How AI Is Reshaping Legal WorkMay 09, 2025 am 11:29 AM

The legal tech revolution is gaining momentum, pushing legal professionals to actively embrace AI solutions. Passive resistance is no longer a viable option for those aiming to stay competitive. Why is Technology Adoption Crucial? Legal professional

This Is What AI Thinks Of You And Knows About YouMay 09, 2025 am 11:24 AM

Many assume interactions with AI are anonymous, a stark contrast to human communication. However, AI actively profiles users during every chat. Every prompt, every word, is analyzed and categorized. Let's explore this critical aspect of the AI revo

7 Steps To Building A Thriving, AI-Ready Corporate CultureMay 09, 2025 am 11:23 AM

A successful artificial intelligence strategy cannot be separated from strong corporate culture support. As Peter Drucker said, business operations depend on people, and so does the success of artificial intelligence. For organizations that actively embrace artificial intelligence, building a corporate culture that adapts to AI is crucial, and it even determines the success or failure of AI strategies. West Monroe recently released a practical guide to building a thriving AI-friendly corporate culture, and here are some key points: 1. Clarify the success model of AI: First of all, we must have a clear vision of how AI can empower business. An ideal AI operation culture can achieve a natural integration of work processes between humans and AI systems. AI is good at certain tasks, while humans are good at creativity and judgment

Netflix New Scroll, Meta AI's Game Changers, Neuralink Valued At $8.5 BillionMay 09, 2025 am 11:22 AM

Meta upgrades AI assistant application, and the era of wearable AI is coming! The app, designed to compete with ChatGPT, offers standard AI features such as text, voice interaction, image generation and web search, but has now added geolocation capabilities for the first time. This means that Meta AI knows where you are and what you are viewing when answering your question. It uses your interests, location, profile and activity information to provide the latest situational information that was not possible before. The app also supports real-time translation, which completely changed the AI experience on Ray-Ban glasses and greatly improved its usefulness. The imposition of tariffs on foreign films is a naked exercise of power over the media and culture. If implemented, this will accelerate toward AI and virtual production

Take These Steps Today To Protect Yourself Against AI CybercrimeMay 09, 2025 am 11:19 AM

Artificial intelligence is revolutionizing the field of cybercrime, which forces us to learn new defensive skills. Cyber criminals are increasingly using powerful artificial intelligence technologies such as deep forgery and intelligent cyberattacks to fraud and destruction at an unprecedented scale. It is reported that 87% of global businesses have been targeted for AI cybercrime over the past year. So, how can we avoid becoming victims of this wave of smart crimes? Let’s explore how to identify risks and take protective measures at the individual and organizational level. How cybercriminals use artificial intelligence As technology advances, criminals are constantly looking for new ways to attack individuals, businesses and governments. The widespread use of artificial intelligence may be the latest aspect, but its potential harm is unprecedented. In particular, artificial intelligence

A Symbiotic Dance: Navigating Loops Of Artificial And Natural PerceptionMay 09, 2025 am 11:13 AM

The intricate relationship between artificial intelligence (AI) and human intelligence (NI) is best understood as a feedback loop. Humans create AI, training it on data generated by human activity to enhance or replicate human capabilities. This AI

AI's Biggest Secret — Creators Don't Understand It, Experts SplitMay 09, 2025 am 11:09 AM

Anthropic's recent statement, highlighting the lack of understanding surrounding cutting-edge AI models, has sparked a heated debate among experts. Is this opacity a genuine technological crisis, or simply a temporary hurdle on the path to more soph

Bulbul-V2 by Sarvam AI: India's Best TTS ModelMay 09, 2025 am 10:52 AM

India is a diverse country with a rich tapestry of languages, making seamless communication across regions a persistent challenge. However, Sarvam’s Bulbul-V2 is helping to bridge this gap with its advanced text-to-speech (TTS) t

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Roblox: Grow A Garden - Complete Mutation Guide

3 weeks agoByDDD

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

How to fix KB5055612 fails to install in Windows 10?

3 weeks agoByDDD

Blue Prince: How To Get To The Basement

1 months agoByDDD

Nordhold: Fusion System, Explained

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software