


Maybe the math test questions you took were machine-generated.
MIT students can easily solve mathematical topics such as multivariate calculus, differential equations, linear algebra, etc., but these But the machine learning model was stumped. Because machine learning models can only answer elementary or high school level math questions, and they don’t always find the right answer.
##Now, researchers from MIT, Columbia University, Harvard University and the University of Waterloo use small sample learning and OpenAI’s Codex to automatically synthesize programs and solve the problem in a few seconds. solved college mathematics problems and reached human level. The research was published in the Proceedings of the National Academy of Sciences (PNAS).
In addition, the model can explain the generated solutions and quickly generate new college mathematics problems. When the researchers showed these machine-generated questions to students, the students couldn't even tell whether the questions were generated by an algorithm or a human.
This research can also be used to streamline course content generation, which is especially useful for schools with thousands of students and massive open online courses (MOOCs). The system can also act as an online tutor, showing students the steps to solve math problems.
Paper address: https://www.pnas.org/doi/epdf/10.1073/pnas.2123433119
The method of this study combines three innovations:
- Unlike pre-training only on text, this study While pre-training on the text, fine-tuning is also performed on the code;
- Using small sample learning to synthesize the program can correctly solve mathematical problems;
- The research can solve problems, explain solutions, and generate new questions.
Examples of new questions generated by this research are as follows.
##A model that can answer, solve and pose questions
The study randomly selected 25 problems from seven courses: MIT's 18.01 Single Variable Calculus, 18.02 Multivariable Calculus, 18.03 Differential Equations, 18.05 Introduction to Probability and Statistics, 18.06 Linear Algebra, 6.042 Mathematics for Computer Science, and COMS3251 Computational Linear Algebra from Columbia University.
For the MATH dataset, the study randomly sampled 15 questions from six topics in the dataset (Algebra, Counting and Probability, Intermediate Algebra, Number Theory, Preliminary Algebra, and Calculus) .
It is different from networks such as GPT-3 that are only pre-trained on text. They transformed these problems into programming tasks and applied program synthesis and few-shot learning techniques. Turning a mathematical problem into a programming task can be as simple as rewriting the problem of finding the distance between two points as writing a program to find the difference between two points.
It is worth mentioning that this research not only pre-trained Codex on text, but also fine-tuned the code so that it can generate programs for solving large-scale mathematical problems.
Pre-trained models display millions of code examples from online repositories. Because the model's training data includes millions of natural language words and millions of lines of code, it can learn relationships between snippets of text and snippets of code. As shown in the figure below, this study uses zero-shot and small-shot learning to automatically generate a program that can solve 81% of mathematical problems. They then use the Codex to interpret the resulting program. The generated program can output answers in many forms. For example, calculating and depicting the geometric shape of singular value decomposition (SVD) not only gives the correct answer, but also the corresponding explanation! Drori, one of the authors of the paper, explained that many mathematical problems can be solved using graphs or trees, but problems written in text are difficult to converted into this representation. However, because the model has learned the relationship between text and code, it can convert text questions into code by simply giving a few examples of question code and then running the code to answer the question. "When you ask questions using only text, it is difficult for machine learning models to give answers, even though the answer may be in the text. This work fills in the missing gap in code and program synthesis. Partly," Drori said. Drori also added that this work is the first to solve an undergraduate mathematics problem and improves accuracy from 8% to more than 80%. In fact, it is not always easy to convert mathematical problems into programming tasks. Some problems require researchers to add context so that neural networks can handle the problem correctly. A student will learn this background knowledge while taking the course, but neural networks do not have this background knowledge unless explicitly stated by the researcher. For example, they need to explain that the network in the text refers to a neural network and not a communication network. Or they may need to tell the model which programming package to use. They may also need to provide certain definitions, for example in a question about playing cards, they may need to tell the model that each deck contains 52 cards. The study automatically feeds these programming tasks, along with included context and examples, into a pre-trained and fine-tuned neural network, which outputs a neural network that typically produces the correct answer. program of. More than 80% of the questions were correct. The researchers also used their model to generate questions, by giving a neural network a series of mathematical questions about a topic and then letting it create a new question. For example, there is the problem of quantum detection of horizontal and vertical lines, which creates a new problem of quantum detection of diagonals. So it's not just creating new problems by replacing values and variables in existing problems. The researchers tested the machine-generated questions by showing them to college students. The researchers randomly gave students 10 problems from an undergraduate mathematics course; five were created by humans and five were generated by machines. Students were unable to tell whether the machine-generated questions were generated by an algorithm or a human, and they gave similar ratings on the difficulty and appropriateness of the course. However, Drori noted that this work is not intended to replace human professors. "Now the accuracy has reached 80%, but it will not reach 100%. Every time you solve a problem, someone will ask a harder problem. But this work It opens up the field for people to start using machine learning to solve increasingly difficult problems. We think this will have a huge impact on higher education," Drori said. The research team is excited about the success of their approach and is extending the work to handle mathematical proofs. They also plan to address some limitations. Currently, the model cannot use a visual component. Answering questions also fails to solve problems that are difficult to compute due to computational complexity. In addition to overcoming these obstacles, the research also aims to scale the model to hundreds of courses. With these courses, they will generate more data to increase automation and provide insights into course design and curriculum. Apply neural networks with OpenAI Codex to solve, interpret and generate mathematical problems.
Add context
Human-Asked Questions vs. Machine-Generated Questions
The above is the detailed content of AI solves college mathematics problems in a few seconds, achieves an accuracy rate of more than 80%, and also acts as a question teacher. For more information, please follow other related articles on the PHP Chinese website!

This article explores the growing concern of "AI agency decay"—the gradual decline in our ability to think and decide independently. This is especially crucial for business leaders navigating the increasingly automated world while retainin

Ever wondered how AI agents like Siri and Alexa work? These intelligent systems are becoming more important in our daily lives. This article introduces the ReAct pattern, a method that enhances AI agents by combining reasoning an

"I think AI tools are changing the learning opportunities for college students. We believe in developing students in core courses, but more and more people also want to get a perspective of computational and statistical thinking," said University of Chicago President Paul Alivisatos in an interview with Deloitte Nitin Mittal at the Davos Forum in January. He believes that people will have to become creators and co-creators of AI, which means that learning and other aspects need to adapt to some major changes. Digital intelligence and critical thinking Professor Alexa Joubin of George Washington University described artificial intelligence as a “heuristic tool” in the humanities and explores how it changes

LangChain is a powerful toolkit for building sophisticated AI applications. Its agent architecture is particularly noteworthy, allowing developers to create intelligent systems capable of independent reasoning, decision-making, and action. This expl

Radial Basis Function Neural Networks (RBFNNs): A Comprehensive Guide Radial Basis Function Neural Networks (RBFNNs) are a powerful type of neural network architecture that leverages radial basis functions for activation. Their unique structure make

Brain-computer interfaces (BCIs) directly link the brain to external devices, translating brain impulses into actions without physical movement. This technology utilizes implanted sensors to capture brain signals, converting them into digital comman

This "Leading with Data" episode features Ines Montani, co-founder and CEO of Explosion AI, and co-developer of spaCy and Prodigy. Ines offers expert insights into the evolution of these tools, Explosion's unique business model, and the tr

This article explores Retrieval Augmented Generation (RAG) systems and how AI agents can enhance their capabilities. Traditional RAG systems, while useful for leveraging custom enterprise data, suffer from limitations such as a lack of real-time dat


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

DVWA
Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.