search
HomeTechnology peripheralsAILeCun predicts AGI: Large models and reinforcement learning are both rampant! My 'world model' is the new way

Yann LeCun, one of the most famous contemporary giants in the AI ​​industry and the soul of Meta's AI laboratory, has long been committed to giving machines a basic understanding of the world's operating concepts, that is, allowing AI to acquire common sense. What LeCun did in the past was to use video excerpts to train neural networks and let the AI ​​predict, pixel by pixel, what will appear in the next frame of daily activity videos. Unsurprisingly, he admitted that this approach hit a brick wall. After thinking about it for several months to a year and a half, LeCun had new ideas for the next generation of AI.

LeCun predicts AGI: Large models and reinforcement learning are both rampant! My world model is the new way

New Path of AI

In an interview with "MIT Technology Review", LeCun outlined his new research path, saying This will give the machine a common sense basis for exploring the world. For LeCun, this is the first step in building AGI (Artificial General Intelligence). Machines that can think like humans have been the guiding vision since the birth of the AI ​​industry, and it is also one of the most controversial concepts.

But LeCun’s new path may still be incomplete, raising more questions than answers. The biggest question is that LeCun himself admits that he doesn’t yet know how to build the kind of AI he describes. At the core of this approach is a neural network that can look at and learn from the real world in a different way than before. LeCun finally gave up on letting AI guess the next frame of video pixel by pixel, and only let the new neural network learn the key knowledge necessary to complete the task.

LeCun predicts AGI: Large models and reinforcement learning are both rampant! My world model is the new way

LeCun then plans to pair this neural network with another neural network called a "configurator." The "configurator" is responsible for deciding which details the main neural network must learn and automatically adjust the main system accordingly. For LeCun, AGI is an integral part of human interaction with future technologies. Of course, this outlook coincides with his employer Meta Company, which has invested all his wealth in developing the metaverse.

LeCun said that in 10-15 years, AR glasses will replace the current status of smartphones. AR glasses must have a virtual intelligent assistant that can assist human daily activities. If these assistants are to be most effective, they must more or less keep up with the intelligence of the human brain.

"World model" is the core of AGI

LeCun's recent passion for "world model", according to him, is the basic operating mode of most animal brains: for the real world Run a simulation. From infancy, animals use prediction-trial-and-error methods to develop intelligence. Young children develop the foundations of intelligence in the first few months of life by observing real-world movements and setbacks.

Observing a small ball falling hundreds of times, ordinary babies have a basic understanding of the existence and operation of gravity even if they have never taken a basic physics class or learned Newton's three laws. Therefore, this kind of intuitive/tacit reasoning is called "common sense" by ordinary people. Human beings use common sense to understand most possible futures and impossible fantasies in the real world, to foresee the consequences of their actions and make decisions accordingly. Such human intelligence requires neither pixel-accurate details nor a comprehensive library of physical parameters. Even if someone has no vision or is illiterate, they can still use their intelligence normally.

LeCun predicts AGI: Large models and reinforcement learning are both rampant! My world model is the new way

But it is difficult to teach a machine to learn common sense. Today's neural networks need to be shown thousands of examples before they begin to vaguely discover underlying patterns. LeCun said that the basis of intelligence is the common sense ability to predict the immediate future. However, after giving up on letting AI predict pixel by pixel, LeCun said he wanted to change his mind. LeCun gave an analogy: Imagine you hold a pen in the air and let it go. Common sense tells you that the pen will definitely fall, but the precise location of the fall is beyond the scope of human intelligence prediction. According to the past AI development model, AI has to run complex physics models to predict whether the pen will fall and to obtain the precise location of the fall.

Now LeCun is trying hard to let AI only predict the common sense conclusion that the pen will fall. As for the precise position, it is not within the scope of solution. LeCun said this is the basic pattern of the "world model".

LeCun predicts AGI: Large models and reinforcement learning are both rampant! My world model is the new way

LeCun said that he has built an early version of the "world model" that can complete basic object recognition, and is now working on training it to learn the above-mentioned common sense predictions.

However, LeCun said that he has not yet understood the function of the "configurator". The "configurator" AI in LeCun's imagination is the control component of the entire AGI system. It will determine what common-sense predictions the World Model needs to make at any moment, and adjust the details of the data the World Model should handle to do so. LeCun now firmly believes that a "configurator" is essential, but he doesn't know how to train a neural network to achieve this effect.

"We need to explore a list of feasible technologies, and this list does not exist yet." In LeCun's vision, "configurator" and "world model" are the future AGI The two core parts of the basic cognitive architecture are based on which the cognitive model for perceiving the world, the incentive model that drives AI to adjust behavior, etc. can be developed. LeCun said that this way the neural network can successfully simulate every part of the human brain. For example, the "configurator" and "world model" play the role of the prefrontal lobe, the motivation model is the amygdala of AI, and so on. LeCun predicts AGI: Large models and reinforcement learning are both rampant! My world model is the new way

Cognitive architecture and prediction models at different levels of detail are all views that have been established in the industry for many years. However, when deep learning becomes the mainstream of the AI ​​industry, many of these old ideas become outdated. Now LeCun is returning to traditional wisdom: "The AI ​​research community has forgotten these things a lot."

Large models and reinforcement learning are dead ends

The reason why we go back to the old ways road because LeCun firmly believes that the current mainstream path in the industry has reached a dead end. Regarding how to build AGI, there are currently two mainstream views in the AI ​​industry.

First, many researchers firmly believe in the path that leads to their own mistakes: just like OpenAI’s GPT series and DALL-E series, the bigger the model, the better, so large that it exceeds the critical point. , AI has awakened human intelligence.

The second is reinforcement learning: continuous trial and error, and rewarding and punishing the AI ​​according to the trial and error results. This is DeepMind’s method for making various chess and card AI and game AI. Believers of this path believe that as long as the reward incentives are set correctly, reinforcement learning will eventually create a real AGI.

Lecun said that the two types of people here are rubbish: "Infinitely expanding the magnitude of existing large language models, and finally being able to create human-level AI? This absurd argument, I didn't believe it for a second. These models can only process various text and image data, without any direct experience of the real world." "Reinforcement learning requires a huge amount of data to train the model to perform the simplest tasks. I I don’t think this method has a chance of making AGI.”

LeCun predicts AGI: Large models and reinforcement learning are both rampant! My world model is the new way

People in the industry have both support and opposition to LeCun’s views. If LeCun's vision is realized, AI will become the next generation of basic high-performance technology no less than the Internet. But his announcement did not include the performance, incentive mechanism, control mechanism, etc. of his own model. However, these shortcomings are minor matters, because regardless of praise or criticism, industry insiders agree that facing these shortcomings will be a long time coming. Because even LeCun can't make AGI right now.

Lecun himself also acknowledged this situation. He said that he only hoped to sow seeds for new theoretical paths and let latecomers build results on this basis. "Achieving this goal requires too many people and too much effort. I am bringing this up now just because I think this path is the final right path." Even if this is not possible, LeCun hopes to persuade his colleagues not to just focus on it. With large models and reinforcement learning, it’s best to open your mind. "I hate to see people wasting time."

Industry reaction: both praise and criticism

Yoshua Bengio, another leader in the AI ​​industry and a good friend of LeCun, expressed his joy Seeing old friends come true. "Yann has been talking about this for a while, but I'm quite happy to see him comprehensively summarizing all his remarks in one place. However, these are just applications for research directions rather than report of results. We usually only discuss them privately. Sharing this below, the risk of talking publicly is quite high."

LeCun predicts AGI: Large models and reinforcement learning are both rampant! My world model is the new way

## David Silver, who leads the development of the game AI AlphaZero at DeepMind, disapproves of LeCun's comments on his project Criticism, but welcome to his vision.

"The world model described by LeCun is indeed an exciting new idea." Melanie Mitchell of the Santa Fe Institute in California agreed with LeCun: "The industry really doesn't see this kind of thing in the deep learning community very often. point of view. But the big language model really lacks both memory and the backbone of the internal world model that can play a role."

Natasha Jaques of Google Brain disagrees: "Everyone has seen that big language The model is extremely efficient and incorporates a lot of human knowledge. Without a language model, how can I upgrade the world model proposed by LeCun? Even if humans learn, the way is not only personal experience, but also word of mouth."

The above is the detailed content of LeCun predicts AGI: Large models and reinforcement learning are both rampant! My 'world model' is the new way. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
You Must Build Workplace AI Behind A Veil Of IgnoranceYou Must Build Workplace AI Behind A Veil Of IgnoranceApr 29, 2025 am 11:15 AM

In John Rawls' seminal 1971 book The Theory of Justice, he proposed a thought experiment that we should take as the core of today's AI design and use decision-making: the veil of ignorance. This philosophy provides a simple tool for understanding equity and also provides a blueprint for leaders to use this understanding to design and implement AI equitably. Imagine that you are making rules for a new society. But there is a premise: you don’t know in advance what role you will play in this society. You may end up being rich or poor, healthy or disabled, belonging to a majority or marginal minority. Operating under this "veil of ignorance" prevents rule makers from making decisions that benefit themselves. On the contrary, people will be more motivated to formulate public

Decisions, Decisions… Next Steps For Practical Applied AIDecisions, Decisions… Next Steps For Practical Applied AIApr 29, 2025 am 11:14 AM

Numerous companies specialize in robotic process automation (RPA), offering bots to automate repetitive tasks—UiPath, Automation Anywhere, Blue Prism, and others. Meanwhile, process mining, orchestration, and intelligent document processing speciali

The Agents Are Coming – More On What We Will Do Next To AI PartnersThe Agents Are Coming – More On What We Will Do Next To AI PartnersApr 29, 2025 am 11:13 AM

The future of AI is moving beyond simple word prediction and conversational simulation; AI agents are emerging, capable of independent action and task completion. This shift is already evident in tools like Anthropic's Claude. AI Agents: Research a

Why Empathy Is More Important Than Control For Leaders In An AI-Driven FutureWhy Empathy Is More Important Than Control For Leaders In An AI-Driven FutureApr 29, 2025 am 11:12 AM

Rapid technological advancements necessitate a forward-looking perspective on the future of work. What happens when AI transcends mere productivity enhancement and begins shaping our societal structures? Topher McDougal's upcoming book, Gaia Wakes:

AI For Product Classification: Can Machines Master Tax Law?AI For Product Classification: Can Machines Master Tax Law?Apr 29, 2025 am 11:11 AM

Product classification, often involving complex codes like "HS 8471.30" from systems such as the Harmonized System (HS), is crucial for international trade and domestic sales. These codes ensure correct tax application, impacting every inv

Could Data Center Demand Spark A Climate Tech Rebound?Could Data Center Demand Spark A Climate Tech Rebound?Apr 29, 2025 am 11:10 AM

The future of energy consumption in data centers and climate technology investment This article explores the surge in energy consumption in AI-driven data centers and its impact on climate change, and analyzes innovative solutions and policy recommendations to address this challenge. Challenges of energy demand: Large and ultra-large-scale data centers consume huge power, comparable to the sum of hundreds of thousands of ordinary North American families, and emerging AI ultra-large-scale centers consume dozens of times more power than this. In the first eight months of 2024, Microsoft, Meta, Google and Amazon have invested approximately US$125 billion in the construction and operation of AI data centers (JP Morgan, 2024) (Table 1). Growing energy demand is both a challenge and an opportunity. According to Canary Media, the looming electricity

AI And Hollywood's Next Golden AgeAI And Hollywood's Next Golden AgeApr 29, 2025 am 11:09 AM

Generative AI is revolutionizing film and television production. Luma's Ray 2 model, as well as Runway's Gen-4, OpenAI's Sora, Google's Veo and other new models, are improving the quality of generated videos at an unprecedented speed. These models can easily create complex special effects and realistic scenes, even short video clips and camera-perceived motion effects have been achieved. While the manipulation and consistency of these tools still need to be improved, the speed of progress is amazing. Generative video is becoming an independent medium. Some models are good at animation production, while others are good at live-action images. It is worth noting that Adobe's Firefly and Moonvalley's Ma

Is ChatGPT Slowly Becoming AI's Biggest Yes-Man?Is ChatGPT Slowly Becoming AI's Biggest Yes-Man?Apr 29, 2025 am 11:08 AM

ChatGPT user experience declines: is it a model degradation or user expectations? Recently, a large number of ChatGPT paid users have complained about their performance degradation, which has attracted widespread attention. Users reported slower responses to models, shorter answers, lack of help, and even more hallucinations. Some users expressed dissatisfaction on social media, pointing out that ChatGPT has become “too flattering” and tends to verify user views rather than provide critical feedback. This not only affects the user experience, but also brings actual losses to corporate customers, such as reduced productivity and waste of computing resources. Evidence of performance degradation Many users have reported significant degradation in ChatGPT performance, especially in older models such as GPT-4 (which will soon be discontinued from service at the end of this month). this

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools