search
HomeTechnology peripheralsAIHaoMo Zhixing & Tsinghua AIR self-driving open class Dr. Pan Xing revealed HaoMo's self-driving AI practice on the spot

As AI large models emerge from deep learning algorithms, they are becoming the hottest new technology paradigm in the current AI field. Autonomous driving technology also has the possibility to evolve from the modular stage to end-to-end autonomous driving due to the introduction of large model technology. AI large models are reshaping the technical route of autonomous driving.

On September 27, the high-quality open course on autonomous driving jointly organized by HaoMo Zhixing and Tsinghua University Intelligent Industry Research Institute (AIR) concluded successfully. This open class focuses on the current leading AI algorithms for autonomous driving, combined with Haomo’s specific practices, bringing an end-to-end autonomous driving technology feast to autonomous driving practitioners, industry partners and media friends.

This course is the third in a series of open courses on autonomous driving. The first and second courses previously provided a basic introduction to the autonomous driving knowledge system from the perspective of macro industry and technical principles. In the third issue, Dr. Zhan Xianyuan, assistant researcher/assistant professor at Tsinghua AIR, explained the characteristics and current progress of end-to-end autonomous driving AI algorithms from the perspective of decision optimization, while Dr. Pan Xing, technical director of HaoMo Zhixing, fully explained from the data closed-loop system How the AI ​​large model algorithm learns and optimizes in massive data, and how it demonstrates its amazing capabilities in practice.

毫末智行&清华AIR自动驾驶公开课 潘兴博士现场揭秘毫末的自动驾驶AI实践之路

In the sharing titled "End-to-End Autonomous Driving from the Perspective of Decision Optimization", Dr. Zhan Xianyuan started from the concept of end-to-end and combined with the 30-year development history of the autonomous driving industry to tell everyone about end-to-end autonomous driving. The strategic learning method in the algorithm, combined with the scientific research cooperation practice of Tsinghua AIR and Hao Mo, uses the Al algorithm to gain insight into the development context of the industry and make summary and trend judgments on the development of the industry.

Dr. Zhan Xianyuan pointed out that end-to-end, simply put, is to integrate all architectures and different modules into a complete whole, conduct training directly from input to output, and transmit learning signals forward from the decision-making point. The advantage of the original modularity is that each module is disassembled very cleanly, the modeling goal of each module is very clear, and the interpretability is very good. However, under the modular architecture, each module has its own system for design and optimization. Combining multiple modules together will inevitably lead to accumulation of errors. The advantages of end-to-end lies in the following three points. First of all, the entire end-to-end model can be regarded as a single very large model, so the structure is very simple. All goals are optimization and learning around the ultimate goal of decision-making. The goals are unified at the optimization level. Secondly, it is end-to-end learning from input to final decision output, which can easily implement pure data-driven learning that relies on massive data. Third, because many models are trained in the same system end-to-end, the backbones of different module models can be shared, thereby reducing computational overhead.

毫末智行&清华AIR自动驾驶公开课 潘兴博士现场揭秘毫末的自动驾驶AI实践之路

Dr. Zhan Xianyuan introduced that all end-to-end driving models can be regarded as a large decision-making model. Training such a model requires the use of decision-making optimization algorithms. This involves imitation learning and reinforcement learning. Imitation learning is a mapping directly trained from data using supervised learning methods; reinforcement learning is not simply imitating data. It provides the possibility of transcending the data itself and can be optimized through continuous learning. Find a better decision-making model than the available data.

In Dr. Zhan Xianyuan’s view, the early end-to-end autonomous driving models were all small decision-making models, but today, the end-to-end systems implemented in the industry are all huge models. The online interaction paradigm has gradually extended to completely offline learning. As models become stronger and better, security becomes better and better, and there are slowly some improvements and transitions at the generalization level. In addition, Dr. Zhan Xianyuan also gave a detailed introduction to the cooperation between Tsinghua AIR and Haimou in imitation learning and offline reinforcement learning, and said that these algorithms will gradually be applied to Haimou’s autonomous driving scenario practice.

毫末智行&清华AIR自动驾驶公开课 潘兴博士现场揭秘毫末的自动驾驶AI实践之路

After Dr. Zhan Xianyuan’s sharing, Dr. Pan Xing took “Hai Mo’s Road to Autonomous Driving AI” as the theme and explained the importance of AI algorithms from an industrial perspective through Hao Mo’s specific practice. Dr. Pershing said that as an artificial intelligence technology company dedicated to autonomous driving, Haimou users have driven assisted driving for more than 80 million kilometers. Urban NOH is also in the process of generalization and iteration, and is expected to achieve mass production next year.

毫末智行&清华AIR自动驾驶公开课 潘兴博士现场揭秘毫末的自动驾驶AI实践之路

Pershing said that with the increase in data scale, improvement in algorithm capabilities, and applications under the trend of big models, big data, and big computing power, the current industry is about to enter the data-driven era of autonomous driving 3.0. As autonomous driving products are moving from high-speed scenarios to urban scenarios, the construction of autonomous driving data intelligence systems is the core infrastructure. In order to achieve a closed data loop, companies such as Tesla, Haimo and many domestic companies are also building their own cloud AI capabilities and supercomputing centers to achieve better results through greater computing power and larger-scale data processing capabilities. autonomous driving capabilities.

Currently, Haomo has built its own data intelligence system MANA, and at the beginning of this year built the largest intelligent computing center in China's autonomous driving industry - MANA OASIS Snow Lake·Oasis. Based on MANA OASIS, Haomo released the industry's first autonomous driving generative large model DriveGPT Xuehu·Hairuo in April this year. "As a basic large model, Haimo uses DriveGPT to build further AI capabilities, including data management retrieval, automatic annotation, AIGC simulation data synthesis, etc. Based on these data capabilities and services, we further improve the performance of various modules and algorithms on the vehicle end. capabilities, and ultimately achieve a better autonomous driving product.”

Dr. Pan Xing pointed out that data intelligence is the core of the entire autonomous driving iteration. In this process, massive data assets must be accumulated. Through large AI models, these data assets can be better managed. At the same time, computing power is inevitably needed after data is available. The stable and continuous operation of the intelligent computing center also provides a steady stream of power for the iteration of large models and the improvement of autonomous driving.

毫末智行&清华AIR自动驾驶公开课 潘兴博士现场揭秘毫末的自动驾驶AI实践之路

Dr. Pan Xing said, "With data and computing power, the current car-cloud linkage and joint training methods can effectively improve the effect of car-side algorithms through large models." For example, by using DriveGPT, in the tool chain It can very effectively reduce the cost of the entire labeling and improve the efficiency of labeling. At the same time, DriveGPT can also use large models to directly support the improvement of the capabilities of small models on the car side, and better transfer the large model capabilities of the cloud to the models on the car side.

Dr. Pershing also said, “How to obtain more realistic simulation data efficiently, large models can play a very important role.” Through the use of large models, texture, depth, semantics and other information can be learned very effectively. Through the effective representation of large models, the data can be edited. For example, vehicle obstacles that are not in the original video can be pasted, edited, rotated at will, and put into the video through DriveGPT, thereby obtaining New simulation synthetic data. In addition to applications in the field of perception, large models also play a great role in intelligent driving decision-making and planning. DriveGPT uses user data of human driving to continuously iterate and learn to achieve better driving behavior and decision-making. .

At the same time, Haimo DriveGPT can not only help complete trajectory prediction and image synthesis, but also has the ability to make intelligent decisions. "DriveGPT has the ability to input a video to predict future trajectories and answer questions in the driving decision-making process, and can give explainable decisions." These capabilities make Weimo believe that with the advent of end-to-end autonomous driving, macro decision-making and Micro-behavior, learning and understanding together through models will become a more effective means. Dr. Pershing revealed that next, Haimou will connect the two models of perception and cognition more deeply end-to-end so that they can be integrated into one.

毫末智行&清华AIR自动驾驶公开课 潘兴博士现场揭秘毫末的自动驾驶AI实践之路

HaoMo Zhixing and Tsinghua University Intelligent Industry Research Institute (AIR) organized a high-quality public course on autonomous driving in 4 phases, and this course is the third. In the two previous courses, lecturers from Tsinghua AIR and Hao Mo Zhixing have introduced the development of autonomous driving technologies such as single-vehicle intelligent autonomous driving, vehicle-road collaborative autonomous driving and high-level intelligent road construction to nearly a hundred industry media people. Everyone explained the basic principles of autonomous driving AI technology and the current application trends of large models in autonomous driving. In this high-quality open course on autonomous driving, Haimo and Tsinghua AIR shared more in-depth AI algorithms and principles of autonomous driving AI systems with observers in the autonomous driving industry, and received active questions and exchanges from online and offline guests.

Facing the self-driving stars and the sea, only action can truly achieve the future goal. Through the high-quality open courses on autonomous driving, Haimo and Tsinghua AIR have joined hands with senior media people in the industry to harvest the latest research results and practical experience on autonomous driving A algorithms. They have joined hands across mountains and seas to share the wisdom of AI knowledge and contribute to the autonomous driving industry. Precious technical consensus and knowledge accumulation.

The above is the detailed content of HaoMo Zhixing & Tsinghua AIR self-driving open class Dr. Pan Xing revealed HaoMo's self-driving AI practice on the spot. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:搜狐. If there is any infringement, please contact admin@php.cn delete
Can't use ChatGPT! Explaining the causes and solutions that can be tested immediately [Latest 2025]Can't use ChatGPT! Explaining the causes and solutions that can be tested immediately [Latest 2025]May 14, 2025 am 05:04 AM

ChatGPT is not accessible? This article provides a variety of practical solutions! Many users may encounter problems such as inaccessibility or slow response when using ChatGPT on a daily basis. This article will guide you to solve these problems step by step based on different situations. Causes of ChatGPT's inaccessibility and preliminary troubleshooting First, we need to determine whether the problem lies in the OpenAI server side, or the user's own network or device problems. Please follow the steps below to troubleshoot: Step 1: Check the official status of OpenAI Visit the OpenAI Status page (status.openai.com) to see if the ChatGPT service is running normally. If a red or yellow alarm is displayed, it means Open

Calculating The Risk Of ASI Starts With Human MindsCalculating The Risk Of ASI Starts With Human MindsMay 14, 2025 am 05:02 AM

On 10 May 2025, MIT physicist Max Tegmark told The Guardian that AI labs should emulate Oppenheimer’s Trinity-test calculus before releasing Artificial Super-Intelligence. “My assessment is that the 'Compton constant', the probability that a race to

An easy-to-understand explanation of how to write and compose lyrics and recommended tools in ChatGPTAn easy-to-understand explanation of how to write and compose lyrics and recommended tools in ChatGPTMay 14, 2025 am 05:01 AM

AI music creation technology is changing with each passing day. This article will use AI models such as ChatGPT as an example to explain in detail how to use AI to assist music creation, and explain it with actual cases. We will introduce how to create music through SunoAI, AI jukebox on Hugging Face, and Python's Music21 library. Through these technologies, everyone can easily create original music. However, it should be noted that the copyright issue of AI-generated content cannot be ignored, and you must be cautious when using it. Let’s explore the infinite possibilities of AI in the music field together! OpenAI's latest AI agent "OpenAI Deep Research" introduces: [ChatGPT]Ope

What is ChatGPT-4? A thorough explanation of what you can do, the pricing, and the differences from GPT-3.5!What is ChatGPT-4? A thorough explanation of what you can do, the pricing, and the differences from GPT-3.5!May 14, 2025 am 05:00 AM

The emergence of ChatGPT-4 has greatly expanded the possibility of AI applications. Compared with GPT-3.5, ChatGPT-4 has significantly improved. It has powerful context comprehension capabilities and can also recognize and generate images. It is a universal AI assistant. It has shown great potential in many fields such as improving business efficiency and assisting creation. However, at the same time, we must also pay attention to the precautions in its use. This article will explain the characteristics of ChatGPT-4 in detail and introduce effective usage methods for different scenarios. The article contains skills to make full use of the latest AI technologies, please refer to it. OpenAI's latest AI agent, please click the link below for details of "OpenAI Deep Research"

Explaining how to use the ChatGPT app! Japanese support and voice conversation functionExplaining how to use the ChatGPT app! Japanese support and voice conversation functionMay 14, 2025 am 04:59 AM

ChatGPT App: Unleash your creativity with the AI ​​assistant! Beginner's Guide The ChatGPT app is an innovative AI assistant that handles a wide range of tasks, including writing, translation, and question answering. It is a tool with endless possibilities that is useful for creative activities and information gathering. In this article, we will explain in an easy-to-understand way for beginners, from how to install the ChatGPT smartphone app, to the features unique to apps such as voice input functions and plugins, as well as the points to keep in mind when using the app. We'll also be taking a closer look at plugin restrictions and device-to-device configuration synchronization

How do I use the Chinese version of ChatGPT? Explanation of registration procedures and feesHow do I use the Chinese version of ChatGPT? Explanation of registration procedures and feesMay 14, 2025 am 04:56 AM

ChatGPT Chinese version: Unlock new experience of Chinese AI dialogue ChatGPT is popular all over the world, did you know it also offers a Chinese version? This powerful AI tool not only supports daily conversations, but also handles professional content and is compatible with Simplified and Traditional Chinese. Whether it is a user in China or a friend who is learning Chinese, you can benefit from it. This article will introduce in detail how to use ChatGPT Chinese version, including account settings, Chinese prompt word input, filter use, and selection of different packages, and analyze potential risks and response strategies. In addition, we will also compare ChatGPT Chinese version with other Chinese AI tools to help you better understand its advantages and application scenarios. OpenAI's latest AI intelligence

5 AI Agent Myths You Need To Stop Believing Now5 AI Agent Myths You Need To Stop Believing NowMay 14, 2025 am 04:54 AM

These can be thought of as the next leap forward in the field of generative AI, which gave us ChatGPT and other large-language-model chatbots. Rather than simply answering questions or generating information, they can take action on our behalf, inter

An easy-to-understand explanation of the illegality of creating and managing multiple accounts using ChatGPTAn easy-to-understand explanation of the illegality of creating and managing multiple accounts using ChatGPTMay 14, 2025 am 04:50 AM

Efficient multiple account management techniques using ChatGPT | A thorough explanation of how to use business and private life! ChatGPT is used in a variety of situations, but some people may be worried about managing multiple accounts. This article will explain in detail how to create multiple accounts for ChatGPT, what to do when using it, and how to operate it safely and efficiently. We also cover important points such as the difference in business and private use, and complying with OpenAI's terms of use, and provide a guide to help you safely utilize multiple accounts. OpenAI

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

VSCode Windows 64-bit Download

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools