


Since the beginning of this year, with the continued popularity of ChatGPT, large models have also entered a period of rapid development. Many well-known domestic and foreign technology companies have successively launched independently developed large model products. So what is the technical principle of large models?
On May 18, Professor Chen Xiaoping, director of the Robotics Laboratory of the University of Science and Technology of China, who was invited to participate in the 2023 China Home Appliances Technology Conference (CHEATC2023), shared his research and views. He is also the director of the Artificial Intelligence Ethics of the China Artificial Intelligence Society. Together with the Chairman of the Governance Committee, Professor Chen Xiaoping delivered a keynote speech on "New Developments in Artificial Intelligence: From Large Models to Soft Robots" at this conference, introducing the technical principles of large artificial intelligence models and new developments in the application of artificial intelligence. Technology trends.
Professor Chen Xiaoping of University of Science and Technology of China
"The fundamental principle of big models is to make predictions," Chen Xiaoping said. The development of artificial intelligence has now begun the process of the fourth wave, and data models have also shifted from big data-driven to big training-driven. Different from the previous three waves, the new stage of artificial intelligence has new requirements for the quality, quantity and acquisition methods of training data, and finally forms an example model that can be applied to large-scale real scenes. He emphasized that a large model is an intelligent system integrated by multiple technologies, rather than a simple combination of a single or a few technologies. ”
The rise of large models comes from generative artificial intelligence. Currently, generative artificial intelligence is not just a simple generation of content such as language and images, but also completes intelligence based on the precise processing of human natural language. Human-computer interaction. Chen Xiaoping said: "At this stage, our expectation for machine language processing is that it can speak human language, understand human language, and answer questions, even if the answer may not be correct. Among them, the basic requirement is that the speech must conform to human language habits. "Since there are no scientific standards for human language habits but there are empirical standards, how can machines master and utilize human language habits? Chen Xiaoping said: "The basic research idea and secret of success of large models is: extract language from large-scale human corpora. Traces and used in human-computer natural language interaction.”
The large model extracts semantic elements including characters, words, punctuation marks, etc. from the original human corpus, and then performs semantic review based on the correlation between the preceding and following semantic elements, and finally achieves behavioral prediction. In principle, the greater the number of semantic elements that are looked back, the higher the accuracy of prediction. At least 4,000 tokens can be reviewed by large models, and some models can review up to 100,000 tokens. "Chen Xiaoping said. The large model technology system is based on the pre-trained model, and then uses a specially trained special model to cooperate with the user guidance model to accurately understand and answer the user's questions. The three major models cooperate with each other, and the quality of the artificial intelligence answer can be Achieve substantial improvement.
Although the emergence of large-scale models has brought new innovative directions to artificial intelligence, it is not applicable to all aspects of real-world scenarios. According to Chen Xiaoping, the three major areas of artificial intelligence that China currently needs to conquer are intelligent manufacturing, intelligent agriculture and inclusive elderly care. "Overcoming these three major battles will completely change our global landscape." On the other hand, large models bring huge changes but also bring new challenges. When large models are based on imitations of human functions, they are likely to be thought of as having emotions and consciousness. This is because people habitually apply their understanding of a concept to the overall structure involving that concept, thinking that the information expressed by the structure also has the same meaning, but in fact this is not the case. "Chen Xiaoping said that the application of large models may also have public safety, employment and long-term impacts.
In addition to large-scale models, Professor Chen Xiaoping also achieved new scientific research results on "artificial intelligence in the physical world". At present, the physical form of artificial intelligence we put into application is mainly rigid robots. This kind of robot has high repeatability, but low dexterity and safety. It is suitable for structured environments, but needs to be carried out in unstructured environments. Accurate measurement, modeling and calculation require high technical requirements and are currently not suitable for most industries. In response to these shortcomings of rigid robots, Chen Xiaoping proposed the principle of fusion, under the three basic assumptions that accurate measurement of the operating objects of intelligent robots is not feasible, accurate modeling of the working environment and operating objects is not feasible, and precise decision-making is not feasible. , developed a pneumatic honeycomb network software arm. This kind of arm has good performance in terms of flexibility and load capacity, and can achieve precise control when there is external interference and irregular movement of objects. It is expected that this technology will have broad application prospects in fields such as home services, emotional interaction, and autonomous driving. On the other hand, Chen Xiaoping's team also combined a flexible arm with a rigid machine, resulting in the experimental results of "rigid and soft claws in one", which enabled the robot to achieve control without changing the program and hardware parameters or using force feedback sensors. Precise gripping of multi-shaped objects.
The above is the detailed content of CHEATC2023|Chen Xiaoping, University of Science and Technology of China: From large models to soft robots. For more information, please follow other related articles on the PHP Chinese website!

蚂蚁集团在上海举行的第二届外滩大会上,宣布正式发布旗下的金融大模型据蚂蚁集团介绍,蚂蚁金融大模型基于其自研基础大模型,并针对金融产业进行深度定制,底层算力集群达到万卡规模。目前,该大模型已在蚂蚁集团财富、保险平台全面测试。同时,基于该大模型的两款产品——智能金融助理“支小宝2.0”、服务金融产业专家的智能业务助手“支小助”,也已正式亮相。据介绍,两款大模型产品展示了蚂蚁从基础大模型到行业大模型以及产业应用的全栈布局和进展。本站附两款产品目前进度如下:“支小宝2.0”已经开始内测近半年,将在完成相

今年以来,360集团创始人周鸿祎在所有公开场合的讲话都离不开一个话题,那就是人工智能大模型。他曾自称“GPT的布道者”,对ChatGPT取得的突破赞不绝口,更是坚定看好由此产生的AI技术迭代。作为一个擅于表达的明星企业家,周鸿祎的演讲往往妙语连珠,所以他的“布道”也创造过很多热点话题,确实为AI大模型添了一把火。但对周鸿祎而言,光做意见领袖还不够,外界更关心他执掌的360公司如何应对这波AI新浪潮。事实上,在360内部,周鸿祎也早已掀起一场全员变革,4月份,他发出内部信,要求360每一位员工、每

2023年8月16日,WAVESUMMIT深度学习开发者大会在中国举办,该活动由深度学习技术及应用国家工程研究中心主办,百度飞桨和文心大模型承办。在会上,百度发布了文心大模型、飞桨平台和AI原生应用如流等一系列技术、产品的最新进展和生态成果。百度集团副总裁兼首席信息官李莹发表了主题演讲,她认为当前以AI大模型为核心技术的第四次科技革命将从根本上推动生产力变革,为各行各业提供强大支持,并为企业办公领域带来前所未有的发展机遇基于AI原生思维,李莹宣布,百度智能工作知识管理理念“创新流水线=AIx知识

6月21日,北大光华管理学院联合腾讯,宣布升级“数字中国筑塔计划”,共同推出“企业管理者人工智能通识课”系列课程。在第一课上,腾讯集团高级执行副总裁、云与智慧产业事业群CEO汤道生回顾了AI发展的历史,表示算法创新、算力增强、开源共创三大因素的叠加,构成了AI的“增长飞轮”。大模型的快速进步,推动我们正在进入一个被AI重塑的时代。汤道生表示,大模型只是起点,未来,应用落地的产业变革是更大的图景。企业过去的研发、生产、销售、服务等环节中,有很多依赖人来判断、协调与沟通的地方,今天都值得去看看,哪些

“无产业不AI,无应用不AI。”随着AI(人工智能)大模型技术落地,AI应用遍地开花。连日来,多家企业发布基于大模型的AI应用产品。身处“百模大战”时代,如何打造国产大模型应用产品?如何为大模型提供更普惠的算力、寻找更合适的场景?发布现场图。6月1日,阿里云对外披露通义大模型最新进展,上线聚焦音视频内容的AI新品“通义听悟”,成为国内首个开放公测的大模型应用产品。有专家认为,云计算是打造大模型最合适的形式,而大模型的进化过程,或将会对传统云计算架构开始新一轮的改造。阿里云AI新产品“通义听悟”开

近期,WakeData惟客数据(以下简称“WakeData”)完成了新一轮的产品能力升级。在2022年11月的产品发布会上,已传递出WakeData的“三个坚定”:始终坚定技术投入,全面夯实核心产品的科技能力和自研率;始终坚定国产化适配能力,支持国产芯片、操作系统、数据库、中间件、国密算法等,并在同领域实现对国外厂商的国产化替代;始终坚定拥抱生态,与伙伴共创共赢。WakeData继续新一轮的产品能力升级,凭借过去5年的技术积累,以及在地产、零售、汽车等行业和垂直领域的实践,与战略伙伴联合研发具

最近,重庆人工智能创新中心成功孵化了云从科技的首个大模型解决方案,名为“从容大模型训推一体机”,并已成功部署。作为国内最早布局大模型的云服务商之一,华为不仅致力于深耕算力,打造强大的算力基础设施来支持中国人工智能事业的发展,而且还着眼于通用大模型和行业大模型,真正实现为千行百业和科学研究提供优质的人工智能服务经过重庆人工智能创新中心技术团队、昇腾研发专家和云从科技人工智能研究院的共同努力,一个月内顺利完成了“从容大模型训推一体机”的精度与性能对齐、产品集成与测试等工作,这成为了重庆人工智能创新中


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SublimeText3 Chinese version
Chinese version, very easy to use

Dreamweaver Mac version
Visual web development tools

WebStorm Mac version
Useful JavaScript development tools

Notepad++7.3.1
Easy-to-use and free code editor

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.
