Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...-AI-php.cn

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

PHPz

Apr 12, 2023 pm 11:31 PM

chatgpt

In the last month of 2022, OpenAI responded to people’s expectations for a whole year with a popular conversational robot-ChatGPT, although it is not the long-awaited GPT-4.

Anyone who has used ChatGPT can understand that it is a real "hexagon warrior": not only can it be used to chat, search, and translate, but it can also be used to write stories, Write code, debug, even develop small games, take the American college entrance examination... Some people joke that from now on, there will only be two types of artificial intelligence models - ChatGPT and others.

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

## Source: https://twitter.com/Tisoga/status/1599347662888882177

Due to its amazing capabilities, ChatGPT attracted 1 million users in just 5 days after it was launched. Many people boldly predict that if this trend continues, ChatGPT will soon replace search engines such as Google and programming question and answer communities such as Stack Overflow.

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

## Source: https://twitter.com/whoiskatrin/status /1600421531212865536

However, many of the answers generated by ChatGPT are wrong, and you can’t tell them without looking carefully. This will cause the answers to questions to be confused. This "very powerful but error-prone" attribute has given the outside world a lot of room for discussion. Everyone wants to know:

In the sixth "REDtech is coming" technical live broadcast organized by the Xiaohongshu technical team, Li Lei, an expert in the NLP field and an assistant professor at the University of California, Santa Barbara, and Xiaohongshu Zhang Lei, Vice President of Technology of Red Book, and Zhang Debing, Head of Multimedia Intelligent Algorithm of Xiaohongshu Community Department, started a conversation and exchanged and answered hot questions about ChatGPT.

Li Lei graduated from the Computer Science Department of Shanghai Jiao Tong University (ACM class) with a bachelor's degree and a Ph.D. from the Computer Science Department of Carnegie Mellon University. He has served as a postdoctoral researcher at the University of California, Berkeley, a young scientist at Baidu US Deep Learning Laboratory, and a senior director at ByteDance Artificial Intelligence Laboratory.

In 2017, Li Lei won the second prize of the Wu Wenjun Artificial Intelligence Technology Invention Award for his work on the AI writing robot Xiaomingbot. Xiaomingbot also has strong content understanding and text creation capabilities, and can smoothly broadcast sports events and write financial news.

Li Lei’s main research directions are machine learning, data mining and natural language processing. He has published more than 100 papers at top international academic conferences in the fields of machine learning, data mining and natural language processing, and holds more than 20 technical invention patents. He has won the second place in the 2012 ACM SIGKDD Best Doctoral Thesis, the 2017 CCF Distinguished Speaker, the 2019 CCF Green Bamboo Award, and the 2021 ACL Best Paper Award.

Zhang Lei, vice president of technology at Xiaohongshu, graduated from Shanghai Jiao Tong University. He served as vice president of technology at Huanju Times and chief architect of Baidu Fengchao, responsible for Baidu search advertising CTR machine learning Algorithms work. He once served as the China technical leader of the IBM Deep Question Answering (DeepQA) project.

Zhang Debing, the person in charge of the multimedia intelligent algorithm of Xiaohongshu Community Department. He was the chief scientist of GreenTong and the person in charge of multi-modal intelligent creation of Kuaishou. He is in the direction of technology research and business implementation. They all have rich experience and have led the team to win multiple academic competition championships, including the FRVT world championship in the international authoritative face recognition competition, and promote CV, multi-modal and other technologies in TO B scenarios and short videos such as security, retail, and sports. , advertising and other C-side scenarios have been implemented.

The discussion by the three guests not only focused on the current capabilities and problems of ChatGPT, but also looked forward to future trends and prospects. In the following, we sort out and summarize the content of the exchange.

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

## OpenAI co-founder Greg Brockman recently tweeted that 2023 will make 2022 look like AI A dull year for progress and adoption. Image source: https://twitter.com/gdb/status/1609244547460255744

Where does ChatGPT’s powerful capabilities come from?

Like many people who tried ChatGPT, the three guests were also impressed by the powerful capabilities of ChatGPT.

Among them, Zhang Debing gave an example of letting ChatGPT act as a Linux Terminal: telling ChatGPT the approximate machine configuration, and then letting it execute some instructions on this basis. It turned out that ChatGPT can remember It has a long operation history and the logical relationship is very consistent (for example, if you write a few lines of characters into a file, and then let it display which characters have been written in the file, it will be displayed).

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

DeepMind researcher Jonas Degrave uses ChatGPT to act as an example of a Linux Terminal. Picture source: https://www.engraved.blog/building-a-virtual-machine-inside/

This result made Zhang Debing and others suspicious. , did ChatGPT open a terminal in the background to deceive users? So they conducted some tests: Let ChatGPT execute some very complex instructions (such as two for loops, each of which has 1 billion times). If ChatGPT really opens a terminal, it will be stuck for a while. . The result was unexpected: ChatGPT quickly skipped this process and displayed the next result after this command. This made Zhang Debing and others realize that ChatGPT did roughly understand the logic of the entire demo, and it had a certain "thinking" ability.

So, where does this powerful ability come from? Zhang Lei put forward two hypotheses. One hypothesis is that this ability itself is built into the large model, but we have not released it properly before; another hypothesis is that the built-in ability of the large model is not actually that strong, and we need Make some adjustments to it with the help of human power.

Both Zhang Debing and Li Lei agree with the first hypothesis. Because we can intuitively see that the amount of data required to train and fine-tune large models differs by several orders of magnitude. In the "pre-training prompting" paradigm used by GPT-3 and subsequent models, This difference in data volume is even more obvious. Moreover, the in-context learning they use does not even require updating model parameters. It only needs to put a small number of labeled samples in the context of the input text to induce the model to output answers. This seems to indicate that ChatGPT’s powerful capabilities are indeed endogenous.

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

Comparison between traditional fine-tune method and GPT-3’s in-context learning method.

In addition, the power of ChatGPT also relies on a secret weapon-a method called RLHF (Reinforcement Learning with Human Feedback) Training method.

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

According to the official information released by OpenAI, this training method can be divided into three stages [1]:

Supervision strategy model in the cold start phase: randomly select a batch of prompts submitted by test users, rely on professional annotators to give high-quality answers to the specified prompts, and then use these manual annotations The data comes from the Fine-tune GPT 3.5 model, so that GPT 3.5 initially has the ability to understand the intention contained in the instructions;
Training reward model (Reward Model, RM): Randomly sample a batch of prompts submitted by users, then use the first-stage Fine-tune good cold start model to generate K different answers for each prompt, and then let the annotators sort the K results as training data. Use the pair-wise learning to rank mode to train the reward model;
Use reinforcement learning to enhance the ability of the pre-trained model: use the RM model learned in the previous stage and rely on the RM scoring results. Update pre-trained model parameters.

Two of these three stages use manual annotation, which is the so-called "human feedback" in RLHF.

Li Lei said that the results produced by this method are unexpected. When doing machine translation research before, they usually used the BLEU score (a fast, cheap, and language-independent automatic machine translation evaluation method that has a strong correlation with human judgment) to guide the model. This method is effective at times, but as the model gets larger and larger, its effect continues to weaken.

Therefore, The experience they gained from this is that training a very large model like GPT-3 with the help of feedback will not theoretically improve much. However, the stunning results of ChatGPT overturn this experience. Li Lei believes that this is what shocks everyone about ChatGPT and reminds everyone to change their research concepts.

What are the shortcomings of ChatGPT?

However, despite being shocked, the three guests also pointed out some of ChatGPT’s current shortcomings.

First of all, as mentioned before, some of the answers it generates are not accurate enough, and "serious nonsense" will appear from time to time, and it is not very good at logical reasoning.

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

## Picture source: https://m.huxiu.com/article/735909.html

Secondly, if a large model like ChatGPT is to be applied in practice, the deployment cost is quite high. And there is currently no clear evidence that models can maintain such powerful capabilities by reducing their size by an order or two. "If such amazing capabilities can only be maintained on a very large scale, it is still far from application," Zhang Debing said. Finally, ChatGPT may not be SOTA for some specific tasks (such as translation). Although the API of ChatGPT has not been released yet, and we cannot know its capabilities on some benchmarks, Li Lei's students discovered during the test of GPT-3 that although GPT-3 can complete the translation task excellently, it is not as good as the current one. The bilingual models trained separately are still worse (BLEU scores differ by 5 to 10 points). Based on this, Li Lei speculated that ChatGPT may not reach SOTA on some benchmarks, and may even be some distance away from SOTA.

Can ChatGPT replace search engines like Google? What inspiration does it have for AI research?

Among the various discussions about ChatGPT, the topic "Can it replace search engines" may be the hottest one. Recently, the New York Times reported that the popularity of ChatGPT has made Google feel like a powerful enemy. They are worried that if everyone uses chatbots like ChatGPT, no one will click on Google links with ads (in 2021, Google Advertising revenue accounts for 81.4% of total revenue). In a memo and recording obtained by The New York Times, Google CEO Sundar Pichai has been meeting to "define Google's AI strategy" and "disrupt the work of numerous teams within the company in response to ChatGPT." threats”[2].

In this regard, Li Lei believes that it may be a bit early to say replacement now. First of all, there is often a deep gap between the popularity of new technologies and commercial success. In the early years, Google Glass also said that it would become a new generation of interaction method, but it has not been able to fulfill its promise so far. Secondly, ChatGPT does perform better than search engines on some question and answer tasks, but the requirements carried by search engines are not limited to these tasks. Therefore, he believes that we should build products based on the advantages of ChatGPT itself, rather than necessarily aiming to replace existing mature products. The latter is a very difficult thing.

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

##Many AI researchers believe that ChatGPT and search engines can work together, and the two do not replace Replaced relationships, just like the recently popular "youChat" shows. Picture source: https://twitter.com/rasbt/status/1606661571459137539

## Zhang Debing also holds a similar view and believes that ChatGPT will not replace search engines in the short term. It's too realistic. After all, it still has many problems, such as being unable to access Internet resources and producing misleading information. In addition, it is still unclear whether its ability can be generalized to multi-modal search scenarios.

But it is undeniable that the emergence of ChatGPT has indeed given AI researchers a lot of inspiration.

Li Lei pointed out that

The first noteworthy point is the ability of in-context learning. In many previous studies, everyone has ignored how to tap the potential of existing models in some way (for example, the machine translation model is only used for translation, without trying to give it some hints to see if it can generate better translation), but GPT-3 and ChatGPT did it. Therefore, Li Lei is wondering if we can change all previous models to this form of in-context learning and give them some text, image or other forms of prompts so that they can fully exert their capabilities. This will be a A very promising research direction.

The second noteworthy point is the human feedback that plays an important role in ChatGPT. Li Lei mentioned that the success of Google search is actually largely due to its ease of obtaining human feedback (whether to click on the search results). ChatGPT obtains a lot of human feedback by asking people to write answers and rank the answers generated by the model, but this method of obtaining is relatively expensive (some recent research has pointed out this problem). Therefore, Li Lei believes that what we need to consider in the future is how to obtain a large amount of human feedback at low cost and efficiently.

Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023...

## Source: https://twitter.com/yizhongwyz/status/1605382356054859777Xiaohongshu’s new technology for “planting grass”

First of all, this model intuitively demonstrates how the large NLP model compares with the small model in various scenarios such as complex multi-round conversations, generalization of different queries, and chain of thought (Chain of Thought). has been significantly improved, and related capabilities are currently not available on small models.

Zhang Debing believes that these related capabilities of NLP large models may also be tried and verified in cross-modal generation. At present, the cross-modal model still has a significant gap compared to GPT-3 and ChatGPT in model scale, and there are also many works in cross-modal scenarios that demonstrate the improvement of NLP branch expression capabilities, which will affect the sophistication of visual generation results. It helps a lot. If the scale of cross-modal models can be further expanded, the "emergence" of model capabilities may be something worth looking forward to.

Secondly, like the first generation GPT-3, the current multi-modal generation results can often see very good and stunning results when selected, but the generation controllability still has a lot of room for improvement. ChatGPT seems to have improved this problem to a certain extent, and the generated things are more in line with human wishes. Therefore, Zhang Debing pointed out that cross-modal generation may be attempted by referring to many ideas of ChatGPT, such as fine-tuning based on high-quality data, reinforcement learning, etc..

These research results will be applied in Xiaohongshu’s multiple businesses, including intelligent customer service in e-commerce and other scenarios, and more accurate user queries and user notes in search scenarios. Understand, perform intelligent soundtrack, copywriting generation, cross-modal conversion and generative creation of user materials in intelligent creation scenarios. In each scenario, the depth and breadth of applications will continue to be enhanced and expanded as model size is compressed and model accuracy continues to improve.

Xiaohongshu, as a UGC community with 200 million monthly active users, has created a huge amount of multi-modal data collection with the richness and diversity of community content. A large amount of real data has been accumulated in information retrieval, information recommendation, information understanding, especially in intelligent creation-related technologies, as well as underlying multi-modal learning and unified representation learning. It also provides unique and practical innovations in these fields. A vast landing scene.

Xiaohongshu is still one of the few Internet products that still maintains strong growth momentum. Thanks to its product form that pays equal attention to graphics, text and video content, Xiaohongshu has become a popular The fields of modality, audio and video, and search, advertising and promotion will face and create many cutting-edge application problems. This has also attracted a large number of technical talents to join. Many members of the Xiaohongshu technical team have working experience in first-tier manufacturers at home and abroad such as Google, Facebook, and BAT.

These technical challenges will also give technical people the opportunity to fully participate in new fields and even play an important role. In the future, the space for talent growth that Xiaohongshu's technical team can provide will be broader than ever before, and it is also waiting for more outstanding AI technical talents to join.

At the same time, Xiaohongshu also attaches great importance to communication with the industry. "REDtech is coming" is a technology live broadcast column created by Xiaohongshu's technical team for the forefront of the industry. Since the beginning of this year, the Xiaohongshu technical team has conducted in-depth exchanges and dialogues with leaders, experts and scholars in the fields of multi-modality, NLP, machine learning, recommendation algorithms, etc., striving to explore and implement solutions from the dual perspectives of academic research and Xiaohongshu’s practical experience. Discuss valuable technical issues.

The above is the detailed content of Behind the carnival of ChatGPT: shortcomings are still there, but there are a lot of inspirations. Here are the things you can do in 2023.... For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Word文本框没有旋转按钮怎么办Dec 08, 2022 am 09:50 AM

Word文本框没有旋转按钮的解决办法：打开兼容模式文档后按F12键另存为高版本，再打开就可以了。

令人惊艳的4个ChatGPT项目，开源了！Mar 30, 2023 pm 02:11 PM

自从 ChatGPT、Stable Diffusion 发布以来，各种相关开源项目百花齐放，着实让人应接不暇。今天，着重挑选几个优质的开源项目分享给大家，对我们的日常工作、学习生活，都会有很大的帮助。

Word文档拆分后的子文档字体格式变了怎么办Feb 07, 2023 am 11:40 AM

Word文档拆分后的子文档字体格式变了的解决办法：1、在大纲模式拆分文档前，先选中正文内容创建一个新的样式，给样式取一个与众不同的名字；2、选中第二段正文内容，通过选择相似文本的功能将剩余正文内容全部设置为新建样式格式；3、进入大纲模式进行文档拆分，操作完成后打开子文档，正文字体格式就是拆分前新建的样式内容。

学术专用版ChatGPT火了，一键完成论文润色、代码解释、报告生成Apr 04, 2023 pm 01:05 PM

用 ChatGPT 辅助写论文这件事，越来越靠谱了。 ChatGPT 发布以来，各个领域的从业者都在探索 ChatGPT 的应用前景，挖掘它的潜力。其中，学术文本的理解与编辑是一种极具挑战性的应用场景，因为学术文本需要较高的专业性、严谨性等，有时还需要处理公式、代码、图谱等特殊的内容格式。现在，一个名为「ChatGPT 学术优化（chatgpt_academic）」的新项目在 GitHub 上爆火，上线几天就在 GitHub 上狂揽上万 Star。项目地址：https://github.com/

vscode配置中文插件，带你无需注册体验ChatGPT！Dec 16, 2022 pm 07:51 PM

面对一夜爆火的 ChatGPT ，我最终也没抵得住诱惑，决定体验一下，不过这玩意要注册需要外国手机号以及科学上网，将许多人拦在门外，本篇博客将体验当下爆火的 ChatGPT 以及无需注册和科学上网，拿来即用的 ChatGPT 使用攻略，快来试试吧！

30行Python代码就可以调用ChatGPT API总结论文的主要内容Apr 04, 2023 pm 12:05 PM

阅读论文可以说是我们的日常工作之一，论文的数量太多，我们如何快速阅读归纳呢？自从ChatGPT出现以后，有很多阅读论文的服务可以使用。其实使用ChatGPT API非常简单，我们只用30行python代码就可以在本地搭建一个自己的应用。阅读论文可以说是我们的日常工作之一，论文的数量太多，我们如何快速阅读归纳呢？自从ChatGPT出现以后，有很多阅读论文的服务可以使用。其实使用ChatGPT API非常简单，我们只用30行python代码就可以在本地搭建一个自己的应用。使用 Python 和 C

用ChatGPT秒建大模型！OpenAI全新插件杀疯了，接入代码解释器一键getApr 04, 2023 am 11:30 AM

ChatGPT可以联网后，OpenAI还火速介绍了一款代码生成器，在这个插件的加持下，ChatGPT甚至可以自己生成机器学习模型了。上周五，OpenAI刚刚宣布了惊爆的消息，ChatGPT可以联网，接入第三方插件了！而除了第三方插件，OpenAI也介绍了一款自家的插件「代码解释器」，并给出了几个特别的用例：解决定量和定性的数学问题；进行数据分析和可视化；快速转换文件格式。此外，Greg Brockman演示了ChatGPT还可以对上传视频文件进行处理。而一位叫Andrew Mayne的畅销作

ChatGPT教我学习PHP中AOP的实现（附代码）Mar 30, 2023 am 10:45 AM

本篇文章给大家带来了关于php的相关知识，其中主要介绍了我是怎么用ChatGPT学习PHP中AOP的实现，感兴趣的朋友下面一起来看一下吧，希望对大家有帮助。

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates

1 months agoBy尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hello Kitty Island Adventure: How To Get Giant Seeds

1 months agoBy尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

4 weeks agoByDDD

R.E.P.O. Save File Location: Where Is It & How to Protect It?

4 weeks agoByDDD

Hot Tools

Dreamweaver Mac version

Visual web development tools

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.