Home  >  Article  >  Technology peripherals  >  The effect can reach 96% of the equivalent model of OpanAI, and the domestic open source AI language model TigerBot is released

The effect can reach 96% of the equivalent model of OpanAI, and the domestic open source AI language model TigerBot is released

WBOY
WBOYforward
2023-06-10 13:35:101079browse

According to news on June 8, the domestic multi-modal large language model TigerBot was officially released recently, including two versions of 7 billion parameters and 180 billion parameters. It is now open source on GitHub.

效果可达 OpanAI 同等模型 96%,国产开源 AI 语言模型 TigerBot 发布

▲ Picture source TigerBot’s GitHub page

It is reported that the innovation brought by TigerBot mainly lies in:

  • Propose instructions and complete them Innovative algorithms for supervised fine-tuning improve model learnability.
  • Use ensemble and probabilistic modeling methods to achieve controllable facts and creativity.
  • Break through the memory and communication issues in mainstream frameworks such as deep-speed in parallel training.

In addition, this model also makes more suitable optimizations from the tokenizer to the training algorithm for the more irregular distribution of the Chinese language.

Researcher Chen Ye said on the official website of Hubo Technology: "This model can quickly understand what type of questions humans have asked using only a small number of parameters. According to the OpenAI InstructGPT paper on the public NLP data set According to the automatic evaluation, TigerBot-7B has reached 96% of the comprehensive performance of OpenAI models of the same size."

效果可达 OpanAI 同等模型 96%,国产开源 AI 语言模型 TigerBot 发布

▲ Picture source TigerBot's GitHub page

According to According to the report, the performance of TigerBot-7B-base is "better than that of OpenAI's comparable models." The open source code includes basic training and inference code, and the quantification and inference code of the dual-card inference 180B model. The data includes 100G pre-training data and 1G or 1 million pieces of data for supervised fine-tuning.

IT House friends canfind GitHub’s open source projects here.

The above is the detailed content of The effect can reach 96% of the equivalent model of OpanAI, and the domestic open source AI language model TigerBot is released. For more information, please follow other related articles on the PHP Chinese website!

Statement:
This article is reproduced at:51cto.com. If there is any infringement, please contact admin@php.cn delete