


Domestic ChatGPT is open source again! The effect has been greatly upgraded and can also be run on mobile phones.
Recently, the Yuanyu Intelligence team has open sourced another large model of the ChatYuan series: ChatYuan-large-v2, which supports inference on a single consumer-grade graphics card, PC and even mobile phones.
Just now, "domestic ChatGPT" ChatYuan has released a new version.
The updated ChatYuan-large-v2 not only supports Chinese and English bilingual, but also supports the total input and output length of up to 4k.
This is also the research result of Yuanyu Intelligence in the direction of large models after the previous PromptCLUE-base, PromptCLUE- v1-5, and ChatYuan-large-v1 models.
Open source project address:
https://github.com/clue-ai/ChatYuan
Huggingface:
https://huggingface.co/ClueAI/ChatYuan-large-v2
Modelscope:
https://modelscope.cn/models/ClueAI/ChatYuan- large-v2/summary
01 What is ChatYuan-large-v2?
ChatYuan-large-v2 is a large functional conversational language model that supports Chinese and English bilingualism. ChatYuan-large-v2 uses the same technical solution as the v1 version, including instruction fine-tuning, human feedback reinforcement learning, and thinking. Chains and other aspects have been optimized.
ChatYuan-large-v2 is a representative model in the ChatYuan series that achieves high-quality effects with lightweight design. It can achieve the basic effects of the industry's 10B model with only 0.7B parameters, greatly reducing the inference cost and improving Usage efficiency. Users can perform inference on consumer-grade graphics cards, PCs, and even mobile phones (INT4 requires as little as 400M).
At the same time, in order to better improve the user experience, the team has encapsulated tools. Chatyuan-large-v2 has been implemented locally and can be run locally. After downloading, the h5 version can be used directly locally for web page interaction.
02 What are the upgrades to v2?
Based on the original functions of chatyuan-large-v1, the v2 model has been optimized as follows:
- Enhanced basic capabilities: original contextual Q&A and creative writing capabilities Significant improvement.
- Added the ability to refuse to answer: for some dangerous and harmful questions, you have learned to refuse to answer.
- Added new code generation function: basic code generation has been optimized to a certain extent.
- Added a new table generation function: optimized the content and format of the generated table.
- Enhanced mathematical operation capabilities: Basic mathematical operations such as addition and subtraction have been optimized.
- Expand the total length of input and output: the maximum number of length tokens is extended to 4096.
- Enhanced simulation scenario capabilities: you can simulate multi-person conversations or specific scenarios, and perform content creation and contextual interaction in scenarios.
- Added Chinese and English bilingual dialogue capabilities: Newly added Chinese and English bilingual interaction, English creation, translation and other functions.
Rejection ability
Computational reasoning
Simulation scenario
##Table generation
Code generation
- Regarding the basic implementation of basic functions in reasoning, calculation, and code generation, there is still the problem of insufficient training. In some scenarios, logical errors will occur. For example, the code can basically be implemented and has the ability to annotate, but cannot. To ensure simplicity, smoothness and accuracy, visibility needs to be optimized.
- The answers to general knowledge are not precise enough, and there are still inaccuracies in factual knowledge.
- Contextual information processing is still insufficient.
Conclusion
Overall, v2 has greatly improved compared to the v1 open source model in terms of context understanding, content generation, code table generation, etc., just through the 0.7B parameter scale. It can achieve basic effects with tens of billions of parameters in the industry, significantly reduce reasoning costs, and improve usage efficiency.
Yuanyu Intelligent said that the team will firmly adhere to the open source route, and will continue to open source better and larger general large models in the future, continue to build an open source developer ecosystem, and promote the open source development of domestic large models. I hope all friends will criticize Correction.
Invitation for internal product testing
In addition to this open source ChatYuan-large-v2 model, the Yuanyu team officially launched the internal testing of the KnowX product. KnowX is equipped with the ChatYuan line The latest version of the large model capability has excellent performance in context understanding, content generation, code generation, logical reasoning calculation, etc. In order to achieve the reliability, stability and further optimization of the version, product internal testing has now been launched. The number of places is limited. Interested parties Friends can apply in the link below.
Internal beta application channel:
https://wj.qq.com/s2/11984341/e00b/
The above is the detailed content of Domestic ChatGPT is open source again! The effect has been greatly upgraded and can also be run on mobile phones.. For more information, please follow other related articles on the PHP Chinese website!

Running large language models at home with ease: LM Studio User Guide In recent years, advances in software and hardware have made it possible to run large language models (LLMs) on personal computers. LM Studio is an excellent tool to make this process easy and convenient. This article will dive into how to run LLM locally using LM Studio, covering key steps, potential challenges, and the benefits of having LLM locally. Whether you are a tech enthusiast or are curious about the latest AI technologies, this guide will provide valuable insights and practical tips. Let's get started! Overview Understand the basic requirements for running LLM locally. Set up LM Studi on your computer

Guy Peri is McCormick’s Chief Information and Digital Officer. Though only seven months into his role, Peri is rapidly advancing a comprehensive transformation of the company’s digital capabilities. His career-long focus on data and analytics informs

Introduction Artificial intelligence (AI) is evolving to understand not just words, but also emotions, responding with a human touch. This sophisticated interaction is crucial in the rapidly advancing field of AI and natural language processing. Th

Introduction In today's data-centric world, leveraging advanced AI technologies is crucial for businesses seeking a competitive edge and enhanced efficiency. A range of powerful tools empowers data scientists, analysts, and developers to build, depl

This week's AI landscape exploded with groundbreaking releases from industry giants like OpenAI, Mistral AI, NVIDIA, DeepSeek, and Hugging Face. These new models promise increased power, affordability, and accessibility, fueled by advancements in tr

But the company’s Android app, which offers not only search capabilities but also acts as an AI assistant, is riddled with a host of security issues that could expose its users to data theft, account takeovers and impersonation attacks from malicious

You can look at what’s happening in conferences and at trade shows. You can ask engineers what they’re doing, or consult with a CEO. Everywhere you look, things are changing at breakneck speed. Engineers, and Non-Engineers What’s the difference be

Simulate Rocket Launches with RocketPy: A Comprehensive Guide This article guides you through simulating high-power rocket launches using RocketPy, a powerful Python library. We'll cover everything from defining rocket components to analyzing simula


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

SublimeText3 English version
Recommended: Win version, supports code prompts!

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

SublimeText3 Linux new version
SublimeText3 Linux latest version

Dreamweaver CS6
Visual web development tools