


Zhipu AI launches the third-generation large base model ChatGLM3 to adapt to more domestic chips
The news on October 27, 2023 is that Zhipu AI released a new self-developed third-generation large base model ChatGLM3 and related series of products at the China Computer Conference (CNCC). This release marks a major breakthrough for Zhipu AI after launching the 100 billion base conversation model ChatGLM and ChatGLM2
ChatGLM3 is developed using an original multi-stage enhanced pre-training method. This method can make training more complete. According to the evaluation results, in 44 Chinese and English public data set tests, ChatGLM3 ranked first among domestic models of the same size. Zhang Peng, CEO of Zhipu AI, released new products at the press conference and demonstrated the latest product features in real time
ChatGLM3 new technology upgrade with higher performance and lower cost
ChatGLM3 launched by Zhipu AI has become more powerful with richer training data and better training solutions. Compared with ChatGLM2, MMLU increased by 36%, CEval increased by 33%, GSM8K increased by 179%, and BBH increased by 126%
At the same time, ChatGLM3 aims at GPT-4V and has implemented iterative upgrades of several new functions, including CogVLM with multi-modal understanding capabilities - image recognition semantics, which has achieved SOTA on more than 10 international standard image and text evaluation data sets. ; Code enhancement module Code Interpreter generates code and executes it according to user needs, automatically completing complex tasks such as data analysis and file processing; Web search enhancement WebGLM-access search enhancement can automatically find relevant information on the Internet based on questions and provide answers when answering Refer to relevant literature or article links. The semantic and logical capabilities of ChatGLM3 have been greatly enhanced.
ChatGLM3 also integrates the self-developed AgentTuning technology, which activates the model agent capabilities, especially in terms of intelligent planning and execution, which is 1000% improved compared to ChatGLM2; it also enables domestic large models to natively support tool calling, code execution, Complex scenarios such as games, database operations, knowledge graph search and reasoning, and operating systems.
In addition, ChatGLM3 this time launches end test models ChatGLM3-1.5B and ChatGLM3-3B that can be deployed on mobile phones, supporting a variety of mobile phones and vehicle platforms including vivo, Xiaomi, and Samsung, and even supporting CPU chips on mobile platforms. Inference speed can reach 20 tokens/s. In terms of accuracy, the performance of the 1.5B and 3B models is close to that of the ChatGLM2-6B model on public benchmarks.
Based on the latest efficient dynamic reasoning and memory optimization technology, the current reasoning framework of ChatGLM3 is better than the current best open source implementation under the same hardware and model conditions, including vLLM launched by the University of Berkeley and the latest version of Hugging Face TGI , the inference speed is increased by 2-3 times, and the inference cost is doubled, only 0.5 points per thousand tokens, the lowest cost.
This content is for reference only and does not constitute any investment advice. Readers should use their own judgment when using this information and assume responsibility for their own decisions. This website is not responsible for any losses caused by the use of this content
This account does not make any statement or guarantee as to the availability, accuracy, timeliness, validity or completeness of any information published, and hereby disclaims any liability or consequences that may arise from the information. After rewriting: This account makes no representation or warranty as to the availability, accuracy, timeliness, validity or completeness of any information posted, and disclaims any liability or consequences in this statement
2. This account is non-commercial and non-profit. The reproduced content does not mean that you agree with its views and are responsible for its authenticity, nor is it intended to constitute any other guidance. This website is not responsible for any direct or indirect responsibility for any inaccuracies or errors in any information reproduced or published.
3. The information, materials, text, pictures, etc. used in this article come from the Internet, and all reproduced content has been marked with the source. If you find any work that infringes your intellectual property rights or personal legal rights, please contact us and we will modify or delete it in a timely manner
The above is the detailed content of Zhipu AI launches the third-generation large base model ChatGLM3 to adapt to more domestic chips. For more information, please follow other related articles on the PHP Chinese website!

The 2025 Artificial Intelligence Index Report released by the Stanford University Institute for Human-Oriented Artificial Intelligence provides a good overview of the ongoing artificial intelligence revolution. Let’s interpret it in four simple concepts: cognition (understand what is happening), appreciation (seeing benefits), acceptance (face challenges), and responsibility (find our responsibilities). Cognition: Artificial intelligence is everywhere and is developing rapidly We need to be keenly aware of how quickly artificial intelligence is developing and spreading. Artificial intelligence systems are constantly improving, achieving excellent results in math and complex thinking tests, and just a year ago they failed miserably in these tests. Imagine AI solving complex coding problems or graduate-level scientific problems – since 2023

Meta's Llama 3.2: A Leap Forward in Multimodal and Mobile AI Meta recently unveiled Llama 3.2, a significant advancement in AI featuring powerful vision capabilities and lightweight text models optimized for mobile devices. Building on the success o

This week's AI landscape: A whirlwind of advancements, ethical considerations, and regulatory debates. Major players like OpenAI, Google, Meta, and Microsoft have unleashed a torrent of updates, from groundbreaking new models to crucial shifts in le

The comforting illusion of connection: Are we truly flourishing in our relationships with AI? This question challenged the optimistic tone of MIT Media Lab's "Advancing Humans with AI (AHA)" symposium. While the event showcased cutting-edg

Introduction Imagine you're a scientist or engineer tackling complex problems – differential equations, optimization challenges, or Fourier analysis. Python's ease of use and graphics capabilities are appealing, but these tasks demand powerful tools

Meta's Llama 3.2: A Multimodal AI Powerhouse Meta's latest multimodal model, Llama 3.2, represents a significant advancement in AI, boasting enhanced language comprehension, improved accuracy, and superior text generation capabilities. Its ability t

Data Quality Assurance: Automating Checks with Dagster and Great Expectations Maintaining high data quality is critical for data-driven businesses. As data volumes and sources increase, manual quality control becomes inefficient and prone to errors.

Mainframes: The Unsung Heroes of the AI Revolution While servers excel at general-purpose applications and handling multiple clients, mainframes are built for high-volume, mission-critical tasks. These powerful systems are frequently found in heavil


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft

SublimeText3 English version
Recommended: Win version, supports code prompts!

Zend Studio 13.0.1
Powerful PHP integrated development environment

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Mac version
God-level code editing software (SublimeText3)