search
HomeTechnology peripheralsAITsinghua University and Zhipu AI open source GLM-4: launching a new revolution in natural language processing

Since the launch of ChatGLM-6B on March 14, 2023, the GLM series models have received widespread attention and recognition. Especially after ChatGLM3-6B was open sourced, developers are full of expectations for the fourth-generation model launched by Zhipu AI. This expectation has finally been fully satisfied with the release of GLM-4-9B.

The birth of GLM-4-9B

In order to give small models (10B and below) more powerful capabilities, the GLM technical team spent nearly half a year exploring , launched this new fourth-generation GLM series open source model: GLM-4-9B. This model greatly compresses the model size while ensuring accuracy, and has faster inference speed and higher efficiency. There is no end to the exploration of the GLM technical team, and we will continue to work hard to launch more competitive open source

Innovative pre-training technology

In the pre-training process, we introduce large The language model performed data screening and finally obtained 10T of high-quality multilingual data. This amount of data is more than three times that of the ChatGLM3-6B model. In addition, we use FP8 technology for efficient pre-training, which improves training efficiency by 3.5 times compared to the third-generation model. Taking into account the user's storage needs, the parameter size of GLM-4-9B has been increased from 6B to 9B. Ultimately, we increased the pre-training computation by 5 times to maximize performance capabilities under limited storage conditions.

Excellent performance display

GLM-4-9B is a comprehensive technology upgrade tool with more powerful inference performance and better It has the advantages of context processing capabilities, multi-language support, multi-modal processing, and full tool set All Tools calling. These upgrades provide users with more stable, more reliable, and more accurate technical support, and improve users' work efficiency and quality.

The GLM-4-9B series includes multiple versions:

  • Basic version: GLM-4-9B (8K)
  • Conversational version: GLM -4-9B-Chat (128K)
  • Extra-long context version: GLM-4-9B-Chat-1M (1M)
  • Multi-modal version: GLM-4V-9B-Chat (8K)

GLM-4-9B’s powerful abilities

Basic abilities

GLM-4- Based on strong pre-training, 9B’s comprehensive ability in Chinese and English has improved by 40% compared to ChatGLM3-6B. In particular, significant improvements have been achieved in the Chinese alignment capability AlignBench, the instruction compliance capability IFeval, and the engineering code processing capability Natural Code Bench. Even when comparing the Llama 3 8B model with more training volume, GLM-4-9B is not inferior at all and leads in English performance. In the field of Chinese subjects, GLM-4-9B has improved by up to 50% [Performance Evaluation chart].

Long text processing capability

清华大学与智谱AI重磅开源 GLM-4:掀起自然语言处理新革命Pictures

The context length of the GLM-4-9B+ model is expanded from 128K At 1M tokens, it means that it can process input of up to 2 million words at the same time, which is equivalent to the length of two "Dream of Red Mansions" or 125 academic papers. The GLM-4-9B-Chat-1M model successfully demonstrated its excellent ability to non-destructively process long text input in the "needle in the haystack" experiment [illustration of long text experiment].

The following are two demo video cases showing long text processing capabilities:

  1. GLM-4-9B-Chat model: Input 5 PDF files with a total length of about 128K, and give a prompt to write a detailed research report on the development of large models in China. The model can quickly generate high-quality research reports (the video is not accelerated).
  2. GLM-4-9B-Chat-1M Model: Input about 900,000 words of the complete collection of "The Three-Body Problem" and ask the model to write a sequel outline for the novel. The model is reasonably planned and provides a continuation framework (video accelerated 10 times).

Multi-language support

GLM-4-9B+ supports up to 26 languages, including Chinese, English, Russian, etc. We expanded the tokenizer vocabulary size from 65K to 150K, improving coding efficiency by 30%. In multi-language understanding and generation tasks, GLM-4-9B-Chat outperforms Llama-3-8B-Instruct [Multi-language performance comparison chart].

Function Call Capability

The function calling capability of GLM-4-9B has been improved by 40% compared to the previous generation. On the Berkeley Function-Calling Leaderboard, its Function Call The capabilities are comparable to GPT-4 [Function call performance comparison chart].

All Tools full tool call

The "All Tools" capability means that the model can understand and use various external tools (such as code execution, network browsing, and drawing) etc.) to assist in completing the task. At the Zhipu DevDay on January 16, the GLM-4 model was fully upgraded with All Tools capabilities, which can intelligently call web browsers, code interpreters, CogView and other tools to complete complex requests [All Tools task icon].

Multimodal processing

GLM-4V-9B As an open source multimodal model of the GLM-4 base, capable of processing high-resolution input, By directly mixing visual and text data for training, it demonstrates significant multi-modal processing effects and is comparable to GPT-4V in performance. It performs very well when identifying and processing complex multi-modal tasks [Multi-modal application example diagram].

清华大学与智谱AI重磅开源 GLM-4:掀起自然语言处理新革命Picture

Future Outlook

GLM-4-9B has demonstrated its powerful performance in a variety of tasks, It is a major breakthrough in the field of natural language processing. Whether it is academic research or industrial applications, the GLM-4-9B will be your best choice.

We sincerely invite you to join the ranks of GLM-4 users and explore the possibilities brought by this excellent model:

  • GitHub repository
  • Hugging Face model page
  • 魔达Community

The above is the detailed content of Tsinghua University and Zhipu AI open source GLM-4: launching a new revolution in natural language processing. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
The AI Skills Gap Is Slowing Down Supply ChainsThe AI Skills Gap Is Slowing Down Supply ChainsApr 26, 2025 am 11:13 AM

The term "AI-ready workforce" is frequently used, but what does it truly mean in the supply chain industry? According to Abe Eshkenazi, CEO of the Association for Supply Chain Management (ASCM), it signifies professionals capable of critic

How One Company Is Quietly Working To Transform AI ForeverHow One Company Is Quietly Working To Transform AI ForeverApr 26, 2025 am 11:12 AM

The decentralized AI revolution is quietly gaining momentum. This Friday in Austin, Texas, the Bittensor Endgame Summit marks a pivotal moment, transitioning decentralized AI (DeAI) from theory to practical application. Unlike the glitzy commercial

Nvidia Releases NeMo Microservices To Streamline AI Agent DevelopmentNvidia Releases NeMo Microservices To Streamline AI Agent DevelopmentApr 26, 2025 am 11:11 AM

Enterprise AI faces data integration challenges The application of enterprise AI faces a major challenge: building systems that can maintain accuracy and practicality by continuously learning business data. NeMo microservices solve this problem by creating what Nvidia describes as "data flywheel", allowing AI systems to remain relevant through continuous exposure to enterprise information and user interaction. This newly launched toolkit contains five key microservices: NeMo Customizer handles fine-tuning of large language models with higher training throughput. NeMo Evaluator provides simplified evaluation of AI models for custom benchmarks. NeMo Guardrails implements security controls to maintain compliance and appropriateness

AI Paints A New Picture For The Future Of Art And DesignAI Paints A New Picture For The Future Of Art And DesignApr 26, 2025 am 11:10 AM

AI: The Future of Art and Design Artificial intelligence (AI) is changing the field of art and design in unprecedented ways, and its impact is no longer limited to amateurs, but more profoundly affecting professionals. Artwork and design schemes generated by AI are rapidly replacing traditional material images and designers in many transactional design activities such as advertising, social media image generation and web design. However, professional artists and designers also find the practical value of AI. They use AI as an auxiliary tool to explore new aesthetic possibilities, blend different styles, and create novel visual effects. AI helps artists and designers automate repetitive tasks, propose different design elements and provide creative input. AI supports style transfer, which is to apply a style of image

How Zoom Is Revolutionizing Work With Agentic AI: From Meetings To MilestonesHow Zoom Is Revolutionizing Work With Agentic AI: From Meetings To MilestonesApr 26, 2025 am 11:09 AM

Zoom, initially known for its video conferencing platform, is leading a workplace revolution with its innovative use of agentic AI. A recent conversation with Zoom's CTO, XD Huang, revealed the company's ambitious vision. Defining Agentic AI Huang d

The Existential Threat To UniversitiesThe Existential Threat To UniversitiesApr 26, 2025 am 11:08 AM

Will AI revolutionize education? This question is prompting serious reflection among educators and stakeholders. The integration of AI into education presents both opportunities and challenges. As Matthew Lynch of The Tech Edvocate notes, universit

The Prototype: American Scientists Are Looking For Jobs AbroadThe Prototype: American Scientists Are Looking For Jobs AbroadApr 26, 2025 am 11:07 AM

The development of scientific research and technology in the United States may face challenges, perhaps due to budget cuts. According to Nature, the number of American scientists applying for overseas jobs increased by 32% from January to March 2025 compared with the same period in 2024. A previous poll showed that 75% of the researchers surveyed were considering searching for jobs in Europe and Canada. Hundreds of NIH and NSF grants have been terminated in the past few months, with NIH’s new grants down by about $2.3 billion this year, a drop of nearly one-third. The leaked budget proposal shows that the Trump administration is considering sharply cutting budgets for scientific institutions, with a possible reduction of up to 50%. The turmoil in the field of basic research has also affected one of the major advantages of the United States: attracting overseas talents. 35

All About Open AI's Latest GPT 4.1 Family - Analytics VidhyaAll About Open AI's Latest GPT 4.1 Family - Analytics VidhyaApr 26, 2025 am 10:19 AM

OpenAI unveils the powerful GPT-4.1 series: a family of three advanced language models designed for real-world applications. This significant leap forward offers faster response times, enhanced comprehension, and drastically reduced costs compared t

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function