


IT Home According to news on July 11, Baichuan Intelligence, a subsidiary of Wang Xiaochuan, today released the Baichuan-13B large model, which is known as "13 billion parameters open source and commercially available".
▲ Picture source Baichuang-13B GitHub page
According to the official introduction, Baichuan-13B is an open source commercially available large-scale language model containing 13 billion parameters developed by Baichuan Intelligent after Baichuan-7B. It has achieved the best results among models of the same size on both Chinese and English Benchmarks. . This release includes two versions: pre-training (Baichuan-13B-Base) and alignment (Baichuan-13B-Chat).
▲ Picture source Baichuang-13B GitHub page
Officially claimed that Baichuan-13B has the following characteristics:
- Larger size, more data: Baichuan-13B further expands the number of parameters to 13 billion based on Baichuan-7B, and trains 1.4 trillion tokens on high-quality corpus, exceeding LLaMA-13B by 40%, which is Currently the open source model with the largest amount of training data in 13B size. Supports Chinese and English bilingual, uses ALiBi position encoding, and the context window length is 4096.
- Open source pre-training and alignment models at the same time: The pre-training model is a "base" for developers, while the majority of ordinary users have stronger needs for alignment models with dialogue functions. Therefore, the project also has an alignment model (Baichuan-13B-Chat), which has strong conversational capabilities. It can be used out of the box and can be easily deployed with a few lines of code.
- More efficient reasoning: In order to support the use of a wider range of users, the project has also open sourced the quantized versions of int8 and int4. Compared with the non-quantified version, it greatly reduces the deployment machine resource threshold with almost no effect loss, and can Deployed on consumer-grade graphics cards such as NVIDIA RTX3090.
- Open source, free for commercial use: Baichuan-13B is not only fully open to academic research, but developers can also use it for free after applying by email and obtaining an official commercial license.
Currently, the model has been released on HuggingFace, GitHub, and Model Scope. Interested IT House friends can go and learn more.
The above is the detailed content of Baichuan Intelligent released Baichuan-13B AI model, claiming that '13 billion parameters are open source and can be used commercially'. For more information, please follow other related articles on the PHP Chinese website!

Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

Revolutionizing App Development: A Deep Dive into Replit Agent Tired of wrestling with complex development environments and obscure configuration files? Replit Agent aims to simplify the process of transforming ideas into functional apps. This AI-p

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

DALL-E 3: A Generative AI Image Creation Tool Generative AI is revolutionizing content creation, and DALL-E 3, OpenAI's latest image generation model, is at the forefront. Released in October 2023, it builds upon its predecessors, DALL-E and DALL-E 2

The $500 billion Stargate AI project, backed by tech giants like OpenAI, SoftBank, Oracle, and Nvidia, and supported by the U.S. government, aims to solidify American AI leadership. This ambitious undertaking promises a future shaped by AI advanceme

Grok 3 – Elon Musk and xAi’s latest AI model is the talk of the town these days. From Andrej Karpathy to tech influencers, everyone is talking about the capabilities of this new model. Initially, access was limited to

Google DeepMind's GenCast: A Revolutionary AI for Weather Forecasting Weather forecasting has undergone a dramatic transformation, moving from rudimentary observations to sophisticated AI-powered predictions. Google DeepMind's GenCast, a groundbreak


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Dreamweaver Mac version
Visual web development tools

MantisBT
Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

Notepad++7.3.1
Easy-to-use and free code editor

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

SublimeText3 Mac version
God-level code editing software (SublimeText3)
