search
HomeTechnology peripheralsAI16 billion parameters, multiple new capabilities, Fudan MOSS is open source

In February this year, Machine Heart reported the news that Fudan University launched the Chinese version of ChatGPT (see "Fudan releases the Chinese version of ChatGPT: MOSS starts testing and hits hot searches, servers are crowded"), which attracted widespread attention . At that time, Professor Qiu Xipeng said that Moss would be open sourced in April.

Yesterday, the open source version of Moss really came.

16 billion parameters, multiple new capabilities, Fudan MOSS is open source

## Project address: https://github.com/OpenLMLab/MOSS

##MOSS is an open source conversation language model that supports Chinese and English bilingualism and multiple plug-ins, but the number of parameters is much smaller than ChatGPT. After v0.0.2, the team continued to adjust it and launched MOSS v0.0.3, which is the current open source version. Compared with earlier versions, the functions have also been updated in many ways.

In the initial test, the basic functions of MOSS are similar to ChatGPT. It can complete various natural language processing tasks according to the instructions entered by the user, including text generation, text summarization, translation, and code generation. , small talk, etc.

After the open beta, the team continued to increase the pre-training of Chinese corpus: "So far, the base language model of MOSS 003 has been trained on 100B Chinese tokens. The total training The number of tokens reached 700B, which also contained about 300B of code."

After the open beta, we also collected some user data. We found that user intentions in the real Chinese world are as disclosed in the OpenAI InstructGPT paper The user prompt distribution of seed regenerated approximately 1.1 million regular conversation data, covering more fine-grained helpfulness data and broader harmlessness data.

Content source: https://www.zhihu.com/question/596908242/answer/2994534005

Currently, the team Three models, moss-moon-003-base, moss-moon-003-sft, and moss-moon-003-sft-plugin, have been uploaded to HuggingFace. In the future, three more models will be open source.

16 billion parameters, multiple new capabilities, Fudan MOSS is open source

According to the project homepage, the moss-moon series model has 16 billion parameters and can be used in a single A100/A800 or two sheets at FP16 accuracy. 3090 graphics card running, can run on a single 3090 graphics card at INT4/8 precision.

The team also stated that due to the small number of model parameters and the autoregressive generation paradigm, MOSS may still generate misleading replies containing factual errors or harmful content containing bias/discrimination, please Be careful to identify and use content generated by MOSS, and do not spread harmful content generated by MOSS to the Internet.

New capabilities

In MOSS v0.0.3, the team added a number of new capabilities.

The team constructed approximately 300,000 plug-in-enhanced conversation data, including search engines, Vincent diagrams, calculators, equation solvers, etc. Regarding how to use the plug-in version of MOSS, the subsequent team will announce it on GitHub.

16 billion parameters, multiple new capabilities, Fudan MOSS is open source

MOSS v0.0.3 now introduces the ability to use multiple plugins.

The following figure shows the ability to call the search engine:

16 billion parameters, multiple new capabilities, Fudan MOSS is open source

The following figure shows The ability to call the equation solver:

16 billion parameters, multiple new capabilities, Fudan MOSS is open source

The following image demonstrates the ability to generate images from text:

16 billion parameters, multiple new capabilities, Fudan MOSS is open source

Project author Sun Tianxiang added that MOSS 003’s ability to support plug-in activation is controlled through meta instruction, similar to the system prompt in gpt-3.5-turbo. "Because it is controlled by the model, it cannot guarantee 100% control rate, and there are still some defects such as inaccurate calling of multi-selected plug-ins and plug-ins fighting with each other. We are developing new models as soon as possible to alleviate these problems."

Download and install

Download the contents of this warehouse to local/remote server:

git clone https://github.com/OpenLMLab/MOSS.gitcd MOSS

Create conda environment:

conda create --name moss pythnotallow=3.8
conda activate moss

Installation dependencies:

pip install -r requirements.txt

The torch and transformers versions are not recommended to be lower than the recommended versions.

According to the agreement, open source MOSS can be used for commercial purposes:

16 billion parameters, multiple new capabilities, Fudan MOSS is open source

In addition, developers can To call MOSS services through API, the team will consider providing services through API interface according to the current service pressure. For the interface format, please refer to: https://github.com/OpenLMLab/MOSS/blob/main/moss_api.pdf

Currently, there are developers creating based on open source content, such as video Q&A through VideoChat.

VideoChat is a multi-functional video question and answer tool that combines the functions of motion recognition, visual subtitles and StableLM. The tool generates dense, descriptive subtitles for any object and action in a video, offering a range of language styles to suit different user preferences. It supports users to have conversations of varying lengths, moods, and language authenticity.

16 billion parameters, multiple new capabilities, Fudan MOSS is open source

## Project address: https://github.com/OpenGVLab/Ask-Anything/tree/main/video_chat_with_MOSS

The above is the detailed content of 16 billion parameters, multiple new capabilities, Fudan MOSS is open source. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
How to Build Your Personal AI Assistant with Huggingface SmolLMHow to Build Your Personal AI Assistant with Huggingface SmolLMApr 18, 2025 am 11:52 AM

Harness the Power of On-Device AI: Building a Personal Chatbot CLI In the recent past, the concept of a personal AI assistant seemed like science fiction. Imagine Alex, a tech enthusiast, dreaming of a smart, local AI companion—one that doesn't rely

AI For Mental Health Gets Attentively Analyzed Via Exciting New Initiative At Stanford UniversityAI For Mental Health Gets Attentively Analyzed Via Exciting New Initiative At Stanford UniversityApr 18, 2025 am 11:49 AM

Their inaugural launch of AI4MH took place on April 15, 2025, and luminary Dr. Tom Insel, M.D., famed psychiatrist and neuroscientist, served as the kick-off speaker. Dr. Insel is renowned for his outstanding work in mental health research and techno

The 2025 WNBA Draft Class Enters A League Growing And Fighting Online HarassmentThe 2025 WNBA Draft Class Enters A League Growing And Fighting Online HarassmentApr 18, 2025 am 11:44 AM

"We want to ensure that the WNBA remains a space where everyone, players, fans and corporate partners, feel safe, valued and empowered," Engelbert stated, addressing what has become one of women's sports' most damaging challenges. The anno

Comprehensive Guide to Python Built-in Data Structures - Analytics VidhyaComprehensive Guide to Python Built-in Data Structures - Analytics VidhyaApr 18, 2025 am 11:43 AM

Introduction Python excels as a programming language, particularly in data science and generative AI. Efficient data manipulation (storage, management, and access) is crucial when dealing with large datasets. We've previously covered numbers and st

First Impressions From OpenAI's New Models Compared To AlternativesFirst Impressions From OpenAI's New Models Compared To AlternativesApr 18, 2025 am 11:41 AM

Before diving in, an important caveat: AI performance is non-deterministic and highly use-case specific. In simpler terms, Your Mileage May Vary. Don't take this (or any other) article as the final word—instead, test these models on your own scenario

AI Portfolio | How to Build a Portfolio for an AI Career?AI Portfolio | How to Build a Portfolio for an AI Career?Apr 18, 2025 am 11:40 AM

Building a Standout AI/ML Portfolio: A Guide for Beginners and Professionals Creating a compelling portfolio is crucial for securing roles in artificial intelligence (AI) and machine learning (ML). This guide provides advice for building a portfolio

What Agentic AI Could Mean For Security OperationsWhat Agentic AI Could Mean For Security OperationsApr 18, 2025 am 11:36 AM

The result? Burnout, inefficiency, and a widening gap between detection and action. None of this should come as a shock to anyone who works in cybersecurity. The promise of agentic AI has emerged as a potential turning point, though. This new class

Google Versus OpenAI: The AI Fight For StudentsGoogle Versus OpenAI: The AI Fight For StudentsApr 18, 2025 am 11:31 AM

Immediate Impact versus Long-Term Partnership? Two weeks ago OpenAI stepped forward with a powerful short-term offer, granting U.S. and Canadian college students free access to ChatGPT Plus through the end of May 2025. This tool includes GPT‑4o, an a

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
Will R.E.P.O. Have Crossplay?
1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use