Nvidia, Hugging Face and ServiceNow release new StarCoder2 LLM for code generation-AI-php.cn

Home

Technology peripherals

Nvidia, Hugging Face and ServiceNow release new StarCoder2 LLM for code generation

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Feb 29, 2024 pm 09:07 PM

AILarge language model

英伟达、Hugging Face和ServiceNow发布用于代码生成的新StarCoder2 LLM

The models, currently available in three different sizes, have been trained on more than 600 programming languages, including low-resource languages, to help enterprises in their development workflows They are developed under the open BigCode project, a joint initiative of ServiceNow and Huging Face to ensure the responsible development and use of large code language models in an open and responsible environment. They are provided free of charge under the AI license.

The launch of StarCoder2 confirms the tremendous power that can come from open scientific collaboration and responsible AI practices combined with an ethical data supply chain. Harm de Vries, head of the StarCoder2 development team at ServiceNow and co-lead of BigCode, pointed out in a statement that the new open access model not only improves the previous GenAI performance, but also improves developer productivity and makes them more accessible. The benefits of code generation AI, making it easier for businesses of any size to realize their full business potential.

StarCoder2: Three models for three different needs

BigCode’s latest offering is more than just an upgrade to StarCoder LLM, it introduces three models of different sizes: 3B, 7B and 15B , and expanded the supported programming languages to 619. In the new generation of products, the amount of training data for the model called Stack has increased nearly seven times compared to the previous one. This means that BigCode is constantly evolving to provide developers with more powerful and comprehensive tools and resources to help them succeed in a variety of programming tasks. This innovative spirit and attitude of continuous improvement have made BigCode the platform of choice that developers trust and rely on, providing them with a wider range of learning and application opportunities. The development of BigCode demonstrates continued investment and focus in the field of technology and programming, bringing new possibilities and opportunities to the entire industry.

The BigCode community uses the latest generation of training technology to ensure that models can understand and generate low-resource programming languages such as COBOL, mathematics, and program source code. This approach is critical to helping users gain a better grasp of diverse programming languages and code discussions.

The 3 billion parameter model was trained using ServiceNow’s Fast LLM framework, while the 7B model was developed based on Hugging Face’s Nantron framework. Both models are designed to provide high performance for text-to-code and text-to-workflow generation while requiring fewer computing resources.

At the same time, the largest 15 billion parameter model was trained and optimized using the end-to-end NVIDIA Nemo cloud-native framework and NVIDIA TensorRT-LLM software.

While it remains to be seen how these models perform in different encoding scenarios, the companies note that the smallest 3B model performs on par with the original 15B StarCoder LLM.

Depending on their needs, enterprise teams can use any of these models and further fine-tune them based on enterprise data for different use cases. This can be any special task, from application source code generation, Workflow generation and text summarization to code completion, advanced code summarization and code snippet retrieval.

The companies emphasized that these models are more extensively and deeply trained to provide more context-aware and accurate predictions. This highly trained model is able to better understand the context of the repository. Ultimately, these efforts pave the way to accelerate development efforts, allowing engineers and developers to focus more energy on more critical tasks.

Jonathan Cohen, vice president of applied research at Nvidia, said in a press statement: "Because every software ecosystem has a proprietary programming language, Code LLM can drive breakthroughs in efficiency and innovation in every industry."

“NVIDIA’s collaboration with ServiceNow and Huging Face introduces a safe, responsible development model and supports broader access to responsible GenAI, which we hope will benefit society around the world,” he added.

How to get started using StarCoder2?

As mentioned before, all models in the StarCoder2 series are provided under the Open Rail-M license and can be accessed and used royalty-free. Supporting code can be found in the BigCode project's GitHub repository. As an alternative, teams can also download and use all three models of Hugging Face.

That said, 15B models trained by NVIDIA will also appear on NVIDIA AI Foundation, allowing developers to experiment directly from their browser or through API endpoints.

While StarCoder is not the first entrant in the field of AI-driven code generation, the wide range of options brought by the latest generation of the project will certainly allow enterprises to leverage LLMS in application development while also saving on computation.

Other notable players in the space include OpenAI, which provides Codex, which powers the GitHub joint pilot service, and Amazon, which provides the CodeWhisper tool, as well as stiff competition from Replit and Codenium, with Replit in Hugging Face There are several small AI coding models on the market, and Codenium recently raised $65 million in Series B funding at a $500 million valuation.

The above is the detailed content of Nvidia, Hugging Face and ServiceNow release new StarCoder2 LLM for code generation. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

How to Run LLM Locally Using LM Studio? - Analytics VidhyaApr 19, 2025 am 11:38 AM

Running large language models at home with ease: LM Studio User Guide In recent years, advances in software and hardware have made it possible to run large language models (LLMs) on personal computers. LM Studio is an excellent tool to make this process easy and convenient. This article will dive into how to run LLM locally using LM Studio, covering key steps, potential challenges, and the benefits of having LLM locally. Whether you are a tech enthusiast or are curious about the latest AI technologies, this guide will provide valuable insights and practical tips. Let's get started! Overview Understand the basic requirements for running LLM locally. Set up LM Studi on your computer

Guy Peri Helps Flavor McCormick's Future Through Data TransformationApr 19, 2025 am 11:35 AM

Guy Peri is McCormick’s Chief Information and Digital Officer. Though only seven months into his role, Peri is rapidly advancing a comprehensive transformation of the company’s digital capabilities. His career-long focus on data and analytics informs

What is the Chain of Emotion in Prompt Engineering? - Analytics VidhyaApr 19, 2025 am 11:33 AM

Introduction Artificial intelligence (AI) is evolving to understand not just words, but also emotions, responding with a human touch. This sophisticated interaction is crucial in the rapidly advancing field of AI and natural language processing. Th

12 Best AI Tools for Data Science Workflow - Analytics VidhyaApr 19, 2025 am 11:31 AM

Introduction In today's data-centric world, leveraging advanced AI technologies is crucial for businesses seeking a competitive edge and enhanced efficiency. A range of powerful tools empowers data scientists, analysts, and developers to build, depl

AV Byte: OpenAI's GPT-4o Mini and Other AI InnovationsApr 19, 2025 am 11:30 AM

This week's AI landscape exploded with groundbreaking releases from industry giants like OpenAI, Mistral AI, NVIDIA, DeepSeek, and Hugging Face. These new models promise increased power, affordability, and accessibility, fueled by advancements in tr

Perplexity's Android App Is Infested With Security Flaws, Report FindsApr 19, 2025 am 11:24 AM

But the company’s Android app, which offers not only search capabilities but also acts as an AI assistant, is riddled with a host of security issues that could expose its users to data theft, account takeovers and impersonation attacks from malicious

Everyone's Getting Better At Using AI: Thoughts On Vibe CodingApr 19, 2025 am 11:17 AM

You can look at what’s happening in conferences and at trade shows. You can ask engineers what they’re doing, or consult with a CEO. Everywhere you look, things are changing at breakneck speed. Engineers, and Non-Engineers What’s the difference be

Rocket Launch Simulation and Analysis using RocketPy - Analytics VidhyaApr 19, 2025 am 11:12 AM

Simulate Rocket Launches with RocketPy: A Comprehensive Guide This article guides you through simulating high-power rocket launches using RocketPy, a powerful Python library. We'll cover everything from defining rocket components to analyzing simula

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

4 weeks agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks agoByDDD

Hot Tools

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Dreamweaver CS6

Visual web development tools

Hot Topics

Where is the login entrance for gmail email?

7593

CakePHP Tutorial

1386

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

123