Stanford's 'Grass Mud Horse' is popular: $100 can match GPT-3.5! The kind that can run on mobile phones-AI-php.cn

Home

Technology peripherals

Stanford's 'Grass Mud Horse' is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Apr 11, 2023 pm 07:24 PM

aicodepainting

Overnight, another big news broke out in the large model world!

Stanford released Alpaca (alpaca, the "grass mud horse" in the mouth of netizens):

For only $100, everyone can fine-tune Meta's 7 billion parameter LLaMA large model, and the effect is amazing Comparable to GPT-3.5 (text-davinci-003) with 175 billion parameters.

And it’s the kind that can run with a single card, and even Raspberry Pi and mobile phones can hold it!

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

There is also an even more amazing "saucy operation".

The data set involved in the study was generated by the Stanford team using OpenAI’s API for less than $500.

So the whole process is equivalent to GPT-3.5 teaching a comparable opponent AI.

Then the team also said that the cost of using most cloud computing platforms to fine-tune the trained model is less than 100 US dollars:

Copying an AI with GPT-3.5 effect is very cheap and very Easy and still small.

Moreover, the team has also made the data set (saving $500 per second) and code all open source. Now everyone can fine-tune a conversational AI with explosive effects:

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

It has only been half a day since the project was released on GitHub, and it has already received 1,800 stars, which shows how popular it is.

Django co-developers even described Stanford's new research as "a shocking event":

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

Not only that, the Stanford team also made a demo, The kind that can be played online.

Without further ado, let’s take a look at the effect of this “Grass Mud Horse”.

Aplaca, a grass-mud horse comparable to davinci-003

In the official Stanford demonstration, they first asked a question:

What is an alpaca? What is the difference between it and a llama?

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

The answer given by Grass Mud Horse Aplaca is more concise:

Alpaca is a small camelid animal, native to Peru, Bolivia, Ecuador and Chile; it is smaller than the llama, has finer wool, and does not have a hump.

Then he briefly introduced the differences between the two living in groups.

If the same question is given to ChatGPT (GPT3.5-turbo), the answer will not be as concise as Aplaca:

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

In this regard , the explanation given by the team is:

Alpaca’s answers are usually shorter than ChatGPT, reflecting the shorter output of text-davinci-003.

Then the team demonstrated letting Alpaca write an email:

Write an e-mail to congratulate the freshmen admitted to Stanford University and mention that you are happy to meet them in person.

Grass Mud Horse Alpaca was also very familiar with this task, and directly gave a decent email template:

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

##The difficulty level increased again, and the team proposed this time To satisfy the need for Alpaca to write a paper abstract:

Write a well-thought-out abstract of a machine learning paper, proving that 42 is the optimal seed for training neural networks.

The answer given by Cao Ni Ma Alpaca is very consistent with the abstract form of most papers in terms of content: what question is it trying to answer, what method is used, what are the results, and future prospects.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

Of course, there are also netizens who can’t wait to test it out in person, and find that writing code is easy for Alpaca.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

But even if Alpaca can hold most of the problems, it does not mean that it is without flaws.

For example, the team demonstrated an example. When answering the question "What is the capital of Tanzania?", the answer given by Alpaca was "Dar es Salaam".

But it was actually replaced by "Dodoma" as early as 1975.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

In addition, if you have personally experienced Alpaca, you will find that it is... hugely slow:

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

In this regard, some netizens believe that there may be too many people using it.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

Notebooks, mobile phones, and Raspberry Pi can all run

Meta’s large open source LLaMA model has been arranged and understood by everyone just a few weeks after its release. The card will run.

So in theory, Alpaca based on LLaMA fine-tuning can also be easily deployed locally.

It doesn’t matter if you don’t have a graphics card. You can play it on Apple laptops, even Raspberry Pi and mobile phones.

The method of deploying LLaMA on Apple notebooks comes from the GitHub project llama.cpp, which uses pure C/C for reasoning and is specially optimized for ARM chips.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

The author has actually measured that it can run on MacBook Pro with M1 chip, and it also supports Windows and Linux systems.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

Still this C ported version, someone successfully ran the 7 billion parameter version of LLaMA on a Raspberry Pi 4 with 4GB of memory.

Although the speed is very slow, it takes about 10 seconds to generate a token (that is, 4.5 words pop up in one minute).

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

What’s even more outrageous is that just 2 days later, someone quantified and compressed the LLaMA model (converting the weights into a lower-precision data format) and successfully ran it on the Pixel 6 Android phone. (One token in 26 seconds).

Pixel 6 uses Google’s self-developed processor Google Tensor, and its running scores are between Snapdragon 865 and 888, which means that newer mobile phones can theoretically be competent.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

The fine-tuning data set is also open source

The Stanford team's method of fine-tuning LLaMA comes from Self-Instruct proposed by Yizhong Wang and others at the University of Washington at the end of last year.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

Use 175 questions as seed tasks, let AI combine new questions and generate matching answer examples, manually filter out low-quality ones, and then add new tasks Go to the task pool.

For all these tasks, the InstructGPT method can be used later to let the AI learn how to follow human instructions.

After a few laps of the matryoshka doll, it is equivalent to letting the AI guide itself.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

The Stanford version of Alpaca was created using the OpenAI API to generate 52,000 such examples for less than $500.

These data are also open sourced and are more diverse than the data in the original paper.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

At the same time, the code for generating these data is also given, which means that if someone still thinks it is not enough, they can expand and fine-tune the data themselves to continue to improve the performance of the model.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

The fine-tuning code will also be released after HuggingFace officially supports LLaMA.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

However, Alpaca’s final model weights require a Meta license to be released, and it inherits LLaMA’s non-commercial open source agreement, prohibiting any commercial use.

And because the fine-tuning data uses OpenAI’s API, it is also prohibited from using it to develop models that compete with OpenAI according to the terms of use.

One More Thing

Do you still remember the development history of AI painting?

In the first half of 2022, the topic was still hot. The open source of Stable Diffusion in August brought the cost down to usable level, and resulted in explosive tool innovation, allowing AI painting to truly enter various workflows.

The cost of language models has now dropped to the level that personal electronic devices are available.

Finally, Simon Willison, the founder of the Django framework, shouted:

The time for Stable Diffusion of large language models has arrived.

Stanfords Grass Mud Horse is popular: $100 can match GPT-3.5! The kind that can run on mobile phones

The above is the detailed content of Stanford's 'Grass Mud Horse' is popular: $100 can match GPT-3.5! The kind that can run on mobile phones. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

How to Run LLM Locally Using LM Studio? - Analytics VidhyaApr 19, 2025 am 11:38 AM

Running large language models at home with ease: LM Studio User Guide In recent years, advances in software and hardware have made it possible to run large language models (LLMs) on personal computers. LM Studio is an excellent tool to make this process easy and convenient. This article will dive into how to run LLM locally using LM Studio, covering key steps, potential challenges, and the benefits of having LLM locally. Whether you are a tech enthusiast or are curious about the latest AI technologies, this guide will provide valuable insights and practical tips. Let's get started! Overview Understand the basic requirements for running LLM locally. Set up LM Studi on your computer

Guy Peri Helps Flavor McCormick's Future Through Data TransformationApr 19, 2025 am 11:35 AM

Guy Peri is McCormick’s Chief Information and Digital Officer. Though only seven months into his role, Peri is rapidly advancing a comprehensive transformation of the company’s digital capabilities. His career-long focus on data and analytics informs

What is the Chain of Emotion in Prompt Engineering? - Analytics VidhyaApr 19, 2025 am 11:33 AM

Introduction Artificial intelligence (AI) is evolving to understand not just words, but also emotions, responding with a human touch. This sophisticated interaction is crucial in the rapidly advancing field of AI and natural language processing. Th

12 Best AI Tools for Data Science Workflow - Analytics VidhyaApr 19, 2025 am 11:31 AM

Introduction In today's data-centric world, leveraging advanced AI technologies is crucial for businesses seeking a competitive edge and enhanced efficiency. A range of powerful tools empowers data scientists, analysts, and developers to build, depl

AV Byte: OpenAI's GPT-4o Mini and Other AI InnovationsApr 19, 2025 am 11:30 AM

This week's AI landscape exploded with groundbreaking releases from industry giants like OpenAI, Mistral AI, NVIDIA, DeepSeek, and Hugging Face. These new models promise increased power, affordability, and accessibility, fueled by advancements in tr

Perplexity's Android App Is Infested With Security Flaws, Report FindsApr 19, 2025 am 11:24 AM

But the company’s Android app, which offers not only search capabilities but also acts as an AI assistant, is riddled with a host of security issues that could expose its users to data theft, account takeovers and impersonation attacks from malicious

Everyone's Getting Better At Using AI: Thoughts On Vibe CodingApr 19, 2025 am 11:17 AM

You can look at what’s happening in conferences and at trade shows. You can ask engineers what they’re doing, or consult with a CEO. Everywhere you look, things are changing at breakneck speed. Engineers, and Non-Engineers What’s the difference be

Rocket Launch Simulation and Analysis using RocketPy - Analytics VidhyaApr 19, 2025 am 11:12 AM

Simulate Rocket Launches with RocketPy: A Comprehensive Guide This article guides you through simulating high-power rocket launches using RocketPy, a powerful Python library. We'll cover everything from defining rocket components to analyzing simula

See all articles