search
HomeTechnology peripheralsAISam Altman talks about OpenAI: Facing GPU shortage panic, GPT-3 may be open source

Since the advent of ChatGPT, large models and AI technology have attracted widespread attention around the world. On the one hand, people marvel at the emerging capabilities of large models; on the other hand, they are concerned about the controllability and future development of artificial intelligence. This year, many experts in the AI ​​field, including Turing Award winners Geoffrey Hinton and Yoshua Bengio, have jointly warned that large AI models will cause a series of risks, and some have even called for a halt to the development of subsequent large AI models for GPT-4. .

OpenAI, as the company behind large models such as ChatGPT and GPT-4, has undoubtedly been pushed to the forefront. OpenAI CEO Sam Altman is currently on a global speaking tour to dispel people's "fear" about artificial intelligence and listen to the opinions of developers and users of OpenAI products.

Sam Altman谈OpenAI:面临GPU短缺恐慌,GPT-3或将开源

According to "Fortune" report, in May Sam Altman met behind closed doors with some developers and startup founders, And talked about OpenAI’s roadmap and challenges. One of the participants in this closed-door meeting, Raza Habib, co-founder and CEO of Humanloop, recently mentioned OpenAI’s product planning and development bottlenecks in a blog.

The original blog has been deleted, but some netizens have uploaded a snapshot (copy) of the blog. Let’s take a look at the specific content of the blog:

OpenAI is now facing The biggest problem is that it is limited by GPU

Currently OpenAI faces very severe GPU limitations, which also delays the implementation of some of their short-term plans. The most common customer complaints these days are about the reliability and speed of APIs. Sam acknowledged the problem, explaining that most of the issues customers complained about were due to GPU shortages.

When it comes to processing text, the longer 32k context is not yet available to more people. Now OpenAI has not completely overcome the O (n^2) expansion problem of the attention mechanism. Although OpenAI seems to be able to achieve 100k-1M token context window (within this year) text processing soon, larger text processing windows require further progress. research breakthroughs.

Not only that, but currently, the fine-tuning API is also limited by GPU supply. OpenAI does not yet use efficient fine-tuning methods like Adapters or LoRa, so fine-tuning is very computationally intensive to run and manage. Sam revealed that better fine-tuning technology will be introduced in the future, and they may even provide a community dedicated to research models.

Additionally, dedicated capacity provision is limited by GPU supply. OpenAI also offers dedicated capacity, giving customers a private copy of their model. To use the service, customers must be willing to commit $100,000 up front.

OpenAI’s near-term roadmap

During the conversation, Sam shared the near-term roadmap of OpenAI API, which is mainly divided into two stages:

Road to 2023:

  • #OpenAI’s top priority is to launch a cheaper, faster GPT-4 — Overall, OpenAI’s goals are Reduce the cost of intelligence as much as possible, so the cost of the API will decrease over time.
  • Longer context windows - In the near future, context windows may be as high as 1 million tokens.
  • Nudge API - The Nudge API will be extended to the latest models, but its exact form will be determined by the developers.
  • Status API - Now when calling the chat API, you have to go through the same session history over and over again and pay for the same token again and again. A future version of the API will be able to remember session history.
  • Road to 2024:
  • Multimodal - This was demonstrated as part of the GPT-4 release, but in more It won't be scalable to everyone until the GPU comes online.

The plugin does not have a PMF and will not appear in the API anytime soon

Many developers are interested in accessing the ChatGPT plugin through the API, but Sam said He doesn't think these plugins will be released anytime soon. Use of plugins other than browsing suggests they don't have PMF yet. Sam pointed out that many people want their applications to be within ChatGPT, but what they really want is ChatGPT within their applications.

OpenAI will avoid competing with its customers except with ChatGPT-like competitors

Many developers say that when OpenAI releases new products, they are nervous about applications built using the OpenAI API because OpenAI may eventually release a competing product. Sam said that OpenAI will not release more products besides ChatGPT. He said there are a lot of great platform companies that have a killer app, and ChatGPT will allow them to make their APIs better by becoming customers of their own products. The vision for ChatGPT is to be a super-intelligent work assistant, but there are many other GPT use cases that OpenAI won’t be getting into.

Regulation is necessary, but so is open source

Although Sam advocates for regulation of future models, he does not believe that existing models are dangerous, And think regulating or banning them would be a huge mistake. He once again emphasized the importance of open source and said that OpenAI is considering open source GPT-3. Part of the reason why OpenAI has been slow to open source is because they feel that not many people and companies have the ability to properly manage such large language models.

The law of scaling still exists

Many recent articles have claimed that "the era of giant artificial intelligence models is over." Sam said that didn't convey exactly what he meant.

OpenAI’s internal data shows that the law of scaling still holds, and increasing the size of the model will continue to improve performance. However, model size cannot always increase at the same scale, as OpenAI has already increased model size millions of times in just a few years, and continuing to do so will be unsustainable. But that doesn’t mean OpenAI will stop trying to make its models bigger, but it does mean they might double or triple in size every year instead of growing by orders of magnitude.

The fact that the extended model is still valid has important implications for the development of AGI. The idea of ​​scaling is that we probably already have most of the elements needed to build an AGI, and most of the remaining work will be taking existing methods and scaling them to larger models and larger data sets. If the era of model extensions is over, it will be even longer before we reach AGI. The fact that the law of scaling still applies implies that we will achieve AGI in less time.

The above is the detailed content of Sam Altman talks about OpenAI: Facing GPU shortage panic, GPT-3 may be open source. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
Does Hugging Face's 7B Model OlympicCoder Beat Claude 3.7?Does Hugging Face's 7B Model OlympicCoder Beat Claude 3.7?Apr 23, 2025 am 11:49 AM

Hugging Face's OlympicCoder-7B: A Powerful Open-Source Code Reasoning Model The race to develop superior code-focused language models is intensifying, and Hugging Face has joined the competition with a formidable contender: OlympicCoder-7B, a product

4 New Gemini Features You Can't Afford to Miss4 New Gemini Features You Can't Afford to MissApr 23, 2025 am 11:48 AM

How many of you have wished AI could do more than just answer questions? I know I have, and as of late, I’m amazed by how it’s transforming. AI chatbots aren’t just about chatting anymore, they’re about creating, researchin

Camunda Writes New Score For Agentic AI OrchestrationCamunda Writes New Score For Agentic AI OrchestrationApr 23, 2025 am 11:46 AM

As smart AI begins to be integrated into all levels of enterprise software platforms and applications (we must emphasize that there are both powerful core tools and some less reliable simulation tools), we need a new set of infrastructure capabilities to manage these agents. Camunda, a process orchestration company based in Berlin, Germany, believes it can help smart AI play its due role and align with accurate business goals and rules in the new digital workplace. The company currently offers intelligent orchestration capabilities designed to help organizations model, deploy and manage AI agents. From a practical software engineering perspective, what does this mean? The integration of certainty and non-deterministic processes The company said the key is to allow users (usually data scientists, software)

Is There Value In A Curated Enterprise AI Experience?Is There Value In A Curated Enterprise AI Experience?Apr 23, 2025 am 11:45 AM

Attending Google Cloud Next '25, I was keen to see how Google would distinguish its AI offerings. Recent announcements regarding Agentspace (discussed here) and the Customer Experience Suite (discussed here) were promising, emphasizing business valu

How to Find the Best Multilingual Embedding Model for Your RAG?How to Find the Best Multilingual Embedding Model for Your RAG?Apr 23, 2025 am 11:44 AM

Selecting the Optimal Multilingual Embedding Model for Your Retrieval Augmented Generation (RAG) System In today's interconnected world, building effective multilingual AI systems is paramount. Robust multilingual embedding models are crucial for Re

Musk: Robotaxis In Austin Need Intervention Every 10,000 MilesMusk: Robotaxis In Austin Need Intervention Every 10,000 MilesApr 23, 2025 am 11:42 AM

Tesla's Austin Robotaxi Launch: A Closer Look at Musk's Claims Elon Musk recently announced Tesla's upcoming robotaxi launch in Austin, Texas, initially deploying a small fleet of 10-20 vehicles for safety reasons, with plans for rapid expansion. H

AI's Shocking Pivot: From Work Tool To Digital Therapist And Life CoachAI's Shocking Pivot: From Work Tool To Digital Therapist And Life CoachApr 23, 2025 am 11:41 AM

The way artificial intelligence is applied may be unexpected. Initially, many of us might think it was mainly used for creative and technical tasks, such as writing code and creating content. However, a recent survey reported by Harvard Business Review shows that this is not the case. Most users seek artificial intelligence not just for work, but for support, organization, and even friendship! The report said that the first of AI application cases is treatment and companionship. This shows that its 24/7 availability and the ability to provide anonymous, honest advice and feedback are of great value. On the other hand, marketing tasks (such as writing a blog, creating social media posts, or advertising copy) rank much lower on the popular use list. Why is this? Let's see the results of the research and how it continues to be

Companies Race Toward AI Agent AdoptionCompanies Race Toward AI Agent AdoptionApr 23, 2025 am 11:40 AM

The rise of AI agents is transforming the business landscape. Compared to the cloud revolution, the impact of AI agents is predicted to be exponentially greater, promising to revolutionize knowledge work. The ability to simulate human decision-maki

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),