


News on June 14th, OpenAI today announced a major update to its large language model API (including GPT-4 and gpt-3.5-turbo), including new function calling functions, Reduce usage costs and provide a 16,000 token version for the gpt-3.5-turbo model.
Large language model (LLM) is an artificial intelligence technology that can process natural language. Its "context window" is equivalent to a short-term memory that can store input content or chat robot dialogue. content. In language models, increasing the context window size has become a technical race, and Anthropic recently announced that its Claude language model can provide 75,000 token context window options. In addition, OpenAI has also developed a 32,000 token GPT-4 version, but it has not yet been publicly launched.
OpenAI has just launched a new 16000 context window version of gpt-3.5-turbo, named "gpt-3.5-turbo-16k", which can handle inputs up to 16000 tokens in length, which means that Processing approximately 20 pages of text at a time, this is a big improvement for developers who need to model processing and generate larger blocks of text.
In addition to this change, OpenAI also lists at least four other major new features:
- Introducing function calling capabilities in the Chat Completions API
- Improved and "more steerable" versions of #GPT-4 and gpt-3.5-turbo
- Reduced the price of the "ada" embedded model by 75%
- Reduced gpt -The input token price of the 3.5-turbo model has been reduced by 25%
The function call function makes it easier for developers to build functions that can call external tools, convert natural language into external API calls, or perform database queries. chatbot. For example, it can convert an input like "Send an email to Anya to see if she wants coffee next Friday" into a function call like "send_email (to: string, body: string)". In particular, this feature also makes it easier for API users to generate output in JSON format, something that was previously difficult to achieve.
Regarding the "more controllable" aspect, which is a technical term for how to make LLM behave the way you want, OpenAI said its new "gpt-3.5-turbo-0613" model will include " More reliable control via system messages" functionality. System messages are a special instruction input in the API that tell the model how to behave, such as "You are Grimes and you only talk about milkshakes."
In addition to functional improvements, OpenAI also provides considerable cost reduction. Notably, the token price of the popular gpt-3.5-turbo model is reduced by 25%. This means that developers can now use this model for about $0.0015 per 1000 tokens and $0.002 per 1000 tokens, which equates to about 700 pages of text per dollar. The gpt-3.5-turbo-16k model is priced at US$0.003 per 1000 tokens and US$0.004 per 1000 tokens.
In addition, IT House noticed that OpenAI also reduced the price of its "text-embedding-ada-002" embedding model by 75%. Embedding models are a technique that allows computers to understand words and concepts, converting natural language into a digital language that machines can understand, which is important for tasks such as searching for text and recommending relevant content.
Because OpenAI is constantly updating its models, old models won’t persist. Today, the company also announced it is starting to retire some earlier versions of its models, including gpt-3.5-turbo-0301 and gpt-4-0314. Developers can continue to use these models until September 13, after which these older models will no longer be available, the company said. It is worth noting that OpenAI’s GPT-4 API is still on the waiting list and is not yet fully open.
The above is the detailed content of OpenAI launches major update to GPT chatbot API for developers and lowers price. For more information, please follow other related articles on the PHP Chinese website!

译者|布加迪审校|重楼本文介绍了如何使用GroqLPU推理引擎在JanAI和VSCode中生成超快速响应。每个人都致力于构建更好的大语言模型(LLM),例如Groq专注于AI的基础设施方面。这些大模型的快速响应是确保这些大模型更快捷地响应的关键。本教程将介绍GroqLPU解析引擎以及如何在笔记本电脑上使用API和JanAI本地访问它。本文还将把它整合到VSCode中,以帮助我们生成代码、重构代码、输入文档并生成测试单元。本文将免费创建我们自己的人工智能编程助手。GroqLPU推理引擎简介Groq

LeanCopilot,让陶哲轩等众多数学家赞不绝口的这个形式化数学工具,又有超强进化了?就在刚刚,加州理工教授AnimaAnandkumar宣布,团队发布了LeanCopilot论文的扩展版本,并且更新了代码库。图片论文地址:https://arxiv.org/pdf/2404.12534.pdf最新实验表明,这个Copilot工具,可以自动化80%以上的数学证明步骤了!这个纪录,比以前的基线aesop还要好2.3倍。并且,和以前一样,它在MIT许可下是开源的。图片他是一位华人小哥宋沛洋,他是

图片来源@视觉中国文|王吉伟从“人+RPA”到“人+生成式AI+RPA”,LLM如何影响RPA人机交互?换个角度,从人机交互看LLM如何影响RPA?影响程序开发与流程自动化人机交互的RPA,现在也要被LLM改变了?LLM如何影响人机交互?生成式AI怎么改变RPA人机交互?一文看明白:大模型时代来临,基于LLM的生成式AI正在快速变革RPA人机交互;生成式AI重新定义人机交互,LLM正在影响RPA软件架构变迁。如果问RPA对程序开发以及自动化有哪些贡献,其中一个答案便是它改变了人机交互(HCI,h

Plaud, the company behind the Plaud Note AI Voice Recorder (available on Amazon for $159), has announced a new product. Dubbed the NotePin, the device is described as an AI memory capsule, and like the Humane AI Pin, this is wearable. The NotePin is

想了解更多AIGC的内容,请访问:51CTOAI.x社区https://www.51cto.com/aigc/译者|晶颜审校|重楼不同于互联网上随处可见的传统问题库,这些问题需要跳出常规思维。大语言模型(LLM)在数据科学、生成式人工智能(GenAI)和人工智能领域越来越重要。这些复杂的算法提升了人类的技能,并在诸多行业中推动了效率和创新性的提升,成为企业保持竞争力的关键。LLM的应用范围非常广泛,它可以用于自然语言处理、文本生成、语音识别和推荐系统等领域。通过学习大量的数据,LLM能够生成文本

图检索增强生成(GraphRAG)正逐渐流行起来,成为传统向量搜索方法的有力补充。这种方法利用图数据库的结构化特性,将数据以节点和关系的形式组织起来,从而增强检索信息的深度和上下文关联性。图在表示和存储多样化且相互关联的信息方面具有天然优势,能够轻松捕捉不同数据类型间的复杂关系和属性。而向量数据库则处理这类结构化信息时则显得力不从心,它们更专注于处理高维向量表示的非结构化数据。在RAG应用中,结合结构化化的图数据和非结构化的文本向量搜索,可以让我们同时享受两者的优势,这也是本文将要探讨的内容。构

随着开源大型语言模型的性能不断提高,编写和分析代码、推荐、文本摘要和问答(QA)对的性能都有了很大的提高。但是当涉及到QA时,LLM通常会在未训练数据的相关的问题上有所欠缺,很多内部文件都保存在公司内部,以确保合规性、商业秘密或隐私。当查询这些文件时,会使得LLM产生幻觉,产生不相关、捏造或不一致的内容。一种处理这一挑战的可行技术是检索增强生成(RAG)。它涉及通过引用训练数据源之外的权威知识库来增强响应的过程,以提升生成的质量和准确性。RAG系统包括一个检索系统,用于从语料库中检索相关文档片段

Google AI has started to provide developers with access to extended context windows and cost-saving features, starting with the Gemini 1.5 Pro large language model (LLM). Previously available through a waitlist, the full 2 million token context windo


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function

Dreamweaver Mac version
Visual web development tools

Notepad++7.3.1
Easy-to-use and free code editor

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft
