search
HomeTechnology peripheralsAIChatGPT-4 is released! Improved accuracy, able to beat 90% of humans on the SAT

ChatGPT-4 is released! Improved accuracy, able to beat 90% of humans on the SAT

News on March 15th, on Tuesday, local time in the United States, the artificial intelligence research company OpenAI released its next-generation large-scale language model GPT-4, which is its new model that supports ChatGPT and Xinbi The latest AI large-scale language model for applications such as The company said the model performed beyond "human levels" in a number of professional tests.

OpenAI claims that ChatGPT-4 is larger than the pre-iteration GPT-3.5, which means it has received more data training and has more weights (parameters) in the model file, which also makes it operating costs are higher. The company claims that the model is "more creative and collaborative than ever" and "can solve difficult problems more accurately." It can parse text and image input, although it can only respond with text.

Many researchers in the field currently believe that many of the recent advances in AI have come from running ever-larger models on thousands of supercomputers, a training process that can cost tens of millions of dollars. GPT-4 is an example of the focus on “scaling up” to achieve better results.

OpenAI admitted that the company used Azure, Microsoft’s cloud computing platform, to train its models. Microsoft has invested billions of dollars in OpenAI. Citing competitive reasons, OpenAI did not release details such as the exact size of the model or the hardware used to train it, which could be used to reconstruct the model. OpenAI’s GPT large language model powers many of the AI ​​demos that have wowed people in the tech industry over the past six months, including Bing’s AI chatbot

​and ChatGPT.

ChatGPT-4 is a preview of the latest advances in language models that may start trickling down to consumer products like chatbots in the coming weeks. Microsoft said on Tuesday that Bing's AI chatbot uses the GPT-4 model.

OpenAI claims that the new model will produce fewer factually incorrect answers, go off topic less often, talk less about forbidden topics, and even perform better than humans on many standardized tests.

For example, the company said that GPT-4 ranked in the top 10% of all candidates in the mock bar exam, in the top 7% in the SAT reading test, and in the top 11 in the SAT math test. %.

However, OpenAI warns that the new model is not perfect yet and in many cases is less capable than humans. For example, GPT-4 still suffers from so-called “hallucinations” or fabricated stories and is factually unreliable. When it makes a mistake, it still tends to insist that it is right. OpenAI CEO Sam Altman tweeted that GPT-4 "still has flaws and big limitations," but that "it still impresses you the first time you use it." ."

OpenAI said in a blog post: "GPT-4 still has many known limitations that we are working hard to address, such as social bias, hallucinations, and hostile replies. In a casual conversation, The difference between GPT-3.5 and GPT-4 is small. And when the complexity of the task reaches a sufficient threshold, the difference becomes apparent: GPT-4 is more reliable, more creative, and capable than GPT-3.5 More nuanced instructions."

OpenAI said it has worked with multiple companies to integrate GPT-4 into their products, including Duolingo, Stripe, and Khan Academy, among others. The new model is available to users through OpenAI’s $20-a-month ChatGPT subscription service, ChatGPT Plus, and powers Microsoft’s Bing chatbot. At the same time, GPT-4 will also be available as part of an API that allows programmers to integrate AI into their own applications.

The above is the detailed content of ChatGPT-4 is released! Improved accuracy, able to beat 90% of humans on the SAT. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
Meta's New AI Assistant: Productivity Booster Or Time Sink?Meta's New AI Assistant: Productivity Booster Or Time Sink?May 01, 2025 am 11:18 AM

Meta has joined hands with partners such as Nvidia, IBM and Dell to expand the enterprise-level deployment integration of Llama Stack. In terms of security, Meta has launched new tools such as Llama Guard 4, LlamaFirewall and CyberSecEval 4, and launched the Llama Defenders program to enhance AI security. In addition, Meta has distributed $1.5 million in Llama Impact Grants to 10 global institutions, including startups working to improve public services, health care and education. The new Meta AI application powered by Llama 4, conceived as Meta AI

80% Of Gen Zers Would Marry An AI: Study80% Of Gen Zers Would Marry An AI: StudyMay 01, 2025 am 11:17 AM

Joi AI, a company pioneering human-AI interaction, has introduced the term "AI-lationships" to describe these evolving relationships. Jaime Bronstein, a relationship therapist at Joi AI, clarifies that these aren't meant to replace human c

AI Is Making The Internet's Bot Problem Worse. This $2 Billion Startup Is On The Front LinesAI Is Making The Internet's Bot Problem Worse. This $2 Billion Startup Is On The Front LinesMay 01, 2025 am 11:16 AM

Online fraud and bot attacks pose a significant challenge for businesses. Retailers fight bots hoarding products, banks battle account takeovers, and social media platforms struggle with impersonators. The rise of AI exacerbates this problem, rende

Selling To Robots: The Marketing Revolution That Will Make Or Break Your BusinessSelling To Robots: The Marketing Revolution That Will Make Or Break Your BusinessMay 01, 2025 am 11:15 AM

AI agents are poised to revolutionize marketing, potentially surpassing the impact of previous technological shifts. These agents, representing a significant advancement in generative AI, not only process information like ChatGPT but also take actio

How Computer Vision Technology Is Transforming NBA Playoff OfficiatingHow Computer Vision Technology Is Transforming NBA Playoff OfficiatingMay 01, 2025 am 11:14 AM

AI's Impact on Crucial NBA Game 4 Decisions Two pivotal Game 4 NBA matchups showcased the game-changing role of AI in officiating. In the first, Denver's Nikola Jokic's missed three-pointer led to a last-second alley-oop by Aaron Gordon. Sony's Haw

How AI Is Accelerating The Future Of Regenerative MedicineHow AI Is Accelerating The Future Of Regenerative MedicineMay 01, 2025 am 11:13 AM

Traditionally, expanding regenerative medicine expertise globally demanded extensive travel, hands-on training, and years of mentorship. Now, AI is transforming this landscape, overcoming geographical limitations and accelerating progress through en

Key Takeaways From Intel Foundry Direct Connect 2025Key Takeaways From Intel Foundry Direct Connect 2025May 01, 2025 am 11:12 AM

Intel is working to return its manufacturing process to the leading position, while trying to attract fab semiconductor customers to make chips at its fabs. To this end, Intel must build more trust in the industry, not only to prove the competitiveness of its processes, but also to demonstrate that partners can manufacture chips in a familiar and mature workflow, consistent and highly reliable manner. Everything I hear today makes me believe Intel is moving towards this goal. The keynote speech of the new CEO Tan Libo kicked off the day. Tan Libai is straightforward and concise. He outlines several challenges in Intel’s foundry services and the measures companies have taken to address these challenges and plan a successful route for Intel’s foundry services in the future. Tan Libai talked about the process of Intel's OEM service being implemented to make customers more

AI Gone Wrong? Now There's Insurance For ThatAI Gone Wrong? Now There's Insurance For ThatMay 01, 2025 am 11:11 AM

Addressing the growing concerns surrounding AI risks, Chaucer Group, a global specialty reinsurance firm, and Armilla AI have joined forces to introduce a novel third-party liability (TPL) insurance product. This policy safeguards businesses against

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.