Stanford University releases AI basic model transparency index, Llama 2 ranks first but 'fails'-AI-php.cn

Home

Technology peripherals

Stanford University releases AI basic model transparency index, Llama 2 ranks first but 'fails'

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Oct 21, 2023 am 08:17 AM

IT House News on October 20th, Stanford University recently released the "Transparency Index" of the AI basic model. The highest display index is Meta's Lama 2, but the related "transparency" is only 54%, so the researchers believe , almost all AI models on the market "lack transparency."

It is reported that this research was led by Rishi Bommasani, head of the HAI Center for Basic Model Research (CRFM), and investigated the 10 most popular basic models overseas:

Meta’s Llama 2,

BloomZ by BigScience,

OpenAI’s GPT-4,

Stability AI’s Stable Diffusion,

Claude,
of Anthropic PBC
Google’s PaLM 2,

Cohere's Command,

Jurassic-2,
by AI21 Labs
Inflection AI’s Inflection、

Amazon’s Titan.

Rishi Bommasani believes that "lack of transparency" has always been a problem faced by the AI industry. In terms of specific model "transparency indicators", IT House found that the relevant evaluation content mainly revolves around "model training data set copyright", "training model "Computing resources used", "Credibility of the content generated by the model", "Model's own capabilities", "Risk of the model being induced to generate harmful content", "User privacy of using the model", etc., totaling 100 items.

The final survey showed that Meta’s Lama 2 topped the list with 54% transparency, while OpenAI’s GPT-4 had only 48% transparency, and Google’s PaLM 2 ranked fifth with 40%.

斯坦福大学发布AI基础模型透明度指标，Llama 2居首但“不及格”

▲ Picture source Stanford University

Among the specific indicators, the top ten models with the "best" score performance are "Model Basics". This evaluation content mainly includes "whether the model, scale, and model of the model are accurately introduced during model training." Architecture" with an average transparency of 63%. The worst performer is Impact, which mainly evaluates whether the basic model will "retrieve user information for evaluation", with an average transparency of only 11%.

CRFM Director Percy Liang said that the "transparency" of the business base model is very important for promoting AI legislation, as well as related industries and academia.

Rishi Bommasani said that lower model transparency makes it more difficult for companies to know whether they can safely rely on relevant models, and for researchers to rely on these models to do research.

Rishi Bommasani ultimately believes that the above ten basic models all "fail" in terms of transparency. Although Meta's Llama 2 has the highest score, it cannot meet the needs of the outside world. "The model transparency must reach at least 82% to be recognized by the outside world. ".

The above is the detailed content of Stanford University releases AI basic model transparency index, Llama 2 ranks first but 'fails'. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:搜狐. If there is any infringement, please contact admin@php.cn delete

I Tried Vibe Coding with Cursor AI and It's Amazing!Mar 20, 2025 pm 03:34 PM

Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

How to Use DALL-E 3: Tips, Examples, and FeaturesMar 09, 2025 pm 01:00 PM

DALL-E 3: A Generative AI Image Creation Tool Generative AI is revolutionizing content creation, and DALL-E 3, OpenAI's latest image generation model, is at the forefront. Released in October 2023, it builds upon its predecessors, DALL-E and DALL-E 2

Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More!Mar 22, 2025 am 10:58 AM

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

How to Use YOLO v12 for Object Detection?Mar 22, 2025 am 11:07 AM

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

Elon Musk & Sam Altman Clash over $500 Billion Stargate ProjectMar 08, 2025 am 11:15 AM

The $500 billion Stargate AI project, backed by tech giants like OpenAI, SoftBank, Oracle, and Nvidia, and supported by the U.S. government, aims to solidify American AI leadership. This ambitious undertaking promises a future shaped by AI advanceme

Sora vs Veo 2: Which One Creates More Realistic Videos?Mar 10, 2025 pm 12:22 PM

Google's Veo 2 and OpenAI's Sora: Which AI video generator reigns supreme? Both platforms generate impressive AI videos, but their strengths lie in different areas. This comparison, using various prompts, reveals which tool best suits your needs. T

Google's GenCast: Weather Forecasting With GenCast Mini DemoMar 16, 2025 pm 01:46 PM

Google DeepMind's GenCast: A Revolutionary AI for Weather Forecasting Weather forecasting has undergone a dramatic transformation, moving from rudimentary observations to sophisticated AI-powered predictions. Google DeepMind's GenCast, a groundbreak

Which AI is better than ChatGPT?Mar 18, 2025 pm 06:05 PM

The article discusses AI models surpassing ChatGPT, like LaMDA, LLaMA, and Grok, highlighting their advantages in accuracy, understanding, and industry impact.(159 characters)

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

How Long Does It Take To Beat Split Fiction?

1 months agoByDDD

R.E.P.O. Best Graphic Settings

2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

1 weeks agoByDDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.