OpenAI guidance allows boards to restrict CEOs from releasing new models to guard against AI risks-AI-php.cn

Home

Technology peripherals

OpenAI guidance allows boards to restrict CEOs from releasing new models to guard against AI risks

PHPz

Dec 19, 2023 am 11:32 AM

AI risk preventionceo controlModel release suppressed

In order to avoid the huge risks that artificial intelligence (AI) may bring, OpenAI decided to give the board of directors greater power to oversee security matters and conduct strict supervision on CEO Sam Altman, who just won an internal battle last month

OpenAI released a series of guidelines on Monday, December 18, Eastern Time, designed to track, assess, predict, and prevent catastrophic risks posed by increasingly powerful artificial intelligence (AI) models. OpenAI defines "catastrophic risk" as any risk that could result in hundreds of billions of dollars in economic losses, or serious injury or death to multiple people

The 27-page guidance, known as the "Readiness Framework," states that even if a company's top managers, including the CEO or a person designated by leadership, believe that an AI model to be released is safe, the company's board of directors still Option to postpone the release of the model. This means that while OpenAI’s CEO is responsible for day-to-day decisions, the board of directors will be informed of the discovery of risks and have the power to veto the CEO’s decisions

OpenAI’s Readiness Framework recommends using a matrix approach to document the level of risk posed by cutting-edge AI models across multiple categories, in addition to provisions for company leadership and board authority. These risks include bad actors using AI models to create malware, launch social engineering attacks, or spread harmful nuclear or biological weapons information

Specifically, OpenAI sets risk thresholds in four categories: cybersecurity, CBRN (chemical, biological, radiological, nuclear threats), persuasion, and model autonomy. Before and after risk mitigation measures are implemented, OpenAI classifies each risk into four levels: low, medium, high, or severe

OpenAI guidance allows boards to restrict CEOs from releasing new models to guard against AI risks

OpenAI stipulates that only AI models rated "medium" or below after risk mitigation can be deployed, and only models rated "high" or below after risk mitigation can continue to be developed. If the risk cannot be reduced, Downgrades below critical and the company will stop developing the model. OpenAI will also take additional security measures for models assessed as high risk or severe risk until the risk is mitigated

OpenAI divides security issue handlers into three teams. The Security Systems team is focused on mitigating and addressing risks posed by current products such as GPT-4. The Super Alignment team is concerned about the problems that may arise when future systems exceed human capabilities. Additionally, there is a new team called Prepare, led by Aleksander Madry, professor in the Department of Electrical Engineering and Computer Science (EECS) at the Massachusetts Institute of Technology (MIT)

The new team will evaluate robust model development and implementation. They will be specifically responsible for overseeing the technical work and operational architecture related to security decisions. They will drive technical work, review limitations of cutting-edge model capabilities, and conduct assessments and synthesis related reports

Madry said that his team will regularly evaluate the risk level of OpenAI’s most advanced artificial intelligence models that have not yet been released, and submit monthly reports to OpenAI’s internal Security Advisory Group (SAG). SAG will analyze Madry’s team’s work and provide recommendations to CEO Altman and the company’s Board of Directors

According to guidance documents released on Monday, Altman and his leadership can use these reports to decide whether to release new AI systems, but the board of directors retains the power to overturn their decisions

Currently, Madry’s team only has four people, but he is working hard to recruit more members. It is expected that the team will reach 15 to 20 people, which is similar to the existing security team and hyper-alignment team.

Madry hopes other AI companies will assess the risks of their models in a similar way and believes this could become a model for regulation

The above is the detailed content of OpenAI guidance allows boards to restrict CEOs from releasing new models to guard against AI risks. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:搜狐. If there is any infringement, please contact admin@php.cn delete

Are You At Risk Of AI Agency Decay? Take The Test To Find OutApr 21, 2025 am 11:31 AM

This article explores the growing concern of "AI agency decay"—the gradual decline in our ability to think and decide independently. This is especially crucial for business leaders navigating the increasingly automated world while retainin

How to Build an AI Agent from Scratch? - Analytics VidhyaApr 21, 2025 am 11:30 AM

Ever wondered how AI agents like Siri and Alexa work? These intelligent systems are becoming more important in our daily lives. This article introduces the ReAct pattern, a method that enhances AI agents by combining reasoning an

Revisiting The Humanities In The Age Of AIApr 21, 2025 am 11:28 AM

"I think AI tools are changing the learning opportunities for college students. We believe in developing students in core courses, but more and more people also want to get a perspective of computational and statistical thinking," said University of Chicago President Paul Alivisatos in an interview with Deloitte Nitin Mittal at the Davos Forum in January. He believes that people will have to become creators and co-creators of AI, which means that learning and other aspects need to adapt to some major changes. Digital intelligence and critical thinking Professor Alexa Joubin of George Washington University described artificial intelligence as a “heuristic tool” in the humanities and explores how it changes

Understanding LangChain Agent FrameworkApr 21, 2025 am 11:25 AM

LangChain is a powerful toolkit for building sophisticated AI applications. Its agent architecture is particularly noteworthy, allowing developers to create intelligent systems capable of independent reasoning, decision-making, and action. This expl

What are the Radial Basis Functions Neural Networks?Apr 21, 2025 am 11:13 AM

Radial Basis Function Neural Networks (RBFNNs): A Comprehensive Guide Radial Basis Function Neural Networks (RBFNNs) are a powerful type of neural network architecture that leverages radial basis functions for activation. Their unique structure make

The Meshing Of Minds And Machines Has ArrivedApr 21, 2025 am 11:11 AM

Brain-computer interfaces (BCIs) directly link the brain to external devices, translating brain impulses into actions without physical movement. This technology utilizes implanted sensors to capture brain signals, converting them into digital comman

Insights on spaCy, Prodigy and Generative AI from Ines MontaniApr 21, 2025 am 11:01 AM

This "Leading with Data" episode features Ines Montani, co-founder and CEO of Explosion AI, and co-developer of spaCy and Prodigy. Ines offers expert insights into the evolution of these tools, Explosion's unique business model, and the tr

A Guide to Building Agentic RAG Systems with LangGraphApr 21, 2025 am 11:00 AM

This article explores Retrieval Augmented Generation (RAG) systems and how AI agents can enhance their capabilities. Traditional RAG systems, while useful for leveraging custom enterprise data, suffer from limitations such as a lack of real-time dat

See all articles