Home > Article > Technology peripherals > How did Microsoft integrate GPT-4 so quickly? The project team even worked overtime on weekends
News on April 6th that Microsoft has integrated OpenAI’s GPT-4 into the Microsoft 365 productivity application suite. This is a very difficult task, but the company has always hoped to be able to do so as soon as possible. Finish. Therefore, Microsoft mobilized hundreds of employees, asked them to work overtime for long periods of time, brainstorm product solutions together, and developed three working modes of artificial intelligence assistants based on a unified design framework.
Over the past few months, this is a word that has been mentioned countless times in Microsoft conference rooms and hallways. It refers to March 16, the day Microsoft announced that it would introduce a generative AI model co-developed with OpenAI into the Microsoft 365 productivity application suite. By integrating GPT-4, every productivity application including Outlook, Word and Teams will have a Copilot (intelligent co-pilot) based on generative artificial intelligence. Currently, more than 20 Microsoft customers are testing this technology.
Integrating an AI assistant into so many apps is a daunting task, but Microsoft hopes to get it done quickly. In November last year, OpenAI released ChatGPT, which caused a global sensation and set off an artificial intelligence race. Companies are racing against time to launch new artificial intelligence products and functions to seize market opportunities. Even Microsoft, which has already cooperated and invested in OpenAI, is no exception. Microsoft design director Jon Friedman is responsible for Copilot product design in Microsoft 365. He said that this project requires long hours of work by hundreds of Microsoft employees, including designers, engineers, product managers, marketers, data scientists, ethics teams, etc. The entire project will last several months and even require overtime work on weekends. . Friedman said the plan also required people to restrain themselves and work together to build such a large project in such a short time, so that everyone felt they had to put aside their egos and work together.
Friedman said: "It's exciting that we can do some really bold and big things together. While we have a lot of experience with artificial intelligence, this particular generative artificial intelligence More powerful, so I think everyone is working with a learning mentality."
New User Experience
The challenge in designing a user interface like Copilot is how and when to present this new artificial intelligence assistant when people usually work in applications such as Word and Powerpoint.
Friedman said that initially designing such an artificial intelligence assistant that could be called up through various productivity applications was just "a vague idea." But as the design team gained a deeper understanding of the application of artificial intelligence assistants in real business, this idea began to become clearer. First, the design team needs to find specific use cases where AI can significantly save users time or stimulate creativity in some way. This is the first step in the user experience design process.
The people who know the relevant use cases best include the engineers, product managers, designers, and computer scientists of each productivity application. Friedman's design team worked collaboratively with them. When Project Copilot started, he asked all product teams to brainstorm ways to leverage generative AI to improve the capabilities of productivity applications. Next, Friedman established a special horizontal design team to work with all application teams to demonstrate Copilot's effects in each application.
As individual application teams began to develop use cases, Friedman said, horizontal design teams began to notice commonalities between those use cases, namely AI use cases that were relevant to multiple applications.
Friedman recalled the brainstorming process at the time and said: "Our discussions were very valuable... We discussed the specific functional requirements of Copilot, such as how the new generative language model can help us be more How well do you accomplish the task of writing email summaries?"
As cross-application use cases became clearer, the horizontal design team began to believe that the AI assistant function did not need to be different for each application.
Friedman said: "Because you have a lot of people...trying to observe each scene and be able to roughly make this judgment, such as 'Ah, this thing also appeared.'"
So they began to conceive of a design framework that would allow a universal assistant to work in several different, predictable ways across applications.
The design team led by Friedman created a deep documentation library designed to help designers across the project prepare for artificial intelligence in a given application Create entry points. They guide designers in determining how to invoke Copilot based on the different tasks a user may be involved in. Friedman said: "There is a concept that Copilot should appear at the right level and do the right job."
The design framework stipulates that Copilot can be displayed in three ways in the application user interface.
The first is an immersive user experience that allows the AI assistant to focus on specific business projects rather than a specific application, so that it can actually pull data from multiple applications or Points that serve the task at hand. For example, Copilot might collect project milestones or risk points from team meetings, slides, or email content, and then summarize and summarize them in a project plan document.
The "immersive" experience mode is Copilot's most powerful feature in the productivity application suite, and it may also be the most influential. Rob Enderle, principal analyst at market consulting firm Enderle Group, believes it may also help solve a long-standing problem with Microsoft's productivity suite - that individual applications are not tightly integrated with each other. . Endler said the reason may be that Microsoft originally acquired these applications from other companies and did not share code bases with each other. But Copilot can cover all applications, at least giving users a sense that these applications can work together for certain tasks.
Friedman said the second mode of presentation is "assistive," meaning the Copilot is like a "sidecar" for a sidecar three-wheeler, helping users maximize their performance in a specific application. to call application functions. For example, in PowerPoint, Copilot can show users how the application's deep graphics capabilities can be used to describe complex data sets; in Outlook, Copilot can help users understand the most important content in an email; in Word, Copilot can provide information about Feedback on how to better write documents and fit specific writing styles.
In addition, in Copilot’s “embedded” presentation, artificial intelligence can exert generative and creative capabilities in applications. For example, Copilot may appear in a pop-up window for a Word document. "It's like a random experience," Friedman said. "When you're immersed in work, Copilot can help you solve your writing block, or automatically help you start a slideshow with text content."
Friedman said Copilot’s horizontal design team began to use the concept of “three levels” to describe the work, and members of the various application teams gradually accepted the framework.
We’ve shared this framework with CEO Nadella and the rest of the company’s executive team, and it’s basically been bought in by everyone,” he said. “This idea can be applied to three Different levels of work."
The name Copilot was not created specifically for Microsoft 365. Microsoft-owned GitHub used this name to call its programming assistant in 2021, and some of its functions also used OpenAI's large-scale language model. Microsoft is creating a consistent Copilot assistant that can perform certain standardized functions across different productivity application suites. For long-time users of productivity apps, this new work will likely reduce the feeling of unfamiliarity they experience when the new version of Microsoft 365 officially launches. By using a unified assistant, users can use different applications more conveniently, thereby improving work efficiency.
As generative artificial intelligence is further integrated into Microsoft's consumer and enterprise products, the Copilot brand and concept may be extended to the Windows operating system and even other Microsoft products such as LinkedIn.
“Nadella liked the name very much because it perfectly describes what the AI assistant does,” Friedman said. "It exists to aid you and guide you in many tasks..."
The above is the detailed content of How did Microsoft integrate GPT-4 so quickly? The project team even worked overtime on weekends. For more information, please follow other related articles on the PHP Chinese website!