Home  >  Article  >  Mobile Tutorial  >  Apple’s big model of Siri may be different from what you think

Apple’s big model of Siri may be different from what you think

WBOY
WBOYOriginal
2024-08-21 17:38:57700browse

Apple’s big model of Siri may be different from what you think

苹果的 AI,虽迟但到。

根据华尔街日报消息,苹果正与百度商讨,关于国内市场的 iPhone 以及其他设备中整合生成式 AI 的业务。

尽管目前还没有得到官方的确认,不过有两件事至此可以确定:

iPhone 16、iOS 18 和 MacOS 将会搭载 AI 功能

苹果设备上的大模型,在国内外将由不同的厂商提供

Apple’s big model of Siri may be different from what you think

比起早已搭载 AI 助手的国内品牌,苹果这回又不出意外地晚了大半年,慢人一步似乎一直都是苹果的标签,只是它们总能在稳步前行中带来一些惊喜。

然而,AI 大模型的进步速度正以周、甚至以天计数,苹果的迟到,究竟是再一次后发先至,还是在新时代掉队的开始。

略显妥协的方案,先上车才是重点

上个月的最后一天,苹果用 12 分钟的短会宣布:放弃造车,All in AI,汽车团队的众多成员将被调往 AI 部门。

蛰伏十年的泰坦计划,倒在了入局新能源汽车的最后一年,对于未来百家争鸣的汽车市场而言,会有些许遗憾,但从一家科技公司长远的发展来看,这莫过于是一个长远且正确的选择。

Apple’s big model of Siri may be different from what you think

AI 是基础性的应用,在所有大公司都主动或被动拥抱 AI 的当下,苹果的「断舍离」˙顺应着时代,而如何拥抱 AI?怎样的人工智能才能在逐渐被瓜分的市场里占得一席之地?是它们最先要解决的难题。

针对国外市场,苹果正在与 Google 积极谈判,以便在 iOS 18 中加入 AI 大模型,实现其他品牌早就有的 AI 功能。

Apple’s big model of Siri may be different from what you think

虽然目前「双方尚未决定人工智能协议的条款或品牌,也没有最终确定如何实施」,但在众多备选合作商(OpenAI 和 Anthropic)中,Google 和 Gemini,应该是最适合苹果和 iPhone 的那一个。

今年二月发售的三星 Galaxy S24 系列机型以 AI 功能出圈,通话翻译、创意写作等功能赶上了国内平均水平,即圈即搜缩短了搜索路径,也极有可能成为今后 AI 手机的主要发展路线。

Apple’s big model of Siri may be different from what you think

海外版的 S24 系列,就是通过大模型 Gemini 支持,才得以完成上述功能。

从经验讲,Google 已在全球出货量最高的旗舰机型上,完成了初步尝试,相比于在 PC 或 Web 端大火的厂商,它们更知道手机大模型的操作习惯、使用场景、适配应用应该怎么做。

Apple’s big model of Siri may be different from what you think

再者,Google 本身也更渴望得到苹果的项目。

根据国际数据公司 IDC 的统计,三星在 2023 年全球智能手机市场占有率达到了 19.4%,苹果则成功登顶达到了 20.1%。

Apple’s big model of Siri may be different from what you think

若是拿下苹果,Gemini 在全球范围内手机终端的搭载率将达到 4 成,这对于一家面临激烈竞争的 AI 大模型公司,极为利好。

在梦里笑醒的除了 Google,还有苹果。

和其他强调「自研」的厂商不同,苹果在一开始就以合作来达成 AI 上机,也有其自身的考量。

Apple’s big model of Siri may be different from what you think

首先,在本身起步晚进度慢的现状下,「拿来主义」是快速争夺市场的妙手,和 Google 的合作,在减小研发成本,收取高额坑位费的同时,还能缓解两家公司目前所面临的监管压力。

其次,AIGC 的技术很好,但在落地时却因道德、隐私等方面的缺陷被大量诟病,交给成熟的第三方,特别是已经在三星机型上试水成功的 Google,省力省心,且减小了舆论和责任风险。

Apple’s big model of Siri may be different from what you think

这当中的另一道坎,是技术本地化。每个国家和地区对 AI 大模型的监管和相关法规都有着不同的要求,合法合规的落地才是争夺市场和发展技术的前提,因此才催生出了「国内+国际」双管齐下的路线。

按照三星与百度初见成效的合作,苹果才会选择这条已经被「验证为真」的路线。

Apple’s big model of Siri may be different from what you think

国行版三星 S24 系列上的 AI 功能,实际上有多个厂家旗下的技术组成:即圈即搜功能由百度和京东提供;智能修图由美图秀秀的大模型 MiracleVision 完成;文章摘要、智能写作则采用了百度的文心一言大模型。

苹果是否也会与多家厂商合作,还要等待后续的消息,不过和百度的合作,已然板上钉钉。

最后,苹果要做的并非一个智能语音助手,而是整套 AI 终端。但根据 Macrumor 的爆料,以目前的自研进度和技术成果,苹果的大模型还远达不到 Google、OpenAI 等公司水准。

Apple’s big model of Siri may be different from what you think

与其赶鸭子上架一个智能聊天机器人,不如先拿成熟的方案做过渡,为自研大模型争取更多的研究时间和进步空间。

当下的市场很重要,但未来的核心技术才是根本

合作,是苹果 AI 全球化的第一步,而最终的目标,是为了拥有全路自研的 AI 大模型。

Apple’s big model of Siri may be different from what you think

这是一项烧钱且耗神的工程,别说退步,稍微进步的慢一点,都有可能在下周被淘汰。一个有竞争力的大模型,往往代表着今后在市场上的主导权与议价权。

库克认为:

生成式 AI 方面开辟新天地,我们相信这项技术可以重新定义未来。

而苹果对于大模型的探索,其实一直都在日程表上。

本月 15 日,苹果工程师悄悄发布的一篇研究论文,当中详细介绍了一种名为 MM1 的新型生成式 AI 模型的开发过程。

Apple’s big model of Siri may be different from what you think

MM1 是一个具有最高 30B(300 亿)参数的多模态 LLM 系列,这是苹果在多模态大模型的最新研究成果。

总的来说,苹果的自研模型在测试效果上,与 Gemini 和 GPT4V 还有一定的差距,也没有在生成结果上表现出如 Sora 一样惊人的效果,更没有探索出一条全新的技术路线。

Apple’s big model of Siri may be different from what you think

但是,它能通过控制各种数据变量,在对比中找出影响模型生成效果最关键的那几个因素,简单来说,它天生不强大,不过善于观察、实践和总结,在一次次的尝试中,也能取得不错的成绩。

MM1 由密集模型和 MoE(混合专家)变体组成,当指令进入 MoE 后,究竟应该去「东市买骏马」还是「西市买鞍鞯」,都会被这个指令中心安排的明明白白的。

问题被细化和分类的同时,也提升了计算效率,节约了运行能耗。

Apple’s big model of Siri may be different from what you think

这篇论文的发布,代表的是苹果在 AI 领域探索的阶段性成果,虽然 MM1 没有颠覆行业更没有惊艳世界,但在晦涩难懂的专业术语里还是能看出它们的进步:

我们的工作模式一直是先做工作,然后再谈论工作,而不是在自己面前出言不逊。—— Tim Cook

没有透露过多技术细节的苹果,其实还在盘算着另一步棋:端侧大模型。

Apple’s big model of Siri may be different from what you think

早在去年底,苹果在名为《闪存中的大型语言模型:在有限内存下高效的大型语言模型推理》的论文中,就提出了大模型落地 iPhone 等「内存有限」设备的方法。

研究人员称,他们通过最新的闪存技术,在 iPhone 和其他内存受限的设备上成功部署了 LLM(大语言模型)。

Apple’s big model of Siri may be different from what you think

This project is called Apple GPT. Its biggest function is to store LLM data directly in flash memory, such as integrating it inside Siri. Compared with the traditional running method, the new technology improves the inference speed of the CPU and GPU by up to 5 times and 25 times.

The researchers said: "The efficiency methods we developed enable the artificial intelligence model to run within twice the current memory range of the iPhone."

In other words, it is feasible to carry large models on the side end by reducing the flash memory The amount of data transferred is increased, the throughput of each transfer is improved, and LLM data is stored directly in the flash memory.

Technology aside, Siri is the bridge between us and AI

The progress is slow, the news is small, and the layout is large. Here’s a look at Apple’s exploration of AI.

Apple’s big model of Siri may be different from what you think

Every time I see a certain Apple technology lagging behind the market and competitors, it gives people the illusion that it "started too late". In fact, when you look through relevant news and patent documents, you will find it again It is often the first batch to be laid out, or even the first one.

As of 2023, Apple has acquired a total of 32 AI companies, ranking first among technology giants in acquisitions. The acquisition of Siri should be regarded as the beginning of Apple’s entry into AI.

Apple’s big model of Siri may be different from what you think

In 2010, a phone call from Steve Jobs to Dag Kittlaus, the "Father of Siri", allowed Siri to join Apple and launch the iPhone with a worth of more than 200 million US dollars.

Siri was originally positioned as an assistant to obtain information quickly and accurately, or to handle complex tasks.

Apple’s big model of Siri may be different from what you think

In the most original version, Siri can connect to 42 network services - from restaurant review website Yelp, ticket sales website StubHub, to movie review website Rotten Tomatoes and mathematical calculation website Wolfram Alpha.

According to the prompts, Siri will integrate various information and reply to the user. Siri can help users buy tickets, book a restaurant or hail a cab without opening another app.

These "AI functions" that are now being vigorously promoted by AI Pin and other smart assistants seem to be just the "basic operations" of Siri more than ten years ago.

But the actual experience of Siri has been greatly separated by the explosive development of large AI models.

Apple’s big model of Siri may be different from what you think

The intelligent assistant is a passive imitation of human beings, answering all questions and responding to all requests.

The AI ​​terminal actively approaches people. Based on the user's personal habits and preferences, after summarizing the past and reasoning, we will give you the most appropriate suggestions and answers at different times and places, and we can continuously learn and optimize to become "private and exclusive".

Apple’s big model of Siri may be different from what you think

▲ Picture from: x.com

On the whole, Apple’s late arrival is only relatively late, because AI phones are still in the early stages of development.

Indeed, most domestic brands have already made efforts in the stage of AI terminals, with roughly the same functions and different specialties. However, the usability of each large model can only be regarded as passing, except for the AI ​​elimination of OPPO photo album and Samsung call real-time translation, Xiao Ai's AI calling and other segmented functions, most of the experience is still somewhat different from independent AI applications.

Apple’s big model of Siri may be different from what you think

In addition to the technical breakthroughs of the manufacturers, it is also related to the open interface of the App. For example, models that do not support WeChat voice call summary will lose a large application space in daily life.

Therefore, there is still a long way to go for the integration of large models, systems, and apps, as well as the exploration of new interaction methods. Prior to this, AI functions had not yet reached the level of influencing consumer purchasing decisions.

Apple’s big model of Siri may be different from what you think

In the first year of AI introduction, Siri’s goal is to close the gap with other AI assistants for more than half a year; and as an important part of Apple’s future layout, we are looking forward to what Siri will bring in June "One more thing".

The above is the detailed content of Apple’s big model of Siri may be different from what you think. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn