


On April 20, Mobvoi held the 2023 AIGC strategy conference in Beijing with the theme of "AGI·Advent". At the meeting, Mobvoi announced an internal test to explore the large model "Sequence Monkey" and proposed that CoPilot will be everywhere. At the same time, based on the large model capabilities, it also launched a CoPilot product matrix for creators and an upgraded version for the C-side. Questions about voice assistant magic, as well as internal testing and exploration of enterprise-specific large-scale models for the B-side. Among them, the CoPilot product matrix for creators includes four AIGC products, namely the AI writing platform "Qiaowen", the AI painting platform "Yihua", the AI dubbing platform "Magic Sound Workshop", and the digital human video and live broadcast platform "Wonderful Yuan".
Self-developed large model "Sequence Monkey" to help AGI "arrive"
Go out and ask about the large model "Sequence Monkey" It is a large language model with multi-modal generation capabilities. The model's language-centered capability system covers the six dimensions of "knowledge, dialogue, mathematics, logic, reasoning, and planning" and can support text generation, image generation, and 3D at the same time. Different tasks such as content generation, speech generation and speech recognition.
At present, the "Sequence Monkey" large model already has certain natural language understanding, knowledge, logic and reasoning capabilities. For more complex questions such as "Which provincial capital has more people, Hunan or Hubei?" Questions can give accurate results quickly.
According to reports, the inspiration for the Chinese name "Sequence Monkey" of Mobvoi's self-developed large model comes from the "Infinite Monkey" theorem proposed by mathematician Emile Borel in the early 20th century. According to this theorem, if a group of monkeys randomly hit a typewriter, they would eventually produce a complete work of Shakespeare. This theorem is based on probability theory and combinatorial mathematics and illustrates the unity of probability. The construction principle of Mobvoi's self-developed large model is similar to that of "Infinite Monkey". Massive text sequences are continuously trained by Mobvoi's independent algorithm and digested and understood by large-scale computing power, and become "Sequence Monkey".
Li Zhifei, founder and CEO of Mobvoi, said that large models are not just about large parameters. Today’s large models are in-depth modeling of Internet text, and Internet text is a mapping of world knowledge, so large models are a Cognitive model is a modeling of language. Language is the boundary of thinking, so large models have unlimited room for imagination. "Sequence Monkey" has already demonstrated its "emergent" ability during the training process. It is currently in the "enlightenment" stage and will improve faster and faster in the future. He also believes that being a human CoPilot will be the best "job" for large models, and CoPilot will be everywhere in the future.
One-stop CoPilot product matrix, opening up the entire content creation process
Based on the "Sequence Monkey" large model, Mobvoi has explored a variety of creator-oriented products during its internal testing AIGC products and applications build a one-stop CoPilot product matrix for thousands of creators, opening up the entire content creation process. Currently, Mobvoi has achieved the "trinity" of technology, products and commercialization, with over 5 million registered users and over one million paying users worldwide.
Qiaowen-Your AI Writing Assistant (write.mobvoi.com)
As the first CoPilot product unveiled at the press conference, the AI of "Qiaowen" The writing ability covers the four major content creation scenarios of workplace, marketing, new media and creative writing, and can continuously provide users with inspiration and creative direction when writing year-end summaries, customer service words, scripts, advertising copywriting and other content. .
In order to better help users improve content creation efficiency, "Qiaowen" has opened eight major AI editing functions, namely style transformation, key point extraction, proofreading and error correction, continuation, rewriting, expansion, abbreviation, and translation. . "Qiaowen" can also automatically generate pictures during writing, providing users with a writing experience that includes both pictures and text.
Yuhua-your AI painting assistant (paint.mobvoi.com)
Faced with designers, illustrators, etc. who have a strong demand for design in addition to text The creators of Mobvoi.com explored the "Yihua" AI painting platform in internal testing. "Word Painting" supports 8 creative styles including two-dimensional, steampunk, and illustration. Users only need to input text, and "Yihuahua" can generate 8 2K high-resolution images with realistic light and shadow and rich details at one time.
In addition to using text to create pictures, it also has AI drawing capabilities such as drawings to create pictures, animation generation, and personalized avatar generation, which greatly enriches users' creative methods. For enterprise users, "Yihua" also supports exclusive model customization, allowing enterprise users to customize the model style according to their own needs, and supports multi-person collaborative production to better meet the drawing needs of enterprises.
Currently, "Yiyanhua" has reached an exploration intention with the home decoration design platform Kujiale. On the Kujiale platform, users can use "Yiyanhua" to describe their needs in words. You can easily change the decoration style, adjust the position of furniture, etc., and then design your favorite decoration plan.
Moyin Workshop-Your AI dubbing assistant (moyin.com)
For AI dubbing scenarios, Mobvoi has launched a new generation of AI dubbing product "Magic Sound Workshop". "Magic Sound Workshop" (overseas version of DupDub) is the world's leading full-process one-stop AI dubbing platform, which has opened to users more than 1,000 timbres, more than 2,000 voice styles, and more than 20 dialects and foreign languages.
With the support of large model technology, "Magic Sound Workshop" is the world's first dubbing platform equipped with large model AI writing functions, covering multiple scenarios such as AI writing, AI dubbing and editing. With its assistance, users can easily complete content creation that integrates copywriting and dubbing, such as film and television commentary, audio books, online education, and news broadcasts. At present, "Magic Sound Workshop" has reached cooperation with leading companies in many industries such as WeChat Reading, Juvenile Get, and Volkswagen.
In order to provide users with a better dubbing experience, "Magic Sound Workshop" supports the adjustment of 7 emotions including calm, sad, and happy for the selected voice. 10 kinds of characters, including 10 characters, etc., have been migrated. At the same time, AI sound personalized editing functions such as rhythm adjustment, local speed change, and multi-person dubbing have been opened, allowing users to edit sounds like editing documents with Word.
In addition to selecting and editing sounds, the "Magic Sound Workshop" will also launch a "pinch sound" function based on large-scale model language generation capabilities, allowing users to freely select gender, age, language, style and emotion, etc. Sound characteristics, create your favorite sound from 0 to 1.
Wonderful Yuan-Your AI Digital Avatar (weta365.com)
Based on the ability to generate text, images, sounds, etc., in order to make video content creators more To quickly and better create novel, lively and interesting works, Mobvoi’s internal beta test explored the AI digital image creation and live broadcast platform “Wonderful Yuan”.
According to reports, the "Wonderful Yuan" platform currently has over 100 digital people, over 1,000 3D digital assets, and over 1,000 sounds. With multi-modal generation technology, the "Wonderful Yuan" platform currently supports three different forms of digital human generation: picture modeling (2D digital human), video modeling (2.5D digital human), and 3D modeling (3D digital human). . Its image cloning function only requires a 5-minute live video material to reproduce the user's image and demeanor 1:1, creating a digital clone for the user with consistent voice and natural movements.
From individuals to enterprises, CoPilot will be everywhere
Mobvoi has a deep accumulation in the field of voice assistants, and its research on AI voice can be traced back to ten years Previously, Mobvoi released its first voice assistant "Xiaowen Assistant" in 2014, iterated on "Xiaowen Secretary" in 2015, launched full-scenario VPA in 2017, and upgraded VPA in 2019. After ten years of accumulation and development, CoPilot-Magic Xiaowen, which was explored in internal testing by Mobvoi at this conference, can be understood as a synthesis of Siri and ChatGPT.
"Hello, hello, where is the nearest hot pot restaurant?" ""Hello, can you tell me the weather in Beijing tomorrow?" People are accustomed to looking up relevant information before necessitating food, clothing, housing, and transportation. As Develop an intelligent voice assistant for individual users. "Magic Question" can help users check encyclopedia information, weather, restaurants, and hotels, and can also talk to users freely, allowing users to obtain information more conveniently and quickly.
According to Mobvoi, “CoPilot” is derived from the concept of a super assistant. It will have a high-intelligence brain based on a large model, which can analyze massive data and communicate with humans instantly and accurately; it will also have beautiful voices, beautiful The image can be equipped with any hardware platform, such as mobile phones, watches, car machines, etc.; it can also be adapted to various industries and transform into high-quality teaching teachers, knowledgeable lawyers, professional doctors, financial management customer service, etc., when needed time, demonstrate professional knowledge anytime and anywhere, and share the pressure for others.
"Sequence Monkey" opens up the ecosystem and empowers more industries
Based on the "Sequence Monkey" large model For bottom-level capabilities, for B-end users and vertical fields, "Mobvoi Copilot" not only provides general capability support services and digital human image customization services, but will also open up its own role capabilities and continue to iterate data interfaces. Enterprise users in need can log in to the web to call API services, and train on industry-specific content by uploading documents to achieve customized voice interaction. The URL of Sequence Monkey is openapi.mobvoi.com.
Currently, Mobvoi has reached cooperation with the first batch of internal beta exploration partners in the top ten industries, including automobiles, education, law firms, finance, medical care, tourism, etc. In the future, Mobvoi “CoPilot” will Gradually empower more industries, help more companies have their own exclusive large models, and create their own exclusive "CoPilot".
The above is the detailed content of Mobvoi's internal beta test explores the large model 'Sequence Monkey” to create exclusive CoPilot for individuals and enterprises. For more information, please follow other related articles on the PHP Chinese website!

又双叒叕是一个新功能的亮相。你是否会遇见过想要给图片角色换个背景,但是AI总是搞出「物非人也非」的效果。即使在Midjourney、DALL・E这样成熟的生成工具中,保持角色一致性还得有些prompt技巧,不然人物就会变来变去,根本达不到你想要的结果。不过,这次算是让你遇着了。AIGC工具PixVerse的「角色-视频」新功能可以帮你实现这一切。不仅如此,它能生成动态视频,让你的角色更加生动。输入一张图,你就能够得到相应的动态视频结果,在保持角色一致性的基础上,丰富的背景元素和角色动态让生成结果

简介ChatGPT推出后,犹如潘多拉魔盒被打开了。我们现在正观察到许多工作方式的技术转变。人们正在使用ChatGPT创建网站、应用程序,甚至写小说。随着AI生成工具的大肆宣传和引入,我们也已经看到了不良行为者的增加。如果你关注最新消息,你一定曾听说ChatGPT已经通过了沃顿商学院的MBA考试。迄今为止,ChatGPT通过的考试涵盖了从医学到法律学位等多个领域。除了考试之外,学生们正在用它来提交作业,作家们正在提交生成性内容,而研究人员只需输入提示语就能产生高质量的论文。为了打击生成性内容的滥用

3月14日消息,小米官方今日宣布,小米相册AIGC编辑功能正式上线小米14Ultra手机,并将在本月内全量上线小米14、小米14Pro和RedmiK70系列手机。AI大模型为小米相册带来两个新功能:智能扩图与魔法消除Pro。AI智能扩图支持对构图不好的图片进行扩展和自动构图,操作方式为:打开相册编辑-进入裁切旋转-点击智能扩图。魔法消除Pro能够对游客照中的路人进行无痕消除,使用方式为:打开相册编辑-进入魔法消除-点击右上角的Pro。目前,小米14Ultra机器已经上线智能扩图与魔法消除Pro功

经过一年多的发展,AIGC已经从文字对话、图片生成逐步向视频生成迈进。回想四个月前,Sora的诞生让视频生成赛道经历了一场洗牌,大力推动了AIGC在视频创作领域的应用范围和深度。在人人都在谈论大模型的时代,我们一方面惊讶于视频生成带来的视觉震撼,另一方面又面临着落地难问题。诚然,大模型从技术研发到应用实践还处于一个磨合期,仍需结合实际业务场景进行调优,但理想与现实的距离正在被逐步缩小。营销作为人工智能技术的重要落地场景,成为了很多企业及从业者想要突破的方向。掌握了恰当方法,营销视频的创作过程就会

由自然语言处理、语音识别、语音合成、机器学习等技术组成的人工智能技术,应用于各行各业获得广泛认可。置身于AI应用的前沿,从2022年底开始,维音不断见证AIGC技术所带来的惊喜,也有幸参与到这场覆盖全球的技术浪潮。经过训练、测试、调优和应用,维音将其丰富的客户服务行业经验与强大的大模型能力相结合,开发出了适用于坐席端和业务端的生成式AI客服机器人。同时,维音还将底层能力与维音Vision系列智能产品相互连接,最终形成了“1+5”维音生成式AI智能产品矩阵其中,“1”是维音自主训练的大模型服务平台

机器之能报道编辑:杨文谁能成为AI视频圈的King?美剧《权力的游戏》中,有一把「铁王座」。传说,它由巨龙「黑死神」熔掉上千把敌人丢弃的利剑铸成,象征着无上的权威。为了坐上这把铁椅子,各大家族展开了一场场争斗和厮杀。而自Sora出现以来,AI视频圈也掀起了一场轰轰烈烈的「权力的游戏」,这场游戏的玩家主要有大洋彼岸的RunwayGen-3、Luma,国内的快手可灵、字节即梦、智谱清影、Vidu、PixVerseV2等。今天我们就来测评一下,看看究竟谁有资格登上AI视频圈的「铁王座」。-1-文生视频

5月16日,美图公司旗下美图设计室上线“AI海报”功能,该功能旨在降低设计门槛,提高制作效率。在AIGC的加持下,让更多非专业人士也能轻松制作出高质量海报。传统的海报制作方式包括使用Photoshop专业设计工具和使用海报模板这类便捷设计工具。PS需要专业设计师才能熟练操作,但即使是专业设计师,也需要花费较多时间不断调整尺寸、配色等细节,耗费大量时间和精力。没有设计基础的人只能使用现成的海报模板来完成设计,但选择模板、替换图片、替换文本同样消耗时间,而且即便用户花了大量时间,有时候也无法达到理想

机器之能报道编辑:杨文以大模型、AIGC为代表的人工智能浪潮已经在悄然改变着我们生活及工作方式,但绝大部分人依然不知道该如何使用。因此,我们推出了「AI在用」专栏,通过直观、有趣且简洁的人工智能使用案例,来具体介绍AI使用方法,并激发大家思考。我们也欢迎读者投稿亲自实践的创新型用例。投稿邮箱:content@jiqizhixin.com这两天被一只黑猴子刷了屏。这热度高得有多离谱?抖音、微博、公众号,只要一划拉,全在聊这款国产游戏《黑神话:悟空》,甚至官媒都下场开直播。还有公司直接放假,让员工在


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

Atom editor mac version download
The most popular open source editor

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.
