He Zhongjiang, General Manager of China Telecom Artificial Intelligence: Supernatural Voice 2.0 will be released in 2024-AI-php.cn

He Zhongjiang, General Manager of China Telecom Artificial Intelligence: Supernatural Voice 2.0 will be released in 2024

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Nov 10, 2023 pm 08:21 PM

AIYearnatural speech

On the afternoon of November 10, He Zhongjiang, General Manager of China Telecom Artificial Intelligence, interpreted the products and ideas of General Large Model at the Artificial Intelligence and Data Industry Development Cooperation Forum

He Zhongjiang, General Manager of China Telecom Artificial Intelligence: Supernatural Voice 2.0 will be released in 2024

He Zhongjiang first shared his views on general artificial intelligence. He believed that general artificial intelligence refers to the ability to see, listen, and think like humans. Being able to see requires visual technology, and being able to listen requires voice technology. After the information and voice information are collected into the brain, the brain processes and judges it and provides decision-making ideas. The general large model plays the role of the brain. Today's massive data, advanced algorithms, and solid computing power will also promote the large-scale development of large models.

After explaining the basic views, He Zhongjiang gave a detailed explanation from the China Telecom Star Semantic Model and the China Telecom Star Multimodal Model. The China Telecom Star Semantic Large Model is the core of general artificial intelligence. It has better capabilities and can alleviate multiple rounds of hallucinations, reducing the "hallucination rate" by 40%. In the future, China Telecom's star semantic large model can empower 2B2G services externally, improve quality and efficiency, and optimize experience; it can be fully applied internally, improve production collaboration efficiency, and have richer applications. He Zhongjiang also revealed that China Telecom’s AI team will also participate in the open source and open source process. It will open source the tens of billions model before the end of this year and the hundreds of billions model in April next year. All underlying codes will be open sourced.

When He Zhongjiang introduced China Telecom’s Xingchen multi-modal large model, he said that China Telecom has trained more than 1.2 billion image and text pairs, using a mixed precision strategy to significantly improve GPU efficiency and speed up inference by 4.5 times. The multi-modal large model will As the basic capability base for the next generation of digital people.

By comparing Wanhao intelligent customer service voice with Supernatural TTS1.0, He Zhongjiang said that China Telecom Xingchen Voice Large Model 1.0 can achieve naturalness comparable to real people, real-time streaming into a suitable voice; the first packet response time is less than 50 milliseconds; it supports extremely Small data volume sound conversion and customization, thereby achieving better, faster and more flexible. He also revealed that Supernatural Speech Synthesis 2.0 will be released in mid-2024.

China Telecom HR is based on the China Telecom Star multi-modal large model, and uses basic digital avatars to display functions such as arbitrary matching of makeup accessories and personalized generation and customization. He Zhongjiang said that with the continuous enhancement of large-scale model technology and the continuous enrichment of knowledge, digital people in the virtual space and robots in the real world will have an increasing impact on people's production, operation and life, and the era of artificial intelligence is about to truly come!

Operator Finance (official WeChat public account yyscjrd) - a mainstream financial website, a website that comprehensively covers technology, finance, securities, automobiles, real estate, food, medicine, daily chemicals, wine and other consumer products .

The above is the detailed content of He Zhongjiang, General Manager of China Telecom Artificial Intelligence: Supernatural Voice 2.0 will be released in 2024. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:搜狐. If there is any infringement, please contact admin@php.cn delete

As AI Use Soars, Companies Shift From SEO To GEOMay 05, 2025 am 11:09 AM

With the explosion of AI applications, enterprises are shifting from traditional search engine optimization (SEO) to generative engine optimization (GEO). Google is leading the shift. Its "AI Overview" feature has served over a billion users, providing full answers before users click on the link. [^2] Other participants are also rapidly rising. ChatGPT, Microsoft Copilot and Perplexity are creating a new “answer engine” category that completely bypasses traditional search results. If your business doesn't show up in these AI-generated answers, potential customers may never find you—even if you rank high in traditional search results. From SEO to GEO – What exactly does this mean? For decades

Big Bets On Which Of These Pathways Will Push Today's AI To Become Prized AGIMay 05, 2025 am 11:08 AM

Let's explore the potential paths to Artificial General Intelligence (AGI). This analysis is part of my ongoing Forbes column on AI advancements, delving into the complexities of achieving AGI and Artificial Superintelligence (ASI). (See related art

Do You Train Your Chatbot, Or Vice Versa?May 05, 2025 am 11:07 AM

Human-computer interaction: a delicate dance of adaptation Interacting with an AI chatbot is like participating in a delicate dance of mutual influence. Your questions, responses, and preferences gradually shape the system to better meet your needs. Modern language models adapt to user preferences through explicit feedback mechanisms and implicit pattern recognition. They learn your communication style, remember your preferences, and gradually adjust their responses to fit your expectations. Yet, while we train our digital partners, something equally important is happening in the reverse direction. Our interactions with these systems are subtly reshaping our own communication patterns, thinking processes, and even expectations of interpersonal conversations. Our interactions with AI systems have begun to reshape our expectations of interpersonal interactions. We adapted to instant response,

California Taps AI To Fast-Track Wildfire Recovery PermitsMay 04, 2025 am 11:10 AM

AI Streamlines Wildfire Recovery Permitting Australian tech firm Archistar's AI software, utilizing machine learning and computer vision, automates the assessment of building plans for compliance with local regulations. This pre-validation significan

What The US Can Learn From Estonia's AI-Powered Digital GovernmentMay 04, 2025 am 11:09 AM

Estonia's Digital Government: A Model for the US? The US struggles with bureaucratic inefficiencies, but Estonia offers a compelling alternative. This small nation boasts a nearly 100% digitized, citizen-centric government powered by AI. This isn't

Wedding Planning Via Generative AIMay 04, 2025 am 11:08 AM

Planning a wedding is a monumental task, often overwhelming even the most organized couples. This article, part of an ongoing Forbes series on AI's impact (see link here), explores how generative AI can revolutionize wedding planning. The Wedding Pl

What Are Digital Defense AI Agents?May 04, 2025 am 11:07 AM

Businesses increasingly leverage AI agents for sales, while governments utilize them for various established tasks. However, consumer advocates highlight the need for individuals to possess their own AI agents as a defense against the often-targeted

A Business Leader's Guide To Generative Engine Optimization (GEO)May 03, 2025 am 11:14 AM

Google is leading this shift. Its "AI Overviews" feature already serves more than one billion users, providing complete answers before anyone clicks a link.[^2] Other players are also gaining ground fast. ChatGPT, Microsoft Copilot, and Pe

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5055523 fails to install in Windows 11?

3 weeks agoByDDD

How to fix KB5055518 fails to install in Windows 10?

3 weeks agoByDDD

Roblox: Dead Rails - How To Tame Wolves

4 weeks agoByDDD

Strength Levels for Every Enemy & Monster in R.E.P.O.

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Roblox: Grow A Garden - Complete Mutation Guide

2 weeks agoByDDD

Hot Tools

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),