Home >Technology peripherals >AI >The fake one looks like the real thing, Tiangong Music's big model brings a subversive AI experience

The fake one looks like the real thing, Tiangong Music's big model brings a subversive AI experience

王林
王林forward
2024-04-03 21:25:102413browse

Yesterday, Kunlun Wanwei’s large-scale AI music generation model “Tiangong SkyMusic” launched a free test invitation event. The media, industry experts and interested music practitioners are sincerely invited to experience SOTA's music model products. This product enables users to have an immersive experience while experiencing the emotional expression of human voices together.

After the invitation test started, the majority of users have high expectations for the "Tiangong SkyMusic" AI music generation large model. The staff received hundreds of thousands of test applications in a very short period of time, including many professional music creators, media and industry experts. At the same time, a large number of test applications are continuously sent to the backend. Among the applications, there are many professional music creators, media and industry experts, and there are also a large number of test applications that need to be continuously screened and reviewed. Many of the applicants include many professional music creators, media and industry experts, who continued to provide valuable feedback and opinions during the testing process

At the same time , we have also received a lot of real feedback and high praise from users:

"The vocals are very clear and the lyrical melody is good"

"It sounds great!"

"This is made by heaven?? It's amazing!"

"The song "Wukong" is sung with a sense of breath. , the emotions are very good, and it basically represents the pinnacle of the ability to generate emotions and make it look real."

"Tiangong SkyMusic's high-pitched singing skills are beyond my imagination, very excellent."

"It's too powerful. 1. The user base of AI music is very large; 2. The generated music can be used repeatedly; 3. It is easy to do social fission"

"The chorus part is so smooth and the beat is so good"

"I didn't expect that the Chinese team could make something better than foreign ones"

The fake one looks like the real thing, Tiangong Musics big model brings a subversive AI experienceUser AI Music Generation Works

Since the enthusiastic feedback from the majority of users has shown us the industry’s high expectations for the "Tiangong SkyMusic" AI music generation large model, it has also allowed us to see the progress in In the exploration direction of AGI large models focusing on "intelligence", the importance of "emotional AGI" is important.

Compared with text and pictures, audio content is the best way to understand human emotions, and music is the most abundant content carrier for expressing human emotions and the most unrestricted by region and culture. No matter the changes of the times, no matter the Whether it is war or disaster, people can always convey their feelings and obtain emotional comfort through music. This is the original intention of "Tiangong SkyMusic", and it is also an important direction that Kunlun Wanwei Emotion AGI continues to explore.

We would like to thank all users who actively participated and gave enthusiastic feedback. Thank you for your support, encouragement and companionship. We will continue to iterate, optimize and broaden the capabilities of "Tiangong SkyMusic" to make the model more powerful. Multi-modal emotional understanding and expression capabilities bring users a better AI music experience.

Finally, we will soon provide the "Tiangong SkyMusic" music creation prompt guide, and provide more AI music demos and usage techniques, so as to explore the powerful charm of AI music with users!

About "Tiangong SkyMusic" and "Tiangong 3.0"

"Tiangong SkyMusic" AI music generation large model is based on Kunlun Wanwei "Tiangong 3.0" Super large model creation. On April 17, "Tiangong SkyMusic" will open public beta simultaneously with "Tiangong 3.0".

Application webpage: https://rg975ojk5z.feishu.cn/share/base/form/shrcnTcBRpGzv5Sx9xAGd5V97Md

"Tiangong 3.0" uses a 400 billion-level parameter MoE hybrid expert model , and will be simultaneously selected as open source. It is one of the MoE models with the largest model parameters and the strongest performance in the world. Compared with the previous generation "Tiangong 2.0" MoE large model, "Tiangong 3.0" has amazing performance improvements in areas such as model semantic understanding, logical reasoning, versatility, generalization, uncertainty knowledge, and learning capabilities. Its model technical knowledge ability has increased by more than 20%, and its mathematics/reasoning/coding/cultural and creative abilities have increased by more than 30%. At the same time, "Tiangong 3.0" has added the ability to search enhancements, research modes, call codes and draw charts, call online searches multiple times, etc., and has trained the agent capabilities of the model in a targeted manner, so that "Tiangong 3.0" can independently complete Plan, call, and combine external tools and information to accurately and efficiently complete various complex needs such as industrial analysis and product comparison, bringing a new disruptive artificial intelligence experience.

"Tiangong SkyMusic" is currently the first and only publicly available large-scale AI music generation model in China. It adopts a Sora-like model architecture in the music audio field, and Large-scale Transformer is responsible for composing music. , to learn the contextual dependencies of Music Patches and achieve music controllability; Diffusion Transformer is responsible for singing and restoring Music Patches into high-quality audio through LDM, so that "Tiangong SkyMusic" can support the generation of 80 seconds 44100Hz sampling rate dual sound stereo songs. This model architecture works extremely well in the fields of video, audio, and music. The Kunlun Wanwei team will also gradually iterate and add new capabilities in the future, so that the model has multi-modal emotional understanding and expression capabilities.

"Tiangong SkyMusic" test application website: https://rg975ojk5z.feishu.cn/share/base/form/shrcnTcBRpGzv5Sx9xAGd5V97Md

"Tiangong SkyMusic" has the following five characteristics:

1. High-quality AI music

"Tiangong SkyMusic" can generate 80 seconds 44100Hz sampling rate two-channel stereo AI songs, and can generate corresponding song styles according to the lyrics style input by the user .

2. Human voices are "fake and real"

Vocal synthesis is the most important dimension in AI music generation that best reflects the generation effect and quality. The AI ​​vocal synthesis of "Tiangong SkyMusic" can reach the industry's top SOTA level. The Chinese proficiency is extremely good, and the pronunciation is clear and no abnormal noise. Its Chinese singing effect is significantly better than that of foreign products, leading the world level.

3. Lyric paragraph control

"Tiangong SkyMusic" can control songs through lyrics, so that the generated songs can clearly distinguish the emotional changes of different lyric paragraphs. Reflect the differences between verses and chorus, intro and verse.

4. Various music styles

"Tiangong SkyMusic" supports rap, folk, funk, ancient style, electronic and other music styles. Users are creating music , you can use the reference audio to formulate the desired music style.

5. Intelligent expression of music - learning of singing skills

"Tiangong SkyMusic" can also learn vibrato, opera, singing, male and female duets, automatic harmony, etc. A singing technique that allows users to create more appropriate emotional expressions in songs.

In 2023, driven by the strategy of “All in AGI and AIGC”, Kunlun Wanwei has made a lot of progress in the field of artificial intelligence, gradually forming AI large models, AI search, AI music, AI animation, AI business matrix such as AI social networking and AI games.

Currently, Kunlun Wanwei has created a comprehensive set of AI search, AI writing, AI long text reading, AI dialogue, AI speech synthesis, AI image generation, AI comic creation, AI image recognition, AI music generation, AI The "Tiangong 3.0" multi-modal "Super Model" (Super Model), which integrates multiple capabilities such as code writing and AI table generation, has become a new milestone in the AI ​​industry.

The above is the detailed content of The fake one looks like the real thing, Tiangong Music's big model brings a subversive AI experience. For more information, please follow other related articles on the PHP Chinese website!

Statement:
This article is reproduced at:jiqizhixin.com. If there is any infringement, please contact admin@php.cn delete