search
HomeTechnology peripheralsAIThe 'golden partner' of large models is here! Tencent Cloud officially releases AI native vector database, providing 1 billion-level vector retrieval capabilities

On July 4, Tencent Cloud officially released the AI ​​native (AI Native) vector database Tencent Cloud VectorDB. This database can be widely used in scenarios such as large model training, inference, and knowledge base supplementation. It is the first vector database in China that provides full life cycle AI from the access layer, computing layer, to storage layer.

Known in the industry as the "hippocampus" of large models, vector databases are specifically designed to store and query vector data. According to reports, Tencent Cloud's vector database supports up to 1 billion vector retrieval scale, with latency controlled at the millisecond level. Compared with traditional stand-alone plug-in databases, the retrieval scale is increased by 10 times, and it also has a peak query capacity of one million levels per second (QPS).

Tencent Cloud defines AI Native vector database

With the advent of the big model era, embracing big models has become a necessity for enterprises.

By vectorizing data for storage, vector databases can significantly improve efficiency and reduce costs. It can solve the problems of high pre-training costs for large models, no "long-term memory", insufficient knowledge updates, and complex prompt word engineering. It breaks through the time and space limitations of large models and accelerates the implementation of large models in industry scenarios.

Statistics show that using Tencent Cloud Vector Database for classification, deduplication and cleaning of large model pre-training data can achieve a 10 times improvement in efficiency compared to traditional methods. If the vector database is used as an external knowledge base for model reasoning, Then the cost can be reduced by 2-4 orders of magnitude.

It is worth noting that Tencent Cloud has redefined the development paradigm of AI Native and provided a comprehensive AI solution for the access layer, computing layer, and storage layer, enabling users to use vector databases throughout the entire life cycle. Apply to AI capabilities.

Specifically, at the access layer, Tencent Cloud Vector Database supports the input of natural language text, adopts the "scalar vector" query method, supports full memory indexing, and supports up to one million queries per second (QPS). ; At the computing layer, the AI ​​Native development paradigm can realize full-scale data AI calculations, and one-stop solves problems such as text segmentation (segmentation) and vectorization (embedding) when enterprises build private domain knowledge bases; at the storage layer, Tencent Cloud Vector database supports intelligent storage distribution of data, helping enterprises reduce storage costs by 50%.

The golden partner of large models is here! Tencent Cloud officially releases AI native vector database, providing 1 billion-level vector retrieval capabilities

It used to take about a month for enterprises to access a large model. After using Tencent Cloud Vector Database, it can be completed in 3 days, which greatly reduces the enterprise's access costs.

It is understood that the vectorization capability (embedding) of Tencent Cloud Vector Database has been recognized by authoritative organizations many times. In 2021, it topped the MS MARCO list and related results have been published in the NLP Summit ACL.

Luo Yun, deputy general manager of Tencent Cloud Database, said that the era of AI Native has arrived. "Vector database large model data" and the three will produce a "flywheel effect" and jointly help enterprises enter the AI ​​Native era. )era.

Tencent Cloud Vector Database helps data access efficiency increase by 10 times

Tencent Cloud Vector Database is based on Tencent Group’s vector engine (OLAMA), which processes hundreds of billions of searches every day. After practice in Tencent’s internal massive scenarios, the efficiency of data access to AI is also 10 times higher than that of traditional solutions, and the operational stability is as high as 99.99%, it has been used in more than 30 national-level products such as Tencent Video, QQ Browser, QQ Music, etc.

Tencent Cloud vector database can effectively help products improve operational efficiency. Data shows that after using Tencent Cloud Vector Database, the per capita listening time of QQ Music increased by 3.2%, the per capita effective exposure time of Tencent Video increased by 1.74%, and the cost of QQ Browser decreased by 37.9%.

Take the application of Tencent Video as an example. Images, audio, title text and other contents in the video library use Tencent Cloud vector database. The average monthly retrieval and calculation volume is up to 20 billion times, which effectively meets the needs of copyright protection and original identification. , similarity retrieval and other scenario requirements.

Large model accelerated vector databases have entered a period of rapid development. According to Northeast Securities’ forecast, the global vector database market is expected to reach US$50 billion by 2030, and the domestic vector database market is expected to exceed RMB 60 billion.

Vector databases can help enterprises use large models more efficiently and conveniently to maximize the value of data. With the continuous development and popularization of large models, AI Native vector databases will become the standard for enterprise data processing.

The above is the detailed content of The 'golden partner' of large models is here! Tencent Cloud officially releases AI native vector database, providing 1 billion-level vector retrieval capabilities. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:搜狐. If there is any infringement, please contact admin@php.cn delete
The Hidden Dangers Of AI Internal Deployment: Governance Gaps And Catastrophic RisksThe Hidden Dangers Of AI Internal Deployment: Governance Gaps And Catastrophic RisksApr 28, 2025 am 11:12 AM

The unchecked internal deployment of advanced AI systems poses significant risks, according to a new report from Apollo Research. This lack of oversight, prevalent among major AI firms, allows for potential catastrophic outcomes, ranging from uncont

Building The AI PolygraphBuilding The AI PolygraphApr 28, 2025 am 11:11 AM

Traditional lie detectors are outdated. Relying on the pointer connected by the wristband, a lie detector that prints out the subject's vital signs and physical reactions is not accurate in identifying lies. This is why lie detection results are not usually adopted by the court, although it has led to many innocent people being jailed. In contrast, artificial intelligence is a powerful data engine, and its working principle is to observe all aspects. This means that scientists can apply artificial intelligence to applications seeking truth through a variety of ways. One approach is to analyze the vital sign responses of the person being interrogated like a lie detector, but with a more detailed and precise comparative analysis. Another approach is to use linguistic markup to analyze what people actually say and use logic and reasoning. As the saying goes, one lie breeds another lie, and eventually

Is AI Cleared For Takeoff In The Aerospace Industry?Is AI Cleared For Takeoff In The Aerospace Industry?Apr 28, 2025 am 11:10 AM

The aerospace industry, a pioneer of innovation, is leveraging AI to tackle its most intricate challenges. Modern aviation's increasing complexity necessitates AI's automation and real-time intelligence capabilities for enhanced safety, reduced oper

Watching Beijing's Spring Robot RaceWatching Beijing's Spring Robot RaceApr 28, 2025 am 11:09 AM

The rapid development of robotics has brought us a fascinating case study. The N2 robot from Noetix weighs over 40 pounds and is 3 feet tall and is said to be able to backflip. Unitree's G1 robot weighs about twice the size of the N2 and is about 4 feet tall. There are also many smaller humanoid robots participating in the competition, and there is even a robot that is driven forward by a fan. Data interpretation The half marathon attracted more than 12,000 spectators, but only 21 humanoid robots participated. Although the government pointed out that the participating robots conducted "intensive training" before the competition, not all robots completed the entire competition. Champion - Tiangong Ult developed by Beijing Humanoid Robot Innovation Center

The Mirror Trap: AI Ethics And The Collapse Of Human ImaginationThe Mirror Trap: AI Ethics And The Collapse Of Human ImaginationApr 28, 2025 am 11:08 AM

Artificial intelligence, in its current form, isn't truly intelligent; it's adept at mimicking and refining existing data. We're not creating artificial intelligence, but rather artificial inference—machines that process information, while humans su

New Google Leak Reveals Handy Google Photos Feature UpdateNew Google Leak Reveals Handy Google Photos Feature UpdateApr 28, 2025 am 11:07 AM

A report found that an updated interface was hidden in the code for Google Photos Android version 7.26, and each time you view a photo, a row of newly detected face thumbnails are displayed at the bottom of the screen. The new facial thumbnails are missing name tags, so I suspect you need to click on them individually to see more information about each detected person. For now, this feature provides no information other than those people that Google Photos has found in your images. This feature is not available yet, so we don't know how Google will use it accurately. Google can use thumbnails to speed up finding more photos of selected people, or may be used for other purposes, such as selecting the individual to edit. Let's wait and see. As for now

Guide to Reinforcement Finetuning - Analytics VidhyaGuide to Reinforcement Finetuning - Analytics VidhyaApr 28, 2025 am 09:30 AM

Reinforcement finetuning has shaken up AI development by teaching models to adjust based on human feedback. It blends supervised learning foundations with reward-based updates to make them safer, more accurate, and genuinely help

Let's Dance: Structured Movement To Fine-Tune Our Human Neural NetsLet's Dance: Structured Movement To Fine-Tune Our Human Neural NetsApr 27, 2025 am 11:09 AM

Scientists have extensively studied human and simpler neural networks (like those in C. elegans) to understand their functionality. However, a crucial question arises: how do we adapt our own neural networks to work effectively alongside novel AI s

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

Atom editor mac version download

Atom editor mac version download

The most popular open source editor