search
HomeTechnology peripheralsAIThe 'golden partner' of large models is here! Tencent Cloud officially releases AI native vector database, providing 1 billion-level vector retrieval capabilities

On July 4, Tencent Cloud officially released the AI ​​native (AI Native) vector database Tencent Cloud VectorDB. This database can be widely used in scenarios such as large model training, inference, and knowledge base supplementation. It is the first vector database in China that provides full life cycle AI from the access layer, computing layer, to storage layer.

Known in the industry as the "hippocampus" of large models, vector databases are specifically designed to store and query vector data. According to reports, Tencent Cloud's vector database supports up to 1 billion vector retrieval scale, with latency controlled at the millisecond level. Compared with traditional stand-alone plug-in databases, the retrieval scale is increased by 10 times, and it also has a peak query capacity of one million levels per second (QPS).

Tencent Cloud defines AI Native vector database

With the advent of the big model era, embracing big models has become a necessity for enterprises.

By vectorizing data for storage, vector databases can significantly improve efficiency and reduce costs. It can solve the problems of high pre-training costs for large models, no "long-term memory", insufficient knowledge updates, and complex prompt word engineering. It breaks through the time and space limitations of large models and accelerates the implementation of large models in industry scenarios.

Statistics show that using Tencent Cloud Vector Database for classification, deduplication and cleaning of large model pre-training data can achieve a 10 times improvement in efficiency compared to traditional methods. If the vector database is used as an external knowledge base for model reasoning, Then the cost can be reduced by 2-4 orders of magnitude.

It is worth noting that Tencent Cloud has redefined the development paradigm of AI Native and provided a comprehensive AI solution for the access layer, computing layer, and storage layer, enabling users to use vector databases throughout the entire life cycle. Apply to AI capabilities.

Specifically, at the access layer, Tencent Cloud Vector Database supports the input of natural language text, adopts the "scalar vector" query method, supports full memory indexing, and supports up to one million queries per second (QPS). ; At the computing layer, the AI ​​Native development paradigm can realize full-scale data AI calculations, and one-stop solves problems such as text segmentation (segmentation) and vectorization (embedding) when enterprises build private domain knowledge bases; at the storage layer, Tencent Cloud Vector database supports intelligent storage distribution of data, helping enterprises reduce storage costs by 50%.

The golden partner of large models is here! Tencent Cloud officially releases AI native vector database, providing 1 billion-level vector retrieval capabilities

It used to take about a month for enterprises to access a large model. After using Tencent Cloud Vector Database, it can be completed in 3 days, which greatly reduces the enterprise's access costs.

It is understood that the vectorization capability (embedding) of Tencent Cloud Vector Database has been recognized by authoritative organizations many times. In 2021, it topped the MS MARCO list and related results have been published in the NLP Summit ACL.

Luo Yun, deputy general manager of Tencent Cloud Database, said that the era of AI Native has arrived. "Vector database large model data" and the three will produce a "flywheel effect" and jointly help enterprises enter the AI ​​Native era. )era.

Tencent Cloud Vector Database helps data access efficiency increase by 10 times

Tencent Cloud Vector Database is based on Tencent Group’s vector engine (OLAMA), which processes hundreds of billions of searches every day. After practice in Tencent’s internal massive scenarios, the efficiency of data access to AI is also 10 times higher than that of traditional solutions, and the operational stability is as high as 99.99%, it has been used in more than 30 national-level products such as Tencent Video, QQ Browser, QQ Music, etc.

Tencent Cloud vector database can effectively help products improve operational efficiency. Data shows that after using Tencent Cloud Vector Database, the per capita listening time of QQ Music increased by 3.2%, the per capita effective exposure time of Tencent Video increased by 1.74%, and the cost of QQ Browser decreased by 37.9%.

Take the application of Tencent Video as an example. Images, audio, title text and other contents in the video library use Tencent Cloud vector database. The average monthly retrieval and calculation volume is up to 20 billion times, which effectively meets the needs of copyright protection and original identification. , similarity retrieval and other scenario requirements.

Large model accelerated vector databases have entered a period of rapid development. According to Northeast Securities’ forecast, the global vector database market is expected to reach US$50 billion by 2030, and the domestic vector database market is expected to exceed RMB 60 billion.

Vector databases can help enterprises use large models more efficiently and conveniently to maximize the value of data. With the continuous development and popularization of large models, AI Native vector databases will become the standard for enterprise data processing.

The above is the detailed content of The 'golden partner' of large models is here! Tencent Cloud officially releases AI native vector database, providing 1 billion-level vector retrieval capabilities. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:搜狐. If there is any infringement, please contact admin@php.cn delete
AI技术加速迭代:周鸿祎视角下的大模型战略AI技术加速迭代:周鸿祎视角下的大模型战略Jun 15, 2023 pm 02:25 PM

今年以来,360集团创始人周鸿祎在所有公开场合的讲话都离不开一个话题,那就是人工智能大模型。他曾自称“GPT的布道者”,对ChatGPT取得的突破赞不绝口,更是坚定看好由此产生的AI技术迭代。作为一个擅于表达的明星企业家,周鸿祎的演讲往往妙语连珠,所以他的“布道”也创造过很多热点话题,确实为AI大模型添了一把火。但对周鸿祎而言,光做意见领袖还不够,外界更关心他执掌的360公司如何应对这波AI新浪潮。事实上,在360内部,周鸿祎也早已掀起一场全员变革,4月份,他发出内部信,要求360每一位员工、每

蚂蚁集团推出金融大模型产品:金融助理“支小宝 2.0”和业务助手“支小助”,已完成备案并即将上线蚂蚁集团推出金融大模型产品:金融助理“支小宝 2.0”和业务助手“支小助”,已完成备案并即将上线Sep 10, 2023 pm 06:13 PM

蚂蚁集团在上海举行的第二届外滩大会上,宣布正式发布旗下的金融大模型据蚂蚁集团介绍,蚂蚁金融大模型基于其自研基础大模型,并针对金融产业进行深度定制,底层算力集群达到万卡规模。目前,该大模型已在蚂蚁集团财富、保险平台全面测试。同时,基于该大模型的两款产品——智能金融助理“支小宝2.0”、服务金融产业专家的智能业务助手“支小助”,也已正式亮相。据介绍,两款大模型产品展示了蚂蚁从基础大模型到行业大模型以及产业应用的全栈布局和进展。本站附两款产品目前进度如下:“支小宝2.0”已经开始内测近半年,将在完成相

百度CIO李莹:大模型是企业办公领域的重要机遇,AI的原生重构将改变智能工作方式百度CIO李莹:大模型是企业办公领域的重要机遇,AI的原生重构将改变智能工作方式Aug 18, 2023 pm 11:49 PM

2023年8月16日,WAVESUMMIT深度学习开发者大会在中国举办,该活动由深度学习技术及应用国家工程研究中心主办,百度飞桨和文心大模型承办。在会上,百度发布了文心大模型、飞桨平台和AI原生应用如流等一系列技术、产品的最新进展和生态成果。百度集团副总裁兼首席信息官李莹发表了主题演讲,她认为当前以AI大模型为核心技术的第四次科技革命将从根本上推动生产力变革,为各行各业提供强大支持,并为企业办公领域带来前所未有的发展机遇基于AI原生思维,李莹宣布,百度智能工作知识管理理念“创新流水线=AIx知识

腾讯汤道生:大模型只是起点,产业落地是AI更大的应用场景腾讯汤道生:大模型只是起点,产业落地是AI更大的应用场景Jun 22, 2023 pm 04:18 PM

6月21日,北大光华管理学院联合腾讯,宣布升级“数字中国筑塔计划”,共同推出“企业管理者人工智能通识课”系列课程。在第一课上,腾讯集团高级执行副总裁、云与智慧产业事业群CEO汤道生回顾了AI发展的历史,表示算法创新、算力增强、开源共创三大因素的叠加,构成了AI的“增长飞轮”。大模型的快速进步,推动我们正在进入一个被AI重塑的时代。汤道生表示,大模型只是起点,未来,应用落地的产业变革是更大的图景。企业过去的研发、生产、销售、服务等环节中,有很多依赖人来判断、协调与沟通的地方,今天都值得去看看,哪些

华为小艺AI助手将实现强大的大模型能力华为小艺AI助手将实现强大的大模型能力Aug 15, 2023 pm 12:05 PM

华为手机官方微博在8月4日宣布,通过盘古大模型的底层能力,HarmonyOS将为小艺带来更强大的AI能力

科大讯飞人工智能大模型升级,另有6家上市公司也已布局大模型科大讯飞人工智能大模型升级,另有6家上市公司也已布局大模型Jun 10, 2023 am 08:11 AM

近日,科大讯飞公告其构建的“讯飞星火认知大模型”将举行升级发布会,推出该人工智能大模型的V1.5(1.5版本)。此前,朗玛信息也因推出“朗玛•39AI全科医生”大模型产品举行发布会。此外,还有5家上市公司也在与投资者沟通交流中,披露已布局AI(人工智能)大模型的信息。来源:摄图网科大讯飞的“讯飞星火认知大模型”升级至1.5版近日,科大讯飞股份有限公司(证券简称:科大讯飞;证券代码:002230.SZ)披露了《关于讯飞星火认知大模型升级发布会的提示性公告》。公告显示,2023年5月6日,科大讯飞举

多家企业发布基于大模型的AI产品,大模型应用落地哪家强?多家企业发布基于大模型的AI产品,大模型应用落地哪家强?Jun 03, 2023 pm 09:56 PM

“无产业不AI,无应用不AI。”随着AI(人工智能)大模型技术落地,AI应用遍地开花。连日来,多家企业发布基于大模型的AI应用产品。身处“百模大战”时代,如何打造国产大模型应用产品?如何为大模型提供更普惠的算力、寻找更合适的场景?发布现场图。6月1日,阿里云对外披露通义大模型最新进展,上线聚焦音视频内容的AI新品“通义听悟”,成为国内首个开放公测的大模型应用产品。有专家认为,云计算是打造大模型最合适的形式,而大模型的进化过程,或将会对传统云计算架构开始新一轮的改造。阿里云AI新产品“通义听悟”开

成功孵化首个大型模型解决方案的重庆人工智能创新中心成功孵化首个大型模型解决方案的重庆人工智能创新中心Aug 06, 2023 pm 09:01 PM

最近,重庆人工智能创新中心成功孵化了云从科技的首个大模型解决方案,名为“从容大模型训推一体机”,并已成功部署。作为国内最早布局大模型的云服务商之一,华为不仅致力于深耕算力,打造强大的算力基础设施来支持中国人工智能事业的发展,而且还着眼于通用大模型和行业大模型,真正实现为千行百业和科学研究提供优质的人工智能服务经过重庆人工智能创新中心技术团队、昇腾研发专家和云从科技人工智能研究院的共同努力,一个月内顺利完成了“从容大模型训推一体机”的精度与性能对齐、产品集成与测试等工作,这成为了重庆人工智能创新中

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Tools

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

VSCode Windows 64-bit Download

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft