search
HomeTechnology peripheralsAI01Wang's self-developed full navigation map vector database swept 6 first places on the authoritative list

On March 11, 011Wang announced the launch of a new vector database "Descartes" based on full navigation graphs, which has won the first place in 6 data set evaluations of the authoritative list ANN-Benchmarks.

Vector database, also known as the information retrieval technology in the AI ​​era, is one of the core technologies of Retrieval-Augmented Generation (RAG). For large model application developers, the vector database is a very important infrastructure, which affects the performance of large models to a certain extent.

In the offline test of the international authoritative evaluation platform ANN-Benchmarks, Zero One Everything Descartes vector database ranked first among the 6 data set evaluations It has a significant performance improvement compared to the first place in the industry on the previous list, and the performance improvement on some data sets is even more than 2 times.

Zero One Everything said that the Cartesian vector database will be used in AI products that will be officially launched in the near future, and will also be provided to developers in combination with tools in the future.

Vector database becomes AI 2.0 infrastructure
Won the favor of the capital market

With the advent of the AI ​​2.0 era represented by large models, the amount of multi-modal unstructured data such as pictures, videos, and natural languages ​​has increased sharply, which is different from the traditional methods used to process structured data. database. Vector database is specially used to store, manage, query and retrieve vectorized unstructured data; it is like an external memory disk that can be called by large models at any time to form "long-term memory", also nicknamed large model memory The "hippocampus".

# Large models naturally have four flaws. Vector databases are like tailor-made "special medicine" that can accurately solve every pain point.

  • #Real-time information: Large models take a long time to train, update slowly, and cannot reflect the latest information. There is a "deadline" challenge for their knowledge. The vector database adopts a lightweight update mechanism that can quickly supplement the latest information.
  • Privacy protection: Users’ security and privacy data should not be directly provided to large model training, otherwise there will be a risk of leakage. Vector data cracks privacy by acting as an intermediate carrier for information transmission in the inference stage. Difficulties in protection.
  • Illusion correction: Large models often exhibit inference distortion or hallucination phenomena. Such problems can be effectively corrected and alleviated through the rich knowledge reference provided by the vector database.
  • Inference efficiency: The cost of inference for large models is high. The vector database can be used as a caching mechanism to avoid the need to re-execute complex inference calculations for each query request, greatly saving computing resources.

The technological changes and platform changes initiated by AI 2.0 have further strengthened the role of vector databases. Related products from major manufacturers such as Google, Microsoft, and Meta have come out one after another, and startups such as Zilliz, Pinecone, Weaviate, and Qdrant have also emerged. In 2023, OpenAI's vector database partner Pinecone completed a Series B financing of US$138 million, and domestic start-up Fabarta ArcNeural also completed a Pre-A round of financing of hundreds of millions of yuan.
Challenging the authoritative list
Winning the first place in six evaluations

ANN-Benchmarks is the most authoritative vector database performance testing tool in the industry. It can show the performance of different algorithms under different real data sets.

The following 6 evaluation data sets cover glove-25-angular, glove-100-angular, sift-128-euclidean, nytimes-256-angular, fashion-mnist -784-euclidean, gist-960-euclidean six major data sets, the abscissa represents recall, the ordinate represents QPS (number of requests processed per second), the closer the curve position is to the upper right corner, the better the algorithm performance, zero ten thousand The Cartesian vector database ranks highest in all 6 dataset evaluations.

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

#

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list


"Throughput QPS" is an important indicator to measure the query processing capabilities of an information retrieval system (such as a search engine or database). Based on the TOP1 of the original list, the Zero-One Thousand Things Cartesian Vector Database has achieved significant performance improvements. The performance improvement on some data sets is more than 2 times. In the gist-960-euclidean data set dimension, it is significantly ahead of the original TOP1 of the list. 286%.

01Wangs self-developed full navigation map vector database swept 6 first places on the authoritative list

#: Poly Herigid Cartesial Pign Database and the original list TOP1 QPS Performance Comparison

##Technical Reveal

What is curious is how Descartes achieves the above excellent performance?

As we all know, RAG is a technology that combines retrieval and generation. It enhances the generation capabilities of language models by retrieving queried information from massive data. Similar to traditional retrieval methods, in essence, RAG vector retrieval mainly solves two major problems:

1. Reduce the candidate set for retrieval inspection by establishing a certain index structure ;
2. Reduce the complexity of single vector calculation.

Zero One Thousand Cartesian Vector Database has significant comparative advantages over the industry in processing complex queries, improving retrieval efficiency, and optimizing data storage. In response to the first question, the Zero One World team has two major killers:

  • Leading full navigation map technology. The current industry status quo is mainly through hashing, KD-Tree, VP-Tree and other methods. The navigation effect is not accurate enough and the cropping strength is not enough. The global multi-layer thumbnail navigation technology developed by Zero One Wish, and the coordinate system navigation on the map, It can not only ensure accuracy, but also clip a large number of irrelevant vectors.
  • The first adaptive neighbor selection strategy to fill the gap in the industry. The self-developed adaptive neighbor selection strategy of 01Wuxing breaks through the limitations of relying only on real topk or fixed edge selection strategies in the past. The new strategy allows each node to dynamically select the best one based on the distribution characteristics of itself and its neighbors. Neighbor edges converge closer to the target vector faster, thereby improving RAG vector retrieval performance by 15%-30%.

In response to the second question, Zero One Wish adopts a two-level quantization scheme to enhance RAG. Zero One Thousand uses two-level quantization to reduce computational complexity. At the same time, columnar storage fully utilizes the concurrency capabilities of SIMD to further leverage hardware capabilities. Compared with traditional PQ table lookup, the performance is greatly improved to 2-3 times.

In addition, Zero-One Everything also has full-stack vector technology solutions such as index structure optimization and connectivity guarantee to improve the performance of Cartesian vector databases.

Full stack vector technology: higher accuracy and stronger performance

Through the above With the support of full-stack vector technology, Zero-One Cartesian Vector Database not only topped the authoritative list of ANN-Benchmarks in 6 evaluations. It also has core advantages such as higher accuracy and stronger performance in practical application scenarios.

#Zero One Everything Cartesian Vector Database currently focuses on high-performance vector databases. High-performance vector databases usually refer to vector data sets with scales of tens of millions or less (such as 20 million 128-dimensional floating-point vectors). Generally speaking, high-performance vector databases can easily handle 80 to 90 percent of daily scenarios. For example, it helps corporate customers build private domain knowledge bases and intelligent customer service systems; in the field of autonomous driving, the use of high-performance vector databases can accelerate autonomous driving model training, etc.

Zero One All High Performance Vector Database has the following advantages:

  • Ultra-high precision: Based on multi-layer thumbnails and coordinate systems, inter-layer navigation and on-map orientation navigation are realized, as well as graph connectivity guarantee, achieving an accuracy greater than 99%. Under the same performance, the accuracy is significantly ahead of the industry level.
  • Super high performance: Efficient edge selection and cropping technology, tens of millions of database ms response.

Take the e-commerce recommendation scenario as an example. The number of products on the shelves may be tens of millions, and each product can be expressed by a vector. Even if the number of vectors in the library is not very large, if the e-commerce user base is very large and the number of user requests per second at peak times is very large, it may reach hundreds of thousands or even millions of QPS. The use of high-performance vector databases can effectively improve the recommendation effect of search and advertising services in e-commerce scenarios, making everyone unable to help but keep buying.

#Zero Yiwu said that the Cartesian vector database is the team’s initial attempt based on RAG and will be effectively used in AI productivity products released in the near future. In the future, after each major model is optimized to a certain extent, the capabilities of the vector database may determine the ceiling of each major model. Zero One Wish will continue to focus on R&D and sharing in the future to bring better technology and experience to users.

The above is the detailed content of 01Wang's self-developed full navigation map vector database swept 6 first places on the authoritative list. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:机器之心. If there is any infringement, please contact admin@php.cn delete
Tesla's Robovan Was The Hidden Gem In 2024's Robotaxi TeaserTesla's Robovan Was The Hidden Gem In 2024's Robotaxi TeaserApr 22, 2025 am 11:48 AM

Since 2008, I've championed the shared-ride van—initially dubbed the "robotjitney," later the "vansit"—as the future of urban transportation. I foresee these vehicles as the 21st century's next-generation transit solution, surpas

Sam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailSam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailApr 22, 2025 am 11:29 AM

Revolutionizing the Checkout Experience Sam's Club's innovative "Just Go" system builds on its existing AI-powered "Scan & Go" technology, allowing members to scan purchases via the Sam's Club app during their shopping trip.

Nvidia's AI Omniverse Expands At GTC 2025Nvidia's AI Omniverse Expands At GTC 2025Apr 22, 2025 am 11:28 AM

Nvidia's Enhanced Predictability and New Product Lineup at GTC 2025 Nvidia, a key player in AI infrastructure, is focusing on increased predictability for its clients. This involves consistent product delivery, meeting performance expectations, and

Exploring the Capabilities of Google's Gemma 2 ModelsExploring the Capabilities of Google's Gemma 2 ModelsApr 22, 2025 am 11:26 AM

Google's Gemma 2: A Powerful, Efficient Language Model Google's Gemma family of language models, celebrated for efficiency and performance, has expanded with the arrival of Gemma 2. This latest release comprises two models: a 27-billion parameter ver

The Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaThe Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaApr 22, 2025 am 11:21 AM

This Leading with Data episode features Dr. Kirk Borne, a leading data scientist, astrophysicist, and TEDx speaker. A renowned expert in big data, AI, and machine learning, Dr. Borne offers invaluable insights into the current state and future traje

AI For Runners And Athletes: We're Making Excellent ProgressAI For Runners And Athletes: We're Making Excellent ProgressApr 22, 2025 am 11:12 AM

There were some very insightful perspectives in this speech—background information about engineering that showed us why artificial intelligence is so good at supporting people’s physical exercise. I will outline a core idea from each contributor’s perspective to demonstrate three design aspects that are an important part of our exploration of the application of artificial intelligence in sports. Edge devices and raw personal data This idea about artificial intelligence actually contains two components—one related to where we place large language models and the other is related to the differences between our human language and the language that our vital signs “express” when measured in real time. Alexander Amini knows a lot about running and tennis, but he still

Jamie Engstrom On Technology, Talent And Transformation At CaterpillarJamie Engstrom On Technology, Talent And Transformation At CaterpillarApr 22, 2025 am 11:10 AM

Caterpillar's Chief Information Officer and Senior Vice President of IT, Jamie Engstrom, leads a global team of over 2,200 IT professionals across 28 countries. With 26 years at Caterpillar, including four and a half years in her current role, Engst

New Google Photos Update Makes Any Photo Pop With Ultra HDR QualityNew Google Photos Update Makes Any Photo Pop With Ultra HDR QualityApr 22, 2025 am 11:09 AM

Google Photos' New Ultra HDR Tool: A Quick Guide Enhance your photos with Google Photos' new Ultra HDR tool, transforming standard images into vibrant, high-dynamic-range masterpieces. Ideal for social media, this tool boosts the impact of any photo,

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.