search
HomeDatabaseMysql TutorialWhat should I do if the Mysql index fails? Brief analysis of failure reasons

This article will record for everyone Mysql Index failure, analyze the reasons for Mysql index failure, I hope it will be helpful to everyone!

What should I do if the Mysql index fails? Brief analysis of failure reasons

This article contains Mysql's Where condition query execution process, range query to stop the joint index from matching, table return operation analysis, common index failure scenarios, Extra analysis and other knowledge. [Related recommendations: mysql video tutorial]

Background

A full query appeared in the data table with 60 million data volume, and the sql was reproduced The statement discovery query does not use the index but uses the entire table query to find out the cause of the index failure.

# sql语句
EXPLAIN SELECT count(*) FROM order_recipient_extend_tab WHERE start_date>&#39;1628442000&#39; and start_date<&#39;1631120399&#39; and station_id=&#39;1809&#39; and status=&#39;2&#39;;

What should I do if the Mysql index fails? Brief analysis of failure reasons

The order_recipient_extend_tab table has 60 million data. The query fields for slow query include start_date, station_id, and status. According to the original intention of the index design, the index that actually fails is:

Union index Field 1 Field 2 Field 3
idx_date_station_driver start_date station_id driver_id

Where conditional query execution process

Understand how Mysql executes where conditional query, and you can quickly and clearly understand the reasons for index failure. The index with high matching degree in this slow query is idx_date_station_driver. Analyze the execution process of where condition query in this slow query.

Mysql's where condition extraction rules can be summarized into three major categories: Index Key (First Key & Last Key), Index Filter, Table Filter .

Index Key

Index Key is used to determine the scope of this sql query on the index tree. A range includes the start and end, Index First Key is used to locate the starting range of the index query, and Index Last Key is used to locate the ending range of the index query.

  • Index First Key

    Extraction rules: Starting from the first field of the index, check whether the field exists in the where condition, if it exists and the condition is =, >=, then add the corresponding condition to Index First Key, and continue reading the next field of the index; if it exists and the condition is >, add the corresponding condition to Index First Key, and then terminate Index First Key. Extract; if it does not exist, also terminate the extraction of Index First Key.

  • Index Last Key

    is just the opposite of Index First Key. Extraction rules: start from the first field of the index and check whether it exists in the where condition. If If exists and the condition is =,

According to the Index Key extraction rules, the Index Last Key extracted in this slow query is: start_date>'1628442000', and the Index Last Key is: start_date

Index First Key is only used to locate the starting range of the index. Use the Index First Key condition, starting from the root node of the index B tree, and use the binary search method to quickly index to the correct leaf node position. During the Where query process, Index First Key is only judged once.

Index Last Key is used to locate the ending range of the index. Therefore, for each index record read after the starting range, it is necessary to determine whether it has exceeded the range of Index Last Key. If it exceeds, the current The query ends.

Index Filter

In the index range determined by Index Key, not all index records meet the query conditions. For example, in the Index Last Key and Index Last Key ranges, not all index records satisfy station_id = '1809'. At this time you need to use Index Filter.

Index Filter, also known as index pushdown, is used to filter records that do not meet the query conditions in the index query range. For each record in the index range, it needs to be compared with the Index Filter. If it does not meet the Index Filter, it will be discarded directly and continue to read the next record in the index.

Extraction rules of Index Filter: Starting from the first field of the index, check whether it exists in the where condition. If it exists and the condition is only =, skip the first field and continue to check the next field of the index. , the next index column adopts the same extraction rules (explanation: fields with the condition = have been filtered out in the Index Key); if it exists and the condition is >=, >,

According to the extraction rules of Index Filter, the Index Filter extracted in this slow query is: station_id='1809'. In the index query range determined by the Index Key, station_id='1809' needs to be compared when traversing the index records. If this condition is not met, it will be lost directly and the next record in the index will continue to be read.

Table Filter

Table Filter is used to filter out data that cannot be filtered by the index. After the entire row of records is queried in the secondary index through the primary key Returning to the table, it is judged whether the record meets the Table Filter conditions. If it does not meet the conditions, it will be lost and the next record will continue to be judged.

The extraction rules are very simple: all query conditions that do not belong to the index fields are classified into Table Filter. According to the extraction rules of Table Filter, the Table Filter in this query is: status=‘2’.

Summary and supplement

Index Key is used to determine the scope of the index scan; Index Filter is used to filter in the index; Table Filter needs to be returned to the table on the Mysql server to filter.

Index Key and Index Filter occur at the InnoDB storage layer, and Table Filter occurs at the Mysql Server layer.

Before MySQL5.6, there was no distinction between Index Filter and Table Filter. All index records within the range of Index First Key and Index Last Key were returned to the table to read the complete record, and then returned to the MySQL Server layer for processing. filter.

In MySQL 5.6 and later, the Index Filter is separated from the Table Filter. The Index Filter drops to the storage engine layer of InnoDB for filtering, which reduces the interaction overhead of returning tables and returning records to the MySQL Server layer, and improves the execution efficiency of SQL.

Analyze the cause of index failure

The first is count(). At this time, the wildcard * will not expand all columns after optimization, and will actually ignore all The column directly counts the number of rows. So if you only want to collect the number of rows, it is best to use count().

Next analyze the where statement. Assume that this slow query uses the secondary index idx_date_station_driver. According to the execution process of the above where condition query, the Index First Key of the slow query is start_date>'1628442000', and the Index Last Key is: start_date

After extracting the Index First Key, locating the index starting range on the index B tree is the process of index matching . Use the binary search method on the index B tree to quickly locate the start range that meets the query conditions. Leaf nodes. Through the above Where condition query execution process, we know the where condition of the slow query (start_date>'1628442000' and start_date, only Matches the first field of index <code>idx_date_station_driver(start_date, station_id, driver_id), that is, only matches idx_date_station_driver(start_date), station_id='1809' precise query does not affect the match On the index, it plays a role in the Index Filter, that is, the index push-down process. In fact, this is because the range query causes the joint index to stop matching .

Range query causes the joint index to stop matching

Why does the range query cause the joint index to stop matching? This involves the leftmost prefix matching principle. Assuming that a joint index index(a, b) is established, a will be sorted first, and if a is equal, b will be sorted, as shown in the figure below. On this index tree, a is globally ordered, while b is in a globally unordered and locally ordered state. From a global perspective, the value of b is 1, 2, 1, 4, 1, 2, and only b=2 query conditions cannot directly use this index; from a local perspective, when the value of a is determined , b is an ordered state, a=2 && b=4 can use this index. Therefore, the fundamental reason why the range query causes the joint index to stop matching is that the ordered state of the non-first field on the index tree depends on the equality of the previous field, and range query destroys the local ordered state of the next index field, Causes the index to stop matching.

What should I do if the Mysql index fails? Brief analysis of failure reasons

Range query stops the joint index from matching, and cannot filter out data with station_id not equal to '1809' when the index matches, resulting in Mysql scanning range on the index Index First Key and Index Last Key are completely determined by start_timestamp_of_date time. The start_timestamp_of_date range query can filter out 73% of the data volume, while the station_id='1809' precise query can filter out 99% of the data volume.

Query conditions Data volume Percentage
All data 63.67 million 100%
start_timestamp_of_date>'1628442000' and start_timestamp_of_date 17.42 million 27.35%
station_id='1809' 80,000 0.16%

Overhead of table return operation

Since the status field is not on the index idx_date_station_driver field, it is necessary to return the table to query the index filtered data, which is judged at the Mysql service layer Whether the data meets the query conditions.

Mysql's optimizer will first estimate the cost of indexing with a high matching degree when executing a sql statement. If the cost of indexing is greater than searching the entire table, then Mysql will choose a full table scan. This conclusion may be counter-intuitive. In our impression, indexes are used to improve query efficiency. There are mainly two factors involved here:

  • When the query condition or the field being searched is not on the field of the secondary index, the table return operation will be performed, which will be: secondary index primary key index.

  • The performance of disk random I/O is lower than sequential I/O. Table return queries are random I/O on the primary key index, and full table scans are sequential I/O on the primary key index.

#Do experiments to analyze whether the cost of table return operations is the direct cause of index failure?

Remove the status='0' query condition and explain to see if the query uses the index idx_date_station_driver. The result is as shown in the figure below. The overhead of the table return operation is reduced, and the index does not become invalid.

What should I do if the Mysql index fails? Brief analysis of failure reasons

Summary

Combined with the above analysis, the reason for the index failure is summarized: the range query causes the joint index to stop matching, and the index matches the filtered data Not enough, causing the MySQL optimizer to estimate that the table return operation cost of Table Filter is greater than that of the full table query, so the full table query was chosen. The range query causing the joint index to stop matching is the culprit of index failure, and the cost of table return operations is the direct cause of index failure.

Optimize Index

The culprit of the slow query index failure is that the range query causes the joint index to stop matching. You only need to adjust the fields of the range query to the fields of the precise query. Later, the

joint indexidx_date_station_driver(start_date, station_id, driver_id) will be modified to idx_station_date_driver(station_id, start_date, driver_id). The optimized results are shown in the figure below.

What should I do if the Mysql index fails? Brief analysis of failure reasons

Expansion

Common scenarios of index failure

  • Violates the leftmost prefix matching principle. For example, there is an index index(a,b), but the query condition only has the b field.

  • Perform any operation on the index column, including calculations, functions, type conversions, etc.

  • Range queries cause the union index to stop matching.

  • Reduce the use of select*. To avoid unnecessary table return operation overhead, try to use covering indexes.

  • Use not equal to (!=, ) and use the or operation.

  • The string index without single quotes is invalid.

  • like starts with wildcard '�c'. Note that like ‘abc%’ can be indexed.

  • order by violates the leftmost matching principle and includes non-index field sorting, which will result in file sorting.

  • group by violates the leftmost matching principle and contains non-index field grouping, which will result in the generation of a temporary table.

Explain analysis

The analysis of slow query is inseparable from the mysql explain statement. explain mainly focuses on two fields: Type and Extra.

Type represents the way to access data, and Extra represents the way to filter and organize data. Listed here for easy search.

##ExtraALLFull table scanUsing indexUse covering index, no need to return the table, no need for Mysql service layer filteringindexFull scan of the index treeUsing whereGet data from the storage engine layer, and use where query conditions in the Mysql service layer Filter data. #rangeIndex tree range scanUsing where; Using indexIndex range scan. Index scans are similar to full table scans, but they occur at different levels. refNon-unique index scan, such as non-unique index and non-unique prefix of unique indexUsing index conditionUse index pushdown to make full use of query index fields to filter data at the storage engine layereq_refUnique index scan, such as unique index and primary key indexUsing temporaryTemporary table stores results, used for sorting and grouping queriesconstConvert queries into constantsUsing filesortFile sorting, used for sortingNULLNo need to access tables or indexesNULLReply to the table
Type

For more programming-related knowledge, please visit:

Introduction to Programming! !

The above is the detailed content of What should I do if the Mysql index fails? Brief analysis of failure reasons. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:掘金社区. If there is any infringement, please contact admin@php.cn delete
图文详解mysql架构原理图文详解mysql架构原理May 17, 2022 pm 05:54 PM

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了关于架构原理的相关内容,MySQL Server架构自顶向下大致可以分网络连接层、服务层、存储引擎层和系统文件层,下面一起来看一下,希望对大家有帮助。

mysql怎么替换换行符mysql怎么替换换行符Apr 18, 2022 pm 03:14 PM

在mysql中,可以利用char()和REPLACE()函数来替换换行符;REPLACE()函数可以用新字符串替换列中的换行符,而换行符可使用“char(13)”来表示,语法为“replace(字段名,char(13),'新字符串') ”。

mysql怎么去掉第一个字符mysql怎么去掉第一个字符May 19, 2022 am 10:21 AM

方法:1、利用right函数,语法为“update 表名 set 指定字段 = right(指定字段, length(指定字段)-1)...”;2、利用substring函数,语法为“select substring(指定字段,2)..”。

mysql的msi与zip版本有什么区别mysql的msi与zip版本有什么区别May 16, 2022 pm 04:33 PM

mysql的msi与zip版本的区别:1、zip包含的安装程序是一种主动安装,而msi包含的是被installer所用的安装文件以提交请求的方式安装;2、zip是一种数据压缩和文档存储的文件格式,msi是微软格式的安装包。

mysql怎么将varchar转换为int类型mysql怎么将varchar转换为int类型May 12, 2022 pm 04:51 PM

转换方法:1、利用cast函数,语法“select * from 表名 order by cast(字段名 as SIGNED)”;2、利用“select * from 表名 order by CONVERT(字段名,SIGNED)”语句。

MySQL复制技术之异步复制和半同步复制MySQL复制技术之异步复制和半同步复制Apr 25, 2022 pm 07:21 PM

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了关于MySQL复制技术的相关问题,包括了异步复制、半同步复制等等内容,下面一起来看一下,希望对大家有帮助。

带你把MySQL索引吃透了带你把MySQL索引吃透了Apr 22, 2022 am 11:48 AM

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了mysql高级篇的一些问题,包括了索引是什么、索引底层实现等等问题,下面一起来看一下,希望对大家有帮助。

mysql怎么判断是否是数字类型mysql怎么判断是否是数字类型May 16, 2022 am 10:09 AM

在mysql中,可以利用REGEXP运算符判断数据是否是数字类型,语法为“String REGEXP '[^0-9.]'”;该运算符是正则表达式的缩写,若数据字符中含有数字时,返回的结果是true,反之返回的结果是false。

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment