search
HomeDatabaseMysql TutorialMySQL single column index and joint index summary

This article brings you relevant knowledge about mysql, which mainly introduces issues related to single-column indexes and joint indexes. Using additional columns in the index can narrow the scope of the search, but Using an index with two columns is different from using two separate indexes. Let's take a look at it together. I hope it will be helpful to everyone.

MySQL single column index and joint index summary

Recommended learning: mysql video tutorial

1. Introduction

Using additional columns in the index, you can Narrow your search, but using an index with two columns is different than using two separate indexes.

The structure of the joint index is similar to that of a phone book. A person's name is composed of a surname and a given name. The phone book is first sorted by surname, and then sorted by first name for people with the same surname. A phone book is very useful if you know your last name, even more useful if you know both your first and last name, but useless if you only know your first name but not your last name.

So when creating a joint index, you should carefully consider the order of columns. Union indexes are useful when searching on all columns in the index or only on the first few columns; they are not useful when searching on any subsequent columns.

2. Single-column index

When multiple single-column indexes are used for multi-condition queries, the optimizer will give priority to the optimal index strategy. It may only use one index, or it may use all multiple indexes. . However, multiple single-column indexes will create multiple B index trees at the bottom, which takes up space and wastes a certain amount of search efficiency. Therefore, it is best to build a joint index if there are only multi-condition joint queries.

3. Leftmost Prefix Principle

As the name implies, it is leftmost priority. Any consecutive index starting from the leftmost can be matched. If the first field is a range query, it needs to be created separately. For an index, when creating a joint index, the most frequently used column in the where clause should be placed on the far left according to business needs. In this case, the scalability is better. For example, username is often used as a query condition, but age is not often used, so username needs to be placed in the first position of the joint index, that is, on the far left.

1. Create a composite index

ALTER TABLE employee ADD INDEX idx_name_salary (name,salary)

2. Satisfy the leftmost characteristics of the composite index, even if it is only part of it, the composite index will take effect

SELECT * FROM employee WHERE NAME='哪吒编程'

3. The left field does not appear. Then the leftmost characteristic is not satisfied, and the index becomes invalid

SELECT * FROM employee WHERE salary=5000

4. All composite indexes are used, name and salary appear in the left order, and the index takes effect

SELECT * FROM employee WHERE NAME='哪吒编程' AND salary=5000

5. Although the leftmost characteristic is violated, However, MySQL will perform optimization when executing SQL, and the bottom layer will perform reverse optimization

SELECT * FROM employee WHERE salary=5000 AND NAME='哪吒编程'

6. Reason

Compound index is also called a joint index. When we create a joint index, such as (k1, k2,k3), which is equivalent to creating three indexes (k1), (k1,k2) and (k1,k2,k3). This is the leftmost matching principle.

The joint index does not satisfy the leftmost principle, and the index will generally fail.

4. There are joint indexes and single-column indexes at the same time (fields are repeated). How will the index be used to query MySQL at this time?

This involves the query optimizer strategy of MySQL itself. When a table has multiple indexes, MySQL will choose which index to use based on the cost of the query statement;

Some people say where query The order is from left to right, so the conditions with the strongest screening force should be placed first. Baidu Online does have this statement, but I have personally tested it. The MySQL execution optimizer will optimize it. When the index is not considered, the order of where conditions has no impact on efficiency. What really has an impact is whether the index is used!

5. The essence of joint index

When creating **(a, b, c) joint index, it is equivalent to creating (a) single column index, (a, b) joint index and (a, b, c) joint index, if you want the index to be effective, you can only use three combinations; of course, we have tested above that the combination of a and c can also be used, but in fact only the index of a is used, and c is not Not used.

6. Index failure

1. Like subquery, put % in front;

2. Non-null judgment is not null; no index is used before and after the or statement at the same time. When only one of the left and right query fields of or is an index, the index will be invalid. It will only take effect when both the left and right query fields of or are indexes;

3. The or statement (only if there are indexes before and after, SQL optimization is required Avoid writing or statements);

4. There is implicit conversion of data types. If varchar is not enclosed in single quotes, it may be automatically converted to int type, invalidating the index and causing a full table scan.

7. Other knowledge points

1. Fields that need to be indexed must be included in the where condition

2. Fields with a small amount of data do not need to be indexed because they are built Indexing has a certain overhead. If the amount of data is small, there is no need to build an index, and the speed range is slow.

3. The joint index has more advantages than building an index on each column, because the more indexes are created, the more disk space is occupied, and the slower the data update speed will be. In addition, when creating a multi-column index, the order It is also important to note that strict indexing should be placed first, so that the screening will be stronger and more efficient.

八、MySQL存储引擎简介

1、InnoDB

支持事务处理,支持外键,支持崩溃修复能力和并发控制。如果需要对事务的完整性要求比较高(比如银行),要求实现并发控制(比如售票),那选择InnoDB有很大的优势。如果需要频繁的更新、删除操作的数据库,也可以选择InnoDB,因为支持事务的提交和回滚。

2、MyISAM

插入速度快,空间和内存使用比较低。如果表主要是用于插入新纪录和读取记录,那么选择MyISAM能实现处理高效率。如果应用的完整性、并发要求比较低,也可以使用。

注意,同一个数据库也可以使用多种存储引擎的表。如果一个表要求比较高的事务处理,可以选择InnoDB。这个数据库中可以将查询要求比较高的表选择MyISAM存储。如果该数据库需要一个用于查询的临时表,可以选择MEMORY存储引擎。

九、索引结构(方法、算法)

在mysql中常用两种索引结构(算法)BTree和Hash,两种算法检索方式不一样,对查询的作用也不一样。

1、Hash

Hash索引的底层实现是由Hash表来实现的,非常适合以 key-value 的形式查询,也就是单个key 查询,或者说是等值查询。

Hash 索引可以比较方便的提供等值查询的场景,由于是一次定位数据,不像BTree索引需 要从根节点到枝节点,最后才能访问到页节点这样多次IO访问,所以检索效率远高于BTree索引。但是对于范围查询的话,就需要进行全表扫描了。

但为什么我们使用BTree比使用Hash多呢?主要Hash本身由于其特殊性,也带来了很多限制和弊端:

  • Hash索引仅仅能满足“=”,“IN”,“”查询,不能使用范围查询。

  • 联合索引中,Hash索引不能利用部分索引键查询。 对于联合索引中的多个列,Hash是要么全部使用,要么全部不使用,并不支持BTree支持的联合索引的最优前缀,也就是联合索引的前面一个或几个索引键进行查询时,Hash索引无法被利用。

  • Hash索引无法避免数据的排序操作 由于Hash索引中存放的是经过Hash计算之后的Hash值,而且Hash值的大小关系并不一定和Hash运算前的键值完全一样,所以数据库无法利用索引的数据来避免任何排序运算。

  • Hash索引任何时候都不能避免表扫描 Hash索引是将索引键通过Hash运算之后,将Hash运算结果的Hash值和所对应的行指针信息存放于一个Hash表中,由于不同索引键存在相同Hash值,所以即使满足某个Hash键值的数据的记录条数,也无法从Hash索引中直接完成查询,还是要通过访问表中的实际数据进行比较,并得到相应的结果。

  • Hash索引遇到大量Hash值相等的情况后性能并不一定会比BTree高 对于选择性比较低的索引键,如果创建Hash索引,那么将会存在大量记录指针信息存于同一个Hash值相关联。这样要定位某一条记录时就会非常麻烦,会浪费多次表数据访问,而造成整体性能底下。

2、B+ Tree

B+Tree索引是最常用的mysql数据库索引算法,因为它不仅可以被用在=,>,>=,

例如:

select * from user where name like 'jack%'; select * from user where name like 'jac%k%';

如果一通配符开头,或者没有使用常量,则不会使用索引,

例如:

select * from user where name like '%jack'; select * from user where name like simply_name;

3、 B+/-Tree原理

在数据库中,数据量相对较大,多路查找树显然更加适合数据库的应用场景,接下来我们就介绍这两类多路查找树,毕竟作为程序员,心里没点B树怎么能行呢?

B树:B树就是B-树,他有着如下的特性:

  • B树不同于二叉树,他们的一个节点可以存储多个关键字和多个子树指针,这就是B+树的特点;

  • 一个m阶的B树要求除了根节点以外,所有的非叶子子节点必须要有[m/2,m]个子树;

  • 根节点必须只能有两个子树,当然,如果只有根节点一个节点的情况存在;

  • B树是一个查找二叉树,这点和二叉查找树很像,他都是越靠前的子树越小,并且,同一个节点内,关键字按照大小排序;

  • B树的一个节点要求子树的个数等于关键字的个数+1;

B+树就是B树的plus版

  • B+树将所有的查找结果放在叶子节点中,这也就意味着查找B+树,就必须到叶子节点才能返回结果;

  • B The number of keywords in each node of the tree is the same as the number of subtree pointers;

  • B The number of keywords in the non-leaf nodes of the tree corresponds to A pointer, and the key is the maximum or minimum value of the subtree;

Optimize the B-Tree in the previous section, because the non-leaf nodes of the B Tree only store keys Value information, assuming that each disk block can store 4 key values ​​and pointer information, then it becomes a B Tree and its structure is as shown below:

MySQL single column index and joint index summary
Usually there are two One pointer points to the root node, the other points to the leaf node with the smallest key, and there is a chain ring structure between all leaf nodes (that is, data nodes). Therefore, two search operations can be performed on B Tree: one is a range search and paging search for the primary key, and the other is a random search starting from the root node.

Maybe there are only 22 data records in the above example, and the advantages of B Tree cannot be seen. Here is a calculation:

The page size in the InnoDB storage engine is 16KB, and the primary key type of the general table It is INT (occupies 4 bytes) or BIGINT (occupies 8 bytes), and the pointer type is generally 4 or 8 bytes, which means that a page (a node in the B Tree) probably stores 16KB/(8B 8B)=1K key values ​​(because it is an estimate, for the convenience of calculation, the value of K here is 〖10〗^3).
In other words, a B Tree index with a depth of 3 can maintain 10^3 * 10^3 * 10^3 = 1 billion records.

In actual situations, each node may not be filled completely, so in the database, the height of B Tree is generally 2-4 layers. MySQL's InnoDB storage engine is designed so that the root node is resident in memory, which means that only 1 to 3 disk I/O operations are needed to find the row record of a certain key value.

The B Tree index in the database can be divided into clustered index (clustered index) and auxiliary index (secondary index). The above B Tree example diagram is implemented in the database as a clustered index. The leaf nodes in the B Tree of the clustered index store the row record data of the entire table. The difference between an auxiliary index and a clustered index is that the leaf nodes of the auxiliary index do not contain all the data of the row record, but the clustered index key that stores the corresponding row data, that is, the primary key. When querying data through a secondary index, the InnoDB storage engine traverses the secondary index to find the primary key, and then finds the complete row record data in the clustered index through the primary key.

Recommended learning: mysql video tutorial

The above is the detailed content of MySQL single column index and joint index summary. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:CSDN. If there is any infringement, please contact admin@php.cn delete
图文详解mysql架构原理图文详解mysql架构原理May 17, 2022 pm 05:54 PM

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了关于架构原理的相关内容,MySQL Server架构自顶向下大致可以分网络连接层、服务层、存储引擎层和系统文件层,下面一起来看一下,希望对大家有帮助。

mysql的msi与zip版本有什么区别mysql的msi与zip版本有什么区别May 16, 2022 pm 04:33 PM

mysql的msi与zip版本的区别:1、zip包含的安装程序是一种主动安装,而msi包含的是被installer所用的安装文件以提交请求的方式安装;2、zip是一种数据压缩和文档存储的文件格式,msi是微软格式的安装包。

mysql怎么去掉第一个字符mysql怎么去掉第一个字符May 19, 2022 am 10:21 AM

方法:1、利用right函数,语法为“update 表名 set 指定字段 = right(指定字段, length(指定字段)-1)...”;2、利用substring函数,语法为“select substring(指定字段,2)..”。

mysql怎么替换换行符mysql怎么替换换行符Apr 18, 2022 pm 03:14 PM

在mysql中,可以利用char()和REPLACE()函数来替换换行符;REPLACE()函数可以用新字符串替换列中的换行符,而换行符可使用“char(13)”来表示,语法为“replace(字段名,char(13),'新字符串') ”。

mysql怎么将varchar转换为int类型mysql怎么将varchar转换为int类型May 12, 2022 pm 04:51 PM

转换方法:1、利用cast函数,语法“select * from 表名 order by cast(字段名 as SIGNED)”;2、利用“select * from 表名 order by CONVERT(字段名,SIGNED)”语句。

MySQL复制技术之异步复制和半同步复制MySQL复制技术之异步复制和半同步复制Apr 25, 2022 pm 07:21 PM

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了关于MySQL复制技术的相关问题,包括了异步复制、半同步复制等等内容,下面一起来看一下,希望对大家有帮助。

mysql怎么判断是否是数字类型mysql怎么判断是否是数字类型May 16, 2022 am 10:09 AM

在mysql中,可以利用REGEXP运算符判断数据是否是数字类型,语法为“String REGEXP '[^0-9.]'”;该运算符是正则表达式的缩写,若数据字符中含有数字时,返回的结果是true,反之返回的结果是false。

带你把MySQL索引吃透了带你把MySQL索引吃透了Apr 22, 2022 am 11:48 AM

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了mysql高级篇的一些问题,包括了索引是什么、索引底层实现等等问题,下面一起来看一下,希望对大家有帮助。

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Tools

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment