Let's talk about how to optimize the order By statement in SQL-Mysql Tutorial-php.cn

Home

Database

Mysql Tutorial

Let's talk about how to optimize the order By statement in SQL

青灯夜游

Sep 27, 2022 pm 01:45 PM

sqlorder by

How to optimize the orderBy statement in sql? The following article will introduce to you the method of optimizing the orderBy statement in SQL. It has a good reference value and I hope it will be helpful to you.

Let's talk about how to optimize the order By statement in SQL

#When using a database for data query, you will inevitably encounter the need to sort the query result set based on certain fields. In sql, the orderby statement is usually used to achieve this. Place the fields that need to be sorted after the keyword. If there are multiple fields, use "," to separate them.

select * from table t order by t.column1,t.column2;

The above sql indicates querying the data in the table, and then sorting by column1 first. If column1 is the same, then sort by column2. The default sorting method is descending order. Of course, the sorting method can also be specified. Add DESC and ASE after the sorted field to indicate descending and ascending order respectively.

Using this orderby can easily implement daily sorting operations. I have used it a lot, and I don’t know if you have ever encountered this scenario: Sometimes after using orderby, the SQL execution efficiency is very slow, and sometimes it is faster. Since I am obsessed with curd all day long, I don’t have time to study it. Anyway, I just feel It's amazing. While I'm free this weekend, let's study how orderby is implemented in mysql.

In order to facilitate the description, we first create a data table t1, as follows:

CREATE TABLE `t1` (
  `id` int(11) NOT NULL not null auto_increment,
  `a` int(11)  DEFAULT NULL,
  `b` int(11)  DEFAULT NULL,
  `c` int(11)  DEFAULT NULL,
  PRIMARY KEY (`id`) ,
  KEY `a` (`a`) USING BTREE
) ENGINE=InnoDB;

And insert the data:

insert into t1 (a,b,c) values (1,1,3);
insert into t1 (a,b,c) values (1,4,5);
insert into t1 (a,b,c) values (1,3,3);
insert into t1 (a,b,c) values (1,3,4);
insert into t1 (a,b,c) values (1,2,5);
insert into t1 (a,b,c) values (1,3,6);

In order to make the index effective, insert 10,000 rows 7,7, 7. If the data is irrelevant and the amount of data is small, the entire table will be scanned directly

insert into t1 (a,b,c) values (7,7,7);

We now need to find all the records with a=1, and then sort them according to the b field.

The query sql is

select a,b,c from t1 where a = 1 order by b limit 2;

In order to prevent the full table scan during the query process, we added an index on field a.

First we check the sql execution plan through the statement

explain select a,b,c from t1 where a = 1 order by b lmit 2;

, as shown below:

We can see in extra Using filesort appears, which means that the sorting operation is performed during the execution of the sql. The sorting operation is completed in sort_buffer. sort_buffer is a memory buffer allocated by mysql to each thread. This buffer is specially used to complete sorting. The size is default It is 1M, and its size is controlled by the variable sort_buffer_size.

When mysql implements orderby, it implements two different implementation methods according to the different field contents put into sort_buffer: full field sorting and rowid sorting.

Full field sorting

First of all, let’s take a look at the sql execution process through a picture:

mysql is first determined based on the query conditions The data set that needs to be sorted is the data set with a=1 in the table, that is, the records with primary key IDs from 1 to 6.

The entire sql execution process is as follows:

1. Create and initialize sort_buffer, and determine the fields that need to be placed in the buffer, that is, a, b ,c these three fields.

2. Find the first primary key id that satisfies a=1 from index tree a, that is, id=1.

3. Return to the table to the id index, take out the entire row of data, and then take out the values of a, b, and c from the entire row of data and put them into sort_buffer.

4. Find the next primary key id of a=1 in order from index a.

5. Repeat steps 3 and 4 until the last record with a=1 is obtained, that is, the primary key id=5.

6. At this time, the a, b, and c fields of all records that meet the condition a=1 are all read and placed in the sort_buffer. Then, these data are sorted according to the value of b. The sorting method is Quick sort. It is the quick sort that is often encountered in interviews, and the time complexity of quick sort is log2n.

7. Then take out the first 2 rows of data from the sorted result set.

The above is the execution process of orderby in msql. Because the data put into sort_buffer is all the fields that need to be output, this sorting is called full sorting.

I wonder if you have any questions after seeing this? What should I do if the amount of data that needs to be sorted is large and the sort_buffer cannot fit in it?

Indeed, if there are a lot of data rows with a=1, and there are many fields that need to be stored in sort_buffer, there may be more than three fields a, b, and c. Some businesses may need to output more fields. Then the sort_buffer with a default size of only 1M may not be able to accommodate it.

When sort_buffer cannot accommodate it, mysql will create a batch of temporary disk files to assist sorting. By default, 12 temporary files will be created, and the data to be sorted will be divided into 12 parts. Each part will be sorted separately to form 12 internal data ordered files, and then these 12 ordered files will be merged into an ordered file. Large files, and finally complete the sorting of the data.

File-based sorting is much less efficient than memory-based sorting. In order to improve the efficiency of sorting, you should try to avoid file-based sorting. If you want to avoid file-based sorting, you need to make sort_buffer Accommodates the amount of data that needs to be sorted.

So mysql has been optimized for situations where sort_buffer cannot accommodate it. It is to reduce the number of fields stored in sort_buffer during sorting.

The specific optimization method is the following rowId sorting

RowId 排序

在全字段排序实现中，排序的过程中，要把需要输出的字段全部放到sort_buffer中，当输出的字段比较多的时候，可以放到sort_buffer中的数据行就会变少。也就增大了sort_buffer无法容纳数据的风险，直至出现基于文件的排序。

rowId排序对全字段排序的优化手段，主要是减少了放到sort_buffer中字段个数。

在rowId排序中，只会将需要排序的字段和主键Id放到sort_buffer中。

select a,b,c from t1 where a = 1 order by b limit 2;

在rowId的排序中的执行流程如下：

1.初始化并创建sort_buffer，并确认要放入的的字段，id和b。

2.从索引树a中找到第一个满足a=1的主键id，也就是id=1。

3.回表主键索引id，取出整行数据，从整行数据中取出id和b，存入sort_buffer中。

4.从索引a中取出下一条满足a=1的记录的主键id。

5.重复步骤3和4，直到最后一个满足a=1的主键id，也就是a=6。

6.对sort_buffer中的数据，按照字段b排序。

7.从sort_buffer中的有序数据集中，取出前2个，因为此时取出的数据只有id和b，要想获取a和c字段，需要根据id字段，回表到主键索引中取出整行数据，从整行数据中获取需要的数据。

根据rowId排序的执行步骤，可以发现：相比全字段排序，rowId排序的实现方式，减少了存放到sort_buffer中的数据量，降低了基于文件的外部排序的可能性。

那rowid排序有不足的地方吗？肯定有的，要不然全字段排序就没有存在的意义了。rowid排序不足之处在于，在最后的步骤7中，增加了回表的次数，不过这个回表的次数，取决于limit后的值，如果返回的结果集比较小的话，回表的次数还是比较小的。

mysql是如何在全字段排序和rowId排序的呢？其实是根据存放的sort_buffer中每行字段的长度决定的，如果mysql认为每次放到sort_buffer中的数据量很大的话，那么就用rowId排序实现，否则使用全字段排序。那么多大算大呢？这个大小的阈值有一个变量的值来决定，这个变量就是 max_length_for_sort_data。如果每次放到sort_buffer中的数据大小大于该字段值的话，就使用rowId排序，否则使用全字段排序。

orderby的优化

上面讲述了orderby的两种排序的方式，以及一些优化策略，优化的目的主要就是避免基于磁盘文件的外部排序。因为基于磁盘文件的排序效率要远低于基于sort_buffer的内存排序。

但是当数据量比较大的时候，即使sort_buffer比较大，所有数据全部放在内存中排序，sql的整体执行效率也不高，因为排序这个操作，本身就是比较消耗性能的。

试想，如果基于索引a获取到所有a=1的数据，按照字段b，天然就是有序的，那么就不用执行排序操作，直接取出来的数据，就是符合结果的数据集，那么sql的执行效率就会大幅度增长。

其实要实现整个sql执行过程中，避免排序操作也不难，只需要创建一个a和b的联合索引即可。

alter table t1 add index a_b (a,b);

添加a和b的联合索引后，sql执行流程就变成了：

1.从索引树(a,b)中找到第一个满足a=1的主键id，也就是id=1。

2.回表到主键索引树，取出整行数据，并从中取出a，b，c，直接作为结果集的一部分返回。

3.从索引树(a,b)上取出下一个满足a=1的主键id。

4.重复步骤2和3，直到找到第二个满足a=1的主键id,并回表获取字段a，b，c。

此时我们可以通过查看sql的执行计划，来判断sql的执行过程中是否执行了排序操作。

explain select a,b from t1 where a = 1 order by b lmit 2;

通过查看执行计划，我们发现extra中已经没有了using filesort了，也就是没有执行排序操作了。

其实还可以通过覆盖索引，对该sql进一步优化，通过在索引中覆盖字段c，来避免回表的操作。

alter table t1 add index a_b_c (a,b,c);

添加索引a_b_c后，sql的执行过程如下：

1.从索引树(a,b,c)中找到第一个满足a=1的索引，从中取出a，b，c。直接作为结果集的一部分直接返回。

2.从索引(a,b,c)中取出下一个，满足a=1的记录作为结果集的一部分。

3.重复执行步骤2，直到查到第二个a=1或者不满足a=1的记录。

此时通过查看执行sql的的还行计划可以发现 extra中只有 Using index。

explain select a,b from t1 where a = 1 order by b lmit 2;

Summary

Through multiple optimizations of this SQL, the final execution efficiency of SQL is basically the same as the query efficiency of ordinary SQL without sorting. The reason why the orderby sorting operation can be avoided is to take advantage of the naturally ordered characteristics of the index.

But we all know that indexes can speed up query efficiency, but the maintenance cost of indexes is relatively high. Adding and modifying data in the data table will involve changes in the indexes, so the more indexes, the better. , Sometimes, it is not worth adding too many indexes just because of some uncommon queries and sorting.

[Related recommendations: mysql video tutorial]

The above is the detailed content of Let's talk about how to optimize the order By statement in SQL. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:脚本之家. If there is any infringement, please contact admin@php.cn delete

MySQL's Place: Databases and ProgrammingApr 13, 2025 am 12:18 AM

MySQL's position in databases and programming is very important. It is an open source relational database management system that is widely used in various application scenarios. 1) MySQL provides efficient data storage, organization and retrieval functions, supporting Web, mobile and enterprise-level systems. 2) It uses a client-server architecture, supports multiple storage engines and index optimization. 3) Basic usages include creating tables and inserting data, and advanced usages involve multi-table JOINs and complex queries. 4) Frequently asked questions such as SQL syntax errors and performance issues can be debugged through the EXPLAIN command and slow query log. 5) Performance optimization methods include rational use of indexes, optimized query and use of caches. Best practices include using transactions and PreparedStatemen

MySQL: From Small Businesses to Large EnterprisesApr 13, 2025 am 12:17 AM

MySQL is suitable for small and large enterprises. 1) Small businesses can use MySQL for basic data management, such as storing customer information. 2) Large enterprises can use MySQL to process massive data and complex business logic to optimize query performance and transaction processing.

What are phantom reads and how does InnoDB prevent them (Next-Key Locking)?Apr 13, 2025 am 12:16 AM

InnoDB effectively prevents phantom reading through Next-KeyLocking mechanism. 1) Next-KeyLocking combines row lock and gap lock to lock records and their gaps to prevent new records from being inserted. 2) In practical applications, by optimizing query and adjusting isolation levels, lock competition can be reduced and concurrency performance can be improved.

MySQL: Not a Programming Language, But...Apr 13, 2025 am 12:03 AM

MySQL is not a programming language, but its query language SQL has the characteristics of a programming language: 1. SQL supports conditional judgment, loops and variable operations; 2. Through stored procedures, triggers and functions, users can perform complex logical operations in the database.

MySQL: An Introduction to the World's Most Popular DatabaseApr 12, 2025 am 12:18 AM

MySQL is an open source relational database management system, mainly used to store and retrieve data quickly and reliably. Its working principle includes client requests, query resolution, execution of queries and return results. Examples of usage include creating tables, inserting and querying data, and advanced features such as JOIN operations. Common errors involve SQL syntax, data types, and permissions, and optimization suggestions include the use of indexes, optimized queries, and partitioning of tables.

The Importance of MySQL: Data Storage and ManagementApr 12, 2025 am 12:18 AM

MySQL is an open source relational database management system suitable for data storage, management, query and security. 1. It supports a variety of operating systems and is widely used in Web applications and other fields. 2. Through the client-server architecture and different storage engines, MySQL processes data efficiently. 3. Basic usage includes creating databases and tables, inserting, querying and updating data. 4. Advanced usage involves complex queries and stored procedures. 5. Common errors can be debugged through the EXPLAIN statement. 6. Performance optimization includes the rational use of indexes and optimized query statements.

Why Use MySQL? Benefits and AdvantagesApr 12, 2025 am 12:17 AM

MySQL is chosen for its performance, reliability, ease of use, and community support. 1.MySQL provides efficient data storage and retrieval functions, supporting multiple data types and advanced query operations. 2. Adopt client-server architecture and multiple storage engines to support transaction and query optimization. 3. Easy to use, supports a variety of operating systems and programming languages. 4. Have strong community support and provide rich resources and solutions.

Describe InnoDB locking mechanisms (shared locks, exclusive locks, intention locks, record locks, gap locks, next-key locks).Apr 12, 2025 am 12:16 AM

InnoDB's lock mechanisms include shared locks, exclusive locks, intention locks, record locks, gap locks and next key locks. 1. Shared lock allows transactions to read data without preventing other transactions from reading. 2. Exclusive lock prevents other transactions from reading and modifying data. 3. Intention lock optimizes lock efficiency. 4. Record lock lock index record. 5. Gap lock locks index recording gap. 6. The next key lock is a combination of record lock and gap lock to ensure data consistency.

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks agoByDDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

WWE 2K25: How To Unlock Everything In MyRise

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 Chinese version

Chinese version, very easy to use

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

Dreamweaver Mac version

Visual web development tools

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Hot Topics

Where is the login entrance for gmail email?

7486

CakePHP Tutorial

1377

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers