1. Why not use UUID
So, if we use the UUID string as the primary key, then every time data is inserted, it will need to find its own position in the B Tree. After finding it, it is possible to move the subsequent node (just like inserting a record into an array). Moving the subsequent node may involve page splitting, and the insertion efficiency will be reduced.
On the other hand, in a non-clustered index, the leaf node stores the primary key value. If the primary key is a long UUID string, it will occupy a larger storage space (relative to int and In other words), then the number of primary key values that can be saved by the same leaf node will be reduced, which may cause the tree to become taller, which means that the number of IOs during query increases and query efficiency decreases.
Based on the above analysis, we try not to use UUID as the primary key in MySQL. Without UUID, some friends may think, can I use the primary key to auto-increment?
Auto-increment of the primary key can obviously solve the two problems encountered when using UUID as the primary key. The primary key is auto-incremented. You only need to add it to the end of the tree each time. Basically, it will not involve the problem of page splitting. The primary key auto-increment means that the primary key is a number and the storage space occupied is relatively small. For non-clustered The impact of indexing will also be smaller.
So is auto-increment of the primary key the best solution? Are there any issues that need to be paid attention to when the primary key is automatically incremented?
2. The problem of primary key auto-increment
The following content has a common premise, which is that our table has a primary key auto-increment.
Generally speaking, there is no problem with auto-increment of the primary key. However, if you are in a high-concurrency environment, there will be problems.
First of all, the easiest thing to think of is the tail hotspot problem that occurs during high concurrent insertion. During concurrent insertion, everyone needs to query this value and then calculate their own primary key value. Then the upper bound of the primary key is It will become hot data, and lock competition will occur here during concurrent insertion.
In order to solve this problem, we need to choose the innodb_autoinc_lock_mode
that suits us.
2.1 Three forms of data insertion
First of all, when we insert data into the data table, there are generally three different forms, as follows:
insert into user(name) values('javaboy')
orreplace into user(name) values('javaboy')
, there is no nesting Query and can determine how many rows to insertThe insertion is calledsimple insert
, but it should be noted thatINSERT ... ON DUPLICATE KEY UPDATE
does not count assimple insert
.load data
orinsert into user select ... from ....
, these are batch inserts, calledbulk insert
, one feature of this bulk insert is that the number of pieces of data to be inserted is unknown at the beginning.insert into user(id,name) values(null,'javaboy'),(null,'Jiangnan Yiyidian')
, this is also a batch insert , but it is different from the second one. This type contains some automatically generated values (the primary key in this case is auto-incremented), and can determine how many rows are inserted in total. This type is calledmixed insert
, for theINSERT ... ON DUPLICATE KEY UPDATE
mentioned in the first point above, it can also be regarded as amixed insert
.
Data insertion is divided into these three categories, mainly because when the primary key is auto-incremented, the lock processing scheme is different. Let’s continue to look down.
2.2 innodb_autoinc_lock_mode
We can control the MySQL lock processing idea when the primary key is auto-incremented by controlling the value of the innodb_autoinc_lock_mode variable.
innodb_autoinc_lock_mode variable has three different values:
0: This represents traditional. In this mode, the three different values we mentioned above When inserting SQL, the solution for auto-increment locks is the same. At the beginning of the inserted SQL statement, a table-level AUTO-INC lock is obtained, and then the lock is released after the execution of the inserted SQL is completed. The advantage of this is that it can ensure that the auto-incrementing primary key is continuous during batch insertion.
1: This means consecutive. In this mode,
simple insert
(which can determine the specific number of inserted rows, corresponds to the two situations 1 and 3 above) ) has made some optimizations. Sincesimple insert
is easy to calculate how many rows to insert, several consecutive values can be generated at one time and used in the corresponding insert SQL statements, so that AUTO- can be released in advance. INC lock can reduce lock waiting and improve concurrent insertion efficiency.2: This means interleaved. In this case, there is no AUTO-INC lock. Each one is processed one by one. When inserting in batches, it is possible that although the primary key is incremented, it does not exist. Continuous questions.
As you can see from the above introduction, in fact, the third type, that is, when the value of innodb_autoinc_lock_mode is 2, the concurrency efficiency is the strongest, so should we What about setting innodb_autoinc_lock_mode=2?
It depends on the situation.
Songge has written an article before and introduced the three formats of MySQL binlog log files to friends:
row: What is recorded in the binlog is the specific value. Instead of the original SQL, to give a simple example, assume that a field in the table is UUID, and the SQL executed by the user is
insert into user(username,uuid) values('javaboy',uuid())
, Then the SQL finally recorded in the binlog isinsert into user(username,uuid) values('javaboy',‘0212cfa0-de06-11ed-a026-0242ac110004’)
.statement: What is recorded in the binlog is the original SQL. Taking the one in row as an example, what is finally recorded in the binlog is
insert into user(username,uuid) values( 'javaboy',uuid())
.mixed: In this mode, MySQL will determine the log format based on the specific SQL statement, that is, choose one between statement and row.
For these three different modes, it is obvious that the statement mode may cause inconsistency in the master-slave data during master-slave replication, so now MySQL’s default binlog format is row. .
Back to our question:
If the binlog format is row, then we can set the value of innodb_autoinc_lock_mode to 2, so as to ensure data concurrency to the greatest extent The ability to insert does not cause the problem of master-slave data inconsistency.
If the binlog format is statement, then we'd better set the value of innodb_autoinc_lock_mode to 1, so that the concurrent insertion capability of
simple insert
is improved, and batch insertion is still Acquire the AUTO-INC lock first, and then release it after the insertion is successful. This can also avoid master-slave data inconsistency and ensure the security of data replication.The above two points are mainly for the InnoDB storage engine. If it is the MyISAM storage engine, the AUTO-INC lock is obtained first, and then released after the insertion is completed, which is equivalent to the value pair of the innodb_autoinc_lock_mode variable. MyISAM does not work.
2.3 Practice
Next, let’s use a simple SQL to demonstrate to our friends how different values of innodb_autoinc_lock_mode correspond to different results.
We can use the following SQL query to view the current innodb_autoinc_lock_mode setting:
As you can see, the current default value of the version 8.0.32 I am using is 2.
I first change it to 0. The modification method is to add a line innodb_autoinc_lock_mode=0
: in the
file.
After making the changes, restart and check, as follows:
You can see that it has been changed now.
Now suppose I have the following table:
CREATE TABLE `user` ( `id` int unsigned NOT NULL AUTO_INCREMENT, `username` varchar(255) CHARACTER SET utf8mb4 COLLATE utf8mb4_unicode_ci DEFAULT NULL, PRIMARY KEY (`id`) ) ENGINE=InnoDB AUTO_INCREMENT=100 DEFAULT CHARSET=utf8mb4 COLLATE=utf8mb4_unicode_ci;
This increment starts from 100. Now suppose I have the following SQL insertion:
insert into user(id,username) values(1,'javaboy'),(null,'江南一点雨'),(3,'www.javaboy.org'),(null,'lisi');
After the insertion is completed, we Let’s look at the query results:
According to our previous introduction, this situation should be explainable, so I won’t go into details here.
Next, I changed the value of innodb_autoinc_lock_mode to 1, as follows:
Still the same SQL above, let’s execute it again. After the execution is completed, the result is the same as above.
but! ! ! **After the above SQL is executed, if we want to insert data again, and the newly inserted ID does not specify a value, we find that the automatically generated ID value is 104. **This is because we set innodb_autoinc_lock_mode=1. At this time, when executing simple insert
to insert, the system saw that I wanted to insert 4 records and directly took out 4 IDs for me in advance. They are 100, 101, 102 and 103 respectively. As a result, the SQL actually only uses two IDs, and the remaining two are useless, but the next insertion will still start from 104.
The above is the detailed content of How to solve the pitfalls encountered by MySQL primary key auto-increment. For more information, please follow other related articles on the PHP Chinese website!

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了关于架构原理的相关内容,MySQL Server架构自顶向下大致可以分网络连接层、服务层、存储引擎层和系统文件层,下面一起来看一下,希望对大家有帮助。

mysql的msi与zip版本的区别:1、zip包含的安装程序是一种主动安装,而msi包含的是被installer所用的安装文件以提交请求的方式安装;2、zip是一种数据压缩和文档存储的文件格式,msi是微软格式的安装包。

方法:1、利用right函数,语法为“update 表名 set 指定字段 = right(指定字段, length(指定字段)-1)...”;2、利用substring函数,语法为“select substring(指定字段,2)..”。

在mysql中,可以利用char()和REPLACE()函数来替换换行符;REPLACE()函数可以用新字符串替换列中的换行符,而换行符可使用“char(13)”来表示,语法为“replace(字段名,char(13),'新字符串') ”。

转换方法:1、利用cast函数,语法“select * from 表名 order by cast(字段名 as SIGNED)”;2、利用“select * from 表名 order by CONVERT(字段名,SIGNED)”语句。

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了关于MySQL复制技术的相关问题,包括了异步复制、半同步复制等等内容,下面一起来看一下,希望对大家有帮助。

在mysql中,可以利用REGEXP运算符判断数据是否是数字类型,语法为“String REGEXP '[^0-9.]'”;该运算符是正则表达式的缩写,若数据字符中含有数字时,返回的结果是true,反之返回的结果是false。

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了mysql高级篇的一些问题,包括了索引是什么、索引底层实现等等问题,下面一起来看一下,希望对大家有帮助。


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft

Dreamweaver Mac version
Visual web development tools

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool