Home  >  Article  >  Database  >  Efficiency comparison of four insertion methods in Mysql

Efficiency comparison of four insertion methods in Mysql

零下一度
零下一度Original
2017-05-03 11:18:011615browse

This article introduces to you the efficiency comparison of several insertion methods in Mysql through examples, including four methods: item-by-item insertion, transaction-based batch insertion, single statement inserting multiple sets of data at a time, and importing data files. In comparison, the article introduces it in detail through example code. Friends who need it can come down and take a look together.

Preface

Recently, due to work needs, a large amount of data of about 10 million has to be inserted into Mysql, and visual inspection will be time-consuming. So now it's like testing which method to insert data is faster and more efficient.

The following will test the insertion efficiency under different data amounts for each method.

The basics and operations of the test database are as follows:

mysql> create database test;
Query OK, 1 row affected (0.02 sec)
mysql> use test;
Database changed
mysql> create table mytable(id int primary key auto_increment ,value varchar(50));
Query OK, 0 rows affected (0.35 sec)
mysql> desc mytable;
+-------+-------------+------+-----+---------+----------------+
| Field | Type | Null | Key | Default | Extra  |
+-------+-------------+------+-----+---------+----------------+
| id | int(11) | NO | PRI | NULL | auto_increment |
| value | varchar(50) | YES | | NULL |  |
+-------+-------------+------+-----+---------+----------------+
2 rows in set (0.02 sec)

To facilitate testing, a table is built here with two fields, one is the auto-incremented id, and the other is The string represents the content.

When testing, you must mysql> truncate mytable at the end of each experiment to clear the existing table.

Method 1: Insert one by one

Test code: (There are 1000 insert statements in the middle. It is more convenient to copy and paste with vim. After writing, save it to a.sql, and then enter source a.sql in the mysql prompt)

set @start=(select current_timestamp(6));
insert into mytable values(null,"value");
......
insert into mytable values(null,"value");
set @end=(select current_timestamp(6));
select @start;
select @end;

Output result:

Query OK, 1 row affected (0.03 sec)
......
Query OK, 1 row affected (0.03 sec)
Query OK, 0 rows affected (0.00 sec)
+----------------------------+
| @start   |
+----------------------------+
| 2016-05-05 23:06:51.267029 |
+----------------------------+
1 row in set (0.00 sec)
+----------------------------+
| @end   |
+----------------------------+
| 2016-05-05 23:07:22.831889 |
+----------------------------+
1 row in set (0.00 sec)

It takes a total of 31.56486s, in fact almost every statement The time it takes is about the same, basically 30ms.

In this way, 1000w of data will take 87h.

As for the larger amount of data, I will not try it. This method is definitely not advisable.

Method 2: Transaction-based batch insertion

In fact, it means putting so many queries in one transaction. In fact, every statement in method one opens a transaction, so it is particularly slow.

Test code: (Basically similar to method 1, mainly adding two lines. Because it is faster, a variety of data volumes are tested here)

set @start=(select current_timestamp(6));
start transaction;
insert into mytable values(null,"value");
......
insert into mytable values(null,"value");
commit;
set @end=(select current_timestamp(6));
select @start;
select @end;

Test results:

数据量 时间(s)
1k  0.1458
1w  1.0793
10w 5.546006
100w 38.930997

It can be seen that it is basically logarithmic time, and the efficiency is relatively high.

Method 3: A single statement inserts multiple sets of data at once

means one insert inserts multiple values ​​at once.

Test code:

insert into mytable values (null,"value"),
    (null,"value"),
    ......
    (null,"value");

Test result:

数据量 时间(s)
1k  0.15
1w  0.80
10w 2.14
100w *

It also seems to be logarithmic time, and it is slightly faster than method 2. However, the problem is that there is a buffer size limit for a single SQL statement. Although the configuration can be modified to make it larger, it cannot be too large. Therefore, it cannot be used when inserting large amounts of data.

Method 4: Import data file

Write the numerical data into a data file and import it directly (refer to the previous section).

Data file (a.dat):

null value
null value
.....
null value
null value

Test code:

mysql> load data local infile "a.dat" into table mytable;

Test result:

数据量 时间(s)
1k  0.13
1w  0.75
10w 1.97
100w 6.75
1000w 58.18

The one with the fastest time is him. . . .

The above is the detailed content of Efficiency comparison of four insertion methods in Mysql. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn