How to expand the fields of large tables under MySQL cascade replication-Mysql Tutorial-php.cn

Home

Database

Mysql Tutorial

How to expand the fields of large tables under MySQL cascade replication

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

May 29, 2023 pm 11:13 PM

mysql

Field expansion of large tables under MySQL cascade replication

1. Background

A customer’s business has a table with about 400 million rows. Due to business expansion, the open_id in the table varchar(50) needs to be expanded to varchar(500).
During the change, try to minimize the impact on the main library (it is best not to have any impact -> In the end, we strived for a 4-hour window period).

2. Database table information

Environment: Mysql 8.0.22
1 Master 1 slave replication based on Gtid

1. The first question, this is a big picture Table? Yes, please look at the ibd file of this table. 280G count has not returned for a long time. Use the standby database to check and confirm the number of rows > 400 million.

以下语句也可以查看：
show table status from dbname like &#39;tablename&#39;\G # Rows 的值不准，有时误差有2倍

SELECT a.table_schema,a.table_name,concat(round(sum(DATA_LENGTH/1024/1024)+sum(INDEX_LENGTH/1024/1024),2) ,&#39;MB&#39;)total_size,concat(round(sum(DATA_LENGTH/1024/1024),2),&#39;MB&#39;) AS data_size,concat(round(sum(INDEX_LENGTH/1024/1024),2),&#39;MB&#39;) AS index_size FROM information_schema.TABLES a WHERE a.table_schema = &#39;dbname&#39; AND a.table_name = &#39;tablename&#39;; #看下此表的数据量

Since it is Big table, what method should we use to make changes?

3. Solution selection

M in the following represents the main database, S1 is slave 1, and S2 is slave 2

MethodOnlineDDLGh-ostPt-osc##M-S1-S2Time can be estimatedCascade replication, manual operation1 star #Why didn’t we choose the first 3 options?

Advantages	Disadvantages	Feasibility
Native, when using intermediate temporary table	ALGORITHM=COPY, DML will be blocked. Recommended version>MySQL5.7	5 stars
Use binlog playback thread instead of trigger	Third-party tools, which lead to longer execution time depending on different parameters	4 stars
Good version compatibility, use triggers to keep the main and secondary tables consistent	Third-party tools, and there are many restrictions on use	3 stars

Based on the actual situation assessment, the demand on the business side this time is
This table has business traffic 24 hours a day, and does not accept business unavailability for more than 4 hours

OnlineDDL method, when ALGORITHM=COPY, DML will be blocked during the period (read-only). Finally, during the rename operation of the main and secondary tables (not readable and writable), until DDL is completed (the time required is uncertain).

It is recommended to connect to the slave library in Gh-ost mode for conversion. This mode has the least impact on the main library, and the flow control can be set through parameters. The fatal weakness of this tool is that the change takes too long. For a table with 400 million pieces of data, it took 70 hours in the test environment. Finally, we also need to issue a switching command and manually delete the intermediate table *_del. The advantage of this solution is that there is a slave database to ensure data security, so if you use a 1 master 2 slave architecture, it is more recommended.

Pt-osc and Gh-ost both belong to third parties. Pt-osc's operation of large tables and OnlineDDL have a common disadvantage, which is that the cost of rollback on failure is very high.

If it is a lower version such as MySQL Finally we chose the DBA’s favorite (xin ku) method, which is performed under M-S1-S2 cascade replication.

4. How to operate

Create a new S1 slave library and build M-S1-S2 cascade replication

Use OnlineDDL to perform field expansion on S2 (the advantage is that the master of M-S1 is never affected during the period)
After the expansion is completed, wait for delayed synchronization of M-S1-S2 (reduce S2 Data difference with M, and perform data verification)
Remove S1 and establish a master-slave relationship between M-S2 (allowing S2 to continue synchronizing M's data)
Backup S2 and restore S1, establish M-S2-S1 cascade replication
Stop the application and wait for the master-slave data to be consistent (the advantage is the synchronization time of different data amounts Very short)
Finally S2 becomes the main library and S1 is the slave library (the application needs to modify the front-end connection information)
Apply for regression verification

环境装备:开启Gtid,注意M,S1 binlog保存时长,磁盘剩余空间大于待变更表的2倍
show global variables like &#39;binlog_expire_logs_seconds&#39;; # 默认604800
set global binlog_expire_logs_seconds=1209600; # 主库和级联主库都需要设置
1.搭建 1主2从的级联复制，M -> S1 -> S2 ，安装MySQL注意本次环境lower_case_table_names = 0 
2.在S2 上做字段扩容。 预估 10个小时
`参数设置:`
set global slave_type_conversions=&#39;ALL_NON_LOSSY&#39;; # 防止复制报错SQL_Errno: 13146，属于字段类型长度不一致无法回放
set global interactive_timeout=144000;set global wait_timeout =144000;
`磁盘IO参数设置:`
set global innodb_buffer_pool_size=32*1024*1024*1024;# 增加buffer_pool 防止Error1206The total number of locks exceeds the lock table size 资源不足
set global sync_binlog=20000;set global innodb_flush_log_at_trx_commit=2;
set global innodb_io_capacity=600000;set global innodb_io_capacity_max=1200000; # innodb_io_capacity需要设置两次
show variables like &#39;%innodb_io%&#39;; # 验证以上设置
screen 下执行:
time mysql -S /data/mysql/3306/data/mysqld.sock -p&#39;&#39; dbname -NBe "ALTER TABLE tablename MODIFY COLUMN open_id VARCHAR(500) NULL DEFAULT NULL COMMENT &#39;Id&#39; COLLATE &#39;utf8mb4_bin&#39;;"
查看DDL进度:
SELECT EVENT_NAME, WORK_COMPLETED, WORK_ESTIMATED  FROM performance_schema.events_stages_current;
3.扩容完成后，等待延迟同步M-S1-S2 
数据同步至主从一致,对比主从Gtid
4.移除S1,建立M-S2的主从关系
S1 (可选)
stop slave;
reset slave all;
systemctl stop mysql_3306
S2
stop slave;
reset slave all;
# MASTER_HOST=&#39;M主机IP&#39;  
CHANGE MASTER TO
  MASTER_HOST=&#39;&#39;,
  MASTER_USER=&#39;&#39;,
  MASTER_PASSWORD=&#39;,
  MASTER_PORT=3306,
  MASTER_AUTO_POSITION=1,
  MASTER_CONNECT_RETRY=10;
start slave; (flush privileges;# 验证数据可正常同步)
5.备份S2恢复S1，建立M-S2-S1级联复制
物理备份S2,重做S2->S1 级联主从
rm -rf binlog/*
rm -rf redolog/*
xtrabackup --defaults-file=/data/mysql/3306/my.cnf.3306 --move-back --target-dir=/data/actionsky/xtrabackup_recovery/data
chown -R mysql. data/
chown -R mysql. binlog/*
chown -R mysql. redolog/*
systemctl start mysql_3306
set global gtid_purged=&#39;&#39;;
reset slave all;
# MASTER_HOST=&#39;S2主机IP&#39;  ,已扩容变更完的主机
CHANGE MASTER TO
  MASTER_HOST=&#39;&#39;,
  MASTER_USER=&#39;&#39;,
  MASTER_PASSWORD=&#39;&#39;,
  MASTER_PORT=3306,
  MASTER_AUTO_POSITION=1,
  MASTER_CONNECT_RETRY=10;
`MySQL8.0版本需要在上面语句中添加 GET_MASTER_PUBLIC_KEY=1; #防止 Last_IO_Errno: 2061 message: Authentication plugin &#39;caching_sha2_password&#39; reported error: Authentication requires secure connection.`
start slave;
6.应用停服，等待主从数据一致
主库停服+可设置read_only+flush privileges，对比主从Gtid
7.最终S2成为主库，S1为从库
应用更改配置连接新主库。
S2上:
stop slave;reset slave all;
set global read_only=0;set global super_read_only=0;
`show master status\G 观察是否有新事务写入`

收尾：还原第2步的参数设置。
set global interactive_timeout=28800;set global wait_timeout =28800;
set global innodb_buffer_pool_size=8*1024*1024*1024;
set global slave_type_conversions=&#39;&#39;;
set global sync_binlog=1;set global innodb_flush_log_at_trx_commit=1;
set global innodb_io_capacity=2000;set global innodb_io_capacity_max=4000;

Supplementary scenario: Test based on disk IO capability

Modify directly on the main database without traffic Case:

Scenario 1, the disk is a physical machine of NVME, and it takes about 5 hours to process 400 million data (disk performance 1G/s).

Scenario 2, the virtual machine whose disk is a mechanical disk, this amount of data takes about 40 hours (disk performance 100M/s).

The above is the detailed content of How to expand the fields of large tables under MySQL cascade replication. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:亿速云. If there is any infringement, please contact admin@php.cn delete

What Are the Limitations of Using Views in MySQL?May 14, 2025 am 12:10 AM

MySQLviewshavelimitations:1)Theydon'tsupportallSQLoperations,restrictingdatamanipulationthroughviewswithjoinsorsubqueries.2)Theycanimpactperformance,especiallywithcomplexqueriesorlargedatasets.3)Viewsdon'tstoredata,potentiallyleadingtooutdatedinforma

Securing Your MySQL Database: Adding Users and Granting PrivilegesMay 14, 2025 am 12:09 AM

ProperusermanagementinMySQLiscrucialforenhancingsecurityandensuringefficientdatabaseoperation.1)UseCREATEUSERtoaddusers,specifyingconnectionsourcewith@'localhost'or@'%'.2)GrantspecificprivilegeswithGRANT,usingleastprivilegeprincipletominimizerisks.3)

What Factors Influence the Number of Triggers I Can Use in MySQL?May 14, 2025 am 12:08 AM

MySQLdoesn'timposeahardlimitontriggers,butpracticalfactorsdeterminetheireffectiveuse:1)Serverconfigurationimpactstriggermanagement;2)Complextriggersincreasesystemload;3)Largertablesslowtriggerperformance;4)Highconcurrencycancausetriggercontention;5)M

MySQL: Is it safe to store BLOB?May 14, 2025 am 12:07 AM

Yes,it'ssafetostoreBLOBdatainMySQL,butconsiderthesefactors:1)StorageSpace:BLOBscanconsumesignificantspace,potentiallyincreasingcostsandslowingperformance.2)Performance:LargerrowsizesduetoBLOBsmayslowdownqueries.3)BackupandRecovery:Theseprocessescanbe

MySQL: Adding a user through a PHP web interfaceMay 14, 2025 am 12:04 AM

Adding MySQL users through the PHP web interface can use MySQLi extensions. The steps are as follows: 1. Connect to the MySQL database and use the MySQLi extension. 2. Create a user, use the CREATEUSER statement, and use the PASSWORD() function to encrypt the password. 3. Prevent SQL injection and use the mysqli_real_escape_string() function to process user input. 4. Assign permissions to new users and use the GRANT statement.

MySQL: BLOB and other no-sql storage, what are the differences?May 13, 2025 am 12:14 AM

MySQL'sBLOBissuitableforstoringbinarydatawithinarelationaldatabase,whileNoSQLoptionslikeMongoDB,Redis,andCassandraofferflexible,scalablesolutionsforunstructureddata.BLOBissimplerbutcanslowdownperformancewithlargedata;NoSQLprovidesbetterscalabilityand

MySQL Add User: Syntax, Options, and Security Best PracticesMay 13, 2025 am 12:12 AM

ToaddauserinMySQL,use:CREATEUSER'username'@'host'IDENTIFIEDBY'password';Here'showtodoitsecurely:1)Choosethehostcarefullytocontrolaccess.2)SetresourcelimitswithoptionslikeMAX_QUERIES_PER_HOUR.3)Usestrong,uniquepasswords.4)EnforceSSL/TLSconnectionswith

MySQL: How to avoid String Data Types common mistakes?May 13, 2025 am 12:09 AM

ToavoidcommonmistakeswithstringdatatypesinMySQL,understandstringtypenuances,choosetherighttype,andmanageencodingandcollationsettingseffectively.1)UseCHARforfixed-lengthstrings,VARCHARforvariable-length,andTEXT/BLOBforlargerdata.2)Setcorrectcharacters

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5055612 fails to install in Windows 10?

4 weeks agoByDDD

Roblox: Grow A Garden - Complete Mutation Guide

3 weeks agoByDDD

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Nordhold: Fusion System, Explained

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),