Detailed analysis of the MyCAT (3) rule of MySQL distributed cluster (picture and text)-Mysql Tutorial-php.cn

Home

Database

Mysql Tutorial

Detailed analysis of the MyCAT (3) rule of MySQL distributed cluster (picture and text)

黄舟

Mar 11, 2017 pm 02:22 PM

has been introduced before SCHEMA. This article will introduce Rule and Server together ~
This is the first Rule. In this file For the rules of sharding, this time we only extract some methods with relatively high usage rates. Let’s first look at the contents of the configuration file.
The upper part of the screenshot describes the definition of the rule, and the lower part shows the actual segmentation rules corresponding to the rule. Here the chief engineer introduces the following four segmentation methods~murmur has been cheated~
- -------------------------------------------------- ----------------------------------------Hash-int------- -------------------------------------------------- --------------------------
Let's look at hash-int first. Under this segmentation rule, there is a mapfile, which means that the segmentation rule is determined based on the content of partition-hash-int, then take a look at this text file
Very simple content, this means that in the base column used for segmentation, when the value is 10000, it is placed in the first DN (dn1), and when the value is 10010, it is placed in the second DN (dn2)
You can take a look at the actual effect

Debug log, these two statements are assigned to dn1 and dn2, and the corresponding data is also inserted into the database.
(Excavator rolls rough~), if the value of the reference column in the inserted data is not the value stated in this file, what will be the effect?
## , which can be roughly understood as
enumeration partition
, which is more suitable for situations where the value is fixed, such as gender (0,1), province (fixed value, which will not be used in the short term) Let’s take back the Japanese province~), channel dealer or ID of various platforms And, with a comma separation, multiple values can be placed in a partition, so you can comprehensively formulate the division strategy according to the actual data/traffic/access volume; Not an all-powerful warrior╮(╯_╰)╭

##---------------------- -------------------------------------------------- ------------------range-long--------------------------------- -------------------------------------------------- ---
The second division method, Range-Long, if you look closely, it is more similar to Hash-I. From the content of the file, it can be seen that this is a way to divide the range, formulate the range of the benchmark column, and then put all the data of this range on a DN. This The method is basically the same as Hash-I, so there is no screenshot. (Lazy cancer is not enough!)
This kind of cutting strategy, I personally feel that there will be fewer use scenarios in the business database, because this cut is divided The method needs to predetermine the overall quantity, which determines that it cannot be used for data that grows infinitely. After all, it will be very troublesome to change this segmentation strategy. Divide it evenly according to a certain number, such as a business that fixes X pieces of data a day (temperature collection? Data collection? and so on), and then build multiple DNs (libraries) in advance.
# # A DN is set to store 1000W pieces of data
), then at this time, a certain DN (sub-database) will have very high IO pressure, while several other DNs (sub-databases) will have no IO operations at all. There will be a

phenomenon similar to the common hot block/hot disk in DB, and MySQL often uses auto-incrementing primary keys, so there will be many more opportunities for a large number of "sequential" insertions in MySQL tables
. -------------------------------------------------- ---------------------------------------------mod- long---------------------------------------------------------------- ---------------------------------- mod-long, from the mod point of view this should be It is a method of taking the remainder. Let’s take a look at the specific configuration information. Reading the data uniformly on four DN (of course, the number of count & lt; dn is not a problem) Look at the actual effect
Look at mycat's debug logs, see that mycat is the How to deal with it
’ ’ ’ ’ 　　　　　　　　　　　　　　　　　　 I U dollars U dollars U dollars U dollars dollars dollars dollars dollars dollars dollars dollars dollars dollars dollars dollars dollars dollars ourselves ourselves ourselves ourselves ourselves ourselves ourselves Wheneverbidbid Big Big Big Good take the the most of the the most DN-like to the DN--base-data-database DN-database? Multiple DN (libraries) above
Compared with the range method above, This segmentation strategy will better disperse the pressure of database writing, but the problem is also obvious. Once a range query occurs, MyCAT needs to be merged Result , when the amount of data is high, the time consumed by this kind of cross-database query + merged results may increase a lot, especially when order occurs. by.
, when querying personal account information, some tables with user information can be redundant, and then use this method to provide more efficient queries (after all, the bank has a large number of users, eh~)

---------------------------------------- ---------------------------------------------partition-by-long------ -------------------------------------------------- -------------------------- partition-by-long, in range-long and mod -long
A slightly compromised partitioning strategy, the specific partitioning situation is described as follows: With 1024 as a unit, each DN stores partitionLength amount of data, and, partitionCount x partitionLength =1024
It seems a bit difficult to understand. To describe it vividly, take
partitionCount(4) x partitionLength(256) as an example. sid%1024=0-255 is placed in DN1,256-511 placed in DN2, and so on                                                                                                                                                                                                           Tried to insert eight pieces of data with an offset value of 128, look directly at the MyCAT log
                                       " Inside a DN~

This two pictures basically also explain the division strategy of this non -uniform distribution. # This division strategy takes a compromise between
range-long and mod-long. At the same time, it is relatively flexible and can be divided non-uniformly according to different situations. It can actually be applied There will be slightly more scenarios, or in other words, it can be used in many scenarios, which relatively reduces the cross-DN situation, and splits the data evenly, and single-point query will not be too slow.

-------------------------------- -------------------------------------------------- ----Write it at the end------------------------------------------------ ------------------------------------------

In fact, MyCAT supports many segmentation methods. For example, segmentation strategies based on time can be segmented by month, day, etc. There is no way to include all the strategies here. I put them all up, sorry o(￣ヘ￣o#)In fact, from a personal point of view, there is no problem in dividing time according to the partitioning strategy of the database itself. Semi-annual and quarterly data are still the same. You will need to query...PS: _(:з　∠)_I am really not lazy... It can be said that the key points of MyCAT's sub-database and table are basically reflected in this rule. The table must be Don't divide it. How to divide the table data needs to be decided based on the actual business. The most appropriate division strategy should be determined based on the characteristics of the business~

The above is the detailed content of Detailed analysis of the MyCAT (3) rule of MySQL distributed cluster (picture and text). For more information, please follow other related articles on the PHP Chinese website!

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Explain the InnoDB Buffer Pool and its importance for performance.Apr 19, 2025 am 12:24 AM

InnoDBBufferPool reduces disk I/O by caching data and indexing pages, improving database performance. Its working principle includes: 1. Data reading: Read data from BufferPool; 2. Data writing: After modifying the data, write to BufferPool and refresh it to disk regularly; 3. Cache management: Use the LRU algorithm to manage cache pages; 4. Reading mechanism: Load adjacent data pages in advance. By sizing the BufferPool and using multiple instances, database performance can be optimized.

MySQL vs. Other Programming Languages: A ComparisonApr 19, 2025 am 12:22 AM

Compared with other programming languages, MySQL is mainly used to store and manage data, while other languages such as Python, Java, and C are used for logical processing and application development. MySQL is known for its high performance, scalability and cross-platform support, suitable for data management needs, while other languages have advantages in their respective fields such as data analytics, enterprise applications, and system programming.

Learning MySQL: A Step-by-Step Guide for New UsersApr 19, 2025 am 12:19 AM

MySQL is worth learning because it is a powerful open source database management system suitable for data storage, management and analysis. 1) MySQL is a relational database that uses SQL to operate data and is suitable for structured data management. 2) The SQL language is the key to interacting with MySQL and supports CRUD operations. 3) The working principle of MySQL includes client/server architecture, storage engine and query optimizer. 4) Basic usage includes creating databases and tables, and advanced usage involves joining tables using JOIN. 5) Common errors include syntax errors and permission issues, and debugging skills include checking syntax and using EXPLAIN commands. 6) Performance optimization involves the use of indexes, optimization of SQL statements and regular maintenance of databases.

MySQL: Essential Skills for Beginners to MasterApr 18, 2025 am 12:24 AM

MySQL is suitable for beginners to learn database skills. 1. Install MySQL server and client tools. 2. Understand basic SQL queries, such as SELECT. 3. Master data operations: create tables, insert, update, and delete data. 4. Learn advanced skills: subquery and window functions. 5. Debugging and optimization: Check syntax, use indexes, avoid SELECT*, and use LIMIT.

MySQL: Structured Data and Relational DatabasesApr 18, 2025 am 12:22 AM

MySQL efficiently manages structured data through table structure and SQL query, and implements inter-table relationships through foreign keys. 1. Define the data format and type when creating a table. 2. Use foreign keys to establish relationships between tables. 3. Improve performance through indexing and query optimization. 4. Regularly backup and monitor databases to ensure data security and performance optimization.

MySQL: Key Features and Capabilities ExplainedApr 18, 2025 am 12:17 AM

MySQL is an open source relational database management system that is widely used in Web development. Its key features include: 1. Supports multiple storage engines, such as InnoDB and MyISAM, suitable for different scenarios; 2. Provides master-slave replication functions to facilitate load balancing and data backup; 3. Improve query efficiency through query optimization and index use.

The Purpose of SQL: Interacting with MySQL DatabasesApr 18, 2025 am 12:12 AM

SQL is used to interact with MySQL database to realize data addition, deletion, modification, inspection and database design. 1) SQL performs data operations through SELECT, INSERT, UPDATE, DELETE statements; 2) Use CREATE, ALTER, DROP statements for database design and management; 3) Complex queries and data analysis are implemented through SQL to improve business decision-making efficiency.

MySQL for Beginners: Getting Started with Database ManagementApr 18, 2025 am 12:10 AM

The basic operations of MySQL include creating databases, tables, and using SQL to perform CRUD operations on data. 1. Create a database: CREATEDATABASEmy_first_db; 2. Create a table: CREATETABLEbooks(idINTAUTO_INCREMENTPRIMARYKEY, titleVARCHAR(100)NOTNULL, authorVARCHAR(100)NOTNULL, published_yearINT); 3. Insert data: INSERTINTObooks(title, author, published_year)VA

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks agoByDDD

Hot Tools

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment

Hot Topics

Where is the login entrance for gmail email?

7626

CakePHP Tutorial

1389

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

140