search
HomeDatabaseMysql TutorialDetails of the evolution of MySQL architecture from small to large

Assume that a website (discuz) has a small number of visits from the beginning to tens of millions of daily PVs. Let's speculate on the evolution of its mysql server architecture.

The first stage
The daily pv level of website visits is below 10,000. A single machine runs web and db, and there is no need to do architectural layer tuning (for example, there is no need to increase the memcached cache). At this time, data is often cold backed up daily, but sometimes if data security is considered, a mysql master-slave will be set up.

Second stage
The number of daily visits to the website has reached tens of thousands. At this time, the single machine is already somewhat loaded. We need to separate web and db, and we need to build a memcached service as a cache. In other words, at this stage, we can also use a single machine to run mysql to undertake the data storage and query of the entire website. If you do MySQL master-slave, the purpose is also for data security.

The third stage
The number of daily visits to the website has reached hundreds of thousands. Although a single machine can also support it, the machine configuration required is much better than the previous machine. If funds allow, you can purchase a machine with high configuration to run the MySQL service. However, this does not mean that doubling the configuration will double the performance. At a certain stage, increasing the configuration will no longer bring about an increase in performance. Therefore, at this stage, we will think of clustering mysql services, which means that we can use multiple machines to run MySQL. However, MySQL clusters are different from web clusters. We need to consider data consistency, so we cannot simply apply the methods of web clustering (lvs, nginx proxy). The possible architecture is mysql master-slave, one master and multiple slaves. In order to ensure the robustness of the architecture and data integrity, there can only be one master and multiple slaves.

There is another question that we need to think about, that is, in the front-end web layer, our program specifies the IP of the MySQL machine. So when there are multiple mysql machines, what happens in the program? To configure? discuz actually has a function that supports MySQL read and write separation. That is, we can use multiple machines to run MySQL, one of which is for writing, and the others are for reading. We only need to configure the reading and writing IPs into the program, and the program will automatically distinguish the machines. Of course, if we do not use the configuration that comes with discuz, we can also refer to a software called mysql-proxy and use it to achieve read-write separation. It supports one master and multiple slaves mode.

The fourth stage
The number of website visits reaches several million per day. The previous one-master-multiple-slave model has encountered bottlenecks, because when the number of website visits increases, the amount of database reading will also increase. We need to add more slaves, but when the number of slaves increases to dozens, due to the main All bin-logs need to be distributed to all slaves, so this process itself is a very cumbersome matter. Coupled with frequent reading, it will inevitably cause a large delay in the data synchronized from the slaves. Therefore, we can make an optimization,change the original one master and multiple slaves of mysql to one master and one slave, and then the slave will serve as the master of other slaves. The former master is only responsible for writing website business, and the subsequent slaves It is not responsible for any business of the website, only responsible for synchronizing the bin-log to other slaves. In this way, you can continue to stack several more slave libraries.

The fifth stage
When the daily pv of website visits reached 10 million, we found that the number of writes to the website It is very large. There was only one master in our previous architecture, and the master here has become a bottleneck. Therefore, further adjustments are needed. For example, we can divide the business into modules, separate user-related ones, separate permissions, points, etc., and run a separate library, and then use it as a master-slave, which is the so-called sub-library. Of course, you can also change the latitude, separate tables with large access or write volume, and run them on a server. You can also divide a table into multiple small tables. This step of operation involves some procedural changes, so it is necessary to communicate and design with development colleagues in advance. In short, what we need to do in this step is sub-database and sub-table.

Write at the back
Going forward, just continue to divide the large table into small tables. The domestic Alibaba Taobao website has a huge amount of data. All their databases are MySQL. Their MySQL architecture follows the principle of sub-database and sub-table. However, their division rules will have many latitudes, such as division by region. , can be divided according to buyers and sellers, can be divided according to time, etc.

The above is the detailed content of the evolution process of MySQL architecture from small to large. For more related content, please pay attention to the PHP Chinese website (www.php.cn)!


Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
What are stored procedures in MySQL?What are stored procedures in MySQL?May 01, 2025 am 12:27 AM

Stored procedures are precompiled SQL statements in MySQL for improving performance and simplifying complex operations. 1. Improve performance: After the first compilation, subsequent calls do not need to be recompiled. 2. Improve security: Restrict data table access through permission control. 3. Simplify complex operations: combine multiple SQL statements to simplify application layer logic.

How does query caching work in MySQL?How does query caching work in MySQL?May 01, 2025 am 12:26 AM

The working principle of MySQL query cache is to store the results of SELECT query, and when the same query is executed again, the cached results are directly returned. 1) Query cache improves database reading performance and finds cached results through hash values. 2) Simple configuration, set query_cache_type and query_cache_size in MySQL configuration file. 3) Use the SQL_NO_CACHE keyword to disable the cache of specific queries. 4) In high-frequency update environments, query cache may cause performance bottlenecks and needs to be optimized for use through monitoring and adjustment of parameters.

What are the advantages of using MySQL over other relational databases?What are the advantages of using MySQL over other relational databases?May 01, 2025 am 12:18 AM

The reasons why MySQL is widely used in various projects include: 1. High performance and scalability, supporting multiple storage engines; 2. Easy to use and maintain, simple configuration and rich tools; 3. Rich ecosystem, attracting a large number of community and third-party tool support; 4. Cross-platform support, suitable for multiple operating systems.

How do you handle database upgrades in MySQL?How do you handle database upgrades in MySQL?Apr 30, 2025 am 12:28 AM

The steps for upgrading MySQL database include: 1. Backup the database, 2. Stop the current MySQL service, 3. Install the new version of MySQL, 4. Start the new version of MySQL service, 5. Recover the database. Compatibility issues are required during the upgrade process, and advanced tools such as PerconaToolkit can be used for testing and optimization.

What are the different backup strategies you can use for MySQL?What are the different backup strategies you can use for MySQL?Apr 30, 2025 am 12:28 AM

MySQL backup policies include logical backup, physical backup, incremental backup, replication-based backup, and cloud backup. 1. Logical backup uses mysqldump to export database structure and data, which is suitable for small databases and version migrations. 2. Physical backups are fast and comprehensive by copying data files, but require database consistency. 3. Incremental backup uses binary logging to record changes, which is suitable for large databases. 4. Replication-based backup reduces the impact on the production system by backing up from the server. 5. Cloud backups such as AmazonRDS provide automation solutions, but costs and control need to be considered. When selecting a policy, database size, downtime tolerance, recovery time, and recovery point goals should be considered.

What is MySQL clustering?What is MySQL clustering?Apr 30, 2025 am 12:28 AM

MySQLclusteringenhancesdatabaserobustnessandscalabilitybydistributingdataacrossmultiplenodes.ItusestheNDBenginefordatareplicationandfaulttolerance,ensuringhighavailability.Setupinvolvesconfiguringmanagement,data,andSQLnodes,withcarefulmonitoringandpe

How do you optimize database schema design for performance in MySQL?How do you optimize database schema design for performance in MySQL?Apr 30, 2025 am 12:27 AM

Optimizing database schema design in MySQL can improve performance through the following steps: 1. Index optimization: Create indexes on common query columns, balancing the overhead of query and inserting updates. 2. Table structure optimization: Reduce data redundancy through normalization or anti-normalization and improve access efficiency. 3. Data type selection: Use appropriate data types, such as INT instead of VARCHAR, to reduce storage space. 4. Partitioning and sub-table: For large data volumes, use partitioning and sub-table to disperse data to improve query and maintenance efficiency.

How can you optimize MySQL performance?How can you optimize MySQL performance?Apr 30, 2025 am 12:26 AM

TooptimizeMySQLperformance,followthesesteps:1)Implementproperindexingtospeedupqueries,2)UseEXPLAINtoanalyzeandoptimizequeryperformance,3)Adjustserverconfigurationsettingslikeinnodb_buffer_pool_sizeandmax_connections,4)Usepartitioningforlargetablestoi

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

ZendStudio 13.5.1 Mac

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment