Optimization practice of a PHP application_PHP tutorial-PHP Tutorial-php.cn

Home

Backend Development

PHP Tutorial

Optimization practice of a PHP application_PHP tutorial

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jul 13, 2016 pm 05:53 PM

phpoptimizationpracticeuseofstillPass

I recently reviewed an optimization practice I did before and found that some common optimization methods can still be reused. As the system runs for a long time, there will always be problems and bottlenecks of one kind or another. It’s not scary to have problems. We have something to do with "beating the tiger" - it’s nothing more than locating the problem ->analyzing the problem->proposing a solution ->Practice->Result feedback->Summary and then optimize.
Problem description: The system was developed using PHP5 + Zend framework. After the data scale and access volume increased (tens of millions), the background apache server load became too high. During peak access hours (such as the period from get off work to 10pm every day) (especially on Friday), the machine CPU load will soar to more than 170. The high CPU load causes the processing of requests to slow down accordingly, so this problem needs to be solved urgently.
Problem analysis: After several consecutive days of observation and analysis, when the CPU usage reaches 100%, the system CPU usage accounts for a large proportion, and the user CPU usage is not very high. In addition, the front-end haproxy and squid cache CPUs The load is very low, and the hit ratio of memcached and Squid can generally reach about 60%.
Analyzing the backend's access-log, we found that a large portion of requested User-Agents are search crawlers;
At the same time, xdebug was configured on apache, and a set of performance data was measured on the main pages during the idle period. By using kcachegrind to analyze the measured data (how to configure xdebug, you can search with soso) and found:
The performance data is not stable enough, and the test data will vary greatly between the same requests
The slow points are scattered
In most cases, memcached access is relatively slow (more than 100ms)
Solution Through the above preliminary analysis, a series of adjustments were gradually made to the existing procedures.
The first thing to consider is whether we can find a way to increase the Hit ratio of the front-end Squid cache, thereby reducing the number of requests that penetrate Squid and reach the back-end Apache.
Considering that a considerable number of requests originate from Crawler, previously Squid cache would only cache requests with a language cookie set, and requests from Crawler did not have cookie information. So I thought of defaulting all requests from Crawler to the language of zh_CN, and then modifying the configuration of haproxy to transfer all requests from common Crawler with User-Agent to squid cache.
Modify the php code and set the cache time of some pages to be longer
After the above two steps, the number of requests reaching apache has indeed been reduced, but this does little to help the problem of excessive CPU load, so I looked for another method.
Secondly, according to the results of using xdebug profiling, the interaction with memcached takes a long time, so I wonder if I can find a way to make memcached respond to requests faster, so that each request can be completed faster, thereby reducing concurrency.
Through code analysis, it was found that online memcached uses poll(), and the number of memcached connections remains around 1,000 during busy times, and memcached's CPU usage is around 30%. Obviously, the poll() method is very inefficient when handling so many concurrent connections. So I recompiled memcached to use epoll() to process requests. After replacing it with epoll, memcached's cpu usage dropped from about 30% to about 3%, which is 10 times!
In addition, the hit ratio of memcached is not particularly high, and the number of items swapped out is also relatively high, so I thought of partitioning the contents of the cache. I originally planned to do manual partitioning, but later I found that the latest memcache extension of PHP can support cache-based partitioning. The key is automatically partitioned, and new memcached instances can be added without modifying the program code (need to modify the configuration file:-)). So I upgraded the php memcache extension of each apache, and then added a new memcached to the configuration file. This completes the content partition of memcached. The effect after the modification is more significant, and the loading time of the page is much shorter than before the modification.
After these two steps of adjustment, the efficiency of memcached is higher than before, but the load of apache is still high. I have no choice but to think of other solutions!
Further in-depth analysis mentioned earlier that the CPU usage of the main system is very high. To find the reason, we can only go deep into the kernel:) From now on, our strace journey begins. To paraphrase a Nike advertising slogan: Just strace it!
Strace was performed on the httpd process during peak hours, using the following methods
strace -p PID -c gives summary
strace -p PID -o output.log Write to file and study slowly
strace -p PID -e trace=file Only look at syscalls related to filesystem operations
strace -p PID -elstat64,stat64,open,getcwd only traces these syscalls
…
From the above strace analysis, the following conclusions are drawn:
lstat64, stat64, open, etc. There are so many syscalls
The above syscalls do take up a lot of time! More than 60% of the time is robbed by them, orz
The vast majority of syscalls fail. It’s really a case of repeated failure
With the above data, we have found the direction of the problem, which is to find where these meaningless system calls come from.
After analysis, when PHP wants to load a certain class, it will search for the files corresponding to the class in a series of directories defined in include_path, and try each directory until it is found. Well, this method is obviously relatively inefficient. Is there a better way to accomplish this? The answer is yes, there is! And there’s more than one way!
When calling require_once(), write the absolute path as the parameter (Guys write Zend Framework didn’t understand this at first; it was updated later))
Use __autoload() to lazy load the class, which means that it will be loaded only when it is really needed, instead of requiring_once all the class files that may be used.
The problem has been found, but there is another problem to solve. Pay attention to using absolute paths in the code during development. The only thing that can be improved is to change it to lazy loading. However, a large number of require_once in Zend Framework use relative paths, which causes problems - the problem I am talking about here is the CPU load we are talking about in this article. The root cause of the excessive problem.
OK, now that the problem is found, let’s solve it. Write a script to automatically generate the Class -> File Path correspondence, and generate the correspondence files between all classes in the code and all classes in Zend Framework. Comment out all require_once in the code and in the Zend Framework library. Then conduct detailed testing before going live. The results were astonishing, the load dropped to within 3! ! Problem solved.
Summary:
Anyone who writes code knows that there will always be problems wherever there may be problems. There will be a cause for any problem (even if it is not found yet). Solving it from the root is the best way. It doesn’t matter what problem is solved. I hope everyone can learn this solution. Ideas and good use of tools. ok, that’s it for this case.

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

PHP in Action: Real-World Examples and ApplicationsApr 14, 2025 am 12:19 AM

PHP is widely used in e-commerce, content management systems and API development. 1) E-commerce: used for shopping cart function and payment processing. 2) Content management system: used for dynamic content generation and user management. 3) API development: used for RESTful API development and API security. Through performance optimization and best practices, the efficiency and maintainability of PHP applications are improved.

PHP: Creating Interactive Web Content with EaseApr 14, 2025 am 12:15 AM

PHP makes it easy to create interactive web content. 1) Dynamically generate content by embedding HTML and display it in real time based on user input or database data. 2) Process form submission and generate dynamic output to ensure that htmlspecialchars is used to prevent XSS. 3) Use MySQL to create a user registration system, and use password_hash and preprocessing statements to enhance security. Mastering these techniques will improve the efficiency of web development.

PHP and Python: Comparing Two Popular Programming LanguagesApr 14, 2025 am 12:13 AM

PHP and Python each have their own advantages, and choose according to project requirements. 1.PHP is suitable for web development, especially for rapid development and maintenance of websites. 2. Python is suitable for data science, machine learning and artificial intelligence, with concise syntax and suitable for beginners.

The Enduring Relevance of PHP: Is It Still Alive?Apr 14, 2025 am 12:12 AM

PHP is still dynamic and still occupies an important position in the field of modern programming. 1) PHP's simplicity and powerful community support make it widely used in web development; 2) Its flexibility and stability make it outstanding in handling web forms, database operations and file processing; 3) PHP is constantly evolving and optimizing, suitable for beginners and experienced developers.

PHP's Current Status: A Look at Web Development TrendsApr 13, 2025 am 12:20 AM

PHP remains important in modern web development, especially in content management and e-commerce platforms. 1) PHP has a rich ecosystem and strong framework support, such as Laravel and Symfony. 2) Performance optimization can be achieved through OPcache and Nginx. 3) PHP8.0 introduces JIT compiler to improve performance. 4) Cloud-native applications are deployed through Docker and Kubernetes to improve flexibility and scalability.

PHP vs. Other Languages: A ComparisonApr 13, 2025 am 12:19 AM

PHP is suitable for web development, especially in rapid development and processing dynamic content, but is not good at data science and enterprise-level applications. Compared with Python, PHP has more advantages in web development, but is not as good as Python in the field of data science; compared with Java, PHP performs worse in enterprise-level applications, but is more flexible in web development; compared with JavaScript, PHP is more concise in back-end development, but is not as good as JavaScript in front-end development.

PHP vs. Python: Core Features and FunctionalityApr 13, 2025 am 12:16 AM

PHP and Python each have their own advantages and are suitable for different scenarios. 1.PHP is suitable for web development and provides built-in web servers and rich function libraries. 2. Python is suitable for data science and machine learning, with concise syntax and a powerful standard library. When choosing, it should be decided based on project requirements.

PHP: A Key Language for Web DevelopmentApr 13, 2025 am 12:08 AM

PHP is a scripting language widely used on the server side, especially suitable for web development. 1.PHP can embed HTML, process HTTP requests and responses, and supports a variety of databases. 2.PHP is used to generate dynamic web content, process form data, access databases, etc., with strong community support and open source resources. 3. PHP is an interpreted language, and the execution process includes lexical analysis, grammatical analysis, compilation and execution. 4.PHP can be combined with MySQL for advanced applications such as user registration systems. 5. When debugging PHP, you can use functions such as error_reporting() and var_dump(). 6. Optimize PHP code to use caching mechanisms, optimize database queries and use built-in functions. 7

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks agoByDDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

WWE 2K25: How To Unlock Everything In MyRise

1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.