Home >Backend Development >PHP Tutorial >Analyze the application of curl_multi in php_PHP tutorial

Analyze the application of curl_multi in php_PHP tutorial

WBOY
WBOYOriginal
2016-07-21 15:00:50747browse

I believe that many people have a headache with the curl_multi family of functions that are unclear in the PHP manual. They have few documents and the examples given are so simple that you cannot learn from them. I have also searched many web pages, but I have not found a complete application. example.
•curl_multi_add_handle
•curl_multi_close
•curl_multi_exec
•curl_multi_getcontent
•curl_multi_info_read
•curl_multi_init
•curl_multi_remove_handle
•curl_multi_select
Generally speaking, when you think of using When using these functions, the purpose should obviously be to request multiple URLs at the same time, rather than requesting them one by one. Otherwise, it is better to loop and adjust curl_exec by yourself.

The steps are summarized as follows:
Step 1: Call curl_multi_init
Step 2: Call curl_multi_add_handle in a loop
What needs to be noted in this step is that curl_multi_add_handle The second parameter is the subhandle from curl_init.
Step 3: Continue to call curl_multi_exec
Step 4: Call curl_multi_getcontent in a loop to obtain the results as needed
Step 5: Call curl_multi_remove_handle, and call curl_close for each word handle
Step 6: Call curl_multi_close

Here is a simple example found online, which the author calls a dirty example (I will explain why dirty later):

Copy codeThe code is as follows:

*
Here's a quick and dirty example for curl-multi from PHP, tested on PHP 5.0.0RC1 CLI / FreeBSD 5.2.1
*/

$connomains = array(
"http://www.cnn.com/",
"http://www.canada.com/",
"http://www. yahoo.com/"
);

$mh = curl_multi_init();

foreach ($connomains as $i => $url) {
$conn[$i]=curl_init($url);
curl_setopt($conn[$i],CURLOPT_RETURNTRANSFER,1);
curl_multi_add_handle ($mh,$conn[$i]);
}

do { $n=curl_multi_exec($mh,$active); } while ($active);

foreach ($connomains as $i => $url) {
$res[$i]=curl_multi_getcontent($conn[$i]);
curl_close($conn[$i]);
}

print_r($res);


The whole usage process is almost like this. However, this simple code has a fatal weakness, that is, in the do loop, it is dead during the entire url request. Loop, it can easily cause the CPU to occupy 100%.

Now let’s improve it. Here we need to use a function curl_multi_select that has almost no documentation. Although C’s curl library has instructions for select, the interface and usage in PHP are indeed different from those in C. .

Change the do paragraph above to the following:
Copy the code The code is as follows:

do {
                                                                                       ​while ($active and $mrc ​​== CURLM_OK) {
                                                                                                                                                                                                                                                                      $mrc ​​= curl_multi_exec($mh, $active);
                                                                                                                      🎜>                                                                                                                                                                                                                 
Because $active has to wait until all url data is received before it becomes false, so the return value of curl_multi_exec is used here to determine whether there is still data. When there is data, curl_multi_exec will be called continuously. If there is no data temporarily, it will enter the select stage. , it can be awakened to continue execution as soon as new data comes. The advantage here is that there is no unnecessary consumption of CPU.

In addition: There are some details that may sometimes be encountered:
Control the timeout of each request, do it through curl_setopt before curl_multi_add_handle:
curl_setopt ($ch, CURLOPT_TIMEOUT, $timeout);

To determine whether there is a timeout or other errors, use: curl_error($conn[$i]);

before curl_multi_getcontent

Note: Use the multi_curl function of php with caution, because there are bugs in the combination of curl and php in some versions. So the code that you debugged without problems is likely to be incorrect on other machines.

For example, today I discovered that if the CURLOPT_USERAGENT attribute is set to certain values ​​in php5.2.2 with curl/7.16.2, the actual HTTP header sent will become a string of binary values.

It just so happened that the strip_tags function in this version of php did not handle binary data well, so I discovered this bug

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/328037.htmlTechArticleI believe many people have headaches about the curl_multi family of functions that are unclear in the PHP manual. They have less documentation and more examples. It is simply that you have nothing to learn from. I have also found many web pages...
Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn