Home >Backend Development >PHP Tutorial >Implementation code for PHP multi-threaded batch collection and downloading of images

Implementation code for PHP multi-threaded batch collection and downloading of images

WBOY
WBOYOriginal
2016-07-25 08:44:171110browse
Use curl's multi-threading. In addition, curl can set the request time. If you encounter a very slow URL resource, you can give up decisively, so there is no blocking. In addition, with multi-threaded requests, the efficiency should be relatively high.
  1. /**
  2. * curl multi-threading
  3. * @param array $array parallel URL
  4. * @param int $timeout timeout time
  5. * @return mix
  6. */
  7. public function Curl_http($array,$timeout='15'){
  8. $res = array();
  9. $mh = curl_multi_init();//Create multiple curl handle
  10. foreach($array as $k=>$url){
  11. $conn[$k]=curl_init($url);//Initialization
  12. curl_setopt($conn[$k], CURLOPT_TIMEOUT, $timeout);//Set the timeout period
  13. curl_setopt($conn[$k], CURLOPT_USERAGENT, 'Mozilla/5.0 (compatible; MSIE 5.01; Windows NT 5.0)');
  14. curl_setopt($conn[$k], CURLOPT_MAXREDIRS, 7);//HTTp orientation level, 7 is the highest
  15. curl_setopt($conn[$k], CURLOPT_HEADER, false);//No header here, increase block efficiency
  16. curl_setopt($conn[$k], CURLOPT_FOLLOWLOCATION, 1); // 302 redirect
  17. curl_setopt($conn[$k], CURLOPT_RETURNTRANSFER,1); //Require the result to be a string and output it to the screen
  18. curl_setopt($conn[$k], CURLOPT_HTTPGET, true);
  19. curl_multi_add_handle ( $mh,$conn[$k]);
  20. }
  21. //Prevent the endless loop from consuming the CPU. This paragraph is based on the writing method on the Internet
  22. do {
  23. $mrc ​​= curl_multi_exec($mh,$active);//When there is no data , active=true
  24. } while ($mrc == CURLM_CALL_MULTI_PERFORM);//When data is being received
  25. while ($active and $mrc ​​== CURLM_OK) {//When there is no data or the request is suspended, active=true
  26. if (curl_multi_select($mh) != -1) {
  27. do {
  28. $mrc ​​= curl_multi_exec($mh, $active);
  29. } while ($mrc == CURLM_CALL_MULTI_PERFORM);
  30. }
  31. }
  32. foreach ($array as $k => $url) {
  33. if(!curl_errno($conn[$k])){
  34. $data[$k]=curl_multi_getcontent($conn[$k]);//Convert data to array
  35. $header[$k]=curl_getinfo($conn[$k]);//Return http header information
  36. curl_close($conn[$k]);//Close the handle
  37. curl_multi_remove_handle($mh, $conn[$k] ]); //Release resources
  38. }else{
  39. unset($k,$url);
  40. }
  41. }
  42. curl_multi_close($mh);
  43. return $data;
  44. }
  45. //Parameter reception
  46. $ callback = $_GET['callback'];
  47. $hrefs = $_GET['hrefs'];
  48. $urlarray = explode(',',trim($hrefs,','));
  49. $date = date(' Ymd',time());
  50. //Instantiation
  51. $img = new HttpImg();
  52. $stime = $img->getMicrotime();//Start time
  53. $data = $img->Curl_http ($urlarray,'20');//List data
  54. mkdir('./img/'.$date,0777);
  55. foreach ((array)$data as $k=>$v){
  56. preg_match_all( "/(href|src)=(["|']?)([^ "'>]+.(jpg|png|PNG|JPG|gif))\2/i", $v, $matches[ $k]);
  57. if(count($matches[$k][3])>0){
  58. $dataimg = $img->Curl_http($matches[$k][3],'20' );//All image data binary
  59. $j = 0;
  60. foreach ((array)$dataimg as $kk=>$vv){
  61. if($vv !=''){
  62. $rand = rand(1000 ,9999);
  63. $basename = time()."_".$rand.".".jpg;//Save as a jpg format file
  64. $fname = './img/'.$date."/" ."$basename";
  65. file_put_contents($fname, $vv);
  66. $j++;
  67. echo "Create the ".$j."th picture"."$fname"."
    ";
  68. }else{
  69. unset($kk,$vv);
  70. }
  71. }
  72. }else{
  73. unset($matches);
  74. }
  75. }
  76. $etime = $img->getMicrotime();//End time
  77. echo "Time".($etime-$stime)."Seconds";
  78. exit;
Copy code

Multi-threading, download pictures, PHP


Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn