Home  >  Article  >  Backend Development  >  What to do if php collects article pictures but does not display them

What to do if php collects article pictures but does not display them

藏色散人
藏色散人Original
2021-10-19 09:29:142679browse

Solutions to the problem that php collected article images are not displayed: 1. Collect Baidu search results based on keywords; 2. Convert the collected HTML in batches based on the defined domain name.

What to do if php collects article pictures but does not display them

The operating environment of this article: windows7 system, PHP7.1 version, DELL G3 computer

What should I do if the pictures collected by php are not displayed? ?

Solution to the problem that PHP CURL collects Baidu search result images and does not display them

1. Collect Baidu search results based on keywords

Based on Keyword collection Baidu search results can be implemented using curl. The code is as follows:

<p><?php <br/>function doCurl($url, $data=array(), $header=array(), $timeout=30){<br>  $ch = curl_init();<br>  curl_setopt($ch, CURLOPT_URL, $url);<br>  curl_setopt($ch, CURLOPT_HTTPHEADER, $header);<br>  curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);<br>  curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);<br>  curl_setopt($ch, CURLOPT_TIMEOUT, $timeout);<br>  $response = curl_exec($ch);<br>  if($error=curl_error($ch)){<br>    die($error);<br>  }<br>  curl_close($ch);<br>  return $response;<br>}<br>$wd = '仙剑奇侠传';<br>$url = 'http://www.baidu.com/s?wd='.urlencode($wd);<br>$data = array();<br>$header = array();<br>$response = doCurl($url, $data, $header, 5);<br>echo $response;<br>?><br></p>

After output, it is found that some pictures cannot be displayed

What to do if php collects article pictures but does not display them

2. After collection Analysis of the reasons why pictures are not displayed

Search directly in Baidu, and the page can display pictures. Use firebug to check the image path and find that the domain name of the collected image is different from the domain name of the image searched on Baidu.

Collected returned image domain name t11.baidu.com

What to do if php collects article pictures but does not display them

Normal search image domain name ss1.baidu.com

What to do if php collects article pictures but does not display them

Check the collected and normal search html, and found that the js for domain name conversion is different

Collection

<p>var list = {<br>    "graph.baidu.com": "http://graph.baidu.com",<br>    "t1.baidu.com":"http://t1.baidu.com",<br>    "t2.baidu.com":"http://t2.baidu.com",<br>    "t3.baidu.com":"http://t3.baidu.com",<br>    "t10.baidu.com":"http://t10.baidu.com",<br>    "t11.baidu.com":"http://t11.baidu.com",<br>    "t12.baidu.com":"http://t12.baidu.com",<br>    "i7.baidu.com":"http://i7.baidu.com",<br>    "i8.baidu.com":"http://i8.baidu.com",<br>    "i9.baidu.com":"http://i9.baidu.com",<br>};<br></p>

Normal search

<p>var list = {<br>    "graph.baidu.com": "https://sp0.baidu.com/-aYHfD0a2gU2pMbgoY3K",<br>    "t1.baidu.com":"https://ss0.baidu.com/6ON1bjeh1BF3odCf",<br>    "t2.baidu.com":"https://ss1.baidu.com/6OZ1bjeh1BF3odCf",<br>    "t3.baidu.com":"https://ss2.baidu.com/6OV1bjeh1BF3odCf",<br>    "t10.baidu.com":"https://ss0.baidu.com/6ONWsjip0QIZ8tyhnq",<br>    "t11.baidu.com":"https://ss1.baidu.com/6ONXsjip0QIZ8tyhnq",<br>    "t12.baidu.com":"https://ss2.baidu.com/6ONYsjip0QIZ8tyhnq",<br>    "i7.baidu.com":"https://ss0.baidu.com/73F1bjeh1BF3odCf",<br>    "i8.baidu.com":"https://ss0.baidu.com/73x1bjeh1BF3odCf",<br>    "i9.baidu.com":"https://ss0.baidu.com/73t1bjeh1BF3odCf",<br>};<br></p>

So it can The conclusion is that Baidu determines if it is collected based on the source address, IP, header and other parameters, and returns different js.

3. Solution to the problem that images are not displayed after collection

Just do a batch conversion of the collected HTML according to the defined domain name.

<p><?php <br/>function doCurl($url, $data=array(), $header=array(), $timeout=30){<br>  $ch = curl_init();<br>  curl_setopt($ch, CURLOPT_URL, $url);<br>  curl_setopt($ch, CURLOPT_HTTPHEADER, $header);<br>  curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);<br>  curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);<br>  curl_setopt($ch, CURLOPT_TIMEOUT, $timeout);<br>  $response = curl_exec($ch);<br>  if($error=curl_error($ch)){<br>    die($error);<br>  }<br>  curl_close($ch);<br>  return $response;<br>}<br>// 域名转换<br>function cdomain($str){<br>  $baidu_domain = '{<br>    "http://graph.baidu.com": "https://sp0.baidu.com/-aYHfD0a2gU2pMbgoY3K",<br>    "http://p.qiao.baidu.com":"https://sp0.baidu.com/5PoXdTebKgQFm2e88IuM_a",<br>    "http://vse.baidu.com":"https://sp3.baidu.com/6qUDsjip0QIZ8tyhnq",<br>    "http://hdpreload.baidu.com":"https://sp3.baidu.com/7LAWfjuc_wUI8t7jm9iCKT-xh_",<br>    "http://lcr.open.baidu.com":"https://sp2.baidu.com/8LUYsjW91Qh3otqbppnN2DJv",<br>    "http://kankan.baidu.com":"https://sp3.baidu.com/7bM1dzeaKgQFm2e88IuM_a",<br>    "http://xapp.baidu.com":"https://sp2.baidu.com/yLMWfHSm2Q5IlBGlnYG",<br>    "http://dr.dh.baidu.com":"https://sp0.baidu.com/-KZ1aD0a2gU2pMbgoY3K",<br>    "http://xiaodu.baidu.com":"https://sp0.baidu.com/yLsHczq6KgQFm2e88IuM_a",<br>    "http://sensearch.baidu.com":"https://sp1.baidu.com/5b11fzupBgM18t7jm9iCKT-xh_",<br>    "http://s1.bdstatic.com":"https://ss1.bdstatic.com/5eN1bjq8AAUYm2zgoY3K",<br>    "http://olime.baidu.com":"https://sp0.baidu.com/8bg4cTva2gU2pMbgoY3K",<br>    "http://app.baidu.com":"https://sp2.baidu.com/9_QWsjip0QIZ8tyhnq",<br>    "http://i.baidu.com":"https://sp0.baidu.com/74oIbT3kAMgDnd_",<br>    "http://c.baidu.com":"https://sp0.baidu.com/9foIbT3kAMgDnd_",<br>    "http://sclick.baidu.com":"https://sp0.baidu.com/5bU_dTmfKgQFm2e88IuM_a",<br>    "http://nsclick.baidu.com":"https://sp1.baidu.com/8qUJcD3n0sgCo2Kml5_Y_D3",<br>    "http://sestat.baidu.com":"https://sp1.baidu.com/5b1ZeDe5KgQFm2e88IuM_a",<br>    "http://eclick.baidu.com":"https://sp3.baidu.com/-0U_dTmfKgQFm2e88IuM_a",<br>    "http://api.map.baidu.com":"https://sp2.baidu.com/9_Q4sjOpB1gCo2Kml5_Y_D3",<br>    "http://ecma.bdimg.com":"https://ss1.bdstatic.com/-0U0bXSm1A5BphGlnYG",<br>    "http://ecmb.bdimg.com":"https://ss0.bdstatic.com/-0U0bnSm1A5BphGlnYG",<br>    "http://t1.baidu.com":"https://ss0.baidu.com/6ON1bjeh1BF3odCf",<br>    "http://t2.baidu.com":"https://ss1.baidu.com/6OZ1bjeh1BF3odCf",<br>    "http://t3.baidu.com":"https://ss2.baidu.com/6OV1bjeh1BF3odCf",<br>    "http://t10.baidu.com":"https://ss0.baidu.com/6ONWsjip0QIZ8tyhnq",<br>    "http://t11.baidu.com":"https://ss1.baidu.com/6ONXsjip0QIZ8tyhnq",<br>    "http://t12.baidu.com":"https://ss2.baidu.com/6ONYsjip0QIZ8tyhnq",<br>    "http://i7.baidu.com":"https://ss0.baidu.com/73F1bjeh1BF3odCf",<br>    "http://i8.baidu.com":"https://ss0.baidu.com/73x1bjeh1BF3odCf",<br>    "http://i9.baidu.com":"https://ss0.baidu.com/73t1bjeh1BF3odCf",<br>    "http://b1.bdstatic.com":"https://ss0.bdstatic.com/9uN1bjq8AAUYm2zgoY3K",<br>    "http://ss.bdimg.com":"https://ss1.bdstatic.com/5aV1bjqh_Q23odCf",<br>    "http://opendata.baidu.com":"https://sp0.baidu.com/8aQDcjqpAAV3otqbppnN2DJv",<br>    "http://api.open.baidu.com":"https://sp0.baidu.com/9_Q4sjW91Qh3otqbppnN2DJv",<br>    "http://tag.baidu.com":"https://sp1.baidu.com/6LMFsjip0QIZ8tyhnq",<br>    "http://f3.baidu.com":"https://sp2.baidu.com/-uV1bjeh1BF3odCf",<br>    "http://s.share.baidu.com":"https://sp0.baidu.com/5foZdDe71MgCo2Kml5_Y_D3",  <br>    "http://bdimg.share.baidu.com":"https://ss1.baidu.com/9rA4cT8aBw9FktbgoI7O1ygwehsv",<br>    "http://1.su.bdimg.com":"https://ss0.bdstatic.com/k4oZeXSm1A5BphGlnYG",<br>    "http://2.su.bdimg.com":"https://ss1.bdstatic.com/kvoZeXSm1A5BphGlnYG",<br>    "http://3.su.bdimg.com":"https://ss2.bdstatic.com/kfoZeXSm1A5BphGlnYG",<br>    "http://4.su.bdimg.com":"https://ss3.bdstatic.com/lPoZeXSm1A5BphGlnYG",<br>    "http://5.su.bdimg.com":"https://ss0.bdstatic.com/l4oZeXSm1A5BphGlnYG",<br>    "http://6.su.bdimg.com":"https://ss1.bdstatic.com/lvoZeXSm1A5BphGlnYG",<br>    "http://7.su.bdimg.com":"https://ss2.bdstatic.com/lfoZeXSm1A5BphGlnYG",<br>    "http://8.su.bdimg.com":"https://ss3.bdstatic.com/iPoZeXSm1A5BphGlnYG"<br>  }';<br>  $domain = json_decode($baidu_domain, true);<br>  foreach($domain as $k=>$v){<br>    $str = str_replace($k, $v, $str);<br>  }<br>  return $str;<br>}<br>$wd = '仙剑奇侠传';<br>$url = 'http://www.baidu.com/s?wd='.urlencode($wd);<br>$data = array();<br>$header = array();<br>$response = doCurl($url, $data, $header, 5);<br>echo cdomain($response); // 调用域名转换<br>?><br></p>

After adding domain name conversion, all pictures can be displayed normally.

What to do if php collects article pictures but does not display them

Recommended study: "PHP Video Tutorial"

The above is the detailed content of What to do if php collects article pictures but does not display them. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn