Home  >  Article  >  Backend Development  >  php curl detects whether a webpage is indexed by Baidu_PHP tutorial

php curl detects whether a webpage is indexed by Baidu_PHP tutorial

WBOY
WBOYOriginal
2016-07-20 11:15:19648browse

<?<span php
</span><span /*</span><span 
* 检测百度是否收录网页 curl模式
* @ param string $url传入的url
* powed by Jin Gao<br />* return int (1 收录 0 不收录)
</span><span */</span>
<span function</span> checkBaidu(<span $url</span><span ){
</span><span $url</span>='http://www.baidu.com/s?wd='.<span $url</span><span ;

</span><span $curl</span>=<span curl_init();
curl_setopt(</span><span $curl</span>,CURLOPT_URL,<span $url</span><span );
curl_setopt(</span><span $curl</span>,CURLOPT_RETURNTRANSFER,1<span );
</span><span $rs</span>=curl_exec(<span $curl</span><span );
curl_close(</span><span $curl</span><span );

</span><span $arr</span>=<span parse_url</span>(<span $url</span><span );

</span><span if</span>(<span strpos</span>(<span $arr</span>['query'],'http://'<span )){
</span><span $arr</span>['query']=<span str_replace</span>('http://','',<span str_replace</span>('wd=','',<span $arr</span>['query'<span ]));
}</span><span else</span><span {
</span><span $arr</span>['query']=<span str_replace</span>('wd=','',<span $arr</span>['query'<span ]);
}

</span><span if</span>(<span strpos</span>(<span $arr</span>['query'],'?'<span )){
</span><span $str</span>=<span strstr</span>(<span $arr</span>['query'],'?'<span );
</span><span $arr</span>['query']=<span str_replace</span>(<span $str</span>,'',<span $arr</span>['query'<span ]);
}

</span><span if</span>(<span strpos</span>(<span $arr</span>['query'],'/'<span )){
</span><span $narr</span>=<span explode</span>('/',<span $arr</span>['query'<span ]);
</span><span $arr</span>['query']=<span $narr</span>[0<span ];
}

</span><span if</span>(<span strpos</span>(<span $rs</span>,'<b>'.<span $arr</span>['query'].'</b>'<span )){
</span><span return</span> 1<span ;
}</span><span else</span><span {
</span><span return</span> 0<span ;
} 
}

</span><span echo</span> checkBaidu('http://www.ahjk.cn/'<span );
</span>?>

The other is the most painful method as shown in the picture

You can use the preg_match function to detect whether there is the text "" to determine whether to include

<span function</span> checkbaidu(<span $key</span><span ){
        </span><span $url</span>="http://www.baidu.com/s?wd=".<span $key</span><span ;
        </span><span $html</span> = @<span file_get_contents</span>(<span $url</span><span );
        </span><span $html</span> = <span iconv</span>('utf-8','gbk//TRANSLIT',<span $html</span><span );
        </span><span if</span>(<span preg_match</span>('/(抱歉,没有找到与)/', <span $html</span><span )) {
          </span><span return</span> 0<span ;
        } </span><span else</span><span  {
                 </span><span return</span> 1<span ;
        }
}            </span>

I have tested it and it is completely ok.

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/440181.htmlTechArticle? * return int (1 included 0 not included) checkBaidu( ='http://www.baidu.com /s?wd='. = ,CURLOPT_URL, ,CURLOPT_RETURNTRANSFER,1 =curl_exec( = ( ( ( ['query'],'http://' ['query']= ('http...
Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn