Home  >  Article  >  Backend Development  >  php抓取百度快照、百度收录、百度暖词程序代码

php抓取百度快照、百度收录、百度暖词程序代码

WBOY
WBOYOriginal
2016-06-13 13:16:12890browse

php抓取百度快照、百度收录、百度热词程序代码

代码如下 复制代码


/*
抓取百度收录代码
*/
function baidu($s){
? $baidu="http://www.baidu.com/s?wd=site%3A".$s;
? $site=file_get_contents($baidu);
? //$site=iconv("gb2312", "UTF-8", $site);
? ereg("找到相关网页(.*)篇,", $site,$count);
? $count=str_replace("找到相关网页","",$count);
? $count=str_replace("篇,","",$count);
? $count=str_replace("约","",$count);
? $count=str_replace(",","",$count);
? return $count[0];
}

echo baidu(www.hzhuti.com); //获取好主题在百度中的收录数量

?>
?

获取百度的热词

?代码如下 复制代码
/**
* * @user 小杰
* @return array 返回百度的热词数据(数组返回)
*/
function getBaiduHotKeyWord()
{
$templateRss = file_get_contents('http://top.baidu.com/rss_xml.php?p=top10');
If (preg_match('/

(.*)
/is', $templateRss, $_description)) {
$templateRss = $_description [0];
$templateRss = str_replace("&", "&", $templateRss);
}
$templateRss = "" . $templateRss;
$xml = simplexml_load_String($templateRss);
foreach ($xml->tbody->tr as $temp) {
if (!empty ($temp->td->a)) {
$keyArray [] = trim(($temp->td->a));
}
}
return $keyArray;
}
print_r(getBaiduHotKeyWord());
?


这是在网上找的 稍微修改了下 将下面代码写入php文件
百度收录和百度快照时间

?代码如下 复制代码
??? $domain = “http://www.hzhuti.com/nokia/5230/ *欲查询的域名*/
??? $site_url = ‘http://www.baidu.com/s?wd=site%3A’;
??? $all = $site_url.$domain; /*域名所有收录的网址*/
??? $today = $all.’&lm=1′;??? /*域名今日收录的网址*/
??? $utf_pattern = “/找到相关结果数(.*)个/”;
??? $kz_pattern = “/(.*)/”; /*用以匹配快照日期的字符串*/
??? $times = “/d{4}-d{1,2}-d{1,2}/”; /*匹配快照日期的正则表达式,如:2011-8-4*/
??? $s0 = @file_get_contents($all);??? /*将site:www.ninthday.net的网页置入$s0字符串中*/
??? $s1 = @file_get_contents($today);
??? preg_match($utf_pattern,$s0,$all_num); /*匹配”找到相关结果数*个”*/
??? preg_match($utf_pattern,$s1,$today_num);
??? preg_match($kz_pattern,$s0,$temp);
??? preg_match($times,$temp[0],$screenshot);
??? if($all_num[1] == “”)
??????? $all_num[1] = 0;
??? if($today_num[1] == “”)
??????? $today_num[1] = 0;
??? if($screenshot[0] == “”)
??????? $screenshot[0] = “暂无快照”;
?>

???


??? Test
???

?
???
?????
???
???
?????
???

?

日期 百度收录 百度今日收录 百度快照日期

???

百度收录:” target=”_blank”>


???

百度今日收录:” target=”_blank”>


???

百度快照日期:”>




?

上面的方法未经过严格考虑,如果服务器不支持file_get_contents函数我们就无法操作了,所以还可以利用curl操作,这个更方便可以模仿用户哦。

?

?

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn