php获取页面内容

2016-06-23 14:11 1242浏览原创

想获取一个页面中的部分内容，代码如下，运行结果为空，不知道为啥

$con= file_get_contents("http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html");$preg ='#<div>(.*)</div>#';  preg_match_all($preg, $con, $arr); //搜索内容赋值给数组print_r($arr);

就是想截取

之间的内容

回复讨论(解决方案)

$preg=

(.*?);

内容取回来后，$con都是空的。肯定没值啊

感觉这个做法不大对，

开始

到最后一个

结束

但不一定是对应的标签闭合关系

的确 $con 没结果，不知道为啥不能抓取这个页面的内容呢

$con= file_get_contents("http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html");
echo $con;

哪位大神指导，为啥$con是空值

$ch = curl_init();curl_setopt($ch, CURLOPT_URL, "http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html");curl_setopt($ch, CURLOPT_USERAGENT, "Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0)");curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);$con=curl_exec($ch);curl_close($ch);echo $con;

楼上用的是报文头的方式可以返回正常的，可是为什么用file_get_contents（）不可以啊！！

一样是可以的，由于他检查了是否是浏览器访问，所以要伪造一下

$url = 'http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html';$opts = array(   'http'=>array(     'user_agent' => "Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0)",  ) ); $context = stream_context_create($opts); $con = file_get_contents("http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html", false, $context);