Home >Backend Development >PHP Tutorial >php获取页面内容

php获取页面内容

WBOY
WBOYOriginal
2016-06-23 14:11:451167browse

想获取一个页面中的部分内容,代码如下,运行结果为空,不知道为啥

$con= file_get_contents("http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html");$preg ='#<div class="edit">(.*)</div>#';  preg_match_all($preg, $con, $arr); //搜索内容赋值给数组print_r($arr);


就是想截取

之间的内容

回复讨论(解决方案)

$preg=

(.*?);

内容取回来后,$con都是空的。肯定没值啊

感觉这个做法不大对,

开始

到最后一个
结束

但不一定是对应的标签闭合关系

的确 $con 没结果,不知道为啥不能抓取这个页面的内容呢

$con= file_get_contents("http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html");
echo $con;


哪位大神指导,为啥$con是空值

$ch = curl_init();curl_setopt($ch, CURLOPT_URL, "http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html");curl_setopt($ch, CURLOPT_USERAGENT, "Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0)");curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);$con=curl_exec($ch);curl_close($ch);echo $con;

楼上用的是报文头的方式可以返回正常的,可是为什么用file_get_contents()不可以啊!!

一样是可以的,由于他检查了是否是浏览器访问,所以要伪造一下

$url = 'http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html';$opts = array(   'http'=>array(     'user_agent' => "Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0)",  ) ); $context = stream_context_create($opts); $con = file_get_contents("http://www.meishij.net/zuofa/gouqiboluoyinertang_5.html", false, $context);

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn