Home  >  Article  >  Backend Development  >  curl采集

curl采集

WBOY
WBOYOriginal
2016-06-13 10:46:31857browse

curl采集,求助
采集$url="http://s.click.taobao.com/t_8?e=7HZ6jHSTbIWZXoQ43%2BGOC2vGvlT28S6knArdy2HkZM%2BsVw%3D%3D&p=mm_32165125_0_0";
淘宝会自动跳转$url地址:http://item.taobao.com/item.htm?id=15711538808&ali_trackid=2:mm_32165125_0_0:1346999816_310_551572511

怎样才能获取到跳转后的$url,请帖详细代码
请用我给的url测试,谢谢

------解决方案--------------------
真的很佩服楼主的执着!
观察了一下,个人认为很难,不过按楼主的执着,实现还是之日可待的

对于 $url 直接抓取后去的跳转地址

HTTP/1.1 302 Moved Temporarily
Server: Tengine
Date: Sun, 09 Sep 2012 13:16:39 GMT
Content-Type: text/html
Connection: close
Location: http://s.click.taobao.com/t_js?tu=http%3A%2F%2Fs.click.taobao.com%2Ft_8%3Fe%3D7HZ6jHSTbIWZXoQ43%252BGOC2vGvlT28S6knArdy2HkZM%252BsVw%253D%253D%26p%3Dmm_32165125_0_0%26ref%3D

Expires: Sun, 09 Sep 2012 13:16:39 GMT
Cache-Control: max-age=0

这个http://s.click.taobao.com/t_js是一个js文件,里面有
var inf = (top.location!=self.location); //不在框架中时为真
其后有
if(inf){
var oi = document.createElement("iframe");
创建一个插入式框架,该框架里引入了 http://js.tongji.linezing.com/1023331/tongji.js
对url参数进行编解码后设置cookie,然后跳转到 tu 执行的地址

由于 curl 不能执行 js,所以你需要用php去模拟这一过程的实现

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn