现在需要抓取天猫详情页里面的月销量
如https://detail.tmall.com/item...
已分析得到是异步js返回的数据,接口如下
https://mdskip.taobao.com/cor...
这个地址访问几次就需要登录,登录之后多访问几次就需要输入验证码
用代理换IP也一样
各位大神有啥好办法
ringa_lee2017-04-18 10:31:07
No login, change agent directly
Do not keep session when switching proxy
巴扎黑2017-04-18 10:31:07
To obtain the entrance, you can choose the mobile terminal entrance, for example: entrance
The data is directly on the page, it is easy to get the data, keywords"sellCount"
.
阿神2017-04-18 10:31:07
No matter how many times you visit the browser, you will not be asked to log in. Use code to adjust the browser plug-in to access htmlunit
After using Postman 50 times, I still won’t be allowed to log in or a verification code will appear. I’ll try it on my next post. Taobao’s anti-crawling is still average.
https://www.endclothing.com is really damned if it uses anti-crawling