Home  >  Q&A  >  body text

python - 抓取天猫详情页里面的月销量,反爬非常厉害

现在需要抓取天猫详情页里面的月销量
如https://detail.tmall.com/item...

已分析得到是异步js返回的数据,接口如下
https://mdskip.taobao.com/cor...


这个地址访问几次就需要登录,登录之后多访问几次就需要输入验证码
用代理换IP也一样
各位大神有啥好办法

天蓬老师天蓬老师2741 days ago889

reply all(4)I'll reply

  • ringa_lee

    ringa_lee2017-04-18 10:31:07

    No login, change agent directly

    Do not keep session when switching proxy

    reply
    0
  • 巴扎黑

    巴扎黑2017-04-18 10:31:07

    To obtain the entrance, you can choose the mobile terminal entrance, for example: entrance


    The data is directly on the page, it is easy to get the data, keywords"sellCount".

    reply
    0
  • 阿神

    阿神2017-04-18 10:31:07

    No matter how many times you visit the browser, you will not be asked to log in. Use code to adjust the browser plug-in to access htmlunit

    After using Postman 50 times, I still won’t be allowed to log in or a verification code will appear. I’ll try it on my next post. Taobao’s anti-crawling is still average.
    https://www.endclothing.com is really damned if it uses anti-crawling

    reply
    0
  • 天蓬老师

    天蓬老师2017-04-18 10:31:07

    Does anyone have experience with this?

    reply
    0
  • Cancelreply