search

Home  >  Q&A  >  body text

java - QQ space crawler is always banned

I grab QQ space and talk about it, but my account is always blocked. Can you suggest some solutions? How to do it quickly without getting your account banned! Thank you!

The code is on github address
https://github.com/20100507/Q...

黄舟黄舟2722 days ago1506

reply all(1)I'll reply

  • 我想大声告诉你

    我想大声告诉你2017-07-07 10:36:12

    Anti-crawler strategy:

    1、识别请求头,判断是否是爬虫
    2、记录请求频率、路径和访问ip,判断是否是爬虫
    3、请求参数中进行加密或复杂加密,增加爬虫开发的难度(如淘宝的ua算法)
    4、复杂验证码
    

    Coping with anti-crawler strategies:

    1、在爬取过程中,适当切换代理ip
    2、适当降低请求频率
    3、请求头模拟成浏览器的请求,也就是用户正常访问的请求
    
    

    Your problem can be mainly solved by switching IP regularly, or switching IP once it is blocked. You can consider going to a proxy IP website such as "Zhan Da Ye" to pay for it, or use several more telecom accounts and use ASDL dial-up to switch. ip.

    reply
    0
  • Cancelreply