Home  >  Q&A  >  body text

Dear python crawler experts, take a look and see how to deal with anti-crawling on this website.

https://www.everysaving.co.uk
Crawling the data of this website through python, but the data cannot be returned! I added the header and proxy IP to crawl, but it didn't work. I hope you guys can give it a try. . .

曾经蜡笔没有小新曾经蜡笔没有小新2711 days ago727

reply all(4)I'll reply

  • 迷茫

    迷茫2017-05-18 11:03:00

    The proxy access website can be seen in the picture below:

    Through https://www.17ce.com/, I found that almost all mainland China is blocked, and the HTTP status returns 403.
    The security policy level of this website is relatively high. It is recommended to use a high-anonymity proxy VPN or server in Europe and the United States to reduce the frequency of crawling.

    reply
    0
  • 为情所困

    为情所困2017-05-18 11:03:00

    Fiddler captures packets, and you can send whatever the browser sends

    reply
    0
  • 迷茫

    迷茫2017-05-18 11:03:00

    Your address cannot be accessed directly through the browser. Is it blocked?

    reply
    0
  • 过去多啦不再A梦

    过去多啦不再A梦2017-05-18 11:03:00

    I can’t access it if I click on it directly. I tested it using a proxy in Singapore and it can be opened

    reply
    0
  • Cancelreply