Home  >  Q&A  >  body text

python - Use scrapy framework to crawl Baidu pictures and get blocked

The request address url is the address of json obtained through firefox. It can be opened with a browser, but it was banned when crawling with scrapy. Please solve it.

https://image.baidu.com/searc...

给我你的怀抱给我你的怀抱2705 days ago633

reply all(3)I'll reply

  • 黄舟

    黄舟2017-05-24 11:36:48

    Try it at settings.pyROBOTSTXT_OBEY = False.

    reply
    0
  • 某草草

    某草草2017-05-24 11:36:48

    Try without adding hearingers

    reply
    0
  • 为情所困

    为情所困2017-05-24 11:36:48

    I agree with the upstairs, if there will still be a wall. The method of scrapy+selenium+phantomjs can be used.

    reply
    0
  • Cancelreply