Home  >  Q&A  >  body text

python - Please tell me how to crawl Jianshu to post in a certain period of time.

I want to crawl Jianshu’s posts in a certain period of time, such as April 13, 2013 - May 13, 2013

The ideas I want to try are as follows:

Implementation: Use Python to directly request the search results, then redirect the obtained URL to get the real short book URL, and then request the real URL

question

  1. Are the results obtained using this method reliable? Is there a more reliable method?

  2. Should I use Google or Baidu?

黄舟黄舟2712 days ago667

reply all(1)I'll reply

  • 过去多啦不再A梦

    过去多啦不再A梦2017-05-18 10:57:49

    In order to be comprehensive, you can use all mainstream search engine interfaces, and it does not have to be limited to a certain search engine interface. This is how some of our teammates search for certain topics, because some websites do not provide a search method that meets the needs. At this time, we can only use search engines. However, the information searched through search engines may not be comprehensive. The robots agreement stipulates that if you cannot search, the search engine will not include it

    reply
    0
  • Cancelreply