search

Home  >  Q&A  >  body text

java - 知乎爬虫问题

爬虫新手求助。

知乎关注的人的列表,是一个下拉加载方式的分页。

不明白怎么获取下一页的。用post和get带上右下脚的FormData请求,都是403。
“next”是什么方法

PHPzPHPz2892 days ago564

reply all(3)I'll reply

  • 伊谢尔伦

    伊谢尔伦2017-04-18 09:30:31

    next is not a method, next is just the value of the method field in Form Data. If the request returns 403, see if you need to bring User-Agent, Referer or Cookie information

    reply
    0
  • 迷茫

    迷茫2017-04-18 09:30:31

    You can scroll down a few times to capture the packet to see. In my opinion, next is the method to get the next page, and offset is the number that has been loaded

    reply
    0
  • 高洛峰

    高洛峰2017-04-18 09:30:31

    You can refer to this Zhihu to collect crawlers
    Crawler link: http://www.shenjianshou.cn/in...

    reply
    0
  • Cancelreply