python - 关于scrapy爬虫AJAX页面

Question

问题：爬取信息页面为：知乎话题广场 当点击加载的时候，用Chrome 开发者工具，可以看到Network中，实际请求的链接为：FormData为：urlencode： 然后我的代码为： {代码...} 执行爬虫之后，返回的是： {代码...} ...

伊谢尔伦 · Answer

DEBUG: Retrying

伊谢尔伦 · Answer

Writing a crawler should be done step by step, not in one step, otherwise you won’t know what went wrong. Generally, you need to get the data you want first, and then parse and filter.
Send a request first to see if you can get the data you want. If not, the URL may be wrong or blocked

大家讲道理 · Answer

#coding=utf-8

import requests

headers = {'Content-Type':'application/x-www-form-urlencoded; charset=UTF-8'}
url = 'https://www.zhihu.com/node/TopicsPlazzaListV2'
data = 'method=next¶ms=%7B%22topic_id%22%3A833%2C%22offset%22%3A0%2C%22hash_id%22%3A%22%22%7D'

r = requests.post(url, data, headers=headers)
print r.text

PHP中文网 · Answer

The young man teaches you a big move.

python - 关于scrapy爬虫AJAX页面

reply all(4)I'll reply