python - Scrapy 爬虫的时候只能抓取到页面的一条数据，请教不知道是不是网站做了反爬虫的手段？

Question

我爬虫的目标网址是http://jobs.monster.com/search/software_5想要保存这个网站上每一条工作的标题、链接、公司和发布时间 我自己检查的时候用sites = hxs.select('//div')获取所有的div结果发现本来只能得到一...

PHPz · Answer

Resolved

The data is all in js, and the data in js is obtained directly through response.body and regular expressions. The method is not very good. Students who have the same problem can study Python-webkit.

python - Scrapy 爬虫的时候只能抓取到页面的一条数据，请教不知道是不是网站做了反爬虫的手段？

reply all(1)I'll reply