python - scrapy抓取天猫重定向（302）问题

Question

spider.py代码 {代码...} 结果： {代码...} 好像是网址转移了，需要重定向的问题，请问我要怎么改代码获得我想要的信息。

天蓬老师 · Answer

被跳转到登录页面了，天猫有防爬装置。你仔细研究下天猫detail域下的cookie，把cookie带上去访问吧。

PHPz · Answer

嗯，应该是防爬虫，你可以cookie带上试试。

阿神 · Answer

解决了吗？我也遇到了同样的问题，不知道怎么添加cookie，
看了视频，是这样添加的

#-*- coding:utf-8 -*-
import scrapy

class StackOverflowSpider(scrapy.Spider):
    name = 'stackoverflow'
    start_urls = ['http://stackoverflow.com/questions?sort=votes']
    
    def start_requests(self):
        url = "http://db.bioon.com/list.php?channelid=1016&classid=951"
        cookies = {
            'dz_username':'wst_today',
            'dz_uid':'1322052',
            'buc_key':'ofR1I78RBaCHkGp8MdBBRjMx7ustawtY',
            'buc_token':'a91b8fef55c66846d3975a9fd8883455'
        }
        return [
            scrapy.Request(url,cookies=cookies),
        ]
    
    def parse(self, response):
        ele = response.xpath(
            '//table[@class="table table-striped"]/thead/tr/th[1]/text()'
            ).extract()
        if ele:
            print "success"

但是换了天猫网站还是报错，不知道怎么写cookie变量

python - scrapy抓取天猫重定向（302）问题

membalas semua(3)saya akan balas