Scrapy how to get original start_url

Question

When Scrapy crawls, the original start_url will change due to redirection or other reasons. How can I get the original start_url? {Code...}

为情所困 · Answer

Reference article: Summary of common problems with Scrapy crawlers

Use the meta parameter in Request to transfer information

def start_requests(self):
    start_url = 'your_scrapy_start_url'
    yield Request(start_url, self.parse, meta={'start_url':start_url})
    
def parse(self, response):
    item = YourItem()
    item['start_url'] = response.meta['start_url']
    yield item

Scrapy how to get original start_url

reply all(1)I'll reply