Webcrawler – Die Proxy-IP-Adresse des Python-Requests.get-Crawlers hat sich nicht geändert

Question

Die Arbeit erfordert das Crawlen von Informationen auf Amazon, aber der Anti-Crawler von Amazon ist zu leistungsfähig und dieselbe IP-Adresse wird blockiert. Python-Version: 3.6, IDE: Pycharm 2017.1. Ich habe viele Informationen im Internet überprüft und das Handbuch der Anforderungsbibliothek gelesen, aber sie sind alle die gleiche Methode. Der Code lautet wie folgt: {code...}. .

阿神 · Answer

proxies在你访问http时用http的设置，访问https时用https的设置
所以你的proxy需要同时包含http及https的配置，这样才能生效

proxy = {
    'http': 'http://117.85.105.170:808',
    'https': 'https://117.85.105.170:808'
}

Webcrawler – Die Proxy-IP-Adresse des Python-Requests.get-Crawlers hat sich nicht geändert

Antworte allen(1)Ich werde antworten