Home  >  Article  >  Backend Development  >  How to Set Custom User Agents in Python urllib2 for Web Scraping?

How to Set Custom User Agents in Python urllib2 for Web Scraping?

Patricia Arquette
Patricia ArquetteOriginal
2024-11-01 08:54:02430browse

How to Set Custom User Agents in Python urllib2 for Web Scraping?

Setting User Agents in urllib2

While using urllib2.urlopen for web scraping, the default user agent is typically the Python package name. This may not always be desirable, as websites can enforce restrictions based on the user agent. To modify the user agent, one can utilize a custom urllib2 opener.

In Python 2.x, urllib2.build_opener can be employed to create a new opener with custom headers, including the user agent. The following code snippet demonstrates how to change the user agent to Mozilla:

opener = urllib2.build_opener()
opener.addheaders = [('User-Agent', 'Mozilla/5.0')]
response = opener.open('http://www.stackoverflow.com')

In Python 3, the equivalent to urllib2.build_opener is urllib.request.build_opener. The same code structure can be used to set custom user agents in Python 3 as well.

The above is the detailed content of How to Set Custom User Agents in Python urllib2 for Web Scraping?. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn