Home  >  Article  >  Backend Development  >  What software do you use to write crawlers in Python?

What software do you use to write crawlers in Python?

(*-*)浩
(*-*)浩Original
2019-07-05 10:04:266733browse

If you are learning programming, then "crawlers" are definitely things you cannot ignore. So, what tools do you need before learning python crawler?

What software do you use to write crawlers in Python?

Code editing operating environment: (Recommended learning: Python video tutorial)

Personal PyCharm Community Edition is recommended. Of course, IDLE that comes with Python will also work, as will Notepad, as long as you are used to it.

F12 Developer Tools

Look at the source code: quickly locate element analysis xpath: 1. Google browser is recommended here, you can right-click on the source code interface to view

Packet Capture Tool

Recommend httpfox, a plug-in for the Firefox browser. It is better than the F12 tool that comes with Google Firefox. It can conveniently check the website's packet receiving and sending packets. Information

is an absolute path. When encountering some dynamically generated icons (common ones include list page turning buttons, etc.), the erratic absolute path is likely to cause errors. Therefore, it is recommended here that during actual analysis, it is only used as a reference. Remember to remember the following figure. Remove the "x:" in the xpath box. It seems that this is the syntax of the early version of xpath. It is currently incompatible with some modules (such as scrapy). It is better to delete it to avoid errors.

Regular Expression Test Tool

Online regular expression test, use it to practice more, and also assist in analysis! There are many ready-made regular expressions that you can use and you can also refer to them!

For more Python related technical articles, please visit the Python Tutorial

column to learn!

The above is the detailed content of What software do you use to write crawlers in Python?. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn