Web crawler - how to cooperate with requests in python's multi-process

Question

This is the code for sequential execution of a single process: {code...} This is the code for multiple processes: {code...} However, there is almost no difference in the time spent by single process and multi-process. The problem is probably that requests block IO. Please understand. Is this correct? How should I modify the code to achieve the purpose of multi-process? Thanks!

phpcn_u1582 · Answer

The bottleneck of writing files is disk IO, not CPU. Parallelism does not have much effect. You can try not to write files and then compare the times

怪我咯 · Answer

Pool without parameters uses
os.cpu_count() or 1
If it is a single-core CPU, or the number cannot be collected, there is only one process.

That should be the reason.

Web crawler - how to cooperate with requests in python's multi-process

reply all(2)I'll reply