Home > Article > Backend Development > Detailed explanation of the neglected performance optimization tool in PHP: generator
If you are working in Python or other languages, you should be familiar with generators. But many PHP developers may not know the generator function. It may be because the generator is a function introduced in PHP 5.5.0, or it may be that the function of the generator is not very obvious. However, the generator function is really useful.
I guess you will still be confused after listening to the concept directly, so let’s talk about the advantages first, maybe it can arouse your interest. So what are the advantages of the generator, as follows:
So, how are these magical functions achieved? Let's give an example first.
First of all, let’s put down the burden of the generator concept and look at a simple PHP function:
function createRange($number){ $data = []; for($i=0;$i<$number;$i++){ $data[] = time(); } return $data; }复制代码
This is a very common PHP function that we are dealing with It is often used when working with some arrays. The code here is also very simple:
for
loop. We loop the current time into $data
for
loop After execution is completed, $data
is returned. It’s not over yet, let’s continue. Let’s write another function and print out the return value of this function in a loop:
$result = createRange(10); // 这里调用上面我们创建的函数 foreach($result as $value){ sleep(1);//这里停顿1秒,我们后续有用 echo $value.'<br />'; }复制代码
Let’s take a look at the running results in the browser:
![](https://picb.zhimg .com/80/v2-3620f3d46bb5b49e53ca47891ea1fd5a_720w.jpg)
It’s perfect here, no issues whatsoever. (Of course sleep(1)
you can’t see the effect)
We noticed that when calling the function createRange
The passed value of $number
is 10, a very small number. Suppose, now pass a value 10000000
(10 million).
Then, in function createRange
, the for
loop needs to be executed 1000
times. And 1000
million values are placed in $data
, and the $data
array is placed in memory. Therefore, a lot of memory will be occupied when calling functions.
Here, the generator can show its talents.
We modify the code directly, please pay attention:
function createRange($number){ for($i=0;$i<$number;$i++){ yield time(); } }复制代码
Look at this code that is very similar to just now, we deleted the array$ data
, and nothing is returned, but a keyword is used before time()
yield
Let’s run the second piece of code again:
$result = createRange(10); // 这里调用上面我们创建的函数 foreach($result as $value){ sleep(1); echo $value.'<br />'; }复制代码
![](https://pic2.zhimg.com/80/v2-9c08f2d01f7a68736f45ffa3d3934cc5_720w.jpg)
We miraculously Found that the output value is different from the first time without using the generator. The values (timestamps) here are separated by 1 second.
The one second interval here is actually the consequence of sleep(1)
. But why is there no gap the first time? That's because:
for
loop result in the createRange
function is quickly placed in $data
and return immediately. Therefore, foreach
loops through a fixed array. createRange
is not generated quickly at once, but relies on the foreach
loop. foreach
loops once and for
executes once. At this point, you should have some idea about the generator.
Let’s analyze the code just now.
function createRange($number){ for($i=0;$i<$number;$i++){ yield time(); } } $result = createRange(10); // 这里调用上面我们创建的函数 foreach($result as $value){ sleep(1); echo $value.'<br />'; }复制代码
Let’s restore the code execution process.
createRange
function, passing in the parameter 10
, but the for
value is executed once and then stops, and tells foreach
Values that can be used in the first loop. foreach
Start the loop on $result
, come in first sleep(1)
, and then start using for
A value performs output. foreach
prepares for the second loop. Before starting the second loop, it requests the for
loop again. for
The loop is executed again, and the generated timestamp is told to foreach
.foreach
to get the Two values and output. Since sleep(1)
in foreach
, the for
loop is delayed by 1 second to generate the current time So, the entire code During execution, there is always only one record value participating in the loop, and there is only one piece of information in the memory.
No matter how big the $number
is initially passed in, since not all result sets are generated immediately, the memory is always a loop of values.
At this point, you should have a rough understanding of what a generator is. Let’s talk about the generator principle below.
首先明确一个概念:生成器yield关键字不是返回值,他的专业术语叫产出值,只是生成一个值
那么代码中foreach
循环的是什么?其实是PHP在使用生成器的时候,会返回一个Generator
类的对象。foreach
可以对该对象进行迭代,每一次迭代,PHP会通过Generator
实例计算出下一次需要迭代的值。这样foreach
就知道下一次需要迭代的值了。
而且,在运行中for
循环执行后,会立即停止。等待foreach
下次循环时候再次和for
索要下次的值的时候,for
循环才会再执行一次,然后立即再次停止。直到不满足条件不执行结束。
很多PHP开发者不了解生成器,其实主要是不了解应用领域。那么,生成器在实际开发中有哪些应用?
PHP开发很多时候都要读取大文件,比如csv文件、text文件,或者一些日志文件。这些文件如果很大,比如5个G。这时,直接一次性把所有的内容读取到内存中计算不太现实。
这里生成器就可以派上用场啦。简单看个例子:读取text文件
![](https://pic1.zhimg.com/80/v2-2d6fb1058b39008fbe4e5363e485e9a2_720w.jpg)
我们创建一个text文本文档,并在其中输入几行文字,示范读取。
<?php header("content-type:text/html;charset=utf-8"); function readTxt() { # code... $handle = fopen("./test.txt", 'rb'); while (feof($handle)===false) { # code... yield fgets($handle); } fclose($handle); } foreach (readTxt() as $key => $value) { # code... echo $value.'<br />'; }复制代码
![](https://pic4.zhimg.com/80/v2-2bbdc40ebf8f1aeedf2aa0c472811c77_720w.jpg)
通过上图的输出结果我们可以看出代码完全正常。
但是,背后的代码执行规则却一点儿也不一样。使用生成器读取文件,第一次读取了第一行,第二次读取了第二行,以此类推,每次被加载到内存中的文字只有一行,大大的减小了内存的使用。
这样,即使读取上G的文本也不用担心,完全可以像读取很小文件一样编写代码。
想了解更多编程学习,敬请关注php培训栏目!
The above is the detailed content of Detailed explanation of the neglected performance optimization tool in PHP: generator. For more information, please follow other related articles on the PHP Chinese website!