Home  >  Article  >  Backend Development  >  Detailed explanation of the neglected performance optimization tool in PHP: generator

Detailed explanation of the neglected performance optimization tool in PHP: generator

coldplay.xixi
coldplay.xixiforward
2020-09-30 16:48:374304browse

Detailed explanation of the neglected performance optimization tool in PHP: generator

If you are working in Python or other languages, you should be familiar with generators. But many PHP developers may not know the generator function. It may be because the generator is a function introduced in PHP 5.5.0, or it may be that the function of the generator is not very obvious. However, the generator function is really useful.

Advantages

I guess you will still be confused after listening to the concept directly, so let’s talk about the advantages first, maybe it can arouse your interest. So what are the advantages of the generator, as follows:

  • The generator will have a great impact on the performance of the PHP application
  • Save a lot of memory when the PHP code is running
  • More suitable for calculating large amounts of data

So, how are these magical functions achieved? Let's give an example first.

Introduction to the concept

First of all, let’s put down the burden of the generator concept and look at a simple PHP function:

function createRange($number){
    $data = [];
    for($i=0;$i<$number;$i++){
        $data[] = time();
    }
    return $data;
}复制代码

This is a very common PHP function that we are dealing with It is often used when working with some arrays. The code here is also very simple:

  1. We create a function. The
  2. function contains a for loop. We loop the current time into $data
  3. for loop After execution is completed, $data is returned.

It’s not over yet, let’s continue. Let’s write another function and print out the return value of this function in a loop:

$result = createRange(10); // 这里调用上面我们创建的函数
foreach($result as $value){
    sleep(1);//这里停顿1秒,我们后续有用
    echo $value.&#39;<br />&#39;;
}复制代码

Let’s take a look at the running results in the browser:

![](https://picb.zhimg .com/80/v2-3620f3d46bb5b49e53ca47891ea1fd5a_720w.jpg)

It’s perfect here, no issues whatsoever. (Of course sleep(1)you can’t see the effect)

Think about a question

We noticed that when calling the function createRange The passed value of $number is 10, a very small number. Suppose, now pass a value 10000000 (10 million).

Then, in function createRange, the for loop needs to be executed 1000 times. And 1000 million values ​​are placed in $data, and the $data array is placed in memory. Therefore, a lot of memory will be occupied when calling functions.

Here, the generator can show its talents.

Create generator

We modify the code directly, please pay attention:

function createRange($number){
    for($i=0;$i<$number;$i++){
        yield time();
    }
}复制代码

Look at this code that is very similar to just now, we deleted the array$ data, and nothing is returned, but a keyword is used before time()yield

Use generator

Let’s run the second piece of code again:

$result = createRange(10); // 这里调用上面我们创建的函数
foreach($result as $value){
    sleep(1);
    echo $value.&#39;<br />&#39;;
}复制代码

![](https://pic2.zhimg.com/80/v2-9c08f2d01f7a68736f45ffa3d3934cc5_720w.jpg)

We miraculously Found that the output value is different from the first time without using the generator. The values ​​(timestamps) here are separated by 1 second.

The one second interval here is actually the consequence of sleep(1). But why is there no gap the first time? That's because:

  • When the generator is not used: The for loop result in the createRange function is quickly placed in $data and return immediately. Therefore, foreach loops through a fixed array.
  • When using a generator: The value of createRange is not generated quickly at once, but relies on the foreach loop. foreach loops once and for executes once.

At this point, you should have some idea about the generator.

In-depth understanding of the generator

Code analysis

Let’s analyze the code just now.

function createRange($number){
    for($i=0;$i<$number;$i++){
        yield time();
    }
}

$result = createRange(10); // 这里调用上面我们创建的函数
foreach($result as $value){
    sleep(1);
    echo $value.&#39;<br />&#39;;
}复制代码

Let’s restore the code execution process.

  1. First call the createRange function, passing in the parameter 10, but the for value is executed once and then stops, and tells foreachValues ​​that can be used in the first loop.
  2. foreachStart the loop on $result, come in first sleep(1), and then start using for A value performs output.
  3. foreach prepares for the second loop. Before starting the second loop, it requests the for loop again.
  4. forThe loop is executed again, and the generated timestamp is told to foreach.
  5. foreach to get the Two values ​​and output. Since sleep(1) in foreach, the for loop is delayed by 1 second to generate the current time

So, the entire code During execution, there is always only one record value participating in the loop, and there is only one piece of information in the memory.

No matter how big the $number is initially passed in, since not all result sets are generated immediately, the memory is always a loop of values.

Conceptual understanding

At this point, you should have a rough understanding of what a generator is. Let’s talk about the generator principle below.

首先明确一个概念:生成器yield关键字不是返回值,他的专业术语叫产出值,只是生成一个值

那么代码中foreach循环的是什么?其实是PHP在使用生成器的时候,会返回一个Generator类的对象。foreach可以对该对象进行迭代,每一次迭代,PHP会通过Generator实例计算出下一次需要迭代的值。这样foreach就知道下一次需要迭代的值了。

而且,在运行中for循环执行后,会立即停止。等待foreach下次循环时候再次和for索要下次的值的时候,for循环才会再执行一次,然后立即再次停止。直到不满足条件不执行结束。

实际开发应用

很多PHP开发者不了解生成器,其实主要是不了解应用领域。那么,生成器在实际开发中有哪些应用?

读取超大文件

PHP开发很多时候都要读取大文件,比如csv文件、text文件,或者一些日志文件。这些文件如果很大,比如5个G。这时,直接一次性把所有的内容读取到内存中计算不太现实。

这里生成器就可以派上用场啦。简单看个例子:读取text文件

![](https://pic1.zhimg.com/80/v2-2d6fb1058b39008fbe4e5363e485e9a2_720w.jpg)

我们创建一个text文本文档,并在其中输入几行文字,示范读取。

<?php
header("content-type:text/html;charset=utf-8");
function readTxt()
{
    # code...
    $handle = fopen("./test.txt", &#39;rb&#39;);

    while (feof($handle)===false) {
        # code...
        yield fgets($handle);
    }

    fclose($handle);
}

foreach (readTxt() as $key => $value) {
    # code...
    echo $value.&#39;<br />&#39;;
}复制代码

![](https://pic4.zhimg.com/80/v2-2bbdc40ebf8f1aeedf2aa0c472811c77_720w.jpg)

通过上图的输出结果我们可以看出代码完全正常。

但是,背后的代码执行规则却一点儿也不一样。使用生成器读取文件,第一次读取了第一行,第二次读取了第二行,以此类推,每次被加载到内存中的文字只有一行,大大的减小了内存的使用。

这样,即使读取上G的文本也不用担心,完全可以像读取很小文件一样编写代码。

想了解更多编程学习,敬请关注php培训栏目!

The above is the detailed content of Detailed explanation of the neglected performance optimization tool in PHP: generator. For more information, please follow other related articles on the PHP Chinese website!

Statement:
This article is reproduced at:juejin.im. If there is any infringement, please contact admin@php.cn delete