Home >Backend Development >PHP Tutorial >In-depth understanding of PHP arrays (traversal order) Laruence Original_PHP Tutorial

In-depth understanding of PHP arrays (traversal order) Laruence Original_PHP Tutorial

WBOY
WBOYOriginal
2016-07-21 15:18:12911browse

People often ask me, if a PHP array is accessed using foreach, is the order of traversal fixed? In what order should it be traversed?
For example:

Copy code The code is as follows:

$arr['laruence'] = 'huixinchen';
$arr['yahoo'] = 2007;
$arr['baidu'] = 2008;
foreach ($arr as $key => $val) {
//What is the result?
}

Another example:
Copy code The code is as follows:

$arr[2] = ' huixinchen';
$arr[1] = 2007;
$arr[0] = 2008;
foreach ($arr as $key => $val) {
//Now the result is again What is it?
}

To fully understand this issue, I think you should first understand the internal implementation structure of PHP arrays...

PHP arrays
In PHP, arrays are implemented using a HASH structure (HashTable). PHP uses some mechanisms to enable the addition and deletion of arrays in O(1) time complexity, and supports both linear traversal and random access. .

We have also discussed PHP’s HASH algorithm in previous articles. Based on this, we will make a further extension.

Before getting to know HashTable, let us first take a look at the structure definition of HashTable. I Comments are added for everyone to understand:
Copy code The code is as follows:

typedef struct _hashtable {
uint nTableSize; / * Hash table size, range of Hash values*/
uint nTableMask; /* Equal to nTableSize -1, used for quick positioning*/
uint nNumOfElements; /* The actual number of elements in the HashTable*/
ulong nNextFreeElement; /* Numeric index of the next free available position*/
Bucket *pInternalPointer; /* Internal position pointer, which will be reset, current and these traversal functions use */
Bucket *pListHead; /* Head element , used for linear traversal*/
Bucket *pListTail; /* tail element, used for linear traversal*/
Bucket **arBuckets; /* actual storage container*/
dtor_func_t pDestructor;/* element Destructor (pointer) */
zend_bool persistent;
unsigned char nApplyCount; /* Loop traversal protection*/
zend_bool bApplyProtection;
#if ZEND_DEBUG
int inconsistent;
#endif
} HashTable;

Regarding the meaning of nApplyCount, we can understand it through an example:
Copy code The code is as follows:

$arr = array(1,2,3,4,5,);
$arr[] = &$arr;

var_export($arr); //Fatal error: Nesting level too deep - recursive dependency?

This field is set up to prevent infinite loops caused by circular references.

Looking at the above structure, we can see that for HashTable, the key element is arBuckets, which is the actual storage container. Let us take a look at its structure definition:
Copy the code The code is as follows:

typedef struct bucket {
ulong h; /* Numeric index/hash value*/
uint nKeyLength; /* Character index Length*/
void *pData; /* data*/
void *pDataPtr; /* data pointer*/
struct bucket *pListNext; /* next element, for linear traversal*/
struct bucket *pListLast; /* Previous element, used for linear traversal*/
struct bucket *pNext; /* Next element in the same zipper*/
struct bucket *pLast; /* In The previous element in the same zipper*/
char arKey[1]; /* Tips to save memory and facilitate initialization*/
} Bucket;


We noticed that the last element is a flexible array technique, which can save memory and facilitate initialization. Friends who are interested can google flexible array.
h is the Hash value of the element , for numerically indexed elements, h is the direct index value (numeric index represented by nKeyLength=0). For string indexes, the index value is stored in arKey, and the length of the index is stored in nKeyLength.
In the Bucket, the actual data is stored in the memory block pointed to by the pData pointer. Usually this memory block is allocated separately by the system. But there is an exception, that is, when the data saved by the Bucket is a pointer, the HashTable will not request the system to allocate space to save the pointer, but directly save the pointer to pDataPtr, and then point pData to the member of this structure. address. This improves efficiency and reduces memory fragmentation. From this we can see the subtleties of PHP HashTable design. If the data in the Bucket is not a pointer, pDataPtr is NULL (this paragraph comes from Altair's "Zend HashTable Detailed Explanation")
Combined with the above HashTable structure, let's illustrate the overall structure diagram of the HashTable:

The pListhHead of HashTable points to the first element in the linear list form. In the above picture, it is element 1. pListTail points to the last element 0. For each element, pListNext is the next element of the linear structure drawn by the red line. And pListLast is the previous element.

pInternalPointer points to the position of the current internal pointer. When sequentially traversing the array, this pointer indicates the current element.

When linear (sequential) ) when traversing, it will start from pListHead, follow pListNext/pListLast in Bucket, and move pInternalPointer according to the movement of pInternalPointer to achieve a linear traversal of all elements.

For example, for foreach, if we look at the generated opcode sequence, we can find that before foreach, there will first be a FE_RESET to reset the internal pointer of the array, which is pInternalPointer (for foreach, please refer to the in-depth understanding of PHP principles: foreach), and then increment pInternalPointer through each FE_FETCH, thus Implement sequential traversal.

Similarly, when we use the each/next series of functions to traverse, we also implement sequential traversal by moving the internal pointer of the array. There is a problem here, such as:
Copy code The code is as follows:

$arr = array(1,2,3,4,5 );
foreach ($arr as $v) {
//can get
}

while (list($key, $v) = each($arr)) {
//Cannot obtain
}
?>

Understand the knowledge I just introduced, then this problem will be very clear, because foreach will automatically reset, and The while block will not be reset, so after foreach ends, pInternalPointer points to the end of the array, and of course the while statement block cannot access it. The solution is to reset the internal pointer of the array before each.

And in During random access, the head pointer position in the hash array will be determined by the hash value, and then the characteristic element will be found through pNext/pLast.

When adding elements, the elements will be inserted into the same Hash element chain The head and the tail of the linear list. In other words, the elements are traversed according to the order of insertion during linear traversal. This special design makes it so that in PHP, when using numerical indexes, the order of elements is determined by adding is determined by the order, not the index order.

In other words, the order in which arrays are traversed in PHP is related to the order in which elements are added. So, now we clearly know the problem at the beginning of the article The output of >
So, if you want to iterate by index size in a numerically indexed array, then you should use for, not foreach
Copy code
The code is as follows:


for($i=0,$l=count($arr); $i<$l; $i++) {
//At this time, it cannot be considered a sequence Traversal (linear traversal)
}


Original text: http://www.laruence.com/2009/08/23/1065.html
http://www.bkjia.com/PHPjc/325561.html
www.bkjia.com

true

http: //www.bkjia.com/PHPjc/325561.html

TechArticle
People often ask me, if a PHP array is accessed using foreach, is the order of traversal fixed? In what order should it be traversed? For example: Copy the code as follows: ?php $arr['laruence'] =...

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn