


How to get the length of mixed Chinese and English strings in PHP_PHP tutorial
When I was writing the form validation class of the framework tonight, I needed to determine whether the length of a certain string was within a specified range. Naturally, I thought of the strlen function in PHP.
$str = 'Hello world!';
echo strlen($str) ; // Output 12
However, in the functions that come with PHP, strlen and mb_strlen both calculate the length by calculating the number of bytes occupied by the string. Under different encoding conditions, the length occupied by Chinese The number of bytes is different. Under GBK/GB2312, Chinese characters occupy 2 bytes, while under UTF-8, Chinese characters occupy 3 bytes.
$str = 'Hello, world! ';
echo strlen($str); // Output 12 under GBK or GB2312, and 18 under UTF-8
When we judge the length of a string, we often need to judge the number of characters. Rather than the number of bytes occupied by the string, such as this PHP code under UTF-8:
$name = 'Zhang Gengchang';
$len = strlen($name);
// Output FALSE, because three Chinese characters occupy 9 bytes under UTF-8
if ($len >= 3 && $len echo 'TRUE';
}else{
echo 'FALSE';
}
Then there is What convenient and practical method can be used to obtain the length of a string containing Chinese characters? You can use regular rules to calculate the number of Chinese characters, divide by 2 under GBK/GB2312 encoding, and divide by 3 under UTF-8 encoding, and finally add the length of the non-Chinese string, but this is too troublesome.
WordPress such a piece of code, refer to the following:
$str = 'Hello, world ! ';
preg_match_all('/./us', $str, $match);
echo count($match[0]); // Output 9
The idea is to use regular expressions Split the string into individual characters and directly use count to calculate the number of matching characters, which is the result we want.
But the above code cannot handle GBK/GB2312 Chinese strings under UTF-8 encoding, because the Chinese characters of GBK/GB2312 will be recognized as two characters and the calculated number of Chinese characters will double, so I I thought of such a way:
$tmp = @iconv('gbk', 'utf -8', $str);
if(!empty($tmp)){
$str = $tmp;
}
preg_match_all('/./us', $str, $ match);
echo count($match[0]);
is compatible with GBK/GB2312 and UTF-8 encoding. It passed the test with a small amount of data, but it is not yet confirmed whether it is completely correct. Hope it will be The expert gives some advice.
The above intention is to make the framework compatible with multiple encoding formats, but generally in daily development, a project can already determine which encoding it is, so you can use the following function to easily obtain the string length:
int iconv_strlen ( string $str [, string $charset = ini_get("iconv.internal_encoding") ] )

PHPsessionstrackuserdataacrossmultiplepagerequestsusingauniqueIDstoredinacookie.Here'showtomanagethemeffectively:1)Startasessionwithsession_start()andstoredatain$_SESSION.2)RegeneratethesessionIDafterloginwithsession_regenerate_id(true)topreventsessi

In PHP, iterating through session data can be achieved through the following steps: 1. Start the session using session_start(). 2. Iterate through foreach loop through all key-value pairs in the $_SESSION array. 3. When processing complex data structures, use is_array() or is_object() functions and use print_r() to output detailed information. 4. When optimizing traversal, paging can be used to avoid processing large amounts of data at one time. This will help you manage and use PHP session data more efficiently in your actual project.

The session realizes user authentication through the server-side state management mechanism. 1) Session creation and generation of unique IDs, 2) IDs are passed through cookies, 3) Server stores and accesses session data through IDs, 4) User authentication and status management are realized, improving application security and user experience.

Tostoreauser'snameinaPHPsession,startthesessionwithsession_start(),thenassignthenameto$_SESSION['username'].1)Usesession_start()toinitializethesession.2)Assigntheuser'snameto$_SESSION['username'].Thisallowsyoutoaccessthenameacrossmultiplepages,enhanc

Reasons for PHPSession failure include configuration errors, cookie issues, and session expiration. 1. Configuration error: Check and set the correct session.save_path. 2.Cookie problem: Make sure the cookie is set correctly. 3.Session expires: Adjust session.gc_maxlifetime value to extend session time.

Methods to debug session problems in PHP include: 1. Check whether the session is started correctly; 2. Verify the delivery of the session ID; 3. Check the storage and reading of session data; 4. Check the server configuration. By outputting session ID and data, viewing session file content, etc., you can effectively diagnose and solve session-related problems.

Multiple calls to session_start() will result in warning messages and possible data overwrites. 1) PHP will issue a warning, prompting that the session has been started. 2) It may cause unexpected overwriting of session data. 3) Use session_status() to check the session status to avoid repeated calls.

Configuring the session lifecycle in PHP can be achieved by setting session.gc_maxlifetime and session.cookie_lifetime. 1) session.gc_maxlifetime controls the survival time of server-side session data, 2) session.cookie_lifetime controls the life cycle of client cookies. When set to 0, the cookie expires when the browser is closed.


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Linux new version
SublimeText3 Linux latest version

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.
