search
HomeBackend DevelopmentPHP TutorialPHP regular expression matching Chinese_PHP tutorial

PHP regular expression matching Chinese_PHP tutorial

Jul 13, 2016 am 10:44 AM
phpChineselearnusematchexiststringusregularChinese characterexpressionwantneed

To use regular expressions to match Chinese characters in php, we need to understand the string encoding and the internal code of the Chinese characters. This way we can quickly and easily accurately match Chinese characters. Let me introduce it to you.


To determine whether a string is Chinese in php, you will follow this idea:

The code is as follows
 代码如下 复制代码
$str = "php编程";
if (preg_match("/^[u4e00-u9fa5]+$/",$str)) {
print("该字符串全部是中文");
} else {
print("该字符串不全部是中文");
}
?>
Copy code


$str = "php programming";

if (preg_match("/^[u4e00-u9fa5]+$/",$str)) {

print("This string is all in Chinese");

} else {

print("This string is not all Chinese");

}

?>

 代码如下 复制代码
$str = "php编程";
if (preg_match("/^[x4e00-x9fa5]+$/",$str)) {
print("该字符串全部是中文");
} else {
print("该字符串不全部是中文");
}

However, you will soon find that php does not support such expressions and an error message is reported:

Warning: preg_match() [function.preg-match]: Compilation failed: PCRE does not support L, l, N, U,

or u at offset 3 in test.php on line 3


I checked it many times on Google at the beginning and wanted to use PHP regular expressions for hexadecimal data

I made a breakthrough in the way of expression and found that in php, x is used to represent hexadecimal data. So,

is transformed into the following code:

The code is as follows
 代码如下 复制代码

(1)     ANSI编程环境下:

$strtest = “yyg中文字符yyg”;

$pregstr = "/([".chr(0xb0)."-".chr(0xf7)."][".chr(0xa1)."-".chr(0xfe)."])+/i";

if(preg_match($pregstr,$strtest,$matchArray)){

echo $matchArray[0];

}

//output:中文字符

(2)     Utf-8编程环境下:

$strtest = “yyg中文字符yyg”;

$pregstr = "/[x{4e00}-x{9fa5}]+/u";

if(preg_match($pregstr,$strtest,$matchArray)){

echo $matchArray[0];

}

//output:中文字符

Copy code
$str = "php programming";
if (preg_match("/^[x4e00-x9fa5]+$/",$str)) { print("This string is all in Chinese");

} else {

print("This string is not all Chinese"); It seems that no error is reported, and the judgment result is correct. However, if $str is replaced with the word "programming", the result still displays "The string is not all in Chinese", see This judgment is still not accurate enough. If you want to accurately match Chinese, that is, match pure Chinese characters, or match Chinese characters plus full-width punctuation, you need to use different methods according to different encoding environments. The following uses two commonly used encodings (gb2312, utf-8)
Here are two examples:
The code is as follows
Copy code
(1) In ANSI programming environment: $strtest = “yyg Chinese character yyg”; $pregstr = "/([".chr(0xb0)."-".chr(0xf7)."][".chr(0xa1)."-".chr(0xfe)."])+/ i"; if(preg_match($pregstr,$strtest,$matchArray)){ echo $matchArray[0]; } //output: Chinese characters (2) In Utf-8 programming environment: $strtest = “yyg Chinese character yyg”; $pregstr = "/[x{4e00}-x{9fa5}]+/u"; if(preg_match($pregstr,$strtest,$matchArray)){ echo $matchArray[0]; } //output: Chinese characters http://www.bkjia.com/PHPjc/633077.htmlwww.bkjia.comtruehttp: //www.bkjia.com/PHPjc/633077.htmlTechArticleIf we want to use regular expressions to match Chinese characters in php, we need to understand the string encoding and the internal code of the Chinese characters. In this way, accurate matching of Chinese characters can be achieved conveniently and quickly...
Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
What is the difference between the unset() and unlink() functions ?What is the difference between the unset() and unlink() functions ?Apr 30, 2025 pm 03:33 PM

The article discusses the differences between unset() and unlink() functions in programming, focusing on their purposes and use cases. Unset() removes variables from memory, while unlink() deletes files from the filesystem. Both are crucial for effec

What are Traits in PHP ?What are Traits in PHP ?Apr 30, 2025 pm 03:31 PM

PHP traits enable code reuse in single inheritance contexts, offering benefits like reusability and simplified inheritance. They can be effectively combined with traditional inheritance to enhance class flexibility and modularity.

Is PHP supports multiple inheritance ?Is PHP supports multiple inheritance ?Apr 30, 2025 pm 03:30 PM

PHP does not support multiple inheritance but uses interfaces and traits as alternatives to achieve similar functionality, avoiding issues like the diamond problem.

What is inheritance in PHP ?What is inheritance in PHP ?Apr 30, 2025 pm 03:29 PM

Inheritance in PHP allows classes to inherit properties and methods, promoting code reuse and hierarchical organization. Key benefits include reusability, abstraction, and polymorphism. Common mistakes to avoid are overuse of inheritance and ignoring

What are the main error types, and how do they differ?What are the main error types, and how do they differ?Apr 30, 2025 pm 03:28 PM

The article discusses three main error types in programming: syntax, runtime, and logical errors. It explains their causes, prevention strategies, impacts on performance and user experience, and methods for diagnosis and resolution.

How can PHP and HTML interact?How can PHP and HTML interact?Apr 30, 2025 pm 03:27 PM

Article discusses PHP and HTML interaction, best practices for embedding PHP in HTML, dynamic HTML content generation, and recommended development tools.

What is the difference between for and foreach loop in PHP?What is the difference between for and foreach loop in PHP?Apr 30, 2025 pm 03:26 PM

The article discusses the differences between for and foreach loops in PHP, focusing on syntax, usage, control, and performance. Foreach is preferred for array iteration due to simplicity and efficiency, but for loops are better for index-based opera

Explain the importance of Parser in PHP.for eachExplain the importance of Parser in PHP.for eachApr 30, 2025 pm 03:25 PM

The article discusses the crucial role of the PHP parser in script execution, focusing on its tasks in syntax analysis, error handling, and code optimization, and how its efficiency impacts web application performance.

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor