PHP regular expression capturing group and non-capturing group
熟练掌握正则表达式是每个程序员的基础要求,对于每个初学者来说会被正则表达式一连串字符弄得头晕眼花。博主便会如此,一直对正则表达式有种莫名的恐惧。近来看到另一位博友写的 《php正则表达式》一文获益良多,对其通配符以及捕获数据两个章节颇感兴趣。这两个章节正好涉及到的是正则表达式的捕获组与非捕获组的知识,因而本文来细细探讨下这部分知识。
我们知道,在正则表达式下(x) 表示匹配'x'并记录匹配的值。这只是比较通俗的说法,甚至说这是不严谨的说法,只有()捕获组形式才会记录匹配的值。非捕获组则只匹配,不记录。
捕获组:
(pattern)
这种形式是我们见到最多的一种形式,匹配并返回捕获结果,可以嵌套,组号顺序从左到右依次排列‘。
<span>$regex</span> = '/(ab(c)+)+d(e)?/'<span>; </span><span>$str</span> = 'abccde'<span>; </span><span>$matches</span> = <span>array</span><span>(); </span><span>if</span>(<span>preg_match</span>(<span>$regex</span>, <span>$str</span>, <span>$matches</span><span>)){ </span><span>print_r</span>(<span>$matches</span><span>); }</span>
匹配结果:
<span>Array</span> ( [0] => abccde [1] => abcc [2] => c [3] => e )
(?Pname>pattern)
这种方式虽然看起来在构造正则表达式的时候略微复杂一点,但实质上与(pattern)一样。最大的优势体现在对结果处理上,程序员可以直接根据自己设置的
<span>$regex</span> = '/(?P<group1>\w(?P<group2>\w))abc(?P<group3>\w)45/'<span>; </span><span>$str</span> = 'fsabcd45'<span>; </span><span>$matches</span> = <span>array</span><span>(); </span><span>if</span>(<span>preg_match</span>(<span>$regex</span>, <span>$str</span>, <span>$matches</span><span>)){ </span><span>print_r</span>(<span>$matches</span><span>); } </span></group3></group2></group1>
匹配结果:
<span>Array</span> ( [0] => fsabcd45 [group1] => fs [1] => fs [group2] => s [2] => s [group3] => d [3] => d )
\num
num是一个整数,是对捕获组的反向引用。 例如\2表示第二个子组匹配值,\表示第一个子组匹配值
<span>$regex</span> = '/(\w)(\w)\2\1/'<span>; </span><span>$str</span> = 'abba'<span>; </span><span>$matches</span> = <span>array</span><span>(); </span><span>if</span>(<span>preg_match</span>(<span>$regex</span>, <span>$str</span>, <span>$matches</span><span>)){ </span><span>print_r</span>(<span>$matches</span><span>); }</span>
匹配结果:
<span>Array</span> ( [0] => abba [1] => a [2] => b )
注意,这里我疏忽了一个小细节,一开始我第一样代码是 $regex = “/(\w)(\w)\2\1/”; 结果返回无匹配结果,经过调试后,发现这里只能用' '。'与" 用法差别大家还是需要注意下。
\k name >
了解了(?Pname>pattern)与\num,这个就不难理解了。\k是对命名捕获组的反向引用。其中 name 是捕获组名。
<span>$regex</span>='/(?P<name>\w)abc\k<name>/'<span>; </span><span>$str</span>="fabcf"<span>; </span><span>echo</span><span>preg_match_all</span>(<span>$regex</span>, <span>$str</span>,<span>$matches</span><span>); </span><span>print_r</span>(<span>$matches</span>);</name></name>
匹配结果:
Array ( [0] => Array ( [0] => fabcf ) [name] => Array ( [0] => f ) [1] => Array ( [0] => f ) )
非捕获组:
(?:pattern)
与(pattern)的唯一区别是,匹配pattern但不捕获匹配结果。这里便不再举例。
还有四种方式实际上讲的是一个事情:预查。
预查分为正向预查与反向预查。根据字面理解,正向预查是判断匹配字符串后面某些字符存在与否,而反向预查则是判断匹配字符串前面某些字符存在与否。
正向预查判断存在使用(?=pattern),判断不存在使用(?!pattern)。
反向预查判断存在使用(?pattern),判断不存在使用(?pattern)。
<span>$regx</span>='/(?; <span>$str</span>="abcd ebcd abce ebca"<span>; </span><span>if</span>(<span>preg_match_all</span>(<span>$regx</span>, <span>$str</span>, <span>$matches</span><span>)){ </span><span>print_r</span>(<span>$matches</span><span>); }</span>
匹配结果:
<span>Array</span> ( [0] => <span>Array</span> ( [0] => bc) )
这四种形式使用的是否只要注意好相对匹配字符串的位置和断言肯定还是否定,就会很快掌握。
另外,预查的四种形式是零宽度的,匹配的时候只做一个判断,本身是不占位置的。/HE(?=L)LLO/ 与HELLO匹配,而/HE(?=L)LO/与HELLO是不匹配的。毕竟但从字节数上两者就是不匹配的,前者只有4个,而后者有5个。
以上就介绍了php 正则表达式捕获组与非捕获组,包括了方面的内容,希望对PHP教程有兴趣的朋友有所帮助。

PHPidentifiesauser'ssessionusingsessioncookiesandsessionIDs.1)Whensession_start()iscalled,PHPgeneratesauniquesessionIDstoredinacookienamedPHPSESSIDontheuser'sbrowser.2)ThisIDallowsPHPtoretrievesessiondatafromtheserver.

The security of PHP sessions can be achieved through the following measures: 1. Use session_regenerate_id() to regenerate the session ID when the user logs in or is an important operation. 2. Encrypt the transmission session ID through the HTTPS protocol. 3. Use session_save_path() to specify the secure directory to store session data and set permissions correctly.

PHPsessionfilesarestoredinthedirectoryspecifiedbysession.save_path,typically/tmponUnix-likesystemsorC:\Windows\TemponWindows.Tocustomizethis:1)Usesession_save_path()tosetacustomdirectory,ensuringit'swritable;2)Verifythecustomdirectoryexistsandiswrita

ToretrievedatafromaPHPsession,startthesessionwithsession_start()andaccessvariablesinthe$_SESSIONarray.Forexample:1)Startthesession:session_start().2)Retrievedata:$username=$_SESSION['username'];echo"Welcome,".$username;.Sessionsareserver-si

The steps to build an efficient shopping cart system using sessions include: 1) Understand the definition and function of the session. The session is a server-side storage mechanism used to maintain user status across requests; 2) Implement basic session management, such as adding products to the shopping cart; 3) Expand to advanced usage, supporting product quantity management and deletion; 4) Optimize performance and security, by persisting session data and using secure session identifiers.

The article explains how to create, implement, and use interfaces in PHP, focusing on their benefits for code organization and maintainability.

The article discusses the differences between crypt() and password_hash() in PHP for password hashing, focusing on their implementation, security, and suitability for modern web applications.

Article discusses preventing Cross-Site Scripting (XSS) in PHP through input validation, output encoding, and using tools like OWASP ESAPI and HTML Purifier.


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

SublimeText3 English version
Recommended: Win version, supports code prompts!

ZendStudio 13.5.1 Mac
Powerful PHP integrated development environment

MantisBT
Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

Dreamweaver CS6
Visual web development tools

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),
