The concept of regular expression
Regular expression is a logical formula for string operations, which uses some predefined Specific characters and combinations of these specific characters form a "rule string". This "rule string" is used to express a filtering logic for strings.
The syntax rules of regular expressions
The rough matching process of regular expressions is:
1. Take out the expressions and Comparison of characters in the text,
2. If every character can be matched, the match is successful; once there are characters that are not matched successfully, the match fails.
3. If there are quantifiers or boundaries in the expression, the process will be slightly different.
The following are some matching rules for regular expressions in Python
## Pattern |
##Description |
|
^ |
# Matches the beginning of the string
|
|
## Matches the end of the string. |
||
(\n)# Any characters of ##, when the re.DOTALL tag is specified, can match any characters including newlines. ##[...] |
||
is used to represent a group of characters, listed separately: [amk] matches 'a', 'm' or 'k' |
[^...] |
|
Characters not in []: [^abc] matches characters other than a, b, c. | ||
re* |
##matched 0 or multiple expressions. |
|
re |
Matches 1 or more expressions. |
|
##re? |
## Matches 0 or 1 fragments defined by the previous regular expression, non-greedy way
|
|
##Exactly matches n previous expressions. |
##re{ n, m} |
|
Match n to m times the fragment defined by the previous regular expression, greedy way |
##a| b |
|
##matches a or b |
(re) |
|
##G matches the expression within the brackets , also represents a group |
##(?imx) |
Regular expressions contain three optional flags: i, m, or x. Only affects the area in brackets. |
##(?-imx) | Regular expression turns off the i, m, or x optional flags. Only affects the area in brackets. |
|
(?: re) | Similar to (...), but does not represent a group |
|
# #(?imx: re) |
Use i, m, or x optional flags in parentheses |
|
##(?-imx: re)
|
Do not use i, m, or x optional flags in parentheses
|
|
|
||
##Forward positive delimiter. Succeeds if the contained regular expression, represented by ... , successfully matches the current position, otherwise it fails. But once the contained expression has been tried, the matching engine doesn't improve at all; the remainder of the pattern still has to try the right side of the delimiter. |
||
Forward negative delimiter. Opposite of positive delimiter; succeeds when the contained expression cannot be matched at the current position in the string |
||
Matching independent pattern, eliminating backtracking. |
||
##\w |
Matches letters, numbers and underscores, equivalent to '[A-Za-z0-9_]'. |
|
##\W | Matches non-alphanumeric characters and underscores, equivalent to '[^A-Za-z0-9_]'. |
|
##\s
|
Matches any whitespace character, equivalent to [\t\n\r\f].
|
|
\S
|
matches any non-empty character, equivalent to [^ \f\n\r\t \v].
|
|
matches any non-number, equivalent to [^0-9]. |
||
Matches the beginning of the string |
##\Z |
|
# Matches the end of the string. If there is a newline, only the end of the string before the newline is matched. c |
||
##\z |
##match string End |
|
\G | Match the position where the last match is completed. |
|
#\b
|
Matches a word boundary, which refers to the position between a word and a space. For example, 'er\b' matches 'er' in "never" but not in "verb".
|
|
## Match non-word boundaries. 'er\B' matches 'er' in "verb", but not in "never". |
||
Matches a newline character. Matches a tab character. Wait |
##\1...\9 |
|
Matches the content of the nth group. |
#\10 |
|
Match the content of the nth group if it is matched. Otherwise it refers to the expression of the octal character code. |
The above is the detailed content of Detailed explanation of python regular expressions. For more information, please follow other related articles on the PHP Chinese website!

两种去除方法:1、利用preg_replace()执行正则表达式搜索所有大写字母并将其替换为空字符即可,语法“preg_replace('/[A-Z]/','',$str)”。2、利用preg_filter()执行正则表达式搜索所有大写字母并将其替换为空字符即可,语法“preg_filter('/[A-Z]/','',$str)”。

方法:1、用“str_replace(" ","其他字符",$str)”语句,可将nbsp符替换为其他字符;2、用“preg_replace("/(\s|\ \;||\xc2\xa0)/","其他字符",$str)”语句。

随着数据量的不断增大,正则表达式匹配成为了程序中常用的操作之一。而在Go语言中,由于其天然的并行ism,以及与底层系统的交互性和高效性,使得Go语言的正则表达式匹配极具优势。那么如何使用Go语言编写高性能的正则表达式匹配呢?一、了解正则表达式在使用正则表达式前,我们首先需要了解正则表达式,了解其基本语法规则以及常用的匹配字符,使我们能够在编写正则表达式时更加

两种方法:1、用preg_replace(),可执行正则表达式的搜索和替换,只需将字符串中匹配的字符替换为空字符即可,语法“preg_replace(正则, "", $str)”。2、用preg_match_all(),可搜索字符串中所有和正则表达式匹配的结果,会将每次的匹配结果放在一个数组$array中,语法“preg_match_all(正则,$str,$array);”。

php中可用preg_match_all()配合正则表达式过滤字符串,只获取中文字符;语法“preg_match_all("/[\x{4e00}-\x{9fff}]+/u","$str",$arr);”,会将匹配字符存入“$arr”数组中。

在javascript中,可以使用replace()函数配合正则表达式“/[u4e00-u9fa5|,]+/ig”来查找字符串中的所有非汉字字符,并将其替换为其他指定值,语法“字符串对象.replace(/[u4e00-u9fa5|,]+/ig,'指定替换值')”。

Java语言正则表达式的使用方法正则表达式是一种强大的文本处理工具,可以用来匹配和验证文本。在Java语言中,也可以使用正则表达式来实现字符串的匹配和处理。本文将介绍Java语言正则表达式的使用方法,涵盖正则表达式的基础知识,常用的正则表达式语法,以及在Java程序中使用正则表达式的方法。一、基础知识正则表达式是什么?正则表达式是一种文本模式,用来描述一组字

在PHP开发中,正则表达式是非常重要的工具,用于匹配、查找和替换文本中的特定字符串。然而,编写高效的正则表达式并不是一件易事,需要开发者具备一定的技巧和经验。下面是一些可以帮助您编写高效正则表达式的技巧:1.尽可能使用非贪婪匹配默认情况下,正则表达式是贪婪的,即它们将尽可能匹配更多的文本。在某些情况下,可能需要使用非贪婪匹配来避免这种情况。非贪婪匹配使用"


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool

SublimeText3 Chinese version
Chinese version, very easy to use
