search
HomeWeb Front-endJS TutorialHow to write regex to match a group of characters

This time I will show you how to write a regular expression to match a group of characters. What are the precautions for matching a group of characters with a regular expression? The following is a practical case, let's take a look.

The example in this article describes the method of matching a group of characters in the

regular expressiontutorial. Share it with everyone for your reference, as follows:

Note: In all examples, the regular expression matching results contain [ and ]## in the source text. #, some examples will be implemented using Java. If it is the usage of regular expressions in Java itself, it will be explained in the corresponding place. All java examples are tested under JDK1.6.0_13.

1. Match one of multiple characters

A match in the previous article "Regular Expression Tutorial: Detailed Explanation of Matching a Single Character" In the example of a text file starting with na or sa, the regular expression used is .a.\.txt. If there is another file called cal.txt, it will also be matched. What should I do if I only want to match files starting with na or sa?

Since we only want to find n or s, using one that can match any character is obviously not possible. In regular expressions, we can use [and] to define a

character set

combination. In the character set defined using [and], all characters between these two metacharacters are the A component of a set. The matching result of a character set is text that can match any member of the set. Let’s look at an example similar to the previous one:

Text:

sales.txt

na1 .txt

na2.txt

sa1.txt

sanatxt.txt

cal.txt

Regular expression:

[ns]a.\.txt

Result:

sales.txt

【na1.txt】

【na2.txt】

【sa1.txt】

sanatxt.txt

##cal.txt

Analysis : The regular expression used here starts with [na]. This set will match the characters n or s and will not match any other characters. [ and ] do not match any characters; they only define a set of characters. Next, a matches a character a, \. will match a . character itself, txt matches the txt character itself, and the matching results are consistent with our expectations.

However, if one of the files is usa1.txt, then it will also be matched. This is a problem of positional matching, which will be discussed later.

2. Use the character set interval

In the above example, what if we only want to match files that start with na or sa and are followed by a number? In the regular expression [ns]a.\.txt, . will match any character, including numbers. This problem can be solved using the character set:

sales.txt

na1.txt

na2. txt

sa1.txt

san.txt

sanatxt.txt

cal.txt

Regular expression: [ns]a[0123456789]\.txt

Result:

sales.txt

【na1.txt】

【na2.txt】

【sa1.txt】

san.txt

sanatxt.txt

cal.txt

Analysis: As you can see from the results, we only match those starting with na or sa , followed by a number file, and san.txt was not matched because the character set [0123456789] was used to limit the third character to only a number.

In regular expressions, some character intervals are frequently used, such as 0-9, a-z, etc. In order to simplify the definition of character intervals, regular expressions provide a special metacharacter - to Define character range. Like the example above, we can use regular expressions to match: [ns]a[0-9]\.txt, and the result is exactly the same as above.

The character range is not limited to numbers. The following are legal character ranges:

[A-F]: Matches all uppercase letters from A to F.

[A-Z]: Matches all uppercase letters from A to Z.

[A-z]: Matches all letters from ASCII character A to ASCII character z. But this interval is generally not used, it is just an example. Because they also contain characters such as [ and ^, which are arranged between Z and a in ASCII.

The first and last characters of the character interval can be any character in the ASCII character list. But in actual use, the most commonly used ranges are numbers and alphabetic characters.

Note: When defining a character interval, the last character of the interval cannot be smaller than the first character (such as [9-0]). This is not allowed. - as a metacharacter can only appear between [ and ], if it is anywhere outside [ and ], it is just an ordinary character and will only match - itself.

Multiple character ranges can be given in the same character set. For example: [0-9a-zA-Z] will match any uppercase and lowercase letters and numbers.

Let’s look at an example of matching colors in a web page:

Text:

<span>测试</span>

Regular expression: #[0-9A-Fa-f] [0 -9A-Fa-f] [0-9A-Fa-f] [0-9A-Fa-f] [0-9A-Fa-f] [0-9A-Fa-f]

Result:【#3636FF】;height:30px; width:60px;">Test

Analysis: In web pages, color is generally expressed as an RGB value starting with #, R represents red, G represents green, and B represents blue. Any color can be blended through different combinations of RGB. RGB values ​​are represented by hexadecimal values, such as #000000 representing white, #FFFFFF representing black, and #FF0000 representing red. Therefore, the regular expression for matching colors in web pages starts with #, followed by the same set of 6 [0-9A-Fa-f] characters (this can be abbreviated as #[0-9A-Fa-f]{6}, This will be discussed later in Repeat Matching).

3. Get non-matching

Character sets are usually used to specify a set of characters that must match one of them, but in some cases, we need to do the opposite. , gives a set of characters that do not need to be obtained. In other words, except for the characters in that character set, any other characters can be matched.

For example, to match files that begin with na or sa and are not followed by numbers:

Text:

sales.txt

na1.txt

na2.txt

sa1.txt

sanatxt.txt

san.txt

Regular expression: [ns]a[^0-9]\.txt

Result:

sales.txt

na1.txt

na2. txt

sa1.txt

sanatxt.txt

【san.txt】

Analysis: The pattern used in this example is exactly the opposite of the previous one. The previous [0-9] only matched numbers, but here [^0-9] matched non-numbers.

Note: ^ between [and] means negation. If it appears at the beginning of the regular expression, it means that the positional match is matched, which will be discussed later. At the same time, the effect of ^ will apply to all characters or character intervals in a given character set, not just the character or character interval immediately following the ^ character. For example, [^0-9a-z] means it does not match any numbers or lowercase letters.

4. Summary

Metacharacters [and] are used to define a set of characters, and their meaning is that they must match one of the characters in the set. There are two ways to define a character set: one is to list all characters; the other is to use metacharacters - given in the form of character intervals. Character sets can be negated using the metacharacter ^, which will forcibly exclude the given character set from the matching operation. Except for the characters in the character set, other characters can be matched.

In the next article, we will discuss the use of some metacharacters in regular expressions.

I believe you have mastered the method after reading the case in this article. For more exciting information, please pay attention to other related articles on the php Chinese website!

Recommended reading:

JS password strength verification regular expression (with code)

Regular expression in JQ Verification cannot contain Chinese methods

The above is the detailed content of How to write regex to match a group of characters. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Go语言正则表达式实践指南:如何匹配十六进制颜色代码Go语言正则表达式实践指南:如何匹配十六进制颜色代码Jul 13, 2023 am 10:46 AM

Go语言正则表达式实践指南:如何匹配十六进制颜色代码引言:正则表达式是一种强大且灵活的工具,用于字符串的模式匹配和查找。在Go语言中,我们可以使用内置的正则表达式包regexp来实现这些操作。本文将介绍如何使用正则表达式在Go语言中匹配十六进制颜色代码。导入正则表达式包首先,我们需要导入Go语言的正则表达式包regexp。可以在代码的开头添加如下导入语句:i

PHP正则表达式实战:匹配字母和数字PHP正则表达式实战:匹配字母和数字Jun 22, 2023 pm 04:49 PM

PHP正则表达式实战:匹配字母和数字正则表达式是一种用于匹配字符串的工具,可以方便地实现字符串的搜索、替换、分割等操作。在PHP开发中,正则表达式也是一个非常有用的工具。本文将介绍如何使用PHP正则表达式来匹配字母和数字。匹配单个字符要匹配单个字符,可以使用正则表达式中的字符类。字符类用方括号[]表示,其中的字符表示可以被匹配的字符,可以使用连字符-表示范围

PHP正则表达式:精确匹配与排除模糊包含PHP正则表达式:精确匹配与排除模糊包含Feb 28, 2024 pm 01:03 PM

PHP正则表达式:精确匹配与排除模糊包含正则表达式是一种强大的文本匹配工具,能够帮助程序员在处理文本时进行高效的搜索、替换和筛选。在PHP中,正则表达式也被广泛应用于字符串处理和数据匹配中。本文将重点介绍在PHP中如何进行精确匹配和排除模糊包含的操作,同时结合具体的代码示例进行说明。精确匹配精确匹配意味着只匹配符合完全条件的字符串,不匹配任何变种或包含额外字

PHP字符串匹配技巧:避免模糊包含表达式PHP字符串匹配技巧:避免模糊包含表达式Feb 29, 2024 am 08:06 AM

PHP字符串匹配技巧:避免模糊包含表达式在PHP开发中,字符串匹配是一个常见的任务,通常用于查找特定的文本内容或验证输入的格式。然而,有时候我们需要避免使用模糊的包含表达式来确保匹配的准确性。本文将介绍一些在PHP中进行字符串匹配时避免模糊包含表达式的技巧,并提供具体的代码示例。使用preg_match()函数进行精确匹配在PHP中,可以使用preg_mat

绝地潜兵2怎么匹配绝地潜兵2怎么匹配Feb 27, 2024 pm 08:43 PM

绝地潜兵2是一款拥有高质量大作玩法打造的第三人称的射击游戏,拥有大量精彩的玩法可让小伙伴去探索联机射击战斗的操作趣味,游戏中的联机模式是可以匹配的,一些玩家还不清楚该如何去操作匹配,本期为大家分享匹配的步骤!绝地潜兵2匹配操作教程答:在星球界面点击快速匹配。绝地潜兵2匹配方法绝地潜兵2的快速匹配是一个很不错的功能,能帮玩家找到一起匹配的队友,共同进入一场任务,相互配合获取更高的任务评价。匹配的选项在星球界面,在找任务或者看公开房间的时候,下方会有一个快速匹配,点击就能开始匹配。如果玩家打开了跨平

Java中的类型不匹配——java.lang.ClassCastExceptionJava中的类型不匹配——java.lang.ClassCastExceptionJun 24, 2023 pm 09:30 PM

Java作为一门强类型语言,在编译时就要求变量的类型必须明确确定下来,这在一定程度上保证了程序的安全性。但是有时候,在运行时,我们可能会遇到类型转换异常——java.lang.ClassCastException,这种异常会在Java程序中出现,当程序试图将一个对象转换为一个不兼容的类型时,就会抛出该异常。Java.lang.ClassCastExcepti

PHP 正则表达式:如何匹配 HTML 中的所有 textarea 标签PHP 正则表达式:如何匹配 HTML 中的所有 textarea 标签Jun 22, 2023 pm 09:27 PM

HTML是一种常用的页面标记语言,用于在网页中展示内容。在HTML中,textarea标签被用于创建文本框,允许用户输入或编辑文本。当你需要从页面中提取所有的textarea标签及其内容时,PHP正则表达式可以提供一个简单有效的解决方案。在本文中,我们将学习如何使用PHP正则表达式匹配HTML中的所有textarea标签。理解正则表

如何在 PHP 中使用正则表达式来匹配多个连续的特定字符如何在 PHP 中使用正则表达式来匹配多个连续的特定字符Jun 22, 2023 pm 08:15 PM

正则表达式是一种强大的文本处理工具,它在PHP中有着广泛的应用。其中一个常见的用法是匹配多个连续的特定字符,比如匹配多个连续的空格、多个连续的逗号等。本文将介绍如何在PHP中使用正则表达式来实现这一功能。在PHP中,我们可以使用preg_match()函数来进行正则表达式的匹配。该函数需要传入两个参数:正则表达式和待匹配的字符串。如果匹配成功

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Hot Tools

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use