Summary of the usage of regular expressions in Java programming-javaTutorial-php.cn

Home

Java

javaTutorial

Summary of the usage of regular expressions in Java programming

黄舟

Jan 20, 2017 am 11:08 AM

This article mainly introduces a summary of the usage of regular expressions in Java programming. Regular expressions are a powerful string processing tool. Java’s support for regular expressions is still very good. Let’s sort out the regular expressions first. Some basic knowledge of expressions:

　1. Regular expressions in strings

Regular expressions can be used to search, extract, split, replace and other operations on strings. The String class provides the following special methods:

boolean matches(String regex): Determine whether the string matches the specified regular expression.

String replaceAll(String regex, String replacement): Replace all substrings matching regex in the string with replacement.

String[] split(String regex): Use regex as the separator to split the string into multiple substrings.

The above special methods all rely on the regular expressions provided by Java.

　2. Create a regular expression

　x: Character x (x can represent any legal character);

　\0mnn: The character represented by the octal number Omnn;

\xhh: The character represented by hexadecimal 0xhh;

\uhhhh: The UNICODE character represented by hexadecimal 0xhhhh;

\t: Tab character ('\u0009');

\n: New line (line feed) character ('\u000A');

\r: Carriage return character ('\u000D');

　\f: Form feed character ('\u000C');

　\a: Alarm (bell) character ('\u0007');

　\e: Escape character ( '\u001B');

　\cx: The control character corresponding to x. For example, \cM matches Ctrl-M. The x value must be one of A~Z or a~z;

　3. Special characters in regular expressions

$: Matches the end of a line. To match the $ character itself, use \$;

　^: to match the beginning of a line. To match the ^ character itself, use \^;

　(): to mark the beginning and end of a subexpression. To match these characters, use $and $;

　[]: Used to determine the start and end position of the bracket expression. To match these characters, use \[ and \];

　{}: used to mark the frequency of occurrence of the previous subexpression. To match these characters, use \{ and \};

*: Specifies that the preceding subexpression may appear zero or more times. To match the * character itself, use \*;

　+: Specifies that the preceding subexpression can appear one or more times. To match the + character itself, use \+;

　?: to specify that the preceding subexpression can appear zero or once. To match the ? character itself, use \?;

　.: matches any unit character except the newline character \n. To match the character itself, use \.;

\: used to escape the next character, or specify octal or hexadecimal characters. To match the \ character, use \\;

|: to specify one of the two items. To match the | character itself, use \|;

　4. Predefined characters

　.: Can match any character;

　\d: Match all 0~9 Numbers;

\D: Match non-digits;

\s: Match all whitespace characters, including spaces, tabs, carriage returns, form feeds, line feeds, etc.;

\S: Matches all non-whitespace characters;

\w: Matches all word characters, including all numbers from 0 to 9, 26 English letters and underscores (_);

　\W: Match all non-word characters;

　5. Boundary matching character

　^: Beginning of line

　$: End of line

　\b: Word boundary

　\B: Non-word boundary

　\A: Beginning of input

　\G: End of previous match

　\Z: The end of the input, only used for the last terminator

　\z: The end of the input

　6. The symbol indicating the number of matches

　The figure shows the symbols representing the number of matches, which are used to determine the number of occurrences of the symbol immediately to the left of the symbol:

Summary of the usage of regular expressions in Java programming

　(1) Suppose we want to in a text file Search for US Social Security numbers. The format of this number is 999-99-9999. The regular expression used to match it is shown in Figure 1. In regular expressions, the hyphen ("-") has a special meaning. It represents a range, such as from 0 to 9. Therefore, when matching a hyphen in a Social Security number, it is preceded by an escape character "\".

Summary of the usage of regular expressions in Java programming

　(2) Assume that when searching, you want the hyphen to appear or not appear - that is, 999-99- 9999 and 999999999 are both correct formats. At this time, you can add the "?" quantity limit symbol after the hyphen, as shown in the figure:

Summary of the usage of regular expressions in Java programming

　(3 ) Let’s look at another example below. One format for U.S. car license plates is four numbers plus two letters. Its regular expression is preceded by the numeric part "[0-9]{4}", plus the letter part "[A-Z]{2}". The image below shows the complete regular expression.

Summary of the usage of regular expressions in Java programming

　7.一些实例

　　例子1　

function replace(content){
 
　var reg = &#39;\\[(\\w+)\\]&#39;,
 
　pattern = new RegExp(reg, &#39;g&#39;);
 
　return content.replace(pattern, &#39;&#39;);
 
　}
 
　//或
 
　function replace(content){
 
　return content.replace(/\[(\w+)\/g, &#39;&#39;);
 
　}

　　例子2　　

//zero-width look behind的替换方案
 
　　//(?<=...)和(?
　　//方法一：反转字符串,用lookahead进行搜索,替换以后再倒回来,例如:
 
　　String.prototype.reverse = function () {
 
　　return this.split(&#39;&#39;).reverse().join(&#39;&#39;);
 
　　}
 
　　//模拟&#39;foo.bar|baz&#39;.replace(/(?<=\.)b/, &#39;c&#39;) 即将前面有&#39;.&#39;的b换成c
 
　　&#39;foo.bar|baz&#39;.reverse().replace(/b(?=\.)/g, &#39;c&#39;).reverse() //foo.car|baz
 
　　//方法二：不用零宽断言,自己判断
 
　　//模拟&#39;foo.bar|baz&#39;.replace(/(?<=\.)b/, &#39;c&#39;) 即将前面有&#39;.&#39;的b换成c
 
　　&#39;foo.bar|baz&#39;.replace(/(\.)?b/, function ($0, $1) {
 
　　return $1 ? $1 + &#39;c&#39; : $0;
 
　　}) //foo.car|baz
 
　　//模拟&#39;foo.bar|baz&#39;.replace(/(?
　　&#39;foo.bar|baz&#39;.replace(/(\.)?b/, function ($0, $1) {
 
　　return $1 ? $0 : &#39;c&#39;;
 
　　}) //foo.bar|caz
 
　　//这个方法在一些比较简单的场景下有用,并且可以和lookahead一起用
 
　　//但也有很多场景无效,例如:
 
　　//&#39;tttt&#39;.replace(/(?<=t)t/g, &#39;x&#39;) 结果应该是&#39;txxx&#39;
 
　　&#39;tttt&#39;.replace(/(t)?t/g, function ($0, $1) {
 
　　return $1 ? $1 + &#39;x&#39; : $0;
 
　　}) // txtx

　例子3

$&符号的使用
 
　function escapeRegExp(str) {
 
　return str.replace(/[abc]/g, "($&)");
 
　}
 
　var str = &#39;a12b34c&#39;;
 
　console.log(escapeRegExp(str)); //(a)12(b)34(c)

以上就是Java编程中正则表达式的用法总结的内容，更多相关内容请关注PHP中文网（www.php.cn）！

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Advanced garbage collection techniques and best practicesApr 19, 2025 pm 01:48 PM

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Saving in R.E.P.O. Explained (And Save Files)

1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

4 weeks agoByDDD

Hot Tools

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

WebStorm Mac version

Useful JavaScript development tools

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Hot Topics

Where is the login entrance for gmail email?

7569

CakePHP Tutorial

1386

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

107