search
HomeBackend DevelopmentPython TutorialHow do we find the exact position of each match in Python's regular expression?
How do we find the exact position of each match in Python's regular expression?Aug 31, 2023 pm 12:13 PM
pythonregular expressionMatch position

How do we find the exact position of each match in Pythons regular expression?

Introduction

The re module is the regular expression we use in Python. Regular expressions are used for text searches and more complex text operations. Tools like grep and sed, text editors like vi and emacs, and computer languages ​​like Tcl, Perl, and Python all have built-in regular expression support.

The re module in Python provides functions for matching regular expressions.

Regular expressions that define the text we want to find or modify are called patterns. Text literals and metacharacters make up this string. Compiled functions are used to create schemas. It is recommended to use raw strings because regular expressions often contain special characters. (The r character is used to indicate a raw string.) These characters are not interpreted until combined into a pattern.

A pattern can be applied to a text string using one of these functions, and the pattern is used after assembly is complete. Available functions include Match, Search, Find, and Finditer.

Syntax used

The regular expression function used here is: We use the regular expression function to find matches.

re.match(): Determines if the RE matches at the beginning of the string. If zero or more characters at the beginning of the string match the regular expression pattern, the match method returns a match object.

p.finditer(): Finds all substrings where the RE matches and returns them as an iterator. An iterator delivering match objects across all non-overlapping matches for the pattern in a string is the result of the finditer method.

re.compile(): Compile a regular expression pattern into a regular expression object, which can be used for matching using its match(), search(), and other methods described below. The expression’s behavior can be modified by specifying a flag's value. Values can be any of the following variables combined using bitwise OR (the | operator).

m.start(): m.start() returns the offset in the string at the match's start.

m.group(): You may use the multiple-assignment approach to assign each value to a different variable when mo.groups() returns a tuple of values, as in the areaCode, mainNumber = mo.groups() line below.

search: It is comparable to re.match() but does not require that we just look for matches at the beginning of the text. The search() function can locate a pattern in the string at any location, but it only returns the first instance of the pattern.

Algorithm

  • Use import re to import the regular expression module.

  • Use the re.compile() function to create a regular expression object. (Remember to use the original string.)

  • Pass the string to be searched for to the finditer() method of the Regex object. This will return a Match object.

  • Calling the group() method of the Match object returns the actual matched text string.

  • We can also use the span() method to get the starting and ending indexes in a tuple.

Example

 #importing re functions
import re
#compiling [A-Z0-9] and storing it in a variable p
p = re.compile("[A-Z0-9]")
#looping m times in p.finditer
for m in p.finditer('A5B6C7D8'):
#printing the m.start and m.group
   print m.start(), m.group()

Output

This will produce the output −

0 A
1 5
2 B
3 6
4 C
5 7
6 D
7 8

Code explanation

Use import re Import the regular expression module. Use the re.compile() function to create a regular expression object ("[A-Z0-9]") and assign it to the variable p. Use a loop to iterate over m and pass the string you want to search for to the finditer() method of the regular expression object. This will return a Match object. Call the Match object's m.group() and m.start() methods to return the string that actually matched the text.

Example

# Python program to illustrate
# Matching regex objects
# with groups
import re
phoneNumRegex = re.compile(r'(\d\d\d)-(\d\d\d-\d\d\d\d)')
mo = phoneNumRegex.search('My number is 415-555-4242.')
print(mo.groups())

Output

This will produce the output −

('415', '555-4242')

Code explanation

Use import re to import the regular expression module. Use the re.compile() function to create a regular expression object (r'(\d\d\d)-(\d\d\d-\d\d\d\d)') and assign it to Variable phoneNumRegex. Pass the string to be searched to the search() method of the Regex object and store it in the variable mo. This will return a Match object. Call the Match object's mo.groups() method to return the actual matched text string.

Conclusion

The search(), match() and finditer() methods provided by the Python re module allow us to match regular expression patterns, and if the match is successful, it will provide a Match object instance. Use this Match object's start(), end(), and span() methods to obtain detailed information about the matched string.

When there are many matches, you may run the risk of memory overload if you use findall() to load them all. You can get an iterator object of all potential matches by using the finditer() method, which will improve efficiency.

This means that finditer() provides a callable object that, when called, loads the results into memory.

The above is the detailed content of How do we find the exact position of each match in Python's regular expression?. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:tutorialspoint. If there is any infringement, please contact admin@php.cn delete
详细讲解Python之Seaborn(数据可视化)详细讲解Python之Seaborn(数据可视化)Apr 21, 2022 pm 06:08 PM

本篇文章给大家带来了关于Python的相关知识,其中主要介绍了关于Seaborn的相关问题,包括了数据可视化处理的散点图、折线图、条形图等等内容,下面一起来看一下,希望对大家有帮助。

详细了解Python进程池与进程锁详细了解Python进程池与进程锁May 10, 2022 pm 06:11 PM

本篇文章给大家带来了关于Python的相关知识,其中主要介绍了关于进程池与进程锁的相关问题,包括进程池的创建模块,进程池函数等等内容,下面一起来看一下,希望对大家有帮助。

Python自动化实践之筛选简历Python自动化实践之筛选简历Jun 07, 2022 pm 06:59 PM

本篇文章给大家带来了关于Python的相关知识,其中主要介绍了关于简历筛选的相关问题,包括了定义 ReadDoc 类用以读取 word 文件以及定义 search_word 函数用以筛选的相关内容,下面一起来看一下,希望对大家有帮助。

归纳总结Python标准库归纳总结Python标准库May 03, 2022 am 09:00 AM

本篇文章给大家带来了关于Python的相关知识,其中主要介绍了关于标准库总结的相关问题,下面一起来看一下,希望对大家有帮助。

Python数据类型详解之字符串、数字Python数据类型详解之字符串、数字Apr 27, 2022 pm 07:27 PM

本篇文章给大家带来了关于Python的相关知识,其中主要介绍了关于数据类型之字符串、数字的相关问题,下面一起来看一下,希望对大家有帮助。

分享10款高效的VSCode插件,总有一款能够惊艳到你!!分享10款高效的VSCode插件,总有一款能够惊艳到你!!Mar 09, 2021 am 10:15 AM

VS Code的确是一款非常热门、有强大用户基础的一款开发工具。本文给大家介绍一下10款高效、好用的插件,能够让原本单薄的VS Code如虎添翼,开发效率顿时提升到一个新的阶段。

详细介绍python的numpy模块详细介绍python的numpy模块May 19, 2022 am 11:43 AM

本篇文章给大家带来了关于Python的相关知识,其中主要介绍了关于numpy模块的相关问题,Numpy是Numerical Python extensions的缩写,字面意思是Python数值计算扩展,下面一起来看一下,希望对大家有帮助。

python中文是什么意思python中文是什么意思Jun 24, 2019 pm 02:22 PM

pythn的中文意思是巨蟒、蟒蛇。1989年圣诞节期间,Guido van Rossum在家闲的没事干,为了跟朋友庆祝圣诞节,决定发明一种全新的脚本语言。他很喜欢一个肥皂剧叫Monty Python,所以便把这门语言叫做python。

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

VSCode Windows 64-bit Download

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Atom editor mac version download

Atom editor mac version download

The most popular open source editor