


How to match @usernames outside of the \[url\] tag without using assertions?
Clever Match: Accurately extract @ usernames from non-[url] tags without assertions
In text processing, it is often necessary to extract strings of specific patterns. For example, from text containing username and URL tags, only extract not<url></url>
@用户名
in the tag. This article provides a solution without using regular expression assertions.
Assume the text is as follows:
<code>[url=/space/4]@张三[/url] [url=/space/5]@李四[/url] @张三@张三[url=/space/6]@王五[/url] [url=/space/7]@赵六[/url] [url=/space/8]@wolegequ[/url]@sweet @haha</code>
The goal is to extract @张三
, @sweet
, @haha
.
Traditional methods may use regular expressions and assertions, but this article takes a more clever approach to avoid assertions:
Step 1: Roughly match all @ usernames
First, use a simple regular expression to match all user names containing @
symbol:
import re text = '[url=/space/4]@Zhang San[/url] [url=/space/5]@Li Si[/url] @Zhang San@Zhang San[url=/space/6]@Wang Wu[/url] [url=/space/7]@Zhao Liu[/url] [url=/space/8]@wolegequ[/url]@sweet @haha' matches = re.findall(r'@([^@\[\]] )', text) # After matching the @ symbol, until @, [ or ] is encountered print(matches) # Output: ['Zhang San', 'Li Si', 'Zhang San', 'Zhang San', 'Wang Wu', 'Zhao Liu', 'wolegequ', 'sweet', 'haha']
Step 2: Accurate filtering and remove usernames from the tags
Next, the key is to filter out the<url></url>
Username within the tag. We can do it through the following steps:
- Remove
<url></url>
andTag: First, put all the text in
<url></url>
andTag removal.
- Check whether the match exists: traverse all user names matched in the first step and determine whether they still exist in the processed text. Only user names that are not in the tag will be retained.
filtered_matches = [] temp_text = text.replace('[url]', '').replace('[/url]', '') #Remove tags for match in matches: if f"@{match}" in temp_text: # Check whether the username is filtered_matches.append(match) in the processed text print(filtered_matches) # Output: ['Zhang San', 'Zhang San', 'Zhang San', 'sweet', 'haha']
Final result:
Although @张三
appeared three times in the final result, this fits the situation of the original text. If you need to deduplicate, you can add deduplication operation in the last step. This method effectively avoids the use of regular expression assertions, while achieving precise matching.
This revised answer provides a more detailed and clearer explanation of the process, improving readability and understanding. It also addresses the potential for duplicate matches in the final output, acknowledging this as a consequence of the original text's structure.
The above is the detailed content of How to match @usernames outside of the \[url\] tag without using assertions?. For more information, please follow other related articles on the PHP Chinese website!

PHPisusedforsendingemailsduetoitsintegrationwithservermailservicesandexternalSMTPproviders,automatingnotificationsandmarketingcampaigns.1)SetupyourPHPenvironmentwithawebserverandPHP,ensuringthemailfunctionisenabled.2)UseabasicscriptwithPHP'smailfunct

The best way to send emails is to use the PHPMailer library. 1) Using the mail() function is simple but unreliable, which may cause emails to enter spam or cannot be delivered. 2) PHPMailer provides better control and reliability, and supports HTML mail, attachments and SMTP authentication. 3) Make sure SMTP settings are configured correctly and encryption (such as STARTTLS or SSL/TLS) is used to enhance security. 4) For large amounts of emails, consider using a mail queue system to optimize performance.

CustomheadersandadvancedfeaturesinPHPemailenhancefunctionalityandreliability.1)Customheadersaddmetadatafortrackingandcategorization.2)HTMLemailsallowformattingandinteractivity.3)AttachmentscanbesentusinglibrarieslikePHPMailer.4)SMTPauthenticationimpr

Sending mail using PHP and SMTP can be achieved through the PHPMailer library. 1) Install and configure PHPMailer, 2) Set SMTP server details, 3) Define the email content, 4) Send emails and handle errors. Use this method to ensure the reliability and security of emails.

ThebestapproachforsendingemailsinPHPisusingthePHPMailerlibraryduetoitsreliability,featurerichness,andeaseofuse.PHPMailersupportsSMTP,providesdetailederrorhandling,allowssendingHTMLandplaintextemails,supportsattachments,andenhancessecurity.Foroptimalu

The reason for using Dependency Injection (DI) is that it promotes loose coupling, testability, and maintainability of the code. 1) Use constructor to inject dependencies, 2) Avoid using service locators, 3) Use dependency injection containers to manage dependencies, 4) Improve testability through injecting dependencies, 5) Avoid over-injection dependencies, 6) Consider the impact of DI on performance.

PHPperformancetuningiscrucialbecauseitenhancesspeedandefficiency,whicharevitalforwebapplications.1)CachingwithAPCureducesdatabaseloadandimprovesresponsetimes.2)Optimizingdatabasequeriesbyselectingnecessarycolumnsandusingindexingspeedsupdataretrieval.

ThebestpracticesforsendingemailssecurelyinPHPinclude:1)UsingsecureconfigurationswithSMTPandSTARTTLSencryption,2)Validatingandsanitizinginputstopreventinjectionattacks,3)EncryptingsensitivedatawithinemailsusingOpenSSL,4)Properlyhandlingemailheaderstoa


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

ZendStudio 13.5.1 Mac
Powerful PHP integrated development environment

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.
