PHP Regular Expression: How to match all image links in HTML
In HTML pages, we often need to extract image links for use on other occasions, or do some image downloading, batch processing, etc. At this time, PHP regular expressions can help us quickly and accurately match all image links.
1. Analysis of image links in HTML
In HTML, image links usually appear in the form of tags, and their format is as follows:
<img src="/static/imghwm/default1.png" data-src="image.jpg" class="lazy" alt="图片">
Among them, The src attribute specifies the link address of the image. Generally, the formats of image links are as follows:
- Relative link: /images/picture.jpg
- Absolute link: https://www.example.com/ images/picture.jpg
- Link with parameters: https://www.example.com/images/picture.jpg?size=large
- Relative path link: ../images/picture .jpg
We need to write regular expressions to match these four link formats.
2. PHP regular expression matching image link
There are many kinds of regular expression functions in PHP, among which preg_match() is the most commonly used one and can be used to match from text The specified string. The following is a regular expression that can match the above four image link formats:
$pattern = '/<img .+?src=['"](.+?)['"].*? alt="PHP Regular Expression: How to match all image links in HTML" >/';
This regular expression consists of multiple parts. Let’s explain them one by one:
- < ;img. ?src= matches the
tag and is positioned before the src attribute. Among them, . ? means non-greedy matching of any character until src is encountered.
- ['"] means quotation marks, which can match double quotation marks or single quotation marks.
- (. ?) means matching any character until the next quotation mark is encountered. A capturing group is used here, which can Use the $matches array call in subsequent code.
- .*? means non-greedy matching of any character until the > symbol.
Next, we use the preg_match() function to Extract all image links in HTML:
$html = file_get_contents('example.html'); // 读取 HTML 文件 preg_match_all($pattern, $html, $matches); // 匹配链接 $imgUrls = $matches[1]; // 获取匹配到的链接地址
In this way, we can get an array $imgUrls containing all image links. If you want to only match image links in a certain format, you can do it in a regular expression Some modifications, such as matching only absolute links:
$pattern = '/<img .+?src=['"](https?://.+?)['"].*? alt="PHP Regular Expression: How to match all image links in HTML" >/';
This regular expression increases the restriction of http or https protocol headers and only matches absolute links starting with these two protocols.
Summary
Using PHP regular expressions to match image links in HTML is not a complicated matter. You only need to write the corresponding regular expression according to the link format, and then use the preg_match() function to quickly and accurately extract the All links. If you often need to extract other content from HTML, you can also achieve it through a similar method.
The above is the detailed content of PHP Regular Expression: How to match all image links in HTML. For more information, please follow other related articles on the PHP Chinese website!

The article explains how to create, implement, and use interfaces in PHP, focusing on their benefits for code organization and maintainability.

The article discusses the differences between crypt() and password_hash() in PHP for password hashing, focusing on their implementation, security, and suitability for modern web applications.

Article discusses preventing Cross-Site Scripting (XSS) in PHP through input validation, output encoding, and using tools like OWASP ESAPI and HTML Purifier.

Autoloading in PHP automatically loads class files when needed, improving performance by reducing memory use and enhancing code organization. Best practices include using PSR-4 and organizing code effectively.

PHP streams unify handling of resources like files, network sockets, and compression formats via a consistent API, abstracting complexity and enhancing code flexibility and efficiency.

The article discusses managing file upload sizes in PHP, focusing on the default limit of 2MB and how to increase it by modifying php.ini settings.

The article discusses nullable types in PHP, introduced in PHP 7.1, allowing variables or parameters to be either a specified type or null. It highlights benefits like improved readability, type safety, and explicit intent, and explains how to declar

The article discusses the differences between unset() and unlink() functions in programming, focusing on their purposes and use cases. Unset() removes variables from memory, while unlink() deletes files from the filesystem. Both are crucial for effec


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

MantisBT
Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function

SublimeText3 English version
Recommended: Win version, supports code prompts!

SublimeText3 Linux new version
SublimeText3 Linux latest version

Notepad++7.3.1
Easy-to-use and free code editor
