search
HomeBackend DevelopmentPHP TutorialPHP Regular Expression: How to match all image links in HTML

PHP Regular Expression: How to match all image links in HTML

Jun 23, 2023 am 11:17 AM
phpregular expressionhtml image link

In HTML pages, we often need to extract image links for use on other occasions, or do some image downloading, batch processing, etc. At this time, PHP regular expressions can help us quickly and accurately match all image links.

1. Analysis of image links in HTML

In HTML, image links usually appear in the form of PHP Regular Expression: How to match all image links in HTML tags, and their format is as follows:

<img src="/static/imghwm/default1.png"  data-src="image.jpg"  class="lazy" alt="图片">

Among them, The src attribute specifies the link address of the image. Generally, the formats of image links are as follows:

  1. Relative link: /images/picture.jpg
  2. Absolute link: https://www.example.com/ images/picture.jpg
  3. Link with parameters: https://www.example.com/images/picture.jpg?size=large
  4. Relative path link: ../images/picture .jpg

We need to write regular expressions to match these four link formats.

2. PHP regular expression matching image link

There are many kinds of regular expression functions in PHP, among which preg_match() is the most commonly used one and can be used to match from text The specified string. The following is a regular expression that can match the above four image link formats:

$pattern = '/<img .+?src=['"](.+?)['"].*? alt="PHP Regular Expression: How to match all image links in HTML" >/';

This regular expression consists of multiple parts. Let’s explain them one by one:

  1. < ;img. ?src= matches the PHP Regular Expression: How to match all image links in HTML tag and is positioned before the src attribute. Among them, . ? means non-greedy matching of any character until src is encountered.
  2. ['"] means quotation marks, which can match double quotation marks or single quotation marks.
  3. (. ?) means matching any character until the next quotation mark is encountered. A capturing group is used here, which can Use the $matches array call in subsequent code.
  4. .*? means non-greedy matching of any character until the > symbol.

Next, we use the preg_match() function to Extract all image links in HTML:

$html = file_get_contents('example.html'); // 读取 HTML 文件
preg_match_all($pattern, $html, $matches); // 匹配链接
$imgUrls = $matches[1]; // 获取匹配到的链接地址

In this way, we can get an array $imgUrls containing all image links. If you want to only match image links in a certain format, you can do it in a regular expression Some modifications, such as matching only absolute links:

$pattern = '/<img .+?src=['"](https?://.+?)['"].*? alt="PHP Regular Expression: How to match all image links in HTML" >/';

This regular expression increases the restriction of http or https protocol headers and only matches absolute links starting with these two protocols.

Summary

Using PHP regular expressions to match image links in HTML is not a complicated matter. You only need to write the corresponding regular expression according to the link format, and then use the preg_match() function to quickly and accurately extract the All links. If you often need to extract other content from HTML, you can also achieve it through a similar method.

The above is the detailed content of PHP Regular Expression: How to match all image links in HTML. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
How do you create and use an interface in PHP?How do you create and use an interface in PHP?Apr 30, 2025 pm 03:40 PM

The article explains how to create, implement, and use interfaces in PHP, focusing on their benefits for code organization and maintainability.

What is the difference between crypt() and password_hash()?What is the difference between crypt() and password_hash()?Apr 30, 2025 pm 03:39 PM

The article discusses the differences between crypt() and password_hash() in PHP for password hashing, focusing on their implementation, security, and suitability for modern web applications.

How can you prevent Cross-Site Scripting (XSS) in PHP?How can you prevent Cross-Site Scripting (XSS) in PHP?Apr 30, 2025 pm 03:38 PM

Article discusses preventing Cross-Site Scripting (XSS) in PHP through input validation, output encoding, and using tools like OWASP ESAPI and HTML Purifier.

What is autoloading in PHP?What is autoloading in PHP?Apr 30, 2025 pm 03:37 PM

Autoloading in PHP automatically loads class files when needed, improving performance by reducing memory use and enhancing code organization. Best practices include using PSR-4 and organizing code effectively.

What are PHP streams?What are PHP streams?Apr 30, 2025 pm 03:36 PM

PHP streams unify handling of resources like files, network sockets, and compression formats via a consistent API, abstracting complexity and enhancing code flexibility and efficiency.

What is the maximum size of a file that can be uploaded using PHP ?What is the maximum size of a file that can be uploaded using PHP ?Apr 30, 2025 pm 03:35 PM

The article discusses managing file upload sizes in PHP, focusing on the default limit of 2MB and how to increase it by modifying php.ini settings.

What is Nullable types in PHP ?What is Nullable types in PHP ?Apr 30, 2025 pm 03:34 PM

The article discusses nullable types in PHP, introduced in PHP 7.1, allowing variables or parameters to be either a specified type or null. It highlights benefits like improved readability, type safety, and explicit intent, and explains how to declar

What is the difference between the unset() and unlink() functions ?What is the difference between the unset() and unlink() functions ?Apr 30, 2025 pm 03:33 PM

The article discusses the differences between unset() and unlink() functions in programming, focusing on their purposes and use cases. Unset() removes variables from memory, while unlink() deletes files from the filesystem. Both are crucial for effec

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor