Home  >  Article  >  Backend Development  >  Use phpQuery to parse html code like jquery, phpqueryjquery_PHP tutorial

Use phpQuery to parse html code like jquery, phpqueryjquery_PHP tutorial

WBOY
WBOYOriginal
2016-07-13 10:10:121020browse

Use phpQuery to parse html code like jquery, phpqueryjquery

Introduction

How to easily parse html code in php is probably a problem that every phper will encounter. Using phpQuery, you can make php process html code as conveniently as jQuery.

Project address: https://code.google.com/p/phpquery/

github address: https://github.com/TobiaszCudnik/phpquery

DEMO

Download library files: https://code.google.com/p/phpquery/downloads/list

I downloaded the onefile version: phpQuery-0.9.5.386-onefile.zip

Official demo: https://code.google.com/p/phpquery/source/browse/branches/dev/demo.php

Then reference it in the project.

html file test.html:

<span><</span><span>div </span><span>class</span><span>="thumb"</span><span> id</span><span>="Thumb-13164-3640"</span><span> style</span><span>="position: absolute; left: 0px; top: 0px;"</span><span>></span>
    <span><</span><span>a </span><span>href</span><span>="/Spiderman-City-Drive"</span><span>></span>
        <span><</span><span>img </span><span>src</span><span>="/thumb/12/Spiderman-City-Drive.jpg"</span><span> alt</span><span>=""</span><span>></span>
        <span><</span><span>span </span><span>class</span><span>="GameName"</span><span> id</span><span>="GameName-13164-3640"</span><span> style</span><span>="display: none;"</span><span>></span>Spiderman City Drive<span></</span><span>span</span><span>></span>
        <span><</span><span>span </span><span>class</span><span>="GameRating"</span><span> id</span><span>="GameRating-13164-3640"</span><span> style</span><span>="display: none;"</span><span>></span>
            <span><</span><span>span </span><span>style</span><span>="width: 68.14816px;"</span><span>></</span><span>span</span><span>></span>
        <span></</span><span>span</span><span>></span>
    <span></</span><span>a</span><span>></span>
<span></</span><span>div</span><span>></span>
<span><</span><span>div </span><span>class</span><span>="thumb"</span><span> id</span><span>="Thumb-13169-5946"</span><span> style</span><span>="position: absolute; left: 190px; top: 0px;"</span><span>></span>
    <span><</span><span>a </span><span>href</span><span>="/Spiderman-City-Raid"</span><span>></span>
        <span><</span><span>img </span><span>src</span><span>="/thumb/12/Spiderman-City-Raid.jpg"</span><span> alt</span><span>=""</span><span>></span>
        <span><</span><span>span </span><span>class</span><span>="GameName"</span><span> id</span><span>="GameName-13169-5946"</span><span> style</span><span>="display: none;"</span><span>></span>Spiderman - City Raid<span></</span><span>span</span><span>></span>
        <span><</span><span>span </span><span>class</span><span>="GameRating"</span><span> id</span><span>="GameRating-13169-5946"</span><span> style</span><span>="display: none;"</span><span>></span>
            <span><</span><span>span </span><span>style</span><span>="width: 67.01152px;"</span><span>></</span><span>span</span><span>></span>
        <span></</span><span>span</span><span>></span>
    <span></</span><span>a</span><span>></span>
<span></</span><span>div</span><span>></span>

php processing:

<?<span>php
    </span><span>include</span>('phpQuery-onefile.php'<span>);
    
    </span><span>$filePath</span> = 'test.html'<span>;
    </span><span>$fileContent</span> = <span>file_get_contents</span>(<span>$filePath</span><span>);
    </span><span>$doc</span> = phpQuery::newDocumentHTML(<span>$fileContent</span><span>);
    phpQuery</span>::selectDocument(<span>$doc</span><span>);
    </span><span>$data</span> = <span>array</span><span>(
        </span>'name' => <span>array</span>(),
        'href' => <span>array</span>(),
        'img' => <span>array</span><span>()
    );
    </span><span>foreach</span> (pq('a') <span>as</span> <span>$t</span><span>) {
        </span><span>$href</span> = <span>$t</span> -> getAttribute('href'<span>);
        </span><span>$data</span>['href'][] = <span>$href</span><span>;
    }
    </span><span>foreach</span> (pq('img') <span>as</span> <span>$img</span><span>) {
        </span><span>$data</span>['img'][] = <span>$domain</span> . <span>$img</span> -> getAttribute('src'<span>);
    }
    </span><span>foreach</span> (pq('.GameName') <span>as</span> <span>$name</span><span>) {
        </span><span>$data</span>['name'][] = <span>$name</span> -><span> nodeValue;
    }
    </span><span>var_dump</span>(<span>$data</span><span>);
</span>?>

The above code includes fetching attributes and innerText content (fetching through nodeValue).

Output:

<span>array</span> (size=3<span>)
  </span>'name' => 
    <span>array</span> (size=2<span>)
      </span>0 => <span>string</span> 'Spiderman City Drive' (length=20<span>)
      </span>1 => <span>string</span> 'Spiderman - City Raid' (length=21<span>)
  </span>'href' => 
    <span>array</span> (size=2<span>)
      </span>0 => <span>string</span> 'http://www.gahe.com/Spiderman-City-Drive' (length=40<span>)
      </span>1 => <span>string</span> 'http://www.gahe.com/Spiderman-City-Raid' (length=39<span>)
  </span>'img' => 
    <span>array</span> (size=2<span>)
      </span>0 => <span>string</span> 'http://www.gahe.com/thumb/12/Spiderman-City-Drive.jpg' (length=53<span>)
      </span>1 => <span>string</span> 'http://www.gahe.com/thumb/12/Spiderman-City-Raid.jpg' (length=52)

The most powerful one is the pq selector. Its syntax is similar to jQuery, which is very convenient.

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/938095.htmlTechArticleUse phpQuery to parse html code like jquery. phpqueryjquery introduces how to parse html code easily in php. It is estimated that every A problem that every PHPer will encounter. Use phpQuery to make php...
Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn