Home >php教程 >php手册 >PHP获取页面中指定内容的类

PHP获取页面中指定内容的类

WBOY
WBOYOriginal
2016-06-07 11:34:41919browse

本节内容:
一个获取页面中指定内容的php类。
实现的功能:
1,获取内容中的url,email,image。
2,替换内容中的url,email,image。

形如这样的内容:
url:xxx
email:admin#admincom
image:PHP获取页面中指定内容的类
该类都可以获取到。
1,类文件 Grep.class.php
代码示例:
/** grep class
* Date: 2013-06-15
* Author: fdipzone
* Ver: 1.0
* Edit: 黑帽联盟www.heimaolianmeng.com
* Func:
*
* set: 设置内容
* get: 返回指定的内容
* replace: 返回替换后的内容
* get_pattern 根据type返回pattern
*/

class Grep{ // class start
private $_pattern = array(
'url' => '/ 'email' => '/([\w\-\.]+@[\w\-\.]+(\.\w+))/',
'image' => '/PHP获取页面中指定内容的类/i'
);

private $_content = ''; // 源内容

/* 設置搜尋的內容
* @param String $content
*/
public function set($content=''){
$this->_content = $content;
}

/* 获取指定内容
* @param String $type
* @param int $unique 0:all 1:unique
* @return Array
*/
public function get($type='', $unique=0){
$type = strtolower($type);

if($this->_content=='' || !in_array($type, array_keys($this->_pattern))){
return array();
}

$pattern = $this->get_pattern($type); // 获取pattern
preg_match_all($pattern, $this->_content, $matches);
return isset($matches[1])? ( $unique==0? $matches[1] : array_unique($matches[1]) ) : array();
}

/* 获取替换后的内容
* @param String $type
* @param String $callback
* @return String
*/
public function replace($type='', $callback=''){
$type = strtolower($type);
if($this->_content=='' || !in_array($type, array_keys($this->_pattern)) || $callback==''){
return $this->_content;
}
$pattern = $this->get_pattern($type);
return preg_replace_callback($pattern, $callback, $this->_content);
}

/* 根据type获取pattern
* @param String $type
* @return String
*/
private function get_pattern($type){
return $this->_pattern[$type];
}

} // class end
?>


2,测试示例 demo.php
代码示例:
/**
* 获取页面中的指定内容
* by 黑帽联盟www.heimaolianmeng.com
*/
header('content-type:text/htm;charset=utf8');

require('Grep.class.php');
$content = file_get_contents('http://www.heimaolianmeng.com/sitemap/');

$obj = new Grep();
$obj->set($content);

$url = $obj->get('url', 0);
$email = $obj->get('email', 1);
$image = $obj->get('image', 1);

print_r($url);
print_r($email);
print_r($image);

$url_new = $obj->replace('url', 'replace_url');
echo $url_new;

function replace_url($matches){
return isset($matches[1])? ''.$matches[1].'' : '';
}
?>

AD:真正免费,域名+虚机+企业邮箱=0元

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn