Home >php教程 >php手册 >php过滤HTML标签、属性等正则表达式汇总

php过滤HTML标签、属性等正则表达式汇总

WBOY
WBOYOriginal
2016-06-06 20:19:011609browse

这篇文章主要介绍了php过滤HTML标签、属性等正则表达式汇总,本文使用代码实例给出了过滤HTML内容的正则表达式,具体说明请参阅代码中的注释,本文对使用PHP做采集

$str=preg_replace("/\s+/", " ", $str); //过滤多余回车 $str=preg_replace("//si","",$str); //注释 $str=preg_replace("//si","",$str); //过滤DOCTYPE $str=preg_replace("//si","",$str); //过滤html标签 $str=preg_replace("//si","",$str); //过滤head标签 $str=preg_replace("//si","",$str); //过滤meta标签 $str=preg_replace("//si","",$str); //过滤body标签 $str=preg_replace("//si","",$str); //过滤link标签 $str=preg_replace("//si","",$str); //过滤form标签 $str=preg_replace("/cookie/si","COOKIE",$str); //过滤COOKIE标签   $str=preg_replace("/(.*?)/si","",$str); //过滤applet标签 $str=preg_replace("//si","",$str); //过滤applet标签   $str=preg_replace("/(.*?)/si","",$str); //过滤style标签 $str=preg_replace("//si","",$str); //过滤style标签   $str=preg_replace("/(.*?)/si","",$str); //过滤title标签 $str=preg_replace("//si","",$str); //过滤title标签   $str=preg_replace("/(.*?)/si","",$str); //过滤object标签 $str=preg_replace("//si","",$str); //过滤object标签   $str=preg_replace("/(.*?)/si","",$str); //过滤noframes标签 $str=preg_replace("//si","",$str); //过滤noframes标签   $str=preg_replace("/(.*?)/si","",$str); //过滤frame标签 $str=preg_replace("//si","",$str); //过滤frame标签   $str=preg_replace("/(.*?)/si","",$str); //过滤script标签 $str=preg_replace("//si","",$str); //过滤script标签 $str=preg_replace("/javascript/si","Javascript",$str); //过滤script标签 $str=preg_replace("/vbscript/si","Vbscript",$str); //过滤script标签 $str=preg_replace("/on([a-z]+)\s*=/si","On\\1=",$str); //过滤script标签 $str=preg_replace("//si","&#",$str); //过滤script标签,如javAsCript:alert(

清除空格,换行

function DeleteHtml($str) { $str = trim($str); $str = strip_tags($str,""); $str = ereg_replace("\t","",$str); $str = ereg_replace("\r\n","",$str); $str = ereg_replace("\r","",$str); $str = ereg_replace("\n","",$str); $str = ereg_replace(" "," ",$str); return trim($str); }

过滤HTML属性

1,过滤所有html标签的正则表达式:

复制代码 代码如下:

 
?[^>]+>
 
//过滤所有html标签的属性的正则表达式:
 
$html = preg_replace("/]*>/","",$html);


3,过滤部分html标签的正则表达式的排除式(比如排除

,即不过滤

):

复制代码 代码如下:


?[^pP/>]+>


4,过滤部分html标签的正则表达式的枚举式(比如需要过滤

等):

复制代码 代码如下:


?[aApPbB][^>]*>


5,过滤部分html标签的属性的正则表达式的排除式(比如排除alt属性,即不过滤alt属性):

复制代码 代码如下:


\s(?!alt)[a-zA-Z]+=[^\s]*


6,,过滤部分html标签的属性的正则表达式的枚举式(比如alt属性):

复制代码 代码如下:


(\s)alt=[^\s]*

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn