Home  >  Article  >  Backend Development  >  php的汉字转换:Unicode(UTF8)至GBK_PHP

php的汉字转换:Unicode(UTF8)至GBK_PHP

WBOY
WBOYOriginal
2016-06-01 12:34:581047browse

P>秋水无恨 GBK Unicode UTF8 汉字 转换

php的汉字转换一直是比较麻烦的事

该类内置了四个过滤"[dec];","[hex];","%u[hex]","utf8转换"
方便用户的使用,同时也可自定义过滤进行自己喜欢的操作


qswhU.php 从这里下载
http://www.blueidea.com/user/qswh/qswhU.zip

class qswhU{
 var $qswhData;
 function qswhU($filename="qswhU.php"){
  $this->qswhData=file($filename);
 }
 
 function decode($str,$pattern=0){
  $arr=array("/(\w+);/iU","/((%\w\w)+)/i","/%u(\w{4,5})/iU");
  if(is_integer($pattern)){
    if($pattern>=count($arr))die("Invalid Function");
    $pattern=$arr[$pattern];
  }
  return preg_replace_callback($pattern,array($this,"u2gb"),$str);
 }
 
 function u2gb($arr){
  /******(qiushuiwuhen 2002-8-15)******/
  $ret="";$str=$arr[1];
  if(preg_match_all("/%\w{2}/",$str,$matches)){
   for($i=0;$i    $chr1=hexdec(substr($matches[0][$i],1));
    $arr=array("f0","e0","c0","0");
    for($j=0;$jhexdec($arr[$j]))break;
    $chr=hexdec(substr($matches[0][$i],1))-hexdec($arr[$j]);
    while(++$j    $str=dechex($chr);
    if(strlen($str)==4){
     $p=hexdec(substr($str,0,2))-0x4d;
     $q=hexdec(substr($str,2))*4;
     $ret.=chr(hexdec(substr($this->qswhData[$p],$q,2)));
     $ret.=chr(hexdec(substr($this->qswhData[$p],$q+2,2)));
    }else
     $ret.=chr(hexdec($str));
   }
  }
  else{
   if(strtolower($str[0])=="x")
    $str=substr($str,1);
   else
    if(strlen($str)!=4)$str=dechex($str);
   
   if(strlen($str)==4){
    $p=hexdec(substr($str,0,2))-0x4d;
    $q=hexdec(substr($str,2))*4;
    $ret.=chr(hexdec(substr($this->qswhData[$p],$q,2)));
    $ret.=chr(hexdec(substr($this->qswhData[$p],$q+2,2)));
   }else
    $ret.=chr(hexdec($str));
  }
  return $ret;
 }
 
 
}


使用范例

$qswh=new qswhU("qswhU.php");//如果文件名是qswhU.php,可省参数
echo "

不带参数(默认过滤为:[num];):";<br>echo "\n".$qswh->decode("中文Abc");<br>echo "\n".$qswh->decode("中文Abc");<br>echo "\n调用内置过滤(UTF转码):".$qswh->decode("%E4%B8%AD%E6%96%87%20!%22%23%24%25%26'()*%2B%2C%2F%3A%3B%3C%3D%3E%3F%40%5B%5D%5E%60%7B%7C%7D~%25Abc",1);<br>echo "\n调用内置过滤unescape(%u[num]):".$qswh->decode("%u4E2D%u6587Abc",2);<br>echo "\n自定义过滤([x+num]):".$qswh->decode("[x4E2D][x6587][x41][x62][x63]","/\[(\w+)\]/"); <p>效果如下:<br>不带参数(默认过滤为:[num];):<br>中文Abc<br>中文Abc<br>调用内置过滤(UTF转码):中文 !"#$%&'()*+,/:;?@[]^`{|}~%Abc<br>调用内置过滤unescape(%u[num]):中文Abc<br>自定义过滤([x+num]):中文Abc<br></p>
Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Previous article:PHP&Java_PHPNext article:PHP作WAP开发时遇到的问题_PHP