Home  >  Article  >  Backend Development  >  Unicode encoding conversion uses PHP to convert GB encoding to UTF8

Unicode encoding conversion uses PHP to convert GB encoding to UTF8

WBOY
WBOYOriginal
2016-07-29 08:35:141105browse

The gb2utf8.php file is as follows:

Copy the code The code is as follows:


Class GB2UTF8
{
var $gb; // GB2312 string to be converted
var $utf8; // Conversion The resulting UTF8 string
var $CodeTable; // Array of GB2312 code files used during the conversion process
var $ErrorMsg; // Error message during the conversion process
function GB2UTF8($InStr="")
{
$this ->gb=$InStr;
$this->SetGb2312();
($this->gb=="")?0:$this->Convert();
}
function SetGb2312($ InStr="gb2312.txt")
{ // Set gb2312 code file, the default is gb2312.txt
$this->ErrorMsg="";
$tmp=@file($InStr);
if (!$tmp ) {
$this->ErrorMsg="No GB2312";
return false;
}
$this->CodeTable=array();
while(list($key,$value)=each($tmp) ) {
$this->CodeTable[hexdec(substr($value,0,6))]=substr($value,7,6);
}
}
function Convert()
{ // Convert GB2312 characters To string to a UTF8 string, $gb needs to be set in advance
$this->utf8="";
if(!trim($this->gb) || $this->ErrorMsg!="") {
return ($this->utf8=$this->ErrorMsg);
}
$str=$this->gb;
while($str) {
if (ord(substr($str,0,1 ))>127)
{
$tmp=substr($str,0,2);
$str=substr($str,2,strlen($str));
$tmp=$this->U2UTF8 (hexdec($this->CodeTable[hexdec(bin2hex($tmp))-0x8080]));
for($i=0;$i$this- >utf8.=chr(substr($tmp,$i,3));
}
else
{
$tmp=substr($str,0,1);
$str=substr($str,1, strlen($str));
$this->utf8.=$tmp;
}
}
return $this->utf8;
}
function U2UTF8($InStr)
{
for($i=0 ;$i$str="";
if ($InStr < 0x80) {
$str.=ord($InStr);
}
else if ($InStr < ; 0x800) {
$str.=(0xC0 | $InStr>>6);
$str.=(0x80 | $InStr & 0x3F);
}
else if ($InStr < 0x10000) {
$str .=(0xE0 | $InStr>>12);
$str.=(0x80 | $InStr>>6 & 0x3F);
$str.=(0x80 | $InStr & 0x3F);
}
else if ($InStr < 0x200000) {
$str.=(0xF0 | $InStr>>18);
$str.=(0x80 | $InStr>>12 & 0x3F);
$str.=(0x80 | $InStr>>6 & 0x3F);
$str.=(0x80 | $InStr & 0x3F);
}
return $str;
}
}
?>


The test file is as follows:

Copy the code The code is as follows:


Header("Content-type: image/png");
$im = imagecreate(400,300);
$black = ImageColorAllocate($im, 0,0, 0);
$white = ImageColorAllocate($im, 184,44,6);
include("gb2utf8.php");
$obj=new gb2utf8();
$obj->gb="123abc中国456def The test is correct";
$obj->Convert();
ImageTTFText($im, 20, 0, 5, 50, $white, "SIMKAI.TTF", $obj->utf8);
ImagePNG($im );
ImageDestroy($im);
?>


Note:
You need to set the font file correctly. Please first confirm that you can use font to output English directly (without using gb2utf8).​

The above introduces the unicode encoding conversion using PHP to convert GB encoding to UTF8, including the content of unicode encoding conversion. I hope it will be helpful to friends who are interested in PHP tutorials.

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn