search
Homephp教程php手册PHP解码unicode编码中文字符代码示例,

PHP解码unicode编码中文字符代码示例,

Jun 13, 2016 am 09:27 AM
phpunicodeChinesecodecharacterExamplecodingdecoding

PHP解码unicode编码中文字符代码示例,

在抓取某网站数据,结果在数据包中发现了一串编码的数据:"......\u65b0\u6d6a\u5fae\u535a......", 这其实是中文被unicode编码后了的数据,想解码出中文来。

解决方案:

方案A(稳定版+推荐):

<span>function</span> replace_unicode_escape_sequence(<span>$match</span><span>) {
</span><span>return</span> mb_convert_encoding(<span>pack</span>('H*', <span>$match</span>[1]), 'UTF-8', 'UCS-2BE'<span>);
}
</span><span>$name</span> = '\u65b0\u6d6a\u5fae\u535a'<span>;
</span><span>$str</span> = <span>preg_replace_callback</span>('/\\\\u([0-9a-f]{4})/i', 'replace_unicode_escape_sequence', <span>$name</span><span>);
</span><span>echo</span> <span>$str</span>; <span>//</span><span>输出: 新浪微博
// www.jbxue.com 脚本学堂
//咱将上述方案A给封装起来~~~(方案A稳定版+升级+推荐)</span>
<span>class</span><span> Helper_Tool
{
</span><span>static</span> <span>function</span> unicodeDecode(<span>$data</span><span>)
{ 
</span><span>function</span> replace_unicode_escape_sequence(<span>$match</span><span>) {
</span><span>return</span> mb_convert_encoding(<span>pack</span>('H*', <span>$match</span>[1]), 'UTF-8', 'UCS-2BE'<span>);
} 

</span><span>$rs</span> = <span>preg_replace_callback</span>('/\\\\u([0-9a-f]{4})/i', 'replace_unicode_escape_sequence', <span>$data</span><span>);

</span><span>return</span> <span>$rs</span><span>;
} 
}
</span><span>//</span><span>调用</span>
<span>$name</span> = '\u65b0\u6d6a\u5fae\u535a'<span>;
</span><span>$data</span> = Helper_Tool::unicodeDecode(<span>$name</span>); <span>//</span><span>输出新浪微博</span>

小贴士:多翻翻国外的php教程,很有帮助哦。


方案B(次推荐):

<?<span>php
</span><span>function</span> unicodeDecode(<span>$name</span><span>){
</span><span>$json</span> = '{"str":"'.<span>$name</span>.'"}'<span>;
</span><span>$arr</span> = json_decode(<span>$json</span>,<span>true</span><span>);
</span><span>if</span>(<span>empty</span>(<span>$arr</span>)) <span>return</span> ''<span>; 
</span><span>return</span> <span>$arr</span>['str'<span>];
} // www.jbxue.com
</span><span>$name</span> = '\u65b0\u6d6a\u5fae\u535a'<span>;
</span><span>echo</span> unicodeDecode(<span>$name</span>); <span>//</span><span>输出: 新浪微博 </span>

对于方案B, 注意事项, 在好友 XAR (猛戳XAR博客) 的技术支持下,总结出要处理的字符串(即传递给函数unicodeDecode的参数$name的内容中一定不能包含单引号,否则就会导致解析失败, 所以有必要的话可以借助 str_replace()函数将非法字符格式化为合格字符)

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.