Home >Backend Development >PHP Tutorial >Solution to the problem of intercepting Chinese garbled characters with PHP substr_PHP tutorial

Solution to the problem of intercepting Chinese garbled characters with PHP substr_PHP tutorial

WBOY
WBOYOriginal
2016-07-13 10:43:141312browse

PHP provides us with several character interception functions, including substr, mb_substr, and mb_strcut functions. Some of our PHP beginners will use substr to intercept Chinese characters. It turns out that the Chinese characters will be garbled. If garbled characters appear, we can use mb_substr to solve.

The description of the article page uses the substr function to intercept 220 characters, but the last Chinese character is always garbled, and the intercepted length is incorrect.

Find the method through the magic of Google. It may be because substr(string,start,length) will truncate Chinese characters into characters, resulting in garbled characters

Solution:

Use the mb_substr method in the PHP extension library.

Attention

1. Make sure you have the php_mbstring.dll file in Windows/system32. If not, copy it from your Php installation directory extensions into Windows/system32.
2. Find php.ini in the windows directory, open it for editing, search for mbstring.dll, and find
;extension=php_mbstring.dll remove the previous; sign so that the mb_substr function can take effect


Method definition:

string mb_substr ( string str, int start [, int length [, string encoding]] )

Note: When using mb_substr()/mb_strcut, you need to add one more parameter at the end to set the encoding of the string,

For example:

The code is as follows Copy code
 代码如下 复制代码

echo mb_substr(‘原本会出现乱码的汉字!’, 0, 7, ‘utf-8′);

echo mb_substr(‘Originally garbled Chinese characters will appear!’, 0, 7, ‘utf-8′);

Another example:
 代码如下 复制代码

$description = mb_substr(strip_tags($post->post_content),0,220,’utf-8′);

The code is as follows Copy code

$description = mb_substr(strip_tags($post->post_content),0,220,’utf-8′);

mb_strcut function

The mb_strcut function can also intercept the length of a string. The following example shows the difference:

 代码如下 复制代码

$str = '这样一来我的字符串就不会有乱码^_^';

echo "mb_substr:" . mb_substr($str, 0, 7, 'utf-8');
//结果:这样一来我的字
echo "
";

echo "mb_strcut:" . mb_strcut($str, 0, 6, 'utf-8');
//结果:这样
?>

The code is as follows Copy code
$str = 'This way my string will not be garbled^_^';<🎜> <🎜>echo "mb_substr:" . mb_substr($str, 0, 7, 'utf-8'); <🎜> //Result: This way my words <🎜> echo "
"; echo "mb_strcut:" . mb_strcut($str, 0, 6, 'utf-8'); //Result: like this ?>

As can be seen from the above example, mb_substr splits characters by words, while mb_strcut splits characters by bytes, but neither will produce half a character.

Chinese version of substr() function The ordinary substr() function can obtain the substring of the specified length of the string, but when encountering Chinese, garbled characters may be generated at the end of the new string. The following function will exceed the length of $len. The string is converted to end with "..." and garbled characters are removed.
Usage: $new = getsubstring($old,20);

The code is as follows
 代码如下 复制代码
function getsubstring($str,$len)
{
for($i = 0;$i <$len;$i++)
{
if ($i >=0 AND $i <$len)
{
if(ord(substr($str,$i,1)) > 0xa1)
     $result_str.=substr($str,$i,2);
    else
     $result_str.=substr($str,$i,1);
   }
   if(ord(substr($str,$i,1)) > 0xa1)
    $i++;
}
if(strlen($str)<=$len)
return $result_str;
else
return $result_str."...";
}
Copy code
function getsubstring($str,$len)
{

for($i = 0;$i <$len;$i++)

{

if ($i >=0 AND $i <$len) If(ord(substr($str,$i,1)) > 0xa1) $result_str.=substr($str,$i,2); else $result_str.=substr($str,$i,1); } If(ord(substr($str,$i,1)) > 0xa1) $i++; } if(strlen($str)<=$len) Return $result_str; else Return $result_str."..."; }
http://www.bkjia.com/PHPjc/633190.htmlwww.bkjia.comtruehttp: //www.bkjia.com/PHPjc/633190.htmlTechArticle provides us with several character interception functions in php, including substr, mb_substr, mb_strcut functions. We have some php Beginners will use substr to intercept Chinese, but they find that the Chinese will be garbled...
Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn