Home  >  Article  >  Backend Development  >  The difference between mb_convert_encoding and iconv in php_PHP tutorial

The difference between mb_convert_encoding and iconv in php_PHP tutorial

WBOY
WBOYOriginal
2016-07-13 10:55:591110browse

mb_convert_encoding This function is used to convert encoding. I used to not understand the concept of program coding, but now I seem to understand a little bit. However, English generally does not have encoding problems, only Chinese data will have this problem.

For example, when you use Zend Studio or Editplus to write a program, you use gbk encoding. If the data needs to be entered into the database, and the database encoding is utf8, then the data must be encoded and converted, otherwise it will be lost when entering the database. Become gibberish.

For the usage of mb_convert_encoding, please see the official website:

mb_convert_encoding — Convert character encoding

Report a bug Description
string mb_convert_encoding ( string $str , string $to_encoding [, mixed $from_encoding ] )
Converts the character encoding of string str to to_encoding from optionally from_encoding.

Report a bug parameter

str
The string being encoded.

to_encoding
The type of encoding that str is being converted to.

from_encoding
Is specified by character code names before conversion. It is either an array, or a comma separated enumerated list. If from_encoding is not specified, the internal encoding will be used.

See supported encodings.


Report a bug return value
The encoded string.

Report a bug example

Example #1 mb_convert_encoding() example

The code is as follows Copy code
 代码如下 复制代码

/* Convert internal character encoding to SJIS */
$str = mb_convert_encoding($str, "SJIS");

/* Convert EUC-JP to UTF-7 */
$str = mb_convert_encoding($str, "UTF-7", "EUC-JP");

/* Auto detect encoding from JIS, eucjp-win, sjis-win, then convert str to UCS-2LE */
$str = mb_convert_encoding($str, "UCS-2LE", "JIS, eucjp-win, sjis-win");

/* "auto" is expanded to "ASCII,JIS,UTF-8,EUC-JP,SJIS" */
$str = mb_convert_encoding($str, "EUC-JP", "auto");
?>

/* Convert internal character encoding to SJIS */

$str = mb_convert_encoding($str, "SJIS");


/* Convert EUC-JP to UTF-7 */
$str = mb_convert_encoding($str, "UTF-7", "EUC-JP");

/* Auto detect encoding from JIS, eucjp-win, sjis-win, then convert str to UCS-2LE */

$str = mb_convert_encoding($str, "UCS-2LE", "JIS, eucjp-win, sjis-win");
代码如下 复制代码
$str='脚本之家:http://www.bKjia.c0m';
echo mb_convert_encoding($str, "UTF-8"); //编码转换为utf-8
?>

/* "auto" is expanded to "ASCII,JIS,UTF-8,EUC-JP,SJIS" */

$str = mb_convert_encoding($str, "EUC-JP", "auto");
 代码如下 复制代码
$str='脚本之家:http://www.bKjia.c0m';
echo mb_convert_encoding($str, "UTF-8", "GBK"); //已知原编码为GBK,转换为utf-8
?>
?>
 代码如下 复制代码
$str='脚本之家:http://www.bKjia.c0m';
echo mb_convert_encoding($str, "UTF-8", "auto"); //未知原编码,通过auto自动检测后,转换编码为utf-8
?>
mb_convert_encoding( $str, $encoding1,$encoding2 ) $str, the string to be encoded $encoding1, target encoding, such as utf-8, gbk, both upper and lower case $encoding2, original encoding, such as utf-8, gbk, both upper and lower case Example 1
The code is as follows Copy code
$str='Script Home: http://www.bKjia.c0m'; <🎜> echo mb_convert_encoding($str, "UTF-8"); //Encoding converted to utf-8 <🎜> ?>
The code is as follows Copy code
$str='Script Home: http://www.bKjia.c0m'; <🎜> echo mb_convert_encoding($str, "UTF-8", "GBK"); //It is known that the original encoding is GBK, convert to utf-8 <🎜> ?>
The code is as follows Copy code
$str='Script Home: http://www.bKjia.c0m'; <🎜> echo mb_convert_encoding($str, "UTF-8", "auto"); //Unknown original encoding, after automatic detection by auto, convert the encoding to utf-8 <🎜> ?>

Make a GBK To UTF-8

 代码如下 复制代码
< ?php
header("content-Type: text/html; charset=Utf-8");
echo mb_convert_encoding("???S我的友仔", "UTF-8", "GBK");
?>

Another GB2312 To Big5

 代码如下 复制代码
< ?php
header("content-Type: text/html; charset=big5");
echo mb_convert_encoding("你是我的朋友", "big5", "GB2312");
?>

However, to use the above function, you need to install and enable the mbstring extension library first.

Another function iconv in PHP is also used to convert string encoding, and its function is similar to the function above.

There are some detailed examples below:

iconv — Convert string to requested character encoding
(PHP 4 >= 4.0.5, PHP 5)
mb_convert_encoding — Convert character encoding
(PHP 4 >= 4.0.6, PHP 5)

Usage:
string mb_convert_encoding ( string str, string to_encoding [, mixed from_encoding] )
You need to enable the mbstring extension library first, and remove the ; in front of extension=php_mbstring.dll in php.ini
mb_convert_encoding can specify multiple input encodings. It will automatically identify based on the content, but the execution efficiency is much worse than iconv;


string iconv (string in_charset, string out_charset, string str)
Note: In addition to specifying the encoding to be converted to, the second parameter can also add two suffixes: //TRANSLIT and //IGNORE, where //TRANSLIT will automatically convert characters that cannot be directly converted into one or more Approximate characters, //IGNORE will ignore characters that cannot be converted, and the default effect is to truncate from the first illegal character.
Returns the converted string or FALSE on failure.


Use:

It was found that iconv would make an error when converting the character "-" to gb2312. Without the ignore parameter, all strings following this character cannot be saved. No matter what, this "—" cannot be converted successfully and cannot be output. In addition, mb_convert_encoding does not have this bug.

In general, use iconv. Only use the mb_convert_encoding function when you are unable to determine what the original encoding is, or when iconv cannot be displayed normally after conversion.

from_encoding is specified by character code name before conversion. it can be array or string - comma separated enumerated list. If it is not specified, the internal encoding will be used.

Example:
The code is as follows
 代码如下 复制代码
/* Auto detect encoding from JIS, eucjp-win, sjis-win, then convert str to UCS-2LE */
$str = mb_convert_encoding($str, “UCS-2LE”, “JIS, eucjp-win, sjis-win”);
/* “auto” is expanded to “ASCII,JIS,UTF-8,EUC-JP,SJIS” */
$str = mb_convert_encoding($str, “EUC-JP”, “auto”);
Copy code

/* Auto detect encoding from JIS, eucjp- win, sjis-win, then convert str to UCS-2LE */
 代码如下 复制代码
$content = iconv(”GBK”, “UTF-8//IGNORE″, $content);
$content = mb_convert_encoding($content, “UTF-8″, “GBK”);
$str = mb_convert_encoding($str, “UCS-2LE”, “JIS, eucjp-win, sjis-win”);

/* “auto” is expanded to “ASCII,JIS,UTF-8,EUC-JP,SJIS” */

$str = mb_convert_encoding($str, “EUC-JP”, “auto”);

The code is as follows Copy code
$content = iconv("GBK", " UTF-8//IGNORE″, $content); $content = mb_convert_encoding($content, “UTF-8″, “GBK”); In general, iconv is used. Only when the original encoding cannot be determined, or iconv cannot be displayed normally after conversion, use the mb_convert_encoding function

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/632197.htmlTechArticlemb_convert_encoding This function is used to convert encoding. I used to not understand the concept of program coding, but now I seem to understand a little bit. However, there are generally no encoding issues in English...
Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn