Home >Backend Development >PHP Tutorial >Summary of iconv function knowledge in PHP_PHP tutorial
The iconv function library can complete conversion between various character sets and is an indispensable basic function library in PHP programming. The content of this article is based on other resources on the Internet, and then combined with my own practice. Friends in need can refer to it.
Today when I was revising my paper online, I encountered the iconv function. Learn
?
1
2
3
4header('Content-Type: application/vnd.ms-excel;charset=UTF-8"');
$name=iconv('utf-8', 'gb2312', $data['year'].'Year, No.'.$data['period'].'Correspondence');
header('Content-Disposition: attachment;filename="' . $name . '.xls"');
header('Cache-Control: max-age=0');
The meaning of this code is to convert the utf-8 format into gb2312 format, and then assign it to $name. In this way, when the name of the excel file is exported, it will be the Chinese name of $name.
The following is the detailed and extended usage of this function
?
1iconv("UTF-8","GB2312//IGNORE",$data)
Ignore means to ignore errors during conversion. Without the ignore parameter, all strings following this character cannot be saved.
This iconv() function is built-in in php5. Thank you.
Example:
?
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
echo $str= 'Hello, we sell coffee here!';
echo '
';
echo iconv('GB2312', 'UTF-8', $str); //Convert the string encoding from GB2312 to UTF-8
echo '
';
echo iconv_substr($str, 1, 1, 'UTF-8'); //Truncate by the number of characters instead of bytes
print_r(iconv_get_encoding()); //Get the current page encoding information
echo iconv_strlen($str, 'UTF-8'); //Get the string length of the set encoding
//This is also used
$content = iconv("UTF-8","gbk//TRANSLIT",$content);
?>
iconv is not the default function of PHP, and it is also a module installed by default. It needs to be installed before it can be used.
If it is Windows 2000 PHP, you can modify the php.ini file and remove the ";" before extension=php_iconv.dll. At the same time, you need to copy the iconv.dll in your original PHP installation file to your winnt/system32 Next (if your dll points to this directory)
In the Linux environment, use static installation and add an additional item --with-iconv when configure. The iconv item can be seen in phpinfo. (Linux7.3 Apache4.06 php4.3.2),
Download: ftp://ftp.gnu.org/pub/gnu/libiconv/libiconv-1.8.tar.gz
Installation:
?
1
2
3
4
5#cp libiconv-1.8.tar.gz /usr/local/src
#tar zxvf lib*
#./configure --prefix=/usr/local/libiconv
#make
#make install
Compile php
#./configure --prefix=/usr/local/php4.3.2 --with-iconv=/usr/local/libiconv/
Simple example of usage:
?
1
2
3
echo iconv("gb2312","ISO-8859-1","we");
?>
Introduction to mb_convert_encoding and iconv functions in PHP
mb_convert_encoding This function is used to convert encoding. I used to not understand the concept of program coding, but now I seem to understand a little bit.
However, English generally does not have encoding problems, only Chinese data will have this problem. For example, when you use Zend Studio or Editplus to write a program, you use gbk encoding. If the data needs to be entered into the database, and the database encoding is utf8, then the data must be encoded and converted, otherwise it will become garbled when entering the database. .
See the official usage of mb_convert_encoding:
http://cn.php.net/manual/zh/function.mb-convert-encoding.php
Make a GBK To UTF-8
?
1
2
3
4< ?php
header("content-Type: text/html; charset=Utf-8");
echo mb_convert_encoding("You are my friend", "UTF-8", "GBK");
?>
Another GB2312 To Big5
?
1
2
3
4< ?php
header("content-Type: text/html; charset=big5");
echo mb_convert_encoding("You are my friend", "big5", "GB2312");
?>
However, to use the above function, you need to install it but you need to enable the mbstring extension library first.
Another function iconv in PHP is also used to convert string encoding, and its function is similar to the function above.
There are some detailed examples below:
?
1
2
3
4iconv — Convert string to requested character encoding
(PHP 4 >= 4.0.5, PHP 5)
mb_convert_encoding — Convert character encoding
(PHP 4 >= 4.0.6, PHP 5)
Usage:
string mb_convert_encoding ( string str, string to_encoding [, mixed from_encoding] )
You need to enable the mbstring extension library first, and remove the ; in front of extension=php_mbstring.dll in php.ini
mb_convert_encoding can specify multiple input encodings. It will automatically identify according to the content, but the execution efficiency is much worse than iconv;
string iconv (string in_charset, string out_charset, string str)
Note: In addition to specifying the encoding to be converted to, the second parameter can also add two suffixes: //TRANSLIT and //IGNORE, where //TRANSLIT will automatically convert characters that cannot be directly converted into or multiple similar characters, //IGNORE will ignore characters that cannot be converted, and the default effect is to truncate from the first illegal character.
Returns the converted string or FALSE on failure.
Usage:
It was found that iconv would make an error when converting the character "-" to gb2312. Without the ignore parameter, all strings following this character cannot be saved. No matter what, this "—" cannot be converted successfully and cannot be output. In addition, mb_convert_encoding does not have this bug.
Normally use iconv. Only use the mb_convert_encoding function when you are unable to determine what the original encoding is, or when the iconv conversion cannot be displayed normally.
from_encoding is specified by character code name before conversion. it can be array or string - comma separated enumerated list. If it is not specified, the internal encoding will be used.
/* Auto detect encoding from JIS, eucjp-win, sjis-win, then convert str to UCS-2LE */
$str = mb_convert_encoding($str, “UCS-2LE”, “JIS, eucjp-win, sjis-win”);
/* “auto” is expanded to “ASCII,JIS,UTF-8,EUC-JP,SJIS” */
$str = mb_convert_encoding($str, “EUC-JP”, “auto”);
Example:
?
1
2$content = iconv(”GBK”, “UTF-8″, $content);
$content = mb_convert_encoding($content, "UTF-8","GBK");
Parameters that are easily overlooked when using the iconv function in php
When I was processing the captured content today, when using iconv for encoding conversion, I found that the result would be interrupted. I guessed it was a problem with the character set. I considered how to skip characters that did not exist in the target character set. I checked the manual and found that iconv The function only has three parameters, but it doesn't seem to work. Then I checked on the Internet and someone said it could, but I was very surprised how to implement it. Finally, I found that the English description said that you can add a label to the end of the target code: "TRANSLIT". I was very depressed. How to add it? It turns out that I add it first. “//”, it’s really depressing that there is such a design
Prototype: $txtContent = iconv("utf-8",'GBK',$txtContent);
Special parameters: iconv("UTF-8","GB2312//IGNORE",$data)
Two optional auxiliary parameters: TRANSLIT and IGNORE (where IGNORE means to skip if it encounters something that cannot be converted).
The above is the entire content of this article, I hope you all like it.