1. The concept of character set
Character set standardizes the mapping relationship between codes and characters, and specifies the encoding method of computer characters. The character set settings in different database systems may be different, and different character sets have unique characteristics and advantages. Compared with Latin1/Latin7, GB2312/GBK/GB18030 has greater advantages in processing Chinese characters.
In MySQL, character set refers to the way to encode character data, including databases, tables and columns. In layman's terms, a character set refers to the way used to store and process characters and strings.
The character sets supported in MySQL mainly include the following: ASCII, GBK, GB2312, UTF-8, UTF-16 and ISO-8859, etc. Among them, UTF-8 is the most commonly used character set in MySQL because It supports multiple languages, has flexible encoding formats, saves space and other benefits.
2. How to set the MySQL character set
Query the character set supported by MySQL
In the MySQL database, you can Query the character sets supported by the database through commands as follows:
SHOW CHARACTER SET;
This command can list the various character sets available for MySQL, for example:
Charset | Description | Default collation | Maxlen |
---|---|---|---|
big5 | Big5 Traditional Chinese | big5_chinese_ci | 2 |
DEC West European | dec8_swedish_ci | 1 | |
DOS West European | cp850_general_ci | 1 | |
HP West European | hp8_english_ci | 1 | |
KOI8-R Relcom Russian | koi8r_general_ci | 1 | |
iso-8859-1 West European | latin1_swedish_ci | 1 | |
iso-8859-2 Central European | latin2_general_ci | 1 | |
7bit Swedish | swe7_swedish_ci | 1 | |
US ASCII | ascii_general_ci | 1 | |
EUC-JP Japanese | ujis_japanese_ci | 3 | |
Shift-JIS Japanese | sjis_japanese_ci | 2 | |
iso-8859-8 Hebrew | hebrew_general_ci | 1 | |
TIS620 Thai | tis620_thai_ci | 1 | |
EUC-KR Korean | euckr_korean_ci | 2 | |
KOI8-U Ukrainian | koi8u_general_ci | 1 | |
GB2312 Simplified Chinese | gb2312_chinese_ci | 2 | ##greek |
greek_general_ci | 1 | cp1250 | |
cp1250_general_ci | 1 | ##gbk | |
gbk_chinese_ci | 2 | latin5 | |
latin5_turkish_ci | 1 | ##armscii8 | ARMSCII-8 Armenian |
1 | utf8 | UTF-8 Unicode | |
3 | ##ucs2 | UCS-2 Unicode | |
2 | cp866 | DOS Russian | |
1 | ##keybcs2 | DOS Kamenicky Czech-Slovak | keybcs2_general_ci |
macce | Mac Central European | macce_general_ci | |
macroman | Mac West European | macroman_general_ci | |
cp852 | DOS Central European | cp852_general_ci | |
##latin7 | iso-8859-13 Baltic | latin7_general_ci | |
utf8mb4 | UTF-8 Unicode | utf8mb4_general_ci | |
cp1251 | Windows Cyrillic | cp1251_general_ci | |
##utf16 | UTF -16 Unicode | utf16_general_ci | 4 |
utf16le | UTF-16LE Unicode | utf16le_general_ci | 4 |
cp1256 | Windows Arabic | cp1256_general_ci | 1 |
cp1257 | Windows Baltic | cp1257_general_ci | 1 |
##utf32 | UTF-32 Unicode | utf32_general_ci | 4 |
binary | Binary pseudo charset | #binary | 1 |
geostd8 | GEOSTD8 Georgian | geostd8_general_ci | 1 |
cp932 | SJIS for Windows Japanese | cp932_japanese_ci | 2 |
eucjpms | UJIS for Windows Japanese | eucjpms_japanese_ci | 3 |
When we create a new MySQL database, we can set the default character set of the database so that all tables in the database All character sets will use this character set. The specific steps are as follows: 2.1 First check the character sets supported by MySQL mysql> SHOW CHARACTER SET; 2.2 When creating a new database, add a character set setting Set CREATE DATABASE new_db CHARACTER SET utf8; or The following sentence is rewritten: "Create a new database called 'new_db' with a default character set of utf8." where utf8 is one of the commonly used character sets.
In order to ensure that the table can store and display data correctly, we need to set the character encoding for the table. In MySQL, the character set of a table can be set when creating the table. Already created tables can be modified using the ALTER command. 3.1 Set the character set when creating the table CREATE TABLE new_table ( The storage engine used to set the table is ENGINE=InnoDB, and the default character set used to set the table is CHARSET=utf8. 3.2 Modify the character set of the table ALTER TABLE old_table CONVERT TO CHARACTER SET utf8; or ALTER TABLE old_table MODIFY COLUMN name VARCHAR(50) CHARACTER SET utf8; Among them, the CONVERT TO command is used to modify the default character set of the table, and the MODIFY COLUMN command is used to modify the character set of a column in the table.
When connecting to the MySQL server, you can also set the character set for the operation. The character set setting, often called "client character set", refers to the character set used for data transmitted on the client side. 4.1 View the character set of the current connection mysql> SELECT @@character_set_connection; 4.2 Modify the character set of the connection SET character_set_connection = utf8; or mysql --default-character-set=utf8 -u root -p Among them, the SET command can modify the default character set of the connection, --default-character The -set command can specify the client character set.
In some cases, in order to handle some less common data storage and conversion requirements, it may be necessary to enable Additional character set settings for MySQL. In this case, you may need to modify the MySQL configuration file-my.cnf file with administrator privileges. The modification method is as follows: 5.1 Find the my.cnf file In Linux, the my.cnf file is usually stored in the /etc/my.cnf or /etc/mysql/my.cnf directory . In Windows, the my.cnf file is usually stored in the installation directory of the MySQL database. 5.2 Modify the my.cnf file Add the following statement in the my.cnf file: [mysqld] Among them, the character_set_server command is used to set the listed character set, and the init_connect command is used to automatically set the character set when creating a connection. |
The above is the detailed content of How to set character set in mysql. For more information, please follow other related articles on the PHP Chinese website!