Home  >  Article  >  For Chinese characters, what encoding is used to save the information in memory or disk?

For Chinese characters, what encoding is used to save the information in memory or disk?

青灯夜游
青灯夜游Original
2021-05-07 16:12:2010902browse

For Chinese characters, "GB2312-80" encoding is used to store and transmit information in memory or disk. GB2312-80 refers to the Chinese character encoding character set for information exchange. GB2312 encoding is suitable for information exchange between Chinese character processing, Chinese character communication and other systems. It is popular in mainland China; Singapore and other places also use this encoding.

For Chinese characters, what encoding is used to save the information in memory or disk?

The operating environment of this tutorial: Windows 7 system, Dell G3 computer.

For Chinese characters, "GB2312-80" encoding is used to store and transmit information in memory or disk.

GB2312-80 Chinese character internal code

"Chinese Character Coded Character Set for Information Exchange" was released by the State Administration of Standards of China in 1980, on May 1, 1981 A set of national standards that have been implemented, the standard number is GB 2312-1980.

GB2312 encoding is suitable for information exchange between Chinese character processing, Chinese character communication and other systems. It is popular in mainland China; Singapore and other places also use this encoding. Almost all Chinese systems and international software in mainland China support GB 2312.

The basic set includes a total of 6763 Chinese characters and 682 non-Chinese graphic characters. The entire character set is divided into 94 areas, each area has 94 bits. There is only one character in each location, so the location and location can be used to encode Chinese characters, which is called location code.

Add 2020H to the location code converted into hexadecimal to get the national standard code. Add the national standard code to 8080H to get the commonly used computer internal code. In 1995, the "Chinese Character Coding Extension Specification" (GBK) was promulgated. GBK is compatible with the internal code standard corresponding to the GB 2312-1980 national standard. At the same time, it supports all Chinese, Japanese, and Korean (CJK) Chinese characters of ISO/IEC10646-1 and GB 13000-1 at the vocabulary level, a total of 20,902 characters.

For more related knowledge, please visit the FAQ column!

The above is the detailed content of For Chinese characters, what encoding is used to save the information in memory or disk?. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn