The National Standard Code for Chinese Characters was created in 1980 with the purpose of promulgating the national standard for Chinese character encoding in order to have a nationally unified code for each Chinese character. Each Chinese character has a binary code, called the Chinese character national standard code. In my country's Chinese character code standard GB2312-80, there are 6763 commonly used Chinese characters that specify binary encoding.
National standard code, also known as Chinese character exchange code, is the code standard used when exchanging Chinese character information between computers. The national standard code is represented by two bytes, that is, each Chinese character is represented by 2 bytes.
GB2312-80 GB2312 divides the code table into 94 areas, corresponding to the first byte; each area has 94 bits, corresponding to the second byte, and the values of the two bytes are the area code value and the bit Add 32 (20H) to the number value. Areas 01-09 are symbols and numbers, areas 16-87 are Chinese characters, and areas 10-15 and 88-94 are blank areas that need further standardization. GB2312 divides the collected Chinese characters into two levels: the first level contains 3755 commonly used Chinese characters, which are placed in areas 16-55, and are arranged in the order of Chinese pinyin letters/stroke shapes; the second level of Chinese characters contains 3008 commonly used Chinese characters, placed in 56 -87 area, arranged in radical/stroke order. Therefore, GB2312 can represent up to 6763 Chinese characters.
The above is the detailed content of How many storage bytes does the national standard code of a Chinese character occupy?. For more information, please follow other related articles on the PHP Chinese website!