search

what is unicode

Jan 26, 2019 am 10:56 AM
unicode

Unicode is a character encoding scheme that sets a unified and unique binary encoding for each character in each language to achieve cross-language and cross-platform text conversion and processing requirements

Unicode meaning

Unicode provides a unique number for each character, no matter what platform, no matter what program, no matter what language. It was officially announced in 1994 and is an industry standard in the computer field, including character sets, encoding schemes, etc. Unicode was created to solve the limitations of traditional character encoding schemes. It sets a unified and unique binary encoding for each character in each language to achieve cross-language and cross-platform text conversion and processing requirements.

what is unicode

The Development of Unicode Encoding

When designing computers, 8 bits are used as a byte. Therefore, one byte can represent up to 256 characters. In the early days, for Western countries that used English, one byte could store uppercase and lowercase English letters, mathematics, and some symbols, so one byte was used to make the code table (ASCII). Later, computers were spread to other countries, and many countries used their own languages, such as Chinese, Japanese, Korean... The languages ​​were complicated. In order to solve this problem, each country formulated its own code table. China formulated GB2312 in 1980 In the Chinese character encoding character set, there are many more Chinese characters than English. One byte is obviously not enough, so 2 bytes are used for encoding. However, although the character encodings defined by different countries can be used, they are often incompatible between different countries. If the computer wants to handle multiple language environments (using Chinese or other languages), it may not be able to support multiple language environments at the same time. In order to unify the encoding of all texts, Unicode was created to unify all languages ​​into one set of encodings so that there would be no garbled characters.

what is unicode

Unicode encoding representation

When representing Unicode characters, U is usually used followed by a set of hexadecimal digits Represents a character, encoding from U 0000 to U FFFF, supporting more than 60,000 characters in total. Characters other than BMP

need to be represented using 5-digit or 6-digit hexadecimal.

Currently Unicode characters are divided into 17 groups, 0x0000 to 0x10FFFF. Each group is called a plane. Each plane has 65536 code points, a total of 1114112.

Unicode is like a table. All characters are written into the table. Each character corresponds to a number, called a code point. This number is generally not used directly. It is usually used

Use different encoding methods

what is unicode

UTF-8, UTF-16, and UTF-32 are encoding schemes for converting numbers into program data. UTF is the abbreviation of "UnicodeTransformation Format", which can be translated into

Unicode character set conversion format, that is, how to convert numbers defined by Unicode into program data

##11110xxx 10xxxxxx 10xxxxxx 10xxxxxx(21 bits)
Decimal
Unicode encoding
UTF-8 byte stream
0-127 bits 0x000000-0x00007F 0xxxxxxx(7 digits)
128-2047 digits
0x000080-0x0007FF 110xxxxx 10xxxxxx (11 digits)
2048-65535 digits 0x000800-0x00FFFF 1110xxxx 10xxxxxx 10xxxxxx (16 digits)
65536-1114111 bits 0x010000-0x10FFFF

The above is the detailed content of what is unicode. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Javascript Data Types : Is there any difference between Browser and NodeJs?Javascript Data Types : Is there any difference between Browser and NodeJs?May 14, 2025 am 12:15 AM

JavaScript core data types are consistent in browsers and Node.js, but are handled differently from the extra types. 1) The global object is window in the browser and global in Node.js. 2) Node.js' unique Buffer object, used to process binary data. 3) There are also differences in performance and time processing, and the code needs to be adjusted according to the environment.

JavaScript Comments: A Guide to Using // and /* */JavaScript Comments: A Guide to Using // and /* */May 13, 2025 pm 03:49 PM

JavaScriptusestwotypesofcomments:single-line(//)andmulti-line(//).1)Use//forquicknotesorsingle-lineexplanations.2)Use//forlongerexplanationsorcommentingoutblocksofcode.Commentsshouldexplainthe'why',notthe'what',andbeplacedabovetherelevantcodeforclari

Python vs. JavaScript: A Comparative Analysis for DevelopersPython vs. JavaScript: A Comparative Analysis for DevelopersMay 09, 2025 am 12:22 AM

The main difference between Python and JavaScript is the type system and application scenarios. 1. Python uses dynamic types, suitable for scientific computing and data analysis. 2. JavaScript adopts weak types and is widely used in front-end and full-stack development. The two have their own advantages in asynchronous programming and performance optimization, and should be decided according to project requirements when choosing.

Python vs. JavaScript: Choosing the Right Tool for the JobPython vs. JavaScript: Choosing the Right Tool for the JobMay 08, 2025 am 12:10 AM

Whether to choose Python or JavaScript depends on the project type: 1) Choose Python for data science and automation tasks; 2) Choose JavaScript for front-end and full-stack development. Python is favored for its powerful library in data processing and automation, while JavaScript is indispensable for its advantages in web interaction and full-stack development.

Python and JavaScript: Understanding the Strengths of EachPython and JavaScript: Understanding the Strengths of EachMay 06, 2025 am 12:15 AM

Python and JavaScript each have their own advantages, and the choice depends on project needs and personal preferences. 1. Python is easy to learn, with concise syntax, suitable for data science and back-end development, but has a slow execution speed. 2. JavaScript is everywhere in front-end development and has strong asynchronous programming capabilities. Node.js makes it suitable for full-stack development, but the syntax may be complex and error-prone.

JavaScript's Core: Is It Built on C or C  ?JavaScript's Core: Is It Built on C or C ?May 05, 2025 am 12:07 AM

JavaScriptisnotbuiltonCorC ;it'saninterpretedlanguagethatrunsonenginesoftenwritteninC .1)JavaScriptwasdesignedasalightweight,interpretedlanguageforwebbrowsers.2)EnginesevolvedfromsimpleinterpreterstoJITcompilers,typicallyinC ,improvingperformance.

JavaScript Applications: From Front-End to Back-EndJavaScript Applications: From Front-End to Back-EndMay 04, 2025 am 12:12 AM

JavaScript can be used for front-end and back-end development. The front-end enhances the user experience through DOM operations, and the back-end handles server tasks through Node.js. 1. Front-end example: Change the content of the web page text. 2. Backend example: Create a Node.js server.

Python vs. JavaScript: Which Language Should You Learn?Python vs. JavaScript: Which Language Should You Learn?May 03, 2025 am 12:10 AM

Choosing Python or JavaScript should be based on career development, learning curve and ecosystem: 1) Career development: Python is suitable for data science and back-end development, while JavaScript is suitable for front-end and full-stack development. 2) Learning curve: Python syntax is concise and suitable for beginners; JavaScript syntax is flexible. 3) Ecosystem: Python has rich scientific computing libraries, and JavaScript has a powerful front-end framework.

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.