Home >Web Front-end >H5 Tutorial >What is the character encoding? How to set character encoding in html5?

What is the character encoding? How to set character encoding in html5?

青灯夜游
青灯夜游Original
2018-11-12 17:36:2510127browse

The content of this article is to introduce what character encoding is and how to set character encoding in html5; let everyone understand the role of character encoding and how to set character encoding in html5. It has certain reference value. Friends in need can refer to it. I hope it will be helpful to you.

First of all, let’s understand What is the character encoding? What is the use?

Character encoding is a method of converting bytes into characters. To validate or display an HTML document, the program must select a character encoding. In the page, we can set the encoding of "utf-8" to prevent Chinese garbled characters. [Recommended related video tutorials: HTML5 tutorial]

There are three methods of setting character encoding in HTML 5, let’s introduce them respectively below:

HTTP Content-Type header

If you are writing a cgi or similar program then you can use the HTTP Content-Type header to set any character coding.

The following is a simple example:

print "Content-Type: text/html; charset=utf-8\r\n";

e8e496c15ba93d81f6ea4fe5f55a2244Element

You can use the attribute with charset The e8e496c15ba93d81f6ea4fe5f55a2244 element, this attribute can specify the encoding within the first 512 bytes of the HTML5 document.

The following is a simplified example:

<meta charset="UTF-8">

Let’s also take a look at the unsimplified one: it is still allowed to use

<meta http-equiv =“Content-Type”content =“text / html; charset = UTF-8”>

It can be clearly seen that the simplified e8e496c15ba93d81f6ea4fe5f55a2244 convenient.

Unicode Byte Order Mark (BOM)

The Byte Order Mark (BOM) consists of the character code U FEFF at the beginning of the data stream, Among them it can be used as a signature that defines the byte order and encoding form, mainly unmarked clear text files.

Many Windows programs (including Windows Notepad) add the bytes 0xEF, 0xBB, 0xBF to the beginning of any document saved as UTF-8. This is the UTF-8 encoding of the Unicode Byte Order Mark (BOM), often referred to as the UTF-8 BOM, even though it has nothing to do with byte order.

For HTML5 documents, you can use Unicode Byte Order Mark (BOM) characters at the beginning of the file. This character provides the signature for the encoding used.

Summary: The above is the entire content of this article. You can choose different methods to set character encoding according to different situations; I hope it will be helpful to your learning.

The above is the detailed content of What is the character encoding? How to set character encoding in html5?. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn