Home  >  Article  >  Backend Development  >  Detailed explanation of basic XML encoding examples

Detailed explanation of basic XML encoding examples

Y2J
Y2JOriginal
2017-05-08 10:02:451732browse

XML documents can contain foreign characters such as Norwegian or French (Chinese of course! This part still cannot be translated according to the original text, some of the content below is written by myself)
In order for your parser to understand these characters, you must A unified character encoding standard in XML documents.

Windows 95/98 Notepad
Windows 95/98 Notepad cannot save files in Unicode encoding format.
You can use Notepad to edit and save XML documents containing foreign characters (for example: Norwegian or French or Chinese)


Xiao Hui
Xiao Lin
Let’s go to hotpot together in the evening

But if you use a browser to open this XML document edited with Notepad, an error "open it with IE 5.0" will appear.

Using encoding in Windows 95/98 Notepad
Windows 95/98 Notepad must set the encoding attribute when editing XML files.
In order to avoid errors, you can add an encoding attribute to the XML document declaration to indicate the encoding type of this XML document, but do not use Unicode encoding.
The following encoding types will not cause errors, and Chinese characters are displayed normally:


The following encoding types will not cause errors Error, and Chinese characters are displayed normally:


The following encoding types will not cause errors, and Chinese characters are displayed abnormally (garbled characters) :


The following encoding types will not cause errors, and Chinese characters will not be displayed normally (garbled characters):


The following encoding types will not cause errors, and Chinese characters are displayed normally:


The following encoding types will cause errors:


Using Windows 2000 Notepad
Windows 2000 Notepad can save files in Unicode encoding format.
Windows 2000 Notepad supports Unicode character set. If you use Win2000 Notepad to save the XML document in Unicode encoding format (please note that there is no encoding information in the XML declaration):

< ;from>Xiao HuiXiao LinLet’s go to hotpot together in the evening

This file is below; note_encode_none_u .xml, there will be no errors in IE5.0+, but if you use Netscape 6.2, errors may occur. Readers compare the two files note_encode_none.xml and note_encode_none_u.xml. If they look at their respective source files separately, there is no difference, but why can one be displayed and the other cannot be displayed? The answer is the Unicode character set.

Windows 2000 Notepad encoding
Windows 2000 Notepad can also save files in "UTF-16" encoding format.
If you declare the encoding attribute in the XML document and save the file in Unicode encoding format, an error may occur.
The following code will cause an error:


The following code will cause an error:


The following code will cause an error:


The following file; note_encode_utf16_u.xml, will display normally in IE5.0+, but will cause an error in Netscape 6.2 browser.


Error message
When browsing XML documents using IE5.0 or higher, you may encounter Two different encoding errors were detected:
An invalid character was found in text content.
If your XML document does not match the encoding format of your XML document, an error may occur. Usually, the XML document contains some "non-English" characters and uses single-byte encoding Editor, and the encoding format of the XML document is not set in the declaration of the XML document.

Switch from current encoding to specified encoding is not supported.
If the XML document is saved in Unicode/UTF-16 encoding format, but the declaration of the XML document sets the encoding format to be some single-byte encoding (such as Windows-1252, ISO-8859-1 or UTF-8) ; Or the XML document is saved in a single-byte encoding format, but the declaration of the XML document sets the encoding format to be some Unicode/UTF-16 encoding form. In both cases, errors will occur.

Conclusion
Conclusion: Before saving the XML document, set the encoding format of the document in the declaration of the XML document. Some of my suggestions to avoid errors:
Use an encoding format that supports Unicode editor.
Make sure you know which encoding format you are using.
Use attribute declarations to set the encoding format in XML documents.

【Related recommendations】

1.XML free video tutorial

2.XML technical manual

3.Boolean tutorial Yan Shiba XML video tutorial

The above is the detailed content of Detailed explanation of basic XML encoding examples. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn