search
HomeBackend DevelopmentXML/RSS TutorialCrazy XML study notes (2)-------------Format requirements

The syntax of Xml

        A XML file is divided into the following parts:

##–       Document Declaration

Element

Attributes

Comments

CDATA area, special characters

Processing instructions (processing instruction)

Declaration of Xml

The simplest declaration

##Encoding attribute : Character encoding if not specified, the default is UTF-8 encoding

##Common character set

Simplified Chinese:

GBK

or GB2312Traditional Chinese

: BIG5

Western European characters

:ISO8859-1

##Universal international encoding:Unicode UTF-8

Demonstration case:

##

中国>

中国

>Cause of error Yes:

Encoding error



## is decoded into the characters


xml

Document when the character set is wrong



##The new creation is a .txt
document


What is saved is .xml UTF-8

Default encodingANSI

China

##Write to hard disk

Encoded into binary code

00 11

Disk file


Standalone:Specify whether the document needs to reference other resources,The attribute value is only: yes or no

##Yes:Independent No:Not independent

XML file is the same as HTML file, it is actually a text file.

Obviously everyone will understand immediately that the most common tool for creating XML files is the same as HTML, which is "Notepad".

In addition to "Notepad", of course there are some more convenient tools, such as XML Notepad, XML Pro, and CLIP! XML Editor, etc.

One of the major features of these tools is that they can check whether the XML file you create complies with the XML specification. However, these tools are currently only available in English and require payment.

Of course, you can still use tools such as FrontPage and DreamWeaver, but they are not very convenient to use.

With the gradual popularity of XML, I believe that very useful tools for creating XML files will appear in the near future. Now let's use "Notepad" to create our XML file. First look at an XML file:

Example 1

##〈?xml version="1.0" encoding="gb2312" ?〉

〈References〉
〈Book〉
〈Name〉Introduction to XML 〈/Name〉
〈Author〉Zhang San〈/Author 〉
〈Price Currency Unit="RMB"〉20.00〈/Price〉
〈/Book〉
〈Book〉
〈Name〉XML Syntax〈/Name〉
〈!-- This book will be published soon--〉
〈Author〉Li Si〈/Author〉
〈Price Currency Unit="RMB"〉18.00〈/Price〉
〈/Book〉
〈/Reference Materials 〉

This is a typical XML file. After editing, save it as a file with the .xml suffix. We can divide this file into two major parts: the file preface (Prolog) and the file body.


The first line in this file is the file preamble. This line is something that an XML file must declare, and it must also be located on the first line of the XML file. It mainly tells the XML parser how to work.

Among them, version is the standard version number used by this XML file, which is required; encoding specifies the character type used in this XML file, which can be omitted. When you omit this statement, the following The character code must be a Unicode character code (it is recommended not to omit it)

. Because we are using GB2312 character code in this example, the encoding statement cannot be omitted. There are also some declaration statements in the preamble of the file, which we will introduce later.

The rest of the file belongs to the file body, and the content information of the XML file is stored here.


We can see that the main body of the file is composed of the starting and the ending control tag. This is called the "root element" of the XML file; is used as a direct It belongs to the "child elements" under the root element; under there are sub-elements such as , , and . The currency unit is an "attribute" in the element, and "RMB" is the "attribute value".

First of all, the first line of the XML file must declare that the file is an XML file and the XML specification version it uses. There cannot be other elements or comments in front of the file.


Second, there can be only one root element in an XML file. In our first example, 〈References〉... 〈/References〉 is the root element of this XML file.


Third, the tags in the XML file must be closed correctly, that is, in the XML file, the control tag must have a corresponding closing tag. For example: the tag must have a corresponding closing tag. Unlike HTML, the closing tag of some tags is optional. If you encounter a self-contained unit tag in an XML file, which is similar to Crazy XML study notes (2)-------------Format requirements in HTML and does not have an end tag, XML calls it "empty". Element" must be written like this: 〈empty element name/〉. If the element contains attributes, it must be written like this: 〈empty element name attribute name="attribute value"/〉.


Fourth, marks must not cross. In the previous HTML file, you could write like this:

〈B〉〈H〉XXXXXXX〈/B〉〈/H〉,〈B〉 and 〈H〉
There are overlapping areas between the tags, and In XML, such interleaving of tags is strictly prohibited, and tags must appear in a regular order.


Fifth, attribute values ​​must be enclosed in " " signs. Such as "1.0", "gb2312", "RMB" in the first example. They are all enclosed by " " symbols and cannot be omitted.


Sixth, English control tags, instructions and attribute names must be case-sensitive. Different from HTML, in HTML, tags like and have the same meaning, while in XML, tags like , or are different


Seventh, we know that in HTML files, if we want the browser to display the things we input intact, we can put these things in 〈pre〉〈/pre〉 or 〈xmp〉〈/xmp〉 marks the middle. This is essential for us to create HTML teaching web pages, because the source code of HTML must be displayed in the web page. In XML, to implement such a function, CDATA tags must be used. The information in the CDATA tag is passed intact to the application by the parser, and any control tags in the segment of information are not parsed. The CDATA area is marked by "〉" as the end mark. For example: In the source code in Example 2, except for the "〉" symbols, the rest of the content parser will be handed over to the downstream application intact, even if the beginning and end of the CDATA area Blanks and newline characters will also be transferred (note that CDATA is an uppercase character).


XmlBasic rules for elements

##Elements are part of the xml document The basic unit , xml document is composed of elements nested layer by layer. The entire xml document starts from the root element,The root element contains several sub-elements,and each sub-element can contain several sub-elements,So that a xml document can be formed.

Xml Tag syntax:

## 1, ## The tags of #Xml are divided into two categories:

#Single tag:

中国/>#Double mark:

中国>中国>## 2, xml tag requirements:

## 1. Tag names can contain letters ( includes non-Western European characters

)

, numbers, underscores (_), and underscores (-), colon (:) and period (.) form , but cannot start with numbers, dashes or periods. 2, tag name cannot contain ,

>

,,,$ and other symbols 3, try not to have English colons (:), in label names unless you are using a namespace.

4, tag name cannot start with any combination of upper and lower case characters such as xml,

XML

, etc. #5, tag name cannot contain spaces.

6, tag names are case-sensitive

3, xml Tag nested sub-elements: xml allows deep nesting of sub-elements, as long as reasonable nesting between elements is ensured That is .And the xml element can nest multiple sub-elements with the same name,these multiple sub-elements are in order.

First:xml

##China >

## Beijing>

; Haidian/>

# ; Chaoyang>Chaoyang >## Chaoyang

>Chaoyang>## Beijing

>

## China>

##Secondxml

##China>

## Beijing>

##​ Chaoyang>Chaoyang>

; ##/>## Beijing

>

中国>

##4, empty element: empty element cannot accept child elements, also cannot accept string content.

##                                                 xml:

                                 

#                                                                                                                              And can accept multiple attributes

                                                                                                                                                                      

##         For example, an empty element accepts attributes:

  建议” price=”30”/>

5、字符数据

  

      Struts2开发详解

     

         Struts2开发详解

     

  

  对于XML标签中出现的所有空格和换行,XML解析程序都会当作标签内容进行处理

 如果文本字符中包含了一些特殊的字符,例如&由于这些字符都有特殊的含义,因此直接在xml文档中使用该字符串将引起文档的混乱。

   表达式>

 

        比较符>1+3比较符>

   表达式>

代表的是文本字符串中包含了特殊的字符

 

有以下两种方法解决:

1、 使用实体引用:用另一个特殊符号代替这些特殊符号

2、使用CDATA标记:将整个文档定义成字符串。

1、  使用实体引用

       

  表达式>

        比较符>1+3&6比较符>

  表达式>

上面的红色的

解决如下:

 

  表达式>

        比较符>1+3<6比较符>

                                                                                                                                                                                                   ​                              ; Tag

Under the special tag

CDATA,all special characters,are even valid elements will be treated as simple strings. Entity references also lose their effect and become direct text. The syntax format of

CDATA is as follows:

##Text content]]>

### Expression>

                                                                                                                                 +3

]]>

# Expression>

# Note: Since the

xml

document will not perform any parsing of the CDATA tag content at all

,

, therefore any content can be placed within CDATA, where The content is never wrong.

1, Comment

##XmlThe document can also add character data for explanation,These strings used for interpretation will not be processed by the xml parser. These explanatory texts are called notes. The documentation comments of Xml

are exactly the same as

html. XmlThe syntax format of comments is as follows:

Comments for Xml are allowed to contain elements and tags. Note: 1, Xml

comments cannot be placed in the tag body.

2,

Do not put the

xml document before the xml document declaration xml document The declaration is always on the first line

3, Do not use double underscores in comments (--)

…..

##2, Processing instructions

Processing instructions, referred to as PI (processing instruction). Processing instructions are used to instruct the parsing engine how to parse the XML document content.

Processing instructions are used to provide information to applications that process xml documents, telling the processor how to process the document. A complete xml document processing program, can read the xml document processing instructions, and perform corresponding processing according to the processing instructions,Or pass it to the next program.

XmlThe format of the processing instruction is as follows:

Processing instruction name processing Command information?>

##3、 Attribute

xmlCharacteristics of attributes:

1, xmlThe attribute of the element must have an attribute value,The attribute value must be enclosed in quotation marks.

2, The same xml element cannot have multiple attributes with the same name

##3, XmlThere is no order among the multiple attributes in the element.

<book>
     <isbn>123456780</isbn>
     <book_name>Java讲义</book_name>
      <price>99</price>
</book>


For example, this

book itself has isbn book_name price The properties of

then

<book isbn=”” book_name=”” price=””/>


##Xml

There must be a root element.

Xml

The overall structure of the document##1,

There must be a root element

(There is and only one)##2, The element must end reasonably

3, Elements must be nested reasonably

4, The attributes of elements must have attribute values

As long as the above conditions are not met, it is a poorly formatted documentxmlDocument

##As long as the above conditions are met, it is a well-formatted documentxmlDocument

and comply with dtd or schema

is a valid

xml document. XmlDocument is a standard structured document that can be converted into DOM (Document Object Model)

Tree

,The root node corresponds to the root node of the DOM tree.

<book>
    <computer>
       <name>网上介绍</name>
       <author>redarmy</author>
    </computer>
    <computer>
        <name>网上介绍</name>
       <author>redarmy</author>
    </computer>
</book>



#book


#computer


#computer


#…………


##computer


#name


##author

Notice:

Must have a root tag and the root tag must be unique

The start tag and the end tag need to be used in pairs

Marks cannot be used interchangeably

Empty markers can be written in the form of "

tag name/>"

Marks are case-sensitive

The above are the crazy XML study notes (2)--------- ----Format required content, please pay attention to the PHP Chinese website (www.php.cn) for more related content!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Decoding RSS: The XML Structure of Content FeedsDecoding RSS: The XML Structure of Content FeedsApr 17, 2025 am 12:09 AM

The XML structure of RSS includes: 1. XML declaration and RSS version, 2. Channel (Channel), 3. Item. These parts form the basis of RSS files, allowing users to obtain and process content information by parsing XML data.

How to Parse and Utilize XML-Based RSS FeedsHow to Parse and Utilize XML-Based RSS FeedsApr 16, 2025 am 12:05 AM

RSSfeedsuseXMLtosyndicatecontent;parsingtheminvolvesloadingXML,navigatingitsstructure,andextractingdata.Applicationsincludebuildingnewsaggregatorsandtrackingpodcastepisodes.

RSS Documents: How They Deliver Your Favorite ContentRSS Documents: How They Deliver Your Favorite ContentApr 15, 2025 am 12:01 AM

RSS documents work by publishing content updates through XML files, and users subscribe and receive notifications through RSS readers. 1. Content publisher creates and updates RSS documents. 2. The RSS reader regularly accesses and parses XML files. 3. Users browse and read updated content. Example of usage: Subscribe to TechCrunch's RSS feed, just copy the link to the RSS reader.

Building Feeds with XML: A Hands-On Guide to RSSBuilding Feeds with XML: A Hands-On Guide to RSSApr 14, 2025 am 12:17 AM

The steps to build an RSSfeed using XML are as follows: 1. Create the root element and set the version; 2. Add the channel element and its basic information; 3. Add the entry element, including the title, link and description; 4. Convert the XML structure to a string and output it. With these steps, you can create a valid RSSfeed from scratch and enhance its functionality by adding additional elements such as release date and author information.

Creating RSS Documents: A Step-by-Step TutorialCreating RSS Documents: A Step-by-Step TutorialApr 13, 2025 am 12:10 AM

The steps to create an RSS document are as follows: 1. Write in XML format, with the root element, including the elements. 2. Add, etc. elements to describe channel information. 3. Add elements, each representing a content entry, including,,,,,,,,,,,. 4. Optionally add and elements to enrich the content. 5. Ensure the XML format is correct, use online tools to verify, optimize performance and keep content updated.

XML's Role in RSS: The Foundation of Syndicated ContentXML's Role in RSS: The Foundation of Syndicated ContentApr 12, 2025 am 12:17 AM

The core role of XML in RSS is to provide a standardized and flexible data format. 1. The structure and markup language characteristics of XML make it suitable for data exchange and storage. 2. RSS uses XML to create a standardized format to facilitate content sharing. 3. The application of XML in RSS includes elements that define feed content, such as title and release date. 4. Advantages include standardization and scalability, and challenges include document verbose and strict syntax requirements. 5. Best practices include validating XML validity, keeping it simple, using CDATA, and regularly updating.

From XML to Readable Content: Demystifying RSS FeedsFrom XML to Readable Content: Demystifying RSS FeedsApr 11, 2025 am 12:03 AM

RSSfeedsareXMLdocumentsusedforcontentaggregationanddistribution.Totransformthemintoreadablecontent:1)ParsetheXMLusinglibrarieslikefeedparserinPython.2)HandledifferentRSSversionsandpotentialparsingerrors.3)Transformthedataintouser-friendlyformatsliket

Is There an RSS Alternative Based on JSON?Is There an RSS Alternative Based on JSON?Apr 10, 2025 am 09:31 AM

JSONFeed is a JSON-based RSS alternative that has its advantages simplicity and ease of use. 1) JSONFeed uses JSON format, which is easy to generate and parse. 2) It supports dynamic generation and is suitable for modern web development. 3) Using JSONFeed can improve content management efficiency and user experience.

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Chat Commands and How to Use Them
1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

VSCode Windows 64-bit Download

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft