XML Syntax and Parsing Concepts

May 31, 2002

📄 Contents

␡

⎙ Print

< Back Page 4 of 8 Next >

This chapter is from the book 

XML Family of Specifications: A Practical Guide

Learn More Buy

XML Syntax Rules

In this section, we explain the various syntactical rules of XML. Documents that follow these rules are called well-formed, but not necessarily valid, as we'll see. If your document breaks any of these rules, it will be rejected by most, if not all, XML parsers.

Well-Formedness

The minimal requirement for an XML document is that it be well-formed, meaning that it adheres to a small number of syntax rules,⁶ which are summarized in Table 3-1 and explained in the following sections. However, a document can abide by all these rules and still be invalid. To be valid, a document must both be well-formed and adhere to the constraints imposed by a DTD or XML Schema.

TABLE 3-1 XML Syntax Rules (Well-Formedness Constraints)

The document must have a consistent, well-defined structure.

All attribute values must be quoted (single or double quotes).

White space in content, including line breaks, is significant by default.

All start tags must have corresponding end tags (exception: empty elements).

The root element must contain all others, which must nest properly by start/end tag pairing.

Elements must not overlap; they may be nested, however. (This is also technically true for HTML. Browsers ignore overlapping in HTML, but not in XML.)

Each element except the root element must have exactly one parent element that contains it.

Element and attribute names are case-sensitive: Price and PRICE are different elements.

Keywords such as DOCTYPE and ENTITY must always appear in uppercase; similarly for other DTD keywords such as ELEMENT and ATTLIST.

Tags without content are called empty elements and must end in "/>".

Legal XML Name Characters

An XML Name (sometimes called simply a Name) is a token that

begins with a letter, underscore, or colon (but not other punctuation)
continues with letters, digits, hyphens, underscores, colons, or full stops [periods], known as name characters.

Names beginning with the string "xml", or any string which would match (('X'|'x')('M'|'m')('L'|'l')), are reserved.

Element and attribute names must be valid XML Names. (Attribute values need not be.) An NMTOKEN (name token) is any mixture of name characters (letters, digits, hyphens, underscores, colons, and periods).

NOTE

The Namespaces in XML Recommendation assigns a meaning to names that contain colon characters. Therefore, authors should not use the colon in XML names except for namespace purposes (e.g., xsl:template).

Listing 3-2 illustrates a number of legal XML Names, followed by three that should be avoided but may or may not be identified as illegal, depending on the XML parser you use, and four that are definitely illegal. (This is file name-tests.xml on the CD; you can try this with your favorite parser, or with one of the ones provided on the CD.)

Listing 3-2 Legal, Illegal, and Questionable XML Names

<?xml version = "1.0" standalone = "yes" encoding = "UTF-8"?>
<Test>
<!-- legal -->
  <price />
  <Price />
  <pRice />
  <_price />
  <subtotal07 />
  <discounted-price />
  <discounted_price />
  <discounted.price />
  <discountedPrice />
  <DiscountedPrice />
  <DISCOUNTEDprice />
  <kbs:DiscountedPrice />
  <xlink:role />
  <xsl:apply-templates />
<!-- discouraged -->
  <xml-price />
  <xml:price />
  <discounted:price />
<!-- illegal -->
  <7price />
  <-price />
  <.price />
  <discounted price />
</Test>

From the legal examples, we see that any mixture of uppercase and lowercase is fine, as are numbers, and the punctuation characters that were in the definition.

Since the last three examples in the first group use a colon, they are assumed to be elements in the namespaces identified by the prefixes "kbs", "xlink", and "xsl". Of these, the last two refer to W3C-specified namespaces; xlink:role is an attribute defined by the XLink specification and xsl:apply-templates is an element defined by the XSLT specification. The "kbs" prefix refers to a hypothetical namespace, which I could have declared (but didn't), since namespaces do not come only from the W3C. (See chapter 5 for a thorough discussion of namespaces.)

The three debatable examples are xml-price, xml:price, and discounted: price. The first two use the reserved letters "xml"; you shouldn't use them, but most parsers won't reject them. The discounted:price example uses a colon, which is frowned upon if "discounted" is not meant to be a prefix associated with a declared namespace.

The four illegal cases are much more clear. The first three, 7price, -price, and .price, are illegal because the initial character is not a letter, underscore, or colon. The fourth example is illegal because a space character cannot occur in an XML Name. Most parsers will think this is supposed to be the element named discounted and the attribute named price, minus a required equal sign and value.

NOTE

XML Names and NMTOKENS apply to elements, attributes, processing instructions, and many other constructs where an identifier is required, so it's important to understand what is and what is not legal.

Elements and Attributes Are Case-Sensitive

Unlike HTML, which is case insensitive (as is the SGML metalanguage of which HTML is an appplication), XML is strictly case-sensitive, and so therefore is every application of XML (e.g., XSLT, MathML, SVG and so forth, plus any languages you create). Therefore, the following elements are all unique and are in no way related to one another in XML:

price
Price
PRICE

The case sensitivity nature of XML often confuses novices. Be sure to remember this when doing string comparisons in code.

The W3C's Extensible HyperText Markup Language (XHTML) recasts HTML in XML syntax. In XHTML, all elements and attributes have lowercase names, such as:

body
h1
img
href

Notice that this is not merely a convention; it is an absolute requirement. An XHTML document that contains capital letters in element or attribute names is simply invalid, even though uppercase or mixed-case names such as BODY, Body, or even bOdY would be perfectly acceptable in HTML.

Uppercase Keywords

Since XML is case-sensitive, it should not be surprising that certain special words must appear in a particular case. In general, the keywords that relate to DTDs (e.g., DOCTYPE, ENTITY, CDATA, ELEMENT, ATTLIST, PCDATA, IMPLIED, REQUIRED, and FIXED) must be all uppercase. On the other hand, the various strings used in the XML declaration (e.g., xml, version, standalone, and encoding) must appear in all lowercase.

Case Conventions or Guidelines

When creating your own XML vocabulary, it would be desirable if there were conventions to explain the use of uppercase, lowercase, mixed case, underscores, and hyphens. Unfortunately, no such conventions exist in XML 1.0. It is a good idea to adopt your own conventions and to apply them consistently, at least across your project, but ideally throughout your entire organization.

For example, for element names I prefer using what is often called CamelCase because the initial letter of each word in a multiword name is uppercase and all others are lowercase, creating humps like a camel's back. (It's also sometimes called TitleCase because it resembles the title of a book.) For example:

<DiscountPrice rate="20%" countryCode="US" />

Note that for attributes, I also use CamelCase, except the first word is always begun with a lowercase letter, as in "countryCode". In fact, the terms UpperCamelCase (as I use for elements) and lowerCamelCase (as I use for attributes) are often used to make this distinction more clear. One reason that I favor this convention is that in any context (including documentation), it's easy to distinguish elements from attributes.

It would be just as reasonable, however, to use all uppercase letters for elements, all lowercase for attributes, and a hyphen to separate multipart terms as in the following examples, or even to use all uppercase for elements and attributes.

<DISCOUNT-PRICE rate="20%" country-code="US" />

As stated earlier, for XHTML, the W3C elected to use all lowercase letters. The most important thing is to pick a convention for your project (or your company) and to be consistent across developers and applications.

We've seen UpperCamelCase for elements and lowerCamelCase for attributes in the employee example: Employee with its sex attribute, Address, PhoneNumbers, and so on. The following fragment from the W3C's SOAP 1.2 Part 2 Adjuncts Working Draft (http://www.w3.org/TR/2001/WD-soap12-part2-20011002/#N4008D) illustrates its use of UpperCamelCase for element names and lowerCamelCase for attributes, as well as for namespace prefixes.

<env:Body >
 <m:GetLastTradePrice
   env:encodingStyle="http://www.w3.org/2001/09/soap-encoding"
   xmlns:m="http://example.org/2001/06/quotes" >
  <m:Symbol>DEF</m:Symbol>
 </m:GetLastTradePrice>
</env:Body>

Root Element Contains All Others

There must be one root element, also known as the document element, which is the parent of all other elements. That is, all elements are nested within the root element. All descendants of the root, whether immediate children or not, represent the content of the root. Recall that the name of the root element is given in the DOCTYPE line if a DTD is referenced (either an external or internal one). We also noted that this document element must be the first element the parser encounters (after the XML prolog, which does not contain elements).

A somewhat surprising aspect, at least to this author, is that the XML Recommendation does not preclude a recursive root! In other words, it is possible for a root element to be defined in a DTD as containing itself. Although this is not common, it is worth noting. For example, in NASA's IML DTD, we allowed that the root element Instrument could contain other Instrument children. (The DTD syntax shown here is formally described in chapter 4.)

<!ELEMENT Instrument (Instrument | Port | CommandProcedureSet)* >

Start and End Tags Must Match

Every start tag must have a corresponding end tag to properly delimit the content of the element the tags represent. The start and end tags are indicated exactly as they are in HTML, with < denoting the beginning of a start tag and </ indicating the beginning of the end tag. The end delimiter of each tag is >.

<ElementName>content</ElementName>

Empty Elements

An exception to the rule about start and end tags is the case in which an element has no content. Such empty elements convey information simply by their presence or possibly by their attributes, if any. Examples from XHTML 1.0 include:

<br />
<hr />
<img src="someImage.gif" width="100" height="200" alt="Some Image" />

An empty element begins like a start tag but terminates with the sequence />. Optional white space may be used before the two terminating characters. This author prefers to include a space to emphasize empty elements. The space before /> is necessary for XHTML 1.0 to be handled correctly by older browser versions. Of course, it's also possible to specify an empty element by using regular start and end tags, and this is syntactically identical (from the parser's viewpoint) to the use of empty-element notation.

<img src="someImage.gif" width="100" height="200" alt="Some Image"></img>

Note that just like in HTML (or more appropriately, XHTML), an empty element is often used as a separator, such as <br /> and <hr />, or to indicate by its presence a particular piece of data, or to convey metadata by its attributes. If the term empty element seems strange to you when attributes are involved, just think in terms of the content of the element. There is no content, even when there are attributes, which is why it's called empty.

Proper Nesting of Start and End Tags

No overlapping of start and end tags from different elements is permitted. Although this might seem like an obvious requirement, HTML as implemented by major browsers is considerably more forgiving and recovers from improper tag overlap. Correct nesting looks like this:

<OuterElement>
 <InnerElement>inner content</InnerElement>
</OuterElement>

An example of improper nesting is:

<OuterElement>
 <InnerElement>inner content</OuterElement>
</InnerElement>

Believe it or not, most browsers recover from this type of error in HTML, but they cannot and will not in XML or any language based on XML syntax. The improper nesting example results in either one or two fatal errors, with a message similar to this (depending on the parser):

Fatal error: end tag '</OuterElement>' does not match start tag. Expected
  '</InnerElement>'
Fatal error: end tag '</InnerElement>' does not match start tag. Expected
  '</OuterElement>'

Parent, Child, Ancestor, Descendant

The notion of the root element and the proper nesting rules leads us to some conclusions and terminology about the hierarchy of elements that are invariant across all XML documents. The terms ancestor and descendant are not used in the XML 1.0 Recommendation, but they certainly are in the DOM, XSLT, XPath, and so on, which is why they are introduced here:

An element is a child of exactly one parent, which is the element that contains it.
A parent may have more than one child.
Immediate children and also children of a child are descendants of the parent.
An element is an ancestor of all its descendants.
The root is the ancestor of all elements.
Every element is a descendant of the root.
Every element has exactly one parent, except the root, which has no parent.

Attribute Values Must Be Quoted

In HTML (but not in XHTML), we are permitted to be inconsistent in the use of quotation marks to delimit the values of attributes. Generally, single-word values do not require quotes in HTML. For example, both of these are acceptable and equivalent in HTML:

<IMG SRC=someImage.gif>

<IMG SRC="someImage.gif">

In XML (and in XHTML), however, we are not allowed to be so cavalier about quotes. All attribute values must be quoted, even if there are no embedded spaces.

<img src="someImage.gif" />
<img src='someImage.gif' />
<img src="someImage.gif" width="34" height="17"/>

Notice that either single or double quotes may be used to delimit the attribute values. Of course, if the attribute value contains double quotes, then you must use single quotes as the delimiter, and vice versa.

<Book title="Tudor's Guide to Paris" />
<Object width='5.3"' height='7.1"' />

White Space Is Significant

White space consists of one or more space characters, tabs, carriage returns, line feeds (denoted as #x20, #x9, #xD, and #xA, respectively). In the XML 1.0 Recommendation, white space is symbolized in production rules by a capital "S", with the following definition (See http://www.w3.org/TR/REC-xml#sec-common-syn and http://www.w3.org/TR/REC-xml#sec-white-space):

  S ::= (#x20 | #x9 | #xD | #xA)+

In contrast to HTML, in which a sequence of white space characters is collapsed into a single white space and in which newlines are ignored, in XML all white space is taken literally. This means that the following two examples are not equivalent:

<Publication>
 <Published>1992</Published>
 <Publisher>Harmony Books</Publisher>
</Publication>
<Publication>
 <Published>1992</Published>
 <Publisher>Harmony 
Books</Publisher>
</Publication>

By default, XML parsers handle the Publisher element differently since in the second example, the string "Harmony Books" contains a newline between the two words. The application that invokes the parser can either consider the white space important, ignore it (i.e., strip it), or inform the parser that it wants white space normalized (collapsed like in HTML).

Comments

Comments in XML are just like they are in HTML. They begin with the character sequence . The parser ignores what appears between them, except to verify that the comment is well-formed.

<Publication>
 <Published>1992</Published>
 <!-- This appears to be the second edition. -->
 <Publisher>Harmony Books</Publisher>
</Publication>

In XML, however; there are several restrictions regarding comments:

Comments cannot contain the double hyphen combination "--" anywhere except as part of the comment's start and end tags. Thus, this comment is illegal: 
Comments cannot be nested. This means you need to take care when commenting out a section that already contains comments.
Comments cannot precede the XML declaration because that part of the prolog must be the very first line in the document.
Comments are not permitted in a start or end tag. They can appear only between tags (as if they were content) or surrounding tags.
Comments may be used to cause the parser to ignore blocks of elements, provided that the result, once the commented-out block is effectively removed by the parser, is still well-formed XML.
Parsers are not required to make comments available to the application, so don't use them to pass data to an application; use Processing Instructions, discussed next.
Comments are also permitted in the DTD, as discussed in chapter 4.

Processing Instructions

Processing instructions (often abbreviated as PI) are directives intended for an application other than the XML parser. Unlike comments, parsers are required to pass processing instructions on to the application. The general syntax for a PI is:

<?targetApplication applicationData ?>

Where targetApplication is the name (any XML Name) of the application that should receive the instruction, and applicationData is any arbitrary string that doesn't contain the end delimiter. Often applicationData consists of name/value pairs that resemble attributes with values, but there is no requirement concerning the format. Aside from the delimiters "<?" and "?>", which must appear exactly as shown, the only restriction is that there can be no space between the initial question mark and the target. Some examples follow.

<?xml-stylesheet type="text/xsl" href="foo.xsl" ?>
<?MortgageRateHandler rate="7%" period="30 years" ?>
<?javaApp class="MortgageRateHandler" ?>
<?javaApp This is the data for the MortgageRateHandler, folks! ?>
<?acroread file="mortgageRates.pdf" ?>

Processing instructions are not part of the actual structure of the document, so they may appear almost anywhere, except before the XML declaration or in a CDATA section. The parser's responsibility is merely to pass the PI and its data on to the application. Since the same XML document could be processed by multiple applications, it is entirely possible that some applications will ignore a given PI and just pass it down the chain. In that case, the processing instruction will be acted upon only by the application for which it is intended (has meaning).

Although an XML declaration looks like a processing instruction because it is wrapped in the delimiters "<?" and "?>", it is not considered a PI. It is simply an XML declaration, the one-of-a-kind markup that may or may not be the first line of the document.

The target portion of the processing instruction can be a notation (defined in chapter 4). For example:

<!NOTATION AcrobatReader SYSTEM "/usr/local/bin/acroread">

The corresponding PI would be:

<?AcrobatReader file="Readme.pdf" size="75%" ?>

Entity References

Entity references are markup that the parser replaces with character data. In HTML, there are hundreds of predefined character entities, including the Greek alphabet, math symbols, and the copyright symbol. There are only five predefined entity references in XML, however, as shown in Table 3-2.

TABLE 3-2 Predefined Entity References

Character	Entity Reference	Decimal Representation	Hexidecimal Representation
`<`	`<`	`<`	`<`
`>`	`>`	`>`	`>`
`&`	`&`	`&`	`&`
`"`	`"`	`"`	`"`
`'`	`'`	`'`	`'`

We've already seen how entity references can be used as content. They can also appear within attribute values. According to Table 3-2,

 <CD title="Brooks &amp; Dunn&apos;s Greatest Hits" />

is equivalent to the decimal representation:

 <CD title="Brooks &#38; Dunn&#39;s Greatest Hits" />

and to the hexidecimal representation:

 <CD title="Brooks &#x26; Dunn&#x27;s Greatest Hits" />

However, the next line is illegal because ampersand ("&") must be escaped by using either the entity reference or one of its numeric representations:

<CD title="Brooks & Dunn&apos;s Greatest Hits" />

This is because ampersand and less-than are special cases.

NOTE

You are required to use the predefined entities < and & to escape the characters < and & in all cases other than when these characters are used as markup delimiters, or in a comment, a processing instruction, or a CDATA section. In other words, the literal < and & characters can appear only as markup delimiters, or within a comment, a processing instruction, or a CDATA section.

Listing 3-3 illustrates the use of all five predefined character entities, several decimal representations of Greek letters, and the three legal variations of the Brooks & Dunn example. If we run this through an XML parser, we can verify that it is well-formed; we did not use the literal ampersand or the literal less-than before the word StockWatch. Figure 3-1 shows how this example looks in Internet Explorer, which renders the characters that are represented by the entities. It also confirms that the three Brooks & Dunn variations are equivalent.

Listing 3-3 Examples of Predefined Entities and Greek Letters (predefined-entities.xml)

<?xml version="1.0" standalone="yes"?>
<Predefined>
 <Test>The hot tip from today&apos;s &lt;StockWatch&gt; column is:
&quot;AT&amp;T stock is doing better than 
Ralph Spoilsports Motors&apos; stock.&quot;
 </Test>
 <PS>Now, wasn&apos;t that as easy as &#928;?
Or &#945;, &#946;, &#947;?</PS>
 <CD title="Brooks &amp; Dunn&apos;s Greatest Hits" />
 <CD title="Brooks &#38; Dunn&#39;s Greatest Hits" />
 <CD title="Brooks &#x26; Dunn&#x27;s Greatest Hits" />
</Predefined>

FIGURE 3-1 Predefined entities displayed in Internet Explorer

HTML (and therefore XHTML) includes three large sets of predefined entities: Latin1, Special, and Symbols. You can pull these definitions into your XML document using external entities, covered in chapter 4. The files containing the entities are:

http://www.w3.org/TR/xhtml1/DTD/xhtml-lat1.ent
http://www.w3.org/TR/xhtml1/DTD/xhtml-special.ent
http://www.w3.org/TR/xhtml1/DTD/xhtml-symbol.ent

CDATA Sections

Sometimes it is necessary to indicate that a particular block of text should not be interpreted by the parser. One example is a large number of occurrences of the five predefined entities in a block of text that contains no markup, such as a section of code that needs to test for the numeric less-than or Boolean &&. In this case, we want text that would normally be considered markup to be treated simply as literal character data. CDATA sections are designated portions of an XML document in which all markup is ignored by the parser and all text is treated as character data instead. The main uses of CDATA sections are:

To delimit blocks of source code (JavaScript, Java, etc.) embedded in XML
To embed XML, XHTML, or even HTML examples in an XML document

The general syntax for a CDATA section is:

<![CDATA[
multi-line text block to be treated as character data
]]>

No spaces are permitted within the two delimiters "<![CDATA[" and "]]>".

Here's a CDATA section used to escape a block of code:

<![CDATA[
function doIt()
{
  var foo = 3;
  var bar = 13;
  if (foo < 8 && bar > 8)
    alert("Help!");
  else
    alert("I'm Down");
}
]]>

An example of embedded XML in XML follows.

<Example>
 <Number>2.4</Number>
 <XMLCode>
<![CDATA[
<?xml version="1.0" standalone="no" ?>
<!DOCTYPE Message SYSTEM "message.dtd">
<Message mime-type="text/plain">
<!-- This is a trivial example. -->
 <From>The Kenster</From>
 <To>Silly Little Cowgirl</To>
 <Body>
 Hi, there. How is your gardening going?
 </Body>
</Message>
]]>
 </XMLCode>
</Example>

In contrast to our earlier use of the Message example, the character data is not simply the three lines of content of the From, To, and Body elements. When this example is embedded within a CDATA section, the entire block is character data, which in this case means from the XML declaration to and including the </Message> end tag. In other words, the XML prolog, the comment, the start and end tags, and so on, are no longer markup; in this context, they constitute the character data contained by the CDATA section.

< Back Page 4 of 8 Next >

🔖 Save To Your Account

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.

Privacy Notice

Overview

Pearson Education, Inc., 221 River Street, Hoboken, New Jersey 07030, (Pearson) presents this site to provide information about products and services that can be purchased through this site.

This privacy notice provides an overview of our commitment to privacy and describes how we collect, protect, use and share personal information collected through this site. Please note that other Pearson websites and online products and services have their own separate privacy policies.

Collection and Use of Information

To conduct business and deliver products and services, Pearson collects and uses personal information in several ways in connection with this site, including:

Questions and Inquiries

For inquiries and questions, we collect the inquiry or question, together with name, contact details (email address, phone number and mailing address) and any other additional information voluntarily submitted to us through a Contact Us form or an email. We use this information to address the inquiry and respond to the question.

Online Store

For orders and purchases placed through our online store on this site, we collect order details, name, institution name and address (if applicable), email address, phone number, shipping and billing addresses, credit/debit card information, shipping options and any instructions. We use this information to complete transactions, fulfill orders, communicate with individuals placing orders or visiting the online store, and for related purposes.

Surveys

Pearson may offer opportunities to provide feedback or participate in surveys, including surveys evaluating Pearson products, services or sites. Participation is voluntary. Pearson collects information requested in the survey questions and uses the information to evaluate, support, maintain and improve products, services or sites, develop new products and services, conduct educational research and for other purposes specified in the survey.

Contests and Drawings

Occasionally, we may sponsor a contest or drawing. Participation is optional. Pearson collects name, contact information and other information specified on the entry form for the contest or drawing to conduct the contest or drawing. Pearson may collect additional personal information from the winners of a contest or drawing in order to award the prize and for tax reporting purposes, as required by law.

Newsletters

If you have elected to receive email newsletters or promotional mailings and special offers but want to unsubscribe, simply email information@informit.com.

Service Announcements

On rare occasions it is necessary to send out a strictly service related announcement. For instance, if our service is temporarily suspended for maintenance we might send users an email. Generally, users may not opt-out of these communications, though they can deactivate their account information. However, these communications are not promotional in nature.

Customer Service

We communicate with users on a regular basis to provide requested services and in regard to issues relating to their account we reply via email or phone in accordance with the users' wishes when a user submits their information through our Contact Us form.

Other Collection and Use of Information

Application and System Logs

Pearson automatically collects log data to help ensure the delivery, availability and security of this site. Log data may include technical information about how a user or visitor connected to this site, such as browser type, type of computer/device, operating system, internet service provider and IP address. We use this information for support purposes and to monitor the health of the site, identify problems, improve service, detect unauthorized access and fraudulent activity, prevent and respond to security incidents and appropriately scale computing resources.

Web Analytics

Pearson may use third party web trend analytical services, including Google Analytics, to collect visitor information, such as IP addresses, browser types, referring pages, pages visited and time spent on a particular site. While these analytical services collect and report information on an anonymous basis, they may use cookies to gather web trend information. The information gathered may enable Pearson (but not the third party web trend services) to link information with application and system log data. Pearson uses this information for system administration and to identify problems, improve service, detect unauthorized access and fraudulent activity, prevent and respond to security incidents, appropriately scale computing resources and otherwise support and deliver this site and its services.

Cookies and Related Technologies

This site uses cookies and similar technologies to personalize content, measure traffic patterns, control security, track use and access of information on this site, and provide interest-based messages and advertising. Users can manage and block the use of cookies through their browser. Disabling or blocking certain cookies may limit the functionality of this site.

Do Not Track

This site currently does not respond to Do Not Track signals.

Security

Pearson uses appropriate physical, administrative and technical security measures to protect personal information from unauthorized access, use and disclosure.

Children

This site is not directed to children under the age of 13.

Marketing

Pearson may send or direct marketing communications to users, provided that

Pearson will not use personal information collected or processed as a K-12 school service provider for the purpose of directed or targeted advertising.
Such marketing is consistent with applicable law and Pearson's legal obligations.
Pearson will not knowingly direct or send marketing communications to an individual who has expressed a preference not to receive marketing.
Where required by applicable law, express or implied consent to marketing exists and has not been withdrawn.

Pearson may provide personal information to a third party service provider on a restricted basis to provide marketing solely on behalf of Pearson or an affiliate or customer for whom Pearson is a service provider. Marketing preferences may be changed at any time.

Correcting/Updating Personal Information

If a user's personally identifiable information changes (such as your postal address or email address), we provide a way to correct or update that user's personal data provided to us. This can be done on the Account page. If a user no longer desires our service and desires to delete his or her account, please contact us at customer-service@informit.com and we will process the deletion of a user's account.

Choice/Opt-out

Users can always make an informed choice as to whether they should proceed with certain services offered by InformIT. If you choose to remove yourself from our mailing list(s) simply visit the following page and uncheck any communication you no longer want to receive: www.informit.com/u.aspx.

Sale of Personal Information

Pearson does not rent or sell personal information in exchange for any payment of money.

While Pearson does not sell personal information, as defined in Nevada law, Nevada residents may email a request for no sale of their personal information to NevadaDesignatedRequest@pearson.com.

Supplemental Privacy Statement for California Residents

California residents should read our Supplemental privacy statement for California residents in conjunction with this Privacy Notice. The Supplemental privacy statement for California residents explains Pearson's commitment to comply with California law and applies to personal information of California residents collected in connection with this site and the Services.

Sharing and Disclosure

Pearson may disclose personal information, as follows:

As required by law.
With the consent of the individual (or their parent, if the individual is a minor)
In response to a subpoena, court order or legal process, to the extent permitted or required by law
To protect the security and safety of individuals, data, assets and systems, consistent with applicable law
In connection the sale, joint venture or other transfer of some or all of its company or assets, subject to the provisions of this Privacy Notice
To investigate or address actual or suspected fraud or other illegal activities
To exercise its legal rights, including enforcement of the Terms of Use for this site or another contract
To affiliated Pearson companies and other companies and organizations who perform work for Pearson and are obligated to protect the privacy of personal information consistent with this Privacy Notice
To a school, organization, company or government agency, where Pearson collects or processes the personal information in a school setting or on behalf of such organization, company or government agency.

Links

This web site contains links to other sites. Please be aware that we are not responsible for the privacy practices of such other sites. We encourage our users to be aware when they leave our site and to read the privacy statements of each and every web site that collects Personal Information. This privacy statement applies solely to information collected by this web site.

Requests and Contact

Please contact us about this Privacy Notice or if you have any requests or questions relating to the privacy of your personal information.

Changes to this Privacy Notice

We may revise this Privacy Notice through an updated posting. We will identify the effective date of the revision in the posting. Often, updates are made to provide greater clarity or to comply with changes in regulatory requirements. If the updates involve material changes to the collection, protection, use or disclosure of Personal Information, Pearson will provide notice of the change through a conspicuous notice on this site or other appropriate way. Continued use of the site after the effective date of a posted revision evidences acceptance. Please contact us if you have questions or concerns about the Privacy Notice or any objection to any revisions.

Last Update: November 17, 2020

Email Address

XML Syntax and Parsing Concepts

This chapter is from the book

This chapter is from the book

This chapter is from the book 

XML Syntax Rules

Well-Formedness

TABLE 3-1 XML Syntax Rules (Well-Formedness Constraints)

Legal XML Name Characters

Listing 3-2 Legal, Illegal, and Questionable XML Names

Elements and Attributes Are Case-Sensitive

Uppercase Keywords

Case Conventions or Guidelines

Root Element Contains All Others

Start and End Tags Must Match

Empty Elements

Proper Nesting of Start and End Tags

Parent, Child, Ancestor, Descendant

Attribute Values Must Be Quoted

White Space Is Significant

Comments

Processing Instructions

Entity References

TABLE 3-2 Predefined Entity References

Listing 3-3 Examples of Predefined Entities and Greek Letters (predefined-entities.xml)

CDATA Sections

InformIT Promotional Mailings & Special Offers

Overview

Collection and Use of Information

Questions and Inquiries

Online Store

Surveys

Contests and Drawings

Newsletters

Service Announcements

Customer Service

Other Collection and Use of Information

Application and System Logs

Web Analytics

Cookies and Related Technologies

Do Not Track

Security

Children

Marketing

Correcting/Updating Personal Information

Choice/Opt-out

Sale of Personal Information

Supplemental Privacy Statement for California Residents

Sharing and Disclosure

Links

Requests and Contact

Changes to this Privacy Notice