- Sams Teach Yourself XML in 21 Days, Third Edition
- Table of Contents
- About the Author
- Acknowledgments
- We Want to Hear from You!
- Introduction
- Part I: At a Glance
- Day 1. Welcome to XML
- All About Markup Languages
- All About XML
- Looking at XML in a Browser
- Working with XML Data Yourself
- Structuring Your Data
- Creating Well-Formed XML Documents
- Creating Valid XML Documents
- How XML Is Used in the Real World
- Online XML Resources
- Summary
- Q&A
- Workshop
- Day 2. Creating XML Documents
- Choosing an XML Editor
- Using XML Browsers
- Using XML Validators
- Creating XML Documents Piece by Piece
- Creating Prologs
- Creating an XML Declaration
- Creating XML Comments
- Creating Processing Instructions
- Creating Tags and Elements
- Creating CDATA Sections
- Handling Entities
- Summary
- Q&A
- Workshop
- Day 3. Creating Well-Formed XML Documents
- What Makes an XML Document Well-Formed?
- Creating an Example XML Document
- Understanding the Well-Formedness Constraints
- Using XML Namespaces
- Understanding XML Infosets
- Understanding Canonical XML
- Summary
- Q&A
- Workshop
- Day 4. Creating Valid XML Documents: DTDs
- All About DTDs
- Validating a Document by Using a DTD
- Creating Element Content Models
- Commenting a DTD
- Supporting External DTDs
- Handling Namespaces in DTDs
- Summary
- Q&A
- Workshop
- Declaring Attributes in DTDs
- Day 5. Handling Attributes and Entities in DTDs
- Specifying Default Values
- Specifying Attribute Types
- Handling Entities
- Summary
- Q&A
- Workshop
- Day 6. Creating Valid XML Documents: XML Schemas
- Using XML Schema Tools
- Creating XML Schemas
- Dissecting an XML Schema
- The Built-in XML Schema Elements
- Creating Elements and Types
- Specifying a Number of Elements
- Specifying Element Default Values
- Creating Attributes
- Summary
- Q&A
- Workshop
- Day 7. Creating Types in XML Schemas
- Restricting Simple Types by Using XML Schema Facets
- Creating XML Schema Choices
- Using Anonymous Type Definitions
- Declaring Empty Elements
- Declaring Mixed-Content Elements
- Grouping Elements Together
- Grouping Attributes Together
- Declaring all Groups
- Handling Namespaces in Schemas
- Annotating an XML Schema
- Summary
- Q&A
- Workshop
- Part I. In Review
- Well-Formed Documents
- Valid Documents
- Part II: At a Glance
- Day 8. Formatting XML by Using Cascading Style Sheets
- Our Sample XML Document
- Introducing CSS
- Connecting CSS Style Sheets and XML Documents
- Creating Style Sheet Selectors
- Using Inline Styles
- Creating Style Rule Specifications in Style Sheets
- Summary
- Q&A
- Workshop
- Day 9. Formatting XML by Using XSLT
- Introducing XSLT
- Transforming XML by Using XSLT
- Writing XSLT Style Sheets
- Using <xsl:apply-templates>
- Using <xsl:value-of> and <xsl:for-each>
- Matching Nodes by Using the match Attribute
- Working with the select Attribute and XPath
- Using <xsl:copy>
- Using <xsl:if>
- Using <xsl:choose>
- Specifying the Output Document Type
- Summary
- Q&A
- Workshop
- Day 10. Working with XSL Formatting Objects
- Introducing XSL-FO
- Using XSL-FO
- Using XSL Formatting Objects and Properties
- Building an XSL-FO Document
- Handling Inline Formatting
- Formatting Lists
- Formatting Tables
- Summary
- Q&A
- Workshop
- Part II. In Review
- Using CSS
- Using XSLT
- Using XSL-FO
- Part III: At a Glance
- Day 11. Extending HTML with XHTML
- Why XHTML?
- Writing XHTML Documents
- Validating XHTML Documents
- The Basic XHTML Elements
- Organizing Text
- Formatting Text
- Selecting Fonts: <font>
- Comments: <!-->
- Summary
- Q&A
- Workshop
- Day 12. Putting XHTML to Work
- Creating Hyperlinks: <a>
- Linking to Other Documents: <link>
- Handling Images: <img>
- Creating Frame Documents: <frameset>
- Creating Frames: <frame>
- Creating Embedded Style Sheets: <style>
- Formatting Tables: <table>
- Creating Table Rows: <tr>
- Formatting Table Headers: <th>
- Formatting Table Data: <td>
- Extending XHTML
- Summary
- Q&A
- Workshop
- Day 13. Creating Graphics and Multimedia: SVG and SMIL
- Introducing SVG
- Creating an SVG Document
- Creating Rectangles
- Adobe's SVG Viewer
- Using CSS Styles
- Creating Circles
- Creating Ellipses
- Creating Lines
- Creating Polylines
- Creating Polygons
- Creating Text
- Creating Gradients
- Creating Paths
- Creating Text Paths
- Creating Groups and Transformations
- Creating Animation
- Creating Links
- Creating Scripts
- Embedding SVG in HTML
- Introducing SMIL
- Summary
- Q&A
- Workshop
- Day 14. Handling XLinks, XPointers, and XForms
- Introducing XLinks
- Beyond Simple XLinks
- Introducing XPointers
- Introducing XBase
- Introducing XForms
- Summary
- Workshop
- Part III. In Review
- Part IV: At a Glance
- Day 15. Using JavaScript and XML
- Introducing the W3C DOM
- Introducing the DOM Objects
- Working with the XML DOM in JavaScript
- Searching for Elements by Name
- Reading Attribute Values
- Getting All XML Data from a Document
- Validating XML Documents by Using DTDs
- Summary
- Q&A
- Workshop
- Day 16. Using Java and .NET: DOM
- Using Java to Read XML Data
- Finding Elements by Name
- Creating an XML Browser by Using Java
- Navigating Through XML Documents
- Writing XML by Using Java
- Summary
- Q&A
- Workshop
- Day 17. Using Java and .NET: SAX
- An Overview of SAX
- Using SAX
- Using SAX to Find Elements by Name
- Creating an XML Browser by Using Java and SAX
- Navigating Through XML Documents by Using SAX
- Writing XML by Using Java and SAX
- Summary
- Q&A
- Workshop
- Day 18. Working with SOAP and RDF
- Introducing SOAP
- A SOAP Example in .NET
- A SOAP Example in Java
- Introducing RDF
- Summary
- Q&A
- Workshop
- Part IV. In Review
- Part V: At a Glance
- Day 19. Handling XML Data Binding
- Introducing DSOs
- Binding HTML Elements to HTML Data
- Binding HTML Elements to XML Data
- Binding HTML Tables to XML Data
- Accessing Individual Data Fields
- Binding HTML Elements to XML Data by Using the XML DSO
- Binding HTML Tables to XML Data by Using the XML DSO
- Searching XML Data by Using a DSO and JavaScript
- Handling Hierarchical XML Data
- Summary
- Q&A
- Workshop
- Day 20. Working with XML and Databases
- XML, Databases, and ASP
- Storing Databases as XML
- Using XPath with a Database
- Introducing XQuery
- Summary
- Q&A
- Workshop
- Day 21. Handling XML in .NET
- Creating and Editing an XML Document in .NET
- From XML to Databases and Back
- Reading and Writing XML in .NET Code
- Using XML Controls to Display Formatted XML
- Creating XML Web Services
- Summary
- Q&A
- Workshop
- Part V. In Review
- Appendix A. Quiz Answers
- Quiz Answers for Day 1
- Quiz Answers for Day 2
- Quiz Answers for Day 3
- Quiz Answers for Day 4
- Quiz Answers for Day 5
- Quiz Answers for Day 6
- Quiz Answers for Day 7
- Quiz Answers for Day 8
- Quiz Answers for Day 9
- Quiz Answers for Day 10
- Quiz Answers for Day 11
- Quiz Answers for Day 12
- Quiz Answers for Day 13
- Quiz Answers for Day 14
- Quiz Answers for Day 15
- Quiz Answers for Day 16
- Quiz Answers for Day 17
- Quiz Answers for Day 18
- Quiz Answers for Day 19
- Quiz Answers for Day 20
- Quiz Answers for Day 21
The Basic XHTML Elements
To be able to use XHTML, you need to know what's available, so the following sections describe the basic XHTML elements and their attributes, with examples. These elements might look like HTML, but they're XHTML, which means they're really XML—and that means that there are rigid rules about which element can or must contain what other elements, which attributes are required, and so on. An HTML author should know these rules when making the transition to XHTML. The following sections list what versions of XHTML support each element and attribute (with the exception of XHTML 2.0, whose list of supported elements isn't even available yet). Let's start with the XHTML document element, <html>.
Using the Document Element: <html>
As you saw earlier today, the document element for all XHTML elements is <html>. This element must contain all other elements in an XHTML document. The <html> element is supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1. Here are the attributes of the <html> element:
- dir — Sets the direction of text that doesn't have an inherent direction, called directionally neutral text. This attribute can be set to ltr, for left-to-right text, or rtl, for right-to-left text. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- lang — Specifies the base language of the element. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- xml:lang — Specifies the base language for the element when the document is treated as XML. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- xmlns — Is a required attribute that should be set to "http://www.w3.org/1999/xhtml". (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
As you've already seen today, <html> is the document element for an XHTML document, and it looks like this:
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> . . . </body> </html>
In HTML, the <HTML> tag is optional because it's the default, and browsers have long come to accept nearly any form of HTML markup. In XHTML, however, the <html> element is required as the document element.
The xmlns attribute is also required, and you use it to set the namespace for the document to "http://www.w3.org/1999/xhtml", as in today's example. The other attributes are optional.
In the XHTML DTDs, the <html> element is declared in such a way that it can contain a <head> element and a <body> element (or a <head> element and a <frameset> element in the XHTML 1.0 Frameset documents). The following section describes the <head> element.
Creating a Document Head: <head>
In an XHTML document, the <head> element contains the document's head, which holds data about the document, scripting elements, and other data not intended for direct display. In XHTML, every XHTML document should have a <head> element, and every <head> element must contain at least a <title> element. The <head> element is supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1. Here are the attributes of this element:
- dir — Sets the direction of directionally neutral text. This attribute can be set to ltr, for left-to-right text, or rtl, for right-to-left text. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- lang — Specifies the base language of the element. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- profile — Specifies the location of one or more profile URIs. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- xml:lang — Specifies the base language for the element when the document is treated as XML. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
You've already seen a <head> element in the sample XHTML document:
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <title> An XHTML Document </title> </head> . . . </html>
XHTML specifies that the following elements may appear in the <head> element:
- <base>
- <isindex>
- <link>
- <meta>
- <noscript>
- <object>
- <script>
- <style>
- <title>
The only one of these elements that's required in the <head> element is the <title> element, which is described in the following section.
Giving a Document a Title: <title>
The <title> element holds the title of the document. A browser displays the document's title in its title bar. The <title> element is supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1. Here are its attributes:
- dir — Sets the direction of directionally neutral text. This attribute can be set to ltr, for left-to-right text, or rtl, for right-to-left text. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- lang — Specifies the base language of the element. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- xml:lang — Specifies the base language for the element when the document is treated as XML. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
The W3C XHTML DTDs say that "exactly one title is required per document." You've already seen how to use this element in the XHTML document ch11_02.html.
Giving a Document a Body: <body>
The document's body holds the document's content—all the data that the document is meant to display. The <body> element is supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, and XHTML 1.1. Here are this element's attributes:
- alink — Sets the color of hyperlinks when they're being activated. This attribute was deprecated in HTML 4.0. (Supported in XHTML 1.0 Transitional and XHTML 1.0 Frameset.)
- background — Specifies the URI of an image to be used for the browser's background. This attribute was deprecated in HTML 4.0. (Supported in XHTML 1.0 Transitional and XHTML 1.0 Frameset.)
- bgcolor — Specifies the color of the browser's background. This attribute was deprecated in HTML 4.0. (Supported in XHTML 1.0 Transitional and XHTML 1.0 Frameset.)
- class — Sets the style class of the element. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- dir — Sets the direction of directionally neutral text. This attribute can be set to ltr, for left-to-right text, or rtl, for right-to-left text. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- id — Specifies the ID with which to refer to the element. You should set this attribute to a unique identifier. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- lang — Sets the base language of the element. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- link — Sets the color of hyperlinks that have not yet been visited. This attribute was deprecated in HTML 4.0. (Supported in XHTML 1.0 Transitional and XHTML 1.0 Frameset.)
- style — Indicates how a browser should display the element. You should set this to an inline style. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- text — Specifies the color of text in the document. This attribute was deprecated in HTML 4.0. (Supported in XHTML 1.0 Transitional and XHTML 1.0 Frameset.)
- title — Specifies the title of the body. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
- vlink — Specifies the color of hyperlinks that have been visited. This attribute was deprecated in HTML 4.0. (Supported in XHTML 1.0 Transitional and XHTML 1.0 Frameset.)
- xml:lang — Specifies the base language for the element when the document is treated as XML. (Supported in XHTML 1.0 Strict, XHTML 1.0 Transitional, XHTML 1.0 Frameset, and XHTML 1.1.)
You've seen the <body> element in the sample XHTML document already:
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> . . . </head> <body> <h1> Long Live XHTML! </h1> This is an XHTML document. <br/> Pretty good, eh? </body> </html>
A number of common attributes of the HTML <BODY> element were deprecated in HTML 4.0, so they are not part of either XHTML 1.0 Strict or XHTML 1.1. These attributes include such favorites as alink, background, bgcolor, link, text, and vlink. You're now supposed to use style sheets properties instead of these attributes. Listing 11.3 shows an example that assigns values to some of these properties.
Example 11.3. An XHTML Document That Uses Styles (ch11_03.html)
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <title> An XHTML Document </title> <style type="text/css"> body {background: coral} a:link {color: black} a:active {color: green} a:visited {color: blue} </style> </head> <body> <h1> Long Live XHTML! </h1> This is an <a href="http://www.w3.org/MarkUp/">XHTML</a>. document. <br/> Pretty good, eh? </body> </html>
Figure 11.5 shows the results of Listing 11.3. The W3C's idea in turning toward style sheets to handle the display is that it wants to separate the display details from the data details. That idea is reaching its zenith in XHTML 2.0, which doesn't support even the usual HTML display elements. Theoretically, this transition makes sense, but it has also delayed the acceptance of XHTML by HTML authors.
Figure 11.5 sing an embedded style sheet.
Note also that if a style sheet contains sensitive characters, such as < or &, you should use external style sheets, as you did on Day 8, "Formatting XML by Using Cascading Style Sheets," because the entire contents of an XHTML document is intended to be valid XML.