- Sams Teach Yourself XML in 21 Days, Third Edition
- Table of Contents
- About the Author
- Acknowledgments
- We Want to Hear from You!
- Introduction
- Part I: At a Glance
- Day 1. Welcome to XML
- All About Markup Languages
- All About XML
- Looking at XML in a Browser
- Working with XML Data Yourself
- Structuring Your Data
- Creating Well-Formed XML Documents
- Creating Valid XML Documents
- How XML Is Used in the Real World
- Online XML Resources
- Summary
- Q&A
- Workshop
- Day 2. Creating XML Documents
- Choosing an XML Editor
- Using XML Browsers
- Using XML Validators
- Creating XML Documents Piece by Piece
- Creating Prologs
- Creating an XML Declaration
- Creating XML Comments
- Creating Processing Instructions
- Creating Tags and Elements
- Creating CDATA Sections
- Handling Entities
- Summary
- Q&A
- Workshop
- Day 3. Creating Well-Formed XML Documents
- What Makes an XML Document Well-Formed?
- Creating an Example XML Document
- Understanding the Well-Formedness Constraints
- Using XML Namespaces
- Understanding XML Infosets
- Understanding Canonical XML
- Summary
- Q&A
- Workshop
- Day 4. Creating Valid XML Documents: DTDs
- All About DTDs
- Validating a Document by Using a DTD
- Creating Element Content Models
- Commenting a DTD
- Supporting External DTDs
- Handling Namespaces in DTDs
- Summary
- Q&A
- Workshop
- Declaring Attributes in DTDs
- Day 5. Handling Attributes and Entities in DTDs
- Specifying Default Values
- Specifying Attribute Types
- Handling Entities
- Summary
- Q&A
- Workshop
- Day 6. Creating Valid XML Documents: XML Schemas
- Using XML Schema Tools
- Creating XML Schemas
- Dissecting an XML Schema
- The Built-in XML Schema Elements
- Creating Elements and Types
- Specifying a Number of Elements
- Specifying Element Default Values
- Creating Attributes
- Summary
- Q&A
- Workshop
- Day 7. Creating Types in XML Schemas
- Restricting Simple Types by Using XML Schema Facets
- Creating XML Schema Choices
- Using Anonymous Type Definitions
- Declaring Empty Elements
- Declaring Mixed-Content Elements
- Grouping Elements Together
- Grouping Attributes Together
- Declaring all Groups
- Handling Namespaces in Schemas
- Annotating an XML Schema
- Summary
- Q&A
- Workshop
- Part I. In Review
- Well-Formed Documents
- Valid Documents
- Part II: At a Glance
- Day 8. Formatting XML by Using Cascading Style Sheets
- Our Sample XML Document
- Introducing CSS
- Connecting CSS Style Sheets and XML Documents
- Creating Style Sheet Selectors
- Using Inline Styles
- Creating Style Rule Specifications in Style Sheets
- Summary
- Q&A
- Workshop
- Day 9. Formatting XML by Using XSLT
- Introducing XSLT
- Transforming XML by Using XSLT
- Writing XSLT Style Sheets
- Using <xsl:apply-templates>
- Using <xsl:value-of> and <xsl:for-each>
- Matching Nodes by Using the match Attribute
- Working with the select Attribute and XPath
- Using <xsl:copy>
- Using <xsl:if>
- Using <xsl:choose>
- Specifying the Output Document Type
- Summary
- Q&A
- Workshop
- Day 10. Working with XSL Formatting Objects
- Introducing XSL-FO
- Using XSL-FO
- Using XSL Formatting Objects and Properties
- Building an XSL-FO Document
- Handling Inline Formatting
- Formatting Lists
- Formatting Tables
- Summary
- Q&A
- Workshop
- Part II. In Review
- Using CSS
- Using XSLT
- Using XSL-FO
- Part III: At a Glance
- Day 11. Extending HTML with XHTML
- Why XHTML?
- Writing XHTML Documents
- Validating XHTML Documents
- The Basic XHTML Elements
- Organizing Text
- Formatting Text
- Selecting Fonts: <font>
- Comments: <!-->
- Summary
- Q&A
- Workshop
- Day 12. Putting XHTML to Work
- Creating Hyperlinks: <a>
- Linking to Other Documents: <link>
- Handling Images: <img>
- Creating Frame Documents: <frameset>
- Creating Frames: <frame>
- Creating Embedded Style Sheets: <style>
- Formatting Tables: <table>
- Creating Table Rows: <tr>
- Formatting Table Headers: <th>
- Formatting Table Data: <td>
- Extending XHTML
- Summary
- Q&A
- Workshop
- Day 13. Creating Graphics and Multimedia: SVG and SMIL
- Introducing SVG
- Creating an SVG Document
- Creating Rectangles
- Adobe's SVG Viewer
- Using CSS Styles
- Creating Circles
- Creating Ellipses
- Creating Lines
- Creating Polylines
- Creating Polygons
- Creating Text
- Creating Gradients
- Creating Paths
- Creating Text Paths
- Creating Groups and Transformations
- Creating Animation
- Creating Links
- Creating Scripts
- Embedding SVG in HTML
- Introducing SMIL
- Summary
- Q&A
- Workshop
- Day 14. Handling XLinks, XPointers, and XForms
- Introducing XLinks
- Beyond Simple XLinks
- Introducing XPointers
- Introducing XBase
- Introducing XForms
- Summary
- Workshop
- Part III. In Review
- Part IV: At a Glance
- Day 15. Using JavaScript and XML
- Introducing the W3C DOM
- Introducing the DOM Objects
- Working with the XML DOM in JavaScript
- Searching for Elements by Name
- Reading Attribute Values
- Getting All XML Data from a Document
- Validating XML Documents by Using DTDs
- Summary
- Q&A
- Workshop
- Day 16. Using Java and .NET: DOM
- Using Java to Read XML Data
- Finding Elements by Name
- Creating an XML Browser by Using Java
- Navigating Through XML Documents
- Writing XML by Using Java
- Summary
- Q&A
- Workshop
- Day 17. Using Java and .NET: SAX
- An Overview of SAX
- Using SAX
- Using SAX to Find Elements by Name
- Creating an XML Browser by Using Java and SAX
- Navigating Through XML Documents by Using SAX
- Writing XML by Using Java and SAX
- Summary
- Q&A
- Workshop
- Day 18. Working with SOAP and RDF
- Introducing SOAP
- A SOAP Example in .NET
- A SOAP Example in Java
- Introducing RDF
- Summary
- Q&A
- Workshop
- Part IV. In Review
- Part V: At a Glance
- Day 19. Handling XML Data Binding
- Introducing DSOs
- Binding HTML Elements to HTML Data
- Binding HTML Elements to XML Data
- Binding HTML Tables to XML Data
- Accessing Individual Data Fields
- Binding HTML Elements to XML Data by Using the XML DSO
- Binding HTML Tables to XML Data by Using the XML DSO
- Searching XML Data by Using a DSO and JavaScript
- Handling Hierarchical XML Data
- Summary
- Q&A
- Workshop
- Day 20. Working with XML and Databases
- XML, Databases, and ASP
- Storing Databases as XML
- Using XPath with a Database
- Introducing XQuery
- Summary
- Q&A
- Workshop
- Day 21. Handling XML in .NET
- Creating and Editing an XML Document in .NET
- From XML to Databases and Back
- Reading and Writing XML in .NET Code
- Using XML Controls to Display Formatted XML
- Creating XML Web Services
- Summary
- Q&A
- Workshop
- Part V. In Review
- Appendix A. Quiz Answers
- Quiz Answers for Day 1
- Quiz Answers for Day 2
- Quiz Answers for Day 3
- Quiz Answers for Day 4
- Quiz Answers for Day 5
- Quiz Answers for Day 6
- Quiz Answers for Day 7
- Quiz Answers for Day 8
- Quiz Answers for Day 9
- Quiz Answers for Day 10
- Quiz Answers for Day 11
- Quiz Answers for Day 12
- Quiz Answers for Day 13
- Quiz Answers for Day 14
- Quiz Answers for Day 15
- Quiz Answers for Day 16
- Quiz Answers for Day 17
- Quiz Answers for Day 18
- Quiz Answers for Day 19
- Quiz Answers for Day 20
- Quiz Answers for Day 21
Using XML Schema Tools
Today and tomorrow you're going to see how to write XML schemas, but before you dig into the details (and there are plenty of them), it's worth noting that more and more software tools are appearing that can generate XML schemas for you. Although XML authors should know how to write XML schemas, tools that do the work for you can be very handy, so you'll start by introducing them.
Creating Schemas by Using XML Schema-Creation Tools
A growing number of XML schema-creation tools are becoming available; here's a sampling of the ones that are out there as of this writing:
- HiT Software( http://www.hitsw.com/xml_utilites/ )— This is an online automatic XML schema generator and DTD to XML schema converter. You just let it upload a document, and it creates an XML schema for free.
- xmlArchitect ( http://www.sysonyx.com/products/xmlArchitect )— This is an XML editor for creating schemas.
- XMLspy ( http://www.xmlspy.com )— XMLspy is a product family of tools that aid in the creation of XML schemas.
- XRay ( http://architag.com/xray )— This tool provides support for XML schemas and has an integrated online XML tutorial system.
- Microsoft Visual Studio .NET ( http://www.microsoft.com )— Visual Studio .NET can also generate XML schemas for you automatically.
As an example of the schema-creation process, let's take a look at the XML schema generator in the Microsoft Visual Studio .NET development tool. Take a look at this XML document:
<?xml version="1.0"?> <document xmlns="http://xmlpowercorp"> <text> Welcome to XML Schemas! </text> </document>
What if you want to automatically generate an XML schema for this document? You start by opening the XML document in Visual Studio .NET (that is, in Visual Studio .NET, you create a new project and then select Project, Add Existing Item to open the Add Existing Item dialog box, where you can browse to the XML document for which you want to create a schema). Figure 6.1 shows the short sample XML document opened in Visual Studio .NET.
Figure 6.1 An XML document in Visual Studio .NET.
Next, you select XML, Create Schema to create an XML schema for the document. The XML that is generated is shown in Figure 6.2.
Figure 6.2 Creating an XML schema in Visual Studio .NET.
Here's what the generated XML schema looks like (note that it's about four times as long as the original XML document) :
<?xml version="1.0"?> <xs:schema id="NewDataSet" targetNamespace="http://xmlpowercorp" xmlns:mstns="http://xmlpowercorp" xmlns="http://xmlpowercorp" xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:msdata="urn:schemas-microsoft-com:xml-msdata" attributeFormDefault="qualified" elementFormDefault="qualified"> <xs:element name="document" msdata:Prefix="ch06"> <xs:complexType> <xs:sequence> <xs:element name="text" msdata:Prefix="ch06" type="xs:string" minOccurs="1" /> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="NewDataSet" msdata:IsDataSet="true" msdata:Prefix="ch06" msdata:EnforceConstraints="False"> <xs:complexType> <xs:choice maxOccurs="unbounded"> <xs:element ref="document" /> </xs:choice> </xs:complexType> </xs:element> </xs:schema>
This is a valid XML schema, although it uses a namespace for Microsoft-specific data types that you're not going to use, and it adds more elements than you'll need.
As mentioned earlier today, you can also use free online XML schema generators such as the one at http://www.hitsw.com/xml_utilities/ to upload XML documents and create XML schemas. Figure 6.3 shows the XML schema that this generator creates for the sample XML document.
Figure 6.3 Using an online XML schema generator.
Validating XML Documents by Using XML Schemas
When you want to validate an XML document by using an XML schema, you can choose from the many XML validators that are available. Here's a starter list:
- Visual Studio .NET ( http://www.microsoft.com )— You just select XML, Validate XML Data to begin validating an XML document.
- Topologi Schematron Validator ( http://www.topologi.com )— This is a free Windows-based tool that validates XML schemas.
- XML Schema Quality Checker ( http://www.alphaworks.ibm.com/tech/xmlsqc )— This is the IBM AlphaWorks XML schema validator.
- Xerces ( http://xml.apache.org/xerces2-j/index.html )— The Apache Project's Xerces 2 XML processor includes XML schema validation.
- XSD Schema Validator ( http://apps.gotdotnet.com/xmltools/xsdvalidator )— This tool is a .NET-based XML schema validator.
- XSV ( http://www.w3.org/2001/03/webdata/xsv )— This is the online W3C XML schema validator.
- Xerces J ( http://tools.decisionsoft.com/schemaValidate.html )— This is a DecisionSoft XML validator.
- Internet Explorer ( http://www.microsoft.com/windows/ie/default.asp )— You can use Microsoft Internet Explorer to validate schemas.
The most widely used of these XML schema validation tools is Internet Explorer. The XML support in Internet Explorer is built into the MSXML package (which was called the Microsoft XML Parser until MSXML version 4.0, when it was named the Microsoft XML Core Services). MSXML version 4.0 is the version that supports XML Schema Definition Language (XSD) schemas. (Note that before version 4.0, MSXML only supported a smaller and different version of XML schemas, which Microsoft called XML-Data Reduced [XDR] schemas.) Table 6.1 lists support for XML schemas, by MSXML version.
Table 6.1. XML Schema Support by MSXML Version
Version |
Support |
MSXML |
No support. |
MSXML 2.0 |
Support only for XDR schemas. |
MSXML 2.6 |
Support only for XDR schemas. |
MSXML 3.0 |
Support only for XDR schemas. |
MSXML 4.0 |
Support for XSD and XDR schemas. Note that XSD support is not yet complete. |
Let's put MSXML 4.0 and Internet Explorer 6 to work now. If you want Internet Explorer to validate an XML document by using an XML schema, you have to do a little extra work by using JScript, Internet Explorer's version of JavaScript. You'll take a more detailed look at JavaScript on Day 15, "Using JavaScript and XML," and you don't have to write any JavaScript until then—all the work is already done for you in the HTML document ch06_01.html, which is shown in Listing 6.1. You can load this HTML document in an XML document, as stored in a file named ch06_02.xml, and validate it.
Example 6.1. An HTML Document That Can Validate an XML Document (ch06_01.xml)
<HTML> <HEAD> <TITLE> Validating With XML Schemas </TITLE> <SCRIPT LANGUAGE="JavaScript"> document.write("<H1>Validating With XML Schemas</H1>"); var parser = new ActiveXObject("MSXML2.DOMDocument.4.0"); parser.validateOnParse = true; if (parser.load("ch06_02.xml")) { document.write("The document is valid!"); } else { if (parser.parseError.errorCode != 0) { document.write(parser.parseError.reason); } } </SCRIPT> </HEAD> <BODY></BODY> </HTML>
As an example, you can validate the following sample XML document, which you have already seen today:
<?xml version="1.0"?> <document xmlns="http://xmlpowercorp"> <text> Welcome to XML Schemas! </text> </document>
How do you connect an XML schema to an XML document? Different XML processors do it in different ways, but, slowly, a standard is emerging. If you define a namespace, usually named xsi, for the URI www.w3.org/2001/XMLSchema-instance, you can then use an attribute named xsi:schemaLocation in the document element to specify the URI of the document's XML schema.
Not many XML processors support this attribute yet, but Internet Explorer does. To use this attribute so that Internet Explorer will understand it, you assign it a text string, giving the namespace you're using in our XML document, which is http://xmlpowercorp here, and the URI of the XML schema, which is ch0603.xsd in this case (assuming that ch0603.xsd is in the same directory as the XML document), like this: xsi:schemaLocation="http://xmlpowercorp ch06_03.xsd". (If you're not using a namespace in our XML document, you can use the xsi:noNamespaceSchemaLocation attribute.) Listing 6.2 shows how this works in the XML document ch06_02.xml.
Example 6.2. A Sample XML Document to Verify (ch06_02.xml)
<?xml version="1.0"?> <document xmlns="http://xmlpowercorp" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://xmlpowercorp ch06_03.xsd"> <text> Welcome to XML Schemas! </text> </document>
Listing 6.3 shows an XML schema (ch06_03.xsd) you can use for this example.
Example 6.3. The XML Schema for the First Example (ch06_03.xsd)
<?xml version="1.0"?> <xsd:schema targetNamespace="http://xmlpowercorp" xmlns="http://xmlpowercorp" xmlns:xsd="http://www.w3.org/2001/XMLSchema" attributeFormDefault="qualified" elementFormDefault="qualified"> <xsd:element name="document"> <xsd:complexType> <xsd:sequence> <xsd:element name="text" type="xsd:string" minOccurs="1" /> </xsd:sequence> </xsd:complexType> </xsd:element> </xsd:schema>
Just by looking at the XML schema in Listing 6.3, you can get an idea of what's going on: You declare an element named <document> and another one named <text> to match what's in the XML document. Then when you open ch06_01.html, Internet Explorer loads in the XML document, ch06_02.xml, and then the XML schema, ch06_03.xsd, and it validates the XML document, as shown in Figure 6.4. (Note that to run this example, ch06_01.html, ch06_02.xml, and ch06_03.xsd should all be in the same directory. )
Figure 6.4 Validating with an XML schema in Internet Explorer.
On the other hand, what if the XML processor you're using objects to the xsi:schemaLocation attribute, which you use to connect the XML schema to an XML document, as nonstandard? It turns out that it's actually not necessary to embed an xsi:schemaLocation attribute in an XML document to validate it by using Internet Explorer—you can use JavaScript to tell Internet Explorer where to find the XML schema. Listing 6.4 shows an HTML document (ch06_04.html) that does that.
Example 6.4. An HTML Document That Verifies an XML Document (ch06_04.html)
<HTML> <HEAD> <TITLE> Validating With XML Schemas </TITLE> <SCRIPT LANGUAGE="JScript"> document.write("<H1>Validating With XML Schemas</H1>"); var schemaHandler = new ActiveXObject("MSXML2.XMLSchemaCache.4.0"); schemaHandler.add("http://xmlpowercorp", "ch06_03.xsd"); var parser = new ActiveXObject("MSXML2.DOMDocument.4.0"); parser.schemas = schemaHandler; parser.validateOnParse = true; if (parser.load("ch06_05.xml")) { document.write("The document is valid!"); } else { if (parser.parseError.errorCode != 0) { document.write(parser.parseError.reason); } } </SCRIPT> </HEAD> <BODY></BODY> </HTML>
By using ch06_04.html, you don't have to use the xsi:schemaLocation attribute, as you can see in the new version of the sample XML document, ch06_05.xml, which is shown in Listing 6.5.
Example 6.5. The XML Document to Be Verified (ch06_05.xml)
<?xml version="1.0"?> <document xmlns="http://xmlpowercorp"> <text> Welcome to XML Schemas! </text> </document>
When you open this HTML document, ch06_04.html, in Internet Explorer, you get the same results shown in Figure 6.4 as it validates the new version of the XML document. (Note that to run this example, ch06_04.html, ch06_05.xml, and ch06_03.xsd should all be in the same directory.)
Now that you have some experience using software both in generating XML schemas and using them to validate XML, it's time to get into the meat of today's discussion: creating our own XML schemas.