Home > Articles > Web Services > XML

📄 Contents

  1. Sams Teach Yourself XML in 21 Days, Third Edition
  2. Table of Contents
  3. About the Author
  4. Acknowledgments
  5. We Want to Hear from You!
  6. Introduction
  7. Part I: At a Glance
  8. Day 1. Welcome to XML
  9. All About Markup Languages
  10. All About XML
  11. Looking at XML in a Browser
  12. Working with XML Data Yourself
  13. Structuring Your Data
  14. Creating Well-Formed XML Documents
  15. Creating Valid XML Documents
  16. How XML Is Used in the Real World
  17. Online XML Resources
  18. Summary
  19. Q&A
  20. Workshop
  21. Day 2. Creating XML Documents
  22. Choosing an XML Editor
  23. Using XML Browsers
  24. Using XML Validators
  25. Creating XML Documents Piece by Piece
  26. Creating Prologs
  27. Creating an XML Declaration
  28. Creating XML Comments
  29. Creating Processing Instructions
  30. Creating Tags and Elements
  31. Creating CDATA Sections
  32. Handling Entities
  33. Summary
  34. Q&A
  35. Workshop
  36. Day 3. Creating Well-Formed XML Documents
  37. What Makes an XML Document Well-Formed?
  38. Creating an Example XML Document
  39. Understanding the Well-Formedness Constraints
  40. Using XML Namespaces
  41. Understanding XML Infosets
  42. Understanding Canonical XML
  43. Summary
  44. Q&A
  45. Workshop
  46. Day 4. Creating Valid XML Documents: DTDs
  47. All About DTDs
  48. Validating a Document by Using a DTD
  49. Creating Element Content Models
  50. Commenting a DTD
  51. Supporting External DTDs
  52. Handling Namespaces in DTDs
  53. Summary
  54. Q&A
  55. Workshop
  56. Declaring Attributes in DTDs
  57. Day 5. Handling Attributes and Entities in DTDs
  58. Specifying Default Values
  59. Specifying Attribute Types
  60. Handling Entities
  61. Summary
  62. Q&A
  63. Workshop
  64. Day 6. Creating Valid XML Documents: XML Schemas
  65. Using XML Schema Tools
  66. Creating XML Schemas
  67. Dissecting an XML Schema
  68. The Built-in XML Schema Elements
  69. Creating Elements and Types
  70. Specifying a Number of Elements
  71. Specifying Element Default Values
  72. Creating Attributes
  73. Summary
  74. Q&A
  75. Workshop
  76. Day 7. Creating Types in XML Schemas
  77. Restricting Simple Types by Using XML Schema Facets
  78. Creating XML Schema Choices
  79. Using Anonymous Type Definitions
  80. Declaring Empty Elements
  81. Declaring Mixed-Content Elements
  82. Grouping Elements Together
  83. Grouping Attributes Together
  84. Declaring all Groups
  85. Handling Namespaces in Schemas
  86. Annotating an XML Schema
  87. Summary
  88. Q&A
  89. Workshop
  90. Part I. In Review
  91. Well-Formed Documents
  92. Valid Documents
  93. Part II: At a Glance
  94. Day 8. Formatting XML by Using Cascading Style Sheets
  95. Our Sample XML Document
  96. Introducing CSS
  97. Connecting CSS Style Sheets and XML Documents
  98. Creating Style Sheet Selectors
  99. Using Inline Styles
  100. Creating Style Rule Specifications in Style Sheets
  101. Summary
  102. Q&A
  103. Workshop
  104. Day 9. Formatting XML by Using XSLT
  105. Introducing XSLT
  106. Transforming XML by Using XSLT
  107. Writing XSLT Style Sheets
  108. Using <xsl:apply-templates>
  109. Using <xsl:value-of> and <xsl:for-each>
  110. Matching Nodes by Using the match Attribute
  111. Working with the select Attribute and XPath
  112. Using <xsl:copy>
  113. Using <xsl:if>
  114. Using <xsl:choose>
  115. Specifying the Output Document Type
  116. Summary
  117. Q&A
  118. Workshop
  119. Day 10. Working with XSL Formatting Objects
  120. Introducing XSL-FO
  121. Using XSL-FO
  122. Using XSL Formatting Objects and Properties
  123. Building an XSL-FO Document
  124. Handling Inline Formatting
  125. Formatting Lists
  126. Formatting Tables
  127. Summary
  128. Q&A
  129. Workshop
  130. Part II. In Review
  131. Using CSS
  132. Using XSLT
  133. Using XSL-FO
  134. Part III: At a Glance
  135. Day 11. Extending HTML with XHTML
  136. Why XHTML?
  137. Writing XHTML Documents
  138. Validating XHTML Documents
  139. The Basic XHTML Elements
  140. Organizing Text
  141. Formatting Text
  142. Selecting Fonts: <font>
  143. Comments: <!-->
  144. Summary
  145. Q&A
  146. Workshop
  147. Day 12. Putting XHTML to Work
  148. Creating Hyperlinks: <a>
  149. Linking to Other Documents: <link>
  150. Handling Images: <img>
  151. Creating Frame Documents: <frameset>
  152. Creating Frames: <frame>
  153. Creating Embedded Style Sheets: <style>
  154. Formatting Tables: <table>
  155. Creating Table Rows: <tr>
  156. Formatting Table Headers: <th>
  157. Formatting Table Data: <td>
  158. Extending XHTML
  159. Summary
  160. Q&A
  161. Workshop
  162. Day 13. Creating Graphics and Multimedia: SVG and SMIL
  163. Introducing SVG
  164. Creating an SVG Document
  165. Creating Rectangles
  166. Adobe's SVG Viewer
  167. Using CSS Styles
  168. Creating Circles
  169. Creating Ellipses
  170. Creating Lines
  171. Creating Polylines
  172. Creating Polygons
  173. Creating Text
  174. Creating Gradients
  175. Creating Paths
  176. Creating Text Paths
  177. Creating Groups and Transformations
  178. Creating Animation
  179. Creating Links
  180. Creating Scripts
  181. Embedding SVG in HTML
  182. Introducing SMIL
  183. Summary
  184. Q&A
  185. Workshop
  186. Day 14. Handling XLinks, XPointers, and XForms
  187. Introducing XLinks
  188. Beyond Simple XLinks
  189. Introducing XPointers
  190. Introducing XBase
  191. Introducing XForms
  192. Summary
  193. Workshop
  194. Part III. In Review
  195. Part IV: At a Glance
  196. Day 15. Using JavaScript and XML
  197. Introducing the W3C DOM
  198. Introducing the DOM Objects
  199. Working with the XML DOM in JavaScript
  200. Searching for Elements by Name
  201. Reading Attribute Values
  202. Getting All XML Data from a Document
  203. Validating XML Documents by Using DTDs
  204. Summary
  205. Q&A
  206. Workshop
  207. Day 16. Using Java and .NET: DOM
  208. Using Java to Read XML Data
  209. Finding Elements by Name
  210. Creating an XML Browser by Using Java
  211. Navigating Through XML Documents
  212. Writing XML by Using Java
  213. Summary
  214. Q&A
  215. Workshop
  216. Day 17. Using Java and .NET: SAX
  217. An Overview of SAX
  218. Using SAX
  219. Using SAX to Find Elements by Name
  220. Creating an XML Browser by Using Java and SAX
  221. Navigating Through XML Documents by Using SAX
  222. Writing XML by Using Java and SAX
  223. Summary
  224. Q&A
  225. Workshop
  226. Day 18. Working with SOAP and RDF
  227. Introducing SOAP
  228. A SOAP Example in .NET
  229. A SOAP Example in Java
  230. Introducing RDF
  231. Summary
  232. Q&A
  233. Workshop
  234. Part IV. In Review
  235. Part V: At a Glance
  236. Day 19. Handling XML Data Binding
  237. Introducing DSOs
  238. Binding HTML Elements to HTML Data
  239. Binding HTML Elements to XML Data
  240. Binding HTML Tables to XML Data
  241. Accessing Individual Data Fields
  242. Binding HTML Elements to XML Data by Using the XML DSO
  243. Binding HTML Tables to XML Data by Using the XML DSO
  244. Searching XML Data by Using a DSO and JavaScript
  245. Handling Hierarchical XML Data
  246. Summary
  247. Q&A
  248. Workshop
  249. Day 20. Working with XML and Databases
  250. XML, Databases, and ASP
  251. Storing Databases as XML
  252. Using XPath with a Database
  253. Introducing XQuery
  254. Summary
  255. Q&A
  256. Workshop
  257. Day 21. Handling XML in .NET
  258. Creating and Editing an XML Document in .NET
  259. From XML to Databases and Back
  260. Reading and Writing XML in .NET Code
  261. Using XML Controls to Display Formatted XML
  262. Creating XML Web Services
  263. Summary
  264. Q&A
  265. Workshop
  266. Part V. In Review
  267. Appendix A. Quiz Answers
  268. Quiz Answers for Day 1
  269. Quiz Answers for Day 2
  270. Quiz Answers for Day 3
  271. Quiz Answers for Day 4
  272. Quiz Answers for Day 5
  273. Quiz Answers for Day 6
  274. Quiz Answers for Day 7
  275. Quiz Answers for Day 8
  276. Quiz Answers for Day 9
  277. Quiz Answers for Day 10
  278. Quiz Answers for Day 11
  279. Quiz Answers for Day 12
  280. Quiz Answers for Day 13
  281. Quiz Answers for Day 14
  282. Quiz Answers for Day 15
  283. Quiz Answers for Day 16
  284. Quiz Answers for Day 17
  285. Quiz Answers for Day 18
  286. Quiz Answers for Day 19
  287. Quiz Answers for Day 20
  288. Quiz Answers for Day 21
Recommended Book

Using XSL-FO

You've already seen a short XSL-FO example, but in fact, XML documents are only rarely formatted by directly using formatting objects as in the example you've seen because the process becomes too complex. What usually happens is that you take an XML document (with the extension .xml), run it through an XSLT processor to create a new XML document that uses XSL-FO (with the extension .fo), and then use an XSL-FO processor to create the formatted display document (with the extension .pdf in today's discussion). You do things this way because any document except a nontrivial one has many paragraphs of text or data, and to format each one by hand would be a time-consuming process.

In fact, XSLT was originally developed for formatting XML documents by using XSL-FO objects. Since then, being able to access XML data without writing software using XSLT in itself has become so powerful that XSLT has outstripped XSL-FO in popularity.

For example, say that you want to use XSL-FO to format the XML document with state data that you worked with yesterday. Let's use that document today, renaming it ch10_01.xml, as shown in Listing 10.1.

Example 10.1. An Example of an XML Document (ch10_01.xml)

<?xml version="1.0" encoding ="UTF-8"?>
<states>

    <state>
        <name>California</name>
        <population units="people">33871648</population><!--2000 census-->
        <capital>Sacramento</capital>
        <bird>Quail</bird>
        <flower>Golden Poppy</flower>
        <area units="square miles">155959</area>
    </state>

    <state>
        <name>Massachusetts</name>
        <population units="people">6349097</population><!--2000 census-->
        <capital>Boston</capital>
        <bird>Chickadee</bird>
        <flower>Mayflower</flower>
        <area units="square miles">7840</area>
    </state>

    <state>
        <name>New York</name>
        <population units="people">18976457</population><!--2000 census-->
        <capital>Albany</capital>
        <bird>Bluebird</bird>
        <flower>Rose</flower>
        <area units="square miles">47214</area>
    </state>

</states>

As you'll see later today, the XSL-FO document holding this data that you're going to feed into an XSL-FO processor is about three times the length of ch10_01.xml, which is why it makes sense to use XSLT to convert ch10_01.xml into a document that uses XSL-FO. Let's take a look at the XSLT style sheet that will do that.

Using XSLT to Create an XSL-FO Document

Although you could format ch10_01.xml by hand, it's easier to use an XSLT style sheet to do so. Listing 10.2 contains the style sheet (ch10_02.xml) that you're going to use in this example. In this case, you're just going to extract the state data from ch10_01.xml and present that data in a list form, using 18-point font.

Example 10.2. An XSL Document That Adds XSL-FO Formatting (ch10_02.xsl)

<?xml version='1.0'?>
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:fo="http://www.w3.org/1999/XSL/Format"
    version='1.0'>

    <xsl:template match="states">
        <fo:root>

            <fo:layout-master-set>
                 <fo:simple-page-master master-name="mainPage"
                     page-height="300mm" page-width="200mm"
                     margin-top="20mm" margin-bottom="20mm"
                     margin-left="20mm" margin-right="20mm">

                     <fo:region-body
                       margin-top="0mm" margin-bottom="10mm"
                       margin-left="0mm" margin-right="0mm"/>

                     <fo:region-after extent="20mm"/>
                 </fo:simple-page-master>
             </fo:layout-master-set>

             <fo:page-sequence master-reference="mainPage">
                 <fo:flow  flow-name="xsl-region-body">
                     <xsl:apply-templates/>
                 </fo:flow>
             </fo:page-sequence>

        </fo:root>
    </xsl:template>

    <xsl:template match="state/name">
        <fo:block font-weight="bold" font-size="18pt"
            line-height="24pt" font-family="sans-serif"
            text-decoration="underline">
            Name:
            <xsl:value-of select="."/>
        </fo:block>
    </xsl:template>

    <xsl:template match="state/population">
        <fo:block font-size="18pt" line-height="24pt"
            font-family="sans-serif">
            Population (people):
            <xsl:value-of select="."/>
        </fo:block>
    </xsl:template>

    <xsl:template match="state/capital">
        <fo:block font-size="18pt" line-height="24pt" font-family="sans-serif">
            Capital:
            <xsl:value-of select="."/>
        </fo:block>
    </xsl:template>

    <xsl:template match="state/bird">
        <fo:block font-size="18pt" line-height="24pt" font-family="sans-serif">
            Bird:
            <xsl:value-of select="."/>
        </fo:block>
    </xsl:template>

    <xsl:template match="state/flower">
        <fo:block font-size="18pt" line-height="24pt" font-family="sans-serif">
            Flower:
            <xsl:value-of select="."/>
        </fo:block>
    </xsl:template>

    <xsl:template match="state/area">
        <fo:block font-size="18pt" line-height="24pt" font-family="sans-serif">
            Area (square miles):
            <xsl:value-of select="."/>
        </fo:block>
    </xsl:template>

</xsl:stylesheet>

Today you'll see how ch10_02.xsl works and what it does. The first step, as described in the following section, is to put this style sheet to work and create the XSL-FO document.

Creating an XSL-FO Document by Using an XSLT Style Sheet

You're ready to create an XSL-FO document, which you can call ch10_03.fo. This document is the one you'll feed into an XSL-FO processor to create a formatted PDF document that will display the data from the XML document ch10_01.xml. To create ch10_03.fo, you only need to apply the XSLT style sheet ch10_02.xsl to the XML document ch10_01.xml. For example, if you have Java 1.4 or later installed, you can use the Java file ch09_05.class that is in the code download area for this book that you used yesterday, like this:

%java ch09_05 ch10_01.xml ch10_02.xsl ch10_03.fo

This creates ch10_03.fo, which is ch10_01.xml formatted with XSL-FO. Listing 10.3 presents ch10_03.fo; note the length of it compared to the original XML document, ch10_01.xml, and you can see why it's a good idea to use XSLT to create XSL-FO documents if you're working with data of any significant length.

Example 10.3. An XSL-FO Document (ch10_03.fo)

<?xml version="1.0" encoding="UTF-8"?>
<fo:root xmlns:fo="http://www.w3.org/1999/XSL/Format">
    <fo:layout-master-set>
        <fo:simple-page-master margin-right="20mm"
            margin-left="20mm" margin-bottom="20mm" margin-top="20mm"
            page-width="200mm" page-height="300mm" master-name="mainPage">
            <fo:region-body margin-right="0mm" margin-left="0mm"
                margin-bottom="10mm" margin-top="0mm"/>
            <fo:region-after extent="20mm"/>
        </fo:simple-page-master>
    </fo:layout-master-set>

    <fo:page-sequence master-reference="mainPage">
        <fo:flow flow-name="xsl-region-body">

            <fo:block text-decoration="underline" font-family="sans-serif"
                line-height="24pt" font-size="18pt" font-weight="bold">
                Name:
                California
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Population (people):
                33871648
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Capital:
                Sacramento
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Bird:
                Quail
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Flower:
                Golden Poppy
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Area (square miles) :
                155959
            </fo:block>

            <fo:block text-decoration="underline" font-family="sans-serif"
                line-height="24pt" font-size="18pt" font-weight="bold">
                Name:
                Massachusetts
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Population (people):
                6349097
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Capital:
                Boston
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Bird:
                Chickadee
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Flower:
                Mayflower
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Area (square miles):
                7840
            </fo:block>

            <fo:block text-decoration="underline" font-family="sans-serif"
                line-height="24pt" font-size="18pt" font-weight="bold">
                Name:
                New York
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Population (people):
                18976457
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Capital:
                Albany
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Bird:
                Bluebird
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Flower:
                Rose
            </fo:block>
            <fo:block font-family="sans-serif" line-height="24pt"
                font-size="18pt">
                Area (square miles):
                47214
            </fo:block>

        </fo:flow>
    </fo:page-sequence>
</fo:root>

You now have ch10_03.fo, which is ready to feed into an XSL-FO processor.

Creating a PDF Document

To use ch10_03.fo and convert it into a PDF file, ch10_04.pdf, you can use what is probably the most popular XSL-FO processor, the Apache XML Project's FOP. You can get FOP for free at http://xml.apache.org/fop; just click the Download button. The current version as of this writing is 0.20.4 (0.20.5 is available in a release candidate version, but it's not yet official), and it's written in Java, which means you have to have Java installed in order to use it. The compressed file you download is fop-0.20.4-bin.tar.gz. (.tar.gz files are targeted to Unix, but Windows unzip utilities, such as WinZip, from http://www.winzip.com, can unzip them as well.)

Here's how to use FOP to convert ch10_03.fo into ch10_04.pdf, assuming that ch10_03.fo is in the same directory where you unzipped FOP (the -cp switch here sets the Java classpath variable, which you'll discuss further on Day 16, "Using Java and .NET: DOM"):

%java -cp build\fop.jar;lib\batik.jar;lib\xalan-2.3.1.jar;
lib\xercesImpl-2.0.1.jar;lib\xml-apis.jar;
lib\avalon-framework-cvs-20020315.jar;lib\logkit-1.0.jar;
lib\jimi-1.0.jar org.apache.fop.apps.Fop ch10_03.fo ch10_04.pdf

This is not very easy to type, so FOP also supplies shell scripts for most shells, including a .bat file for Windows, which means that the following is usually all you have to type:

%fop ch10_03.fo ch10_04.pdf

This creates ch10_04.pdf, which is the goal you've been working toward. To view this PDF document, you can use the Adobe Acrobat PDF viewer, which you can download for free from Adobe at http://www.adobe.com (currently, Adobe Acrobat reader is available at http://www.adobe.com/products/acrobat/readermain.html). Figure 10.1 shows ch10_04.pdf in Adobe Acrobat. As you can see in the figure, the data from the XML document has indeed been formatted into a PDF document.

10fig01.jpg

Figure 10.1 Formatting XML data in PDF format.

Now take a look at how to create your own XSL-FO formatting.

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.