XML - Tutorial PDF
XML - Tutorial PDF
XML - Tutorial PDF
Tutorial
XML
version 1.0
General information
The KickStart Tutorial XML is free. If somebody asked you money for it, you've
been swindled!
You're allowed to distribute this ebook as long as you leave the orginal pdf-file
intact and you don't charge anything for it.
Happy XML-ing
Jan Kampherbeek
Webmaster of SpiderPro
http://www.spiderpro.com/
[email protected]
This tutorial
You'll understand the basic XML syntax. An you will know what's
needed to make XML usable.
You won't be an XML expert after following this kickstart tutorial. But
you'll understand the basics of XML. And you'll be able to
understand XML Documents and most of XML DTD's.
Index
General information 2
This Tutorial 3
Index 4
Why do we needXML ? 5
What is XML ? 6
The general structure of XML 7
XML Tags 8
Elements and sub elements 9
XML documents 10
XML attributes 12
Well formed XML documents 13
Valid XML documents 14
XML: The DTD 15
Presenting XML documents 18
About SpiderPro 19
Disclaimer 19
More possibilities
XML makes communication easy. It's a great tool for transactions
between businesses.
But it has much more possibilities.
You can define other languages with XML. A good example is WML
(Wireless Markup Language), the language used in WAP-
communications.
WML is just an XML dialect.
What is XML ?
Simpler SGML
XML is a meta-language.
A meta-language is a language that's used to define other languages. You can use
XML for instance to define a language like WML.
XML is a smaller version of SGML. It's easy to master and that's a major advantage
compared to SGML which is a very complex meta-language.
Define how your data is shown. To show data, you need other techniques.
DTD or Schema
If you want to use a tag, you'll have to define it's meaning.
This definition is stored in a DTD (Document Type Definition). You can define your own
DTD or use an existing one.
Defining a DTD actually means defining a XML language.
An alternative for a DTD is Schema.
XML Tags
Tags
XML tags are created like HTML tags. There's a start tag and a closing tag.
<TAG>content</TAG>
The closing tag uses a slash after the opening bracket, just like in HTML.
The text between the brackets is called an element.
Syntax
The following rules are used for using XML tags:
Tags are case sensitive. The tag <TRAVEL> differs from the tags <Travel> and
<travel>
Starting tags always need a closing tag
All tags must be nested properly
Comments can be used like in HTML: <!-- Comments -->
Between the starting tag and the end tag XML expects the content.
<amount>135</amount> is a valid tag for an element amount that has the content
135
Empty tags
Besides a starting tag and a closing tag, you can use an empty tag. An empty tag does
not have a closing tag.
The syntax differs from HTML: <TAG/>
<car>
<brand>volvo</brand>
<type>v40</type>
<color>green</color>
</car>
Besides the element car three other elements are used: brand, type and color.
Brand, type and color are sub-elements of the element car. In the XML-code the tags of
the sub-elements are enclosed within the tags of the element car. Sub-elements are
also called children
XML documents
The XML declaration
The first line of an XML document is the XML declaration.
It's a special kind of tag:
<?xml version="1.0"?>
<?xml version="1.0"?>
<sales>
<shop>
<number>
100
</number>
<manager>
Ray Bradbury
</manager>
</shop>
<product>
<name>
carrots
</name>
<totalprice>
10
</totalprice>
</product>
</sales>
XML Attributes
Attributes
Elements in XML can use attributes. The syntax is:
<element attribute-name = "attribute-value">....</element>
Avoid attributes
When possible try to avoid attributes. Data structures are more easy described in XML-
tags.
Software that checks XML-documents can do a better job with tags than with attributes.
The rules
To be well formed a document needs to comply to the following rules:
it contains a root element
all other elements are children of the root element
all elements are correctly paired
the element name in a start-tag and an end-tag are exactly the same
attribute names are used only once within the same element
Note
There are more rules, some of them have to do with entities. In this quick tutorial,
entities are not covered.
If a document is valid, it's clearly defined what the data in the document really means.
There's no possibility to use a tag that's not defined in the DTD. Companies that
exchange XML-documents can check them with the same DTD.
Because a valid XML document is also well formed, there's no possibility for typo's in
the tags.
Elements
A DTD describes elements. It uses the following syntax:
The text <! ELEMENT, followed by the name of the element, followed by a description
of the element.
For instance:
<!ELEMENT brand (#PCDATA)>
This DTD description defines the XML tag <brand>.
Data
The description (#PCDATA) stands for parsed character data.
It's the tag that is shown and also will be parsed (interpreted) by the program that reads
the XML document.
You can also define (#CDATA), this stands for character data.
CDATA will not be parsed or shown.
Sub elements
An element that contains sub elements is described thus:
<!ELEMENT car (brand, type) >
<!ELEMENT brand (#PCDATA) >
<!ELEMENT type (#PCDATA) >
-16-
This means that the element car has two subtypes: brand and type. Each subtype can
contain characters.
+ must occur at least one time but may occur more often
* may occur more often but may also be omitted
? may occur once or not at all
The indications are used behind the sub element name. For instance:
<!ELEMENT animal (color+) >
Making choices
With the sign '|' you define a choice between two sub elements.
You enter the sign between the names of the sub elements.
<!ELEMENT animal (wingsize|legsize) >
Empty elements
Empty elements get the description EMPTY.
For instance
<!ELEMENT separator EMPTY>
that could define a separator line to be shown if the XML document appears in a
browser.
DTD: external
A DTD can be an external document that's referred to.
Such a DTD starts with the text
<!DOCTYPE name of root-element SYSTEM "address">
The address is an URL that points to the DTD.
-17-
In the XML document you make clear that you'll use this DTD with the line:
<!DOCTYPE name of root-element SYSTEM "address">
that should be typed after the line <?xml version="1.0"?>
CSS
Cascading Style sheets (CSS offer possibilities to show XML.
It works just like adding styles to HTML elements.
XSL
The preferred solution is using XSL (eXtensible Style sheet Language).
XSL can convert XML documents into HTML.
It can be used client side but the best solution is to use XSL server side. You can
convert your XML documents to HTML, thus making them visible to any browser.
SpiderPro http://www.spiderpro.com/ab/abmlist.html
Mailing List
SpiderPro's mailing list informs you
when new information becomes
available. Subscription is free.
If you subscribe, you'll receive mails
about updates to SpiderPro and about
new released ebooks.
Expect to receive this mails about 2
times a month.
SpiderPro will use your email address
for the sole purpose of sending you
mails from the list.
We will never use your address for any
other purpose. We will never give your