This tutorial
In
this tutorial you will learn what XML is about.
You'll understand the basic XML
syntax. An you will know what's needed to make XML usable.
You won't be an XML expert after
following this kickstart tutorial. But you'll understand the basics of XML. And
you'll be able to understand XML Documents and most of XML DTD's.
Why do we need XML?
Data-exchange
XML
is used to aid the exchange of data. It makes it possible to define data in a
clear way.
Both
the sending and the receiving party will use XML to understand the kind of data
that's been sent. By using XML everybody knows that the same interpretation of
the data is used
Replacement for EDI
EDI
(Electronic Data Interchange) has been for
several years the way to exchange data between businesses.
EDI
is expensive, it uses a dedicated communication infrastructure. And the
definitions used are far from flexible.
XML
is a good replacement for EDI. It uses the Internet for the data exchange. And
it's very flexible.
More possibilities
XML
makes communication easy. It's a great tool for transactions between
businesses.
But it has much more
possibilities.
You
can define other languages with XML. A good example is WML (Wireless Markup
Language), the language used in WAP-communications.
WML is just an XML dialect.
-6-
What is XML ?
Simpler SGML
XML is a meta-language.
A
meta-language is a language that's used to define other languages. You can use
XML for instance to define a language like WML.
XML
is a smaller version of SGML. It's easy to master and that's a major advantage
compared to SGML which is a very complex meta-language.
XML: What it can do
With XML you can :
Define data structures
Make these structures
platform independent Process XML defined data automatically Define your own
tags
With XML you cannot
Define how your data is shown. To show data,
you need other techniques.
-7-
The general structure of XML
Define your own tags
In XML, you define your own tags.
If you need a tag or ,
that's no problem.
DTD or Schema
If you want to use a tag, you'll have to define it's meaning.
This
definition is stored in a DTD (Document Type Definition).
You can define your own DTD or use an existing one.
Defining
a DTD actually means defining a XML language. An alternative for a DTD is
Schema.
Showing the results
Often
it's not necessary to display the data in a XML document. It's for instance
possible to store the data in a database right away.
If you want to show the data, you can. XML
itself is not capable of doing so.
But
XML documents can be made visible with the aid of a language that defines the
presentation.
XSL
(eXtensible Stylesheet Language) is created
for this purpose. But the presentation can also be defined with CSS
(Cascading Style Sheets).
XML Tags
Tags
XML tags are created like HTML tags. There's
a start tag and a closing tag.
The
closing tag uses a slash after the opening bracket, just like in HTML. The text
between the brackets is called an element.
Syntax
The following rules are used for using XML tags:
Tags are case sensitive. The tag
differs from the tags and
Starting
tags always need a closing tag All tags must be nested properly
Comments can be used like in HTML:
Between
the starting tag and the end tag XML expects the content. 135
Empty tags
Besides a starting tag and a
closing tag, you can use an empty tag. An empty tag does not have a closing
tag.
The syntax differs from HTML:
Elements
and sub elements
Elements
and children
With XML tags
you define the type of data. But often data is more complex. It can consist of
several parts.
To describe the
element car you can define the tags mercedes . This model
might look like this:
Besides
the element car three other elements are used: brand, type and color.
Brand, type and color are sub-elements
of the element car. In the XML-code the tags of the sub-elements are enclosed
within the tags of the element car. Sub-elements are also called children
XML
documents
The
XML declaration
The first line of an XML document is the XML
declaration.
It's a special kind of tag:
The version 1.0 is the actual version of XML.
The XML
declaration makes clear that we're talking XML and also which version is used.
The version identification will become important after
new versions of XML are used.
The
root element
All
XML documents must have a root element.
All other
elements in the same document are children of this root element. The root
element is the top level of the structure in an XML document.
Structure
of an XML page
content
element>
All
elements must be nested. The level of nesting can be arbitrarily deep.
Ray Bradbury
10
XML
Attributes
Attributes
Elements in XML can use attributes. The syntax is:
The value of an
attribute needs to be quoted, even if it contains only numbers. An example
The
same information can also be defined without using attributes:
Avoid
attributes
When possible
try to avoid attributes. Data structures are more easy described in XML-tags.
Software that checks XML-documents can do a better job
with tags than with attributes.
Well formed XML documents
Well formedness
An
XML document needs to be well formed. Well formed means that the document
applies to the syntax rules for XML.
The rules
To be well formed a document needs to comply to the following
rules:
it
contains a root element
all
other elements are children of the root element
all
elements are correctly paired
the element name in a start-tag and an end-tag are
exactly the same
attribute
names are used only once within the same element
Note
There
are more rules, some of them have to do with entities. In this quick tutorial,
entities are not covered.
Valid XML documents
Valid
To
be of practical use, an XML document needs to be valid. To be valid an XML
document needs to apply to the following rules:
The
document must be well formed. (More on well formed in the previous page).
The document must apply to the rules as defined in a
Document Type Definition (DTD), (More on DTD's in the next page)
If a document is valid, it's clearly defined what the data in the
document really means.
There's
no possibility to use a tag that's not defined in the DTD. Companies that
exchange XML-documents can check them with the same DTD.
Because
a valid XML document is also well formed, there's no possibility for typo's in
the tags.
Valid is about structure
A
valid XML-document has a structure that's valid. That's the part you can check.
There's no check for the content.
XML:
the DTD
Defining
the language
To
use XML you need a DTD (Document Type Definition).
A DTD contains the rules for a particular type of
XML-documents.
Actually it's the DD that defines the language.
Elements
A DTD
describes elements. It uses the following syntax:
The text <!
ELEMENT, followed by the name of the element, followed by a description of
the element.
For instance:
This
DTD description defines the XML tag .
Data
The
description (#PCDATA) stands for parsed character data.
It's the tag that
is shown and also will be parsed (interpreted) by the program that reads the
XML document.
You can also
define (#CDATA), this stands for character data. CDATA will not be
parsed or shown.
Sub elements
An
element that contains sub elements is described thus:
Number of sub elements
If you use , the sub elements brand and type can occur once
inside the element car. To change the number of possible occurrences the
following indications can be used:
+ must occur at least one
time but may occur more often * may occur more often but may also be omitted
? may occur once or not at all
The indications are used behind the sub
element name. For instance:
Making choices
With
the sign '|' you define a choice between two sub elements. You enter the sign
between the names of the sub elements.
Empty elements
Empty
elements get the description EMPTY. For instance
that
could define a separator line to be shown if the XML document appears in a
browser.
DTD: external
A
DTD can be an external document that's referred to. Such a DTD starts with the
text
The address is an URL that points to the DTD.
In the XML document you make clear that you'll use this DTD with
the line:
that
should be typed after the line
DTD: internal
A DTD can also be included
in the XML document itself. After the line you must type
followed by the
element definitions. The DTD part is closed with
]>
-
Presenting XML documents
Showing XML documents
XML
is about defining data. With XML you can define documents that are understood
by computers.
But to make these documents understandable
to humans, you need to show them.
CSS
Cascading Style sheets (CSS offer possibilities to show XML.
It works just like adding styles to HTML
elements.
XSL
The
preferred solution is using XSL (eXtensible Style sheet Language). XSL can
convert XML documents into HTML.
It
can be used client side but the best solution is to use XSL server side. You
can convert your XML documents to HTML, thus making them visible to any
browser.
No comments:
Post a Comment