XML

SGML

Mechanism to describe markup languages and conventions

Character set definition + tag format specification + document type definition (DTD)

Based on a descriptive definition

Several features to reduce size of markup (e.g., tag minimization, shortrefs)

Applications: CALS, AAP, TEI, HTML

Tagged text in the OED project at Waterloo

PAT regions (search)

Lector (display)

Goedel (structured query)

MAD (interprocess communication)

XML

Goals:

SGML variant for a variety of applications operating over the Internet

formally defined, simple to process, human readable (but not necessarily terse)

no features that are optionally implemented

XML processor vs. XML application

Data components:

markup vocabulary: elements, attributes

"start tag ... content ... end tag" (<name type="first">Frank</name>) vs. "empty tag" (<name/>)

character entity references (&lt;) vs. character references (&#x3C; or &#60;)

white space, parsed and unparsed character data, entity references

comments <!-- ... -->, processing instructions <? ... ?>

XML declaration: <?xml version="1.0"?>

Example: Melton and Buxton's Movies, Michael Ley's DBLP (not viewable from Firefox)

Markup declarations using Document Type Definition (DTD):

internal vs. external

root document type

elements

attribute lists (including IDs and IDREFs)

notations

entities

Conformance:

well-formed:

syntactically correct tags, matching tags, nested elements, all entities declared

valid:

well-formed + DTD + doctype matches DTD, unique IDs, no dangling IDREFs

Example: Movies DTD, DBLP DTD

W3C specifications

Stages: Working draft, candidate recommendation , proposed recommendation, recommendation

XML suite of specifications:

XML Transducers

XML Applications

CSS, XSL

Web Publishing

Semantic Web

Web Communications

Non-textual Data

XSLT

XHTML

RDF

CC/PP

MathML

XQuery

XHTML-Print

RDF Schema

SOAP

SMIL

Canonical XML

XML Events

GRDDL

P3P, APPEL

SMIL Animation

XInclude

XFrames 

OWL

XML-Signature

SVG

DISelect

XForms 

 RIF

XML Encryption

Ruby Annotation

EXI

 DIAL

 

XKMS

VoiceXML, SSML

XProc

 

 

 EARL

CCXML


 

 

REX

EMMA


 

 

SML

InkML

 

 

 

 POWDER

 Timed Text

XML Accessories

XML Names, XML Schema, XPath, XLink, XML Base, XPointer, XBL, ITS, XML Stylesheet, xml:id

XML

 

References and related reading

Querying XML, Chapters 1, 16, B.2

Goldfarb90

TEI07

 

Walsh98

Bray98

W3C-xml

Salminen09

Salminen01