Sample DTDTrees generated using Matra

Here are a few dtdtrees that I generated using Matra.

RSS 0.91

RSS is an XML/RDF vocabulary for describing metadata about websites, and enabling the display of "channels" on the "My Netscape" website. It is based on the RSS DTD originally created by Lars Marius Garshol.

Link to the RSS 0.9 dtdtree.
Link to the RSS 0.91 dtdtree.


MARC stands for "MAchine Readable Cataloging". It is developed and maintained by the Library of Congress. There are two dtds for MARC - the MARC Authority/Classification Record dtd and MARC Bibliographic/Holdings/Community Information Record dtd. The dtdtrees for the Feb 19 2004 version of the MARC dtds - MARC Authority/Classification Record dtd and MARC Bibliographic/Holdings/Community Information Record dtd - had a problem of orphan elements - as mentioned in the introduction. I notified MARC regarding the problem and they have fixed it in the May 5 2004 versions - MARC Authority/Classification Record dtd and MARC Bibliographic/Holdings/Community Information Record dtd.


SVG stands for "Scalable Vector Graphics". SVG is a W3C (World Wide Web Consortium) Recommendation.

Link to the SVG dtdtree.


Reach is an initiative by the Irish Government to build or procure the Public Services Broker.

Link to the IAMS Message Envelope DTDTree


Project Gutenberg began in 1971 by Michael Hart to convert the worlds great literature to electronic versions.

I have the dtdtrees for the following Gutenberg DTDs -
Gutenberg Poems DTD
Gutenberg Book DTD
Gutenberg Play DTD


Mathematical Markup Language (MathML) is an XML application for describing mathematical notation and capturing both its structure and content. MathML is a W3C Recommendation.

Link to the MathML dtdtree.

Protein Sequence Database Markup Language

The Protein Information Resource (PIR), located at Georgetown University Medical Center (GUMC), is an integrated public bioinformatics resource that supports genomic and proteomic research and scientific studies.

Link to the Protein Sequence Database Markup Language dtdtree.

Microarray Gene Expression Database Markup Language

Microarray Gene Expression Markup Language (MAGE-ML) is a language designed to describe and communicate information about microarray based experiments.

Link to the Microarray Gene Expression Database Markup Language dtdtree.

Bioinformatic Sequence Markup Language

Bioinformatic Sequence Markup LanguageTM (BSML) is an open XML data standard created to facilitate the interchange of data from diverse technologies into a consistent format for more efficient communication within the life sciences community. In 1997 the National Human Genome Research Institute (NHGRI) funded the development of BSML as an evolving public domain standard for the bioinformatics community.

Link to the Bioinformatic Sequence Markup Language dtdtree.

WAP Wireless Markup Language

WML specification is specified by the Open Mobile Alliance.

Link to the WAP Wireless Markup Language dtdtree.

Speech Synthesis Markup Language

"The Voice Browser Working Group has sought to develop standards to enable access to the Web using spoken interaction. The Speech Synthesis Markup Language Specification is one of these standards and is designed to provide a rich, XML-based markup language for assisting the generation of synthetic speech in Web and other applications. The essential role of the markup language is to provide authors of synthesizable content a standard way to control aspects of speech such as pronunciation, volume, pitch, rate, etc. across different synthesis-capable platforms."

Link to the Speech Synthesis Markup Language dtdtree.

Extensible HTML (XHTML)

XHTML is a family of current and future document types and modules that reproduce, subset, and extend HTML 4 [HTML4]. XHTML family document types are XML based, and ultimately are designed to work in conjunction with XML-based user agents.

Link to the Extensible HTML version 1.0 Strict DTDTree.


GEDCOM, the de-facto standard for storage of genealogical data has published the GEDCOM XML 6.0 specification. The dtdtree for GEDCOM XML 6.0 is online.

Michael Kay's GedML is a way for encoding genealogical data sets in XML. It combines the well-established GEDCOM data model with the XML standard for encoding complex information. The dtdtree for GedML is online.