A DTD is an SGML-compliant (it's written according to the SGML
standard) set of rules for marking text content. It specifies that every
time I mean that the word "Xanadu" is from a poem (not a song by Olivia
Newton-John), I will write '
<poemgeography>Xanadu</poemgeography>'.
Everyone in my field--say the Humanities--will also do this.
DTD's
can be made by anyone for any purpose. Getting everyone to agree on
these, however, is a problem. It's like getting everyone to speak
Esperanto. In addition, a DTD just says how I want to identify content,
not how it looks. To determine how it looks, I have to write a DSL. (Document Style Language) file,
which says--in effect--any time I start a new chapter, center it, put it
in bold, and font size such and such. Both DTD's and DSL's are
not easy to write, and are equivalent in tedium to any
detailed programming language.
Scholars made a DTD called TEI (Text
Encoding Initiative), but--sort of like a Microsoft product--it's huge, it
tries to do too much for too many people, and ultimately fails to be
efficient. VT has a DTD for ETD's (these acronyms get worse), called
ETD-ML. Archivists have "EAD-ML" and so on. This is all well and good if
a bunch of folks agree, but that still doesn't solve finding software to
read it. XML solves this.