Document Type Definition

Version 2.0 beta (30 June 2004)

 

Back to list of contents

 

All XML-encoded documents should refer to a Document Type Definition (DTD) or a schema to ensure consistency in their document structure. We have decided to base our encoding on the XML version of TEI P4. In order to create a single Document Type Definition out of TEI's library of DTD-fragments we have used the PizzaChef program, which is available through the TEI website.

We have made some changes to the original DTD to make it better suited to some special needs in our project. One of the more significant changes in version 1.0 of the handbook was the introduction of the <dipl>, <facs> and <norm> elements, as well as modifications to the elements <w> and <supplied>. We also included all elements for manuscript description as recommended by the MASTER project (MASTER reference manual).

In version 2.0 of the handbook, we have made a number of additional changes:

1. A new element <punct> has been added in order to encode punctuation on various levels of transcription. It has identical contents to the <w> element.
2. A new element <expunged> has been added to encode deletions made by the editor/transcriber. The usage of the <expunged> element is parallel to the <supplied> element. Note that the elements <del> and <add> are used for deletions and additions made by the scribe of the manuscript, not the editor/transcriber.
3. The element <p> has been extended to contain the elements <lg> and <l>. Thus "inline" poems can be encoded with the <lg> element within <p>.
4. The element <lg> has been extended to contain the element <p>. Thus short inline prose parts of poems (e.g. 'inquit') can be encoded with the <p> element within <lg>.
5. The element <add> has a compulsory place attribute. Default value is inline.
6. A new element <pal> has been introduced in <w>, in order to enable the encoding of detailed paleographical information. It is parallel to the elements <facs>, <dipl> and <norm>.
7. The elements <lemma> and <pos> have been added to <w>. Thus, lemma and grammatical form (part of speech) may be encoded by way of elements rather than by way of the attributes lemma and pos. TEI P5 may discourage the latter solution.
8. The element <textSpan> has been introduced in order to cover a number of overlapping structures. We recommend using this element rather than <addSpan>, <delSpan>, <suppliedSpan>, <expungedSpan>, <unclearSpan>, etc. However, in order to describe the type of text span, an attribute category has been added. This attribute has the values add, del, supplied, expunged, unclear, gap, seg, sic, corr, other; cf. the deprecated elements <addSpan> and <delSpan>.

All changes and modifications to the DTD have been written into the two files menota.extensions.ent and menota.extensions.dtd.

 

Current version of the DTD

menota.dtd 30 June 2004

Commencing with version 2.0 of the handbook, the current version of the Document Type Definition will always be called menota.dtd and located here. Older versions will be numbered sequentially and archived below.

Current versions of the extension files

menota.extensions.ent 30 June 2004
menota.extensions.dtd 30 June 2004

Commencing with version 2.0 of the handbook, the current versions of the extension fileswill always be called menota.extensions.dtd and menota.extensions.ent. They will be located here, while older versions will be numbered sequentially and archived below.

 

Version 1.0 published 20 May 2003. Version 2.0 beta published 30 June 2004.