Chapter 7. Altered, corrected and unreadable text

Version 2a-040830-TW

 

7.1 Introduction
7.2 Additions, deletions and substitutions
7.3 Damage and illegibility
7.4 Editorial interventions

 

7.1 Introduction

This chapter deals with the encoding of additions, deletions and corrections made in the manuscript by the scribe or later users, or similar changes made in the transcription, e.g. by the transcriber or encoder of the manuscript text. Further, the chapter deals with the encoding of damage to the manuscript that affects the reading of the manuscript text. In §7.2 corrections, deletions and additions made by the scribe or later users of the manuscript are treated. In §7.3 damage to the manuscript that affects the reading of the manuscript text are treated. §7.4 treats corrections, deletions and additions made by the transcriber of the manuscript text that have been made e.g. from other text witnesses or earlier editions of the text. The encoding recommended here is based on the TEI P4 Guidelines, ch. 6 and ch. 18, where the following elements are defined:

Elements Contents
<add> contains letters, words, or phrases inserted in the manuscript text or in the margins of the manuscript by an author, scribe, annotator or corrector.
<addSpan> used instead of <add> to mark the beggining of an addition where the added text does not correspond to the XML hierarchy.
<del> contains a letter, word or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the manuscript text by an author, scribe, annotator or corrector.
<delSpan> used instead of <del> to mark the beggining of a deletion where the deleted text does not correspond to the XML hierarchy.
<gap/> indicates a point where material has been omitted in a transcription, normally because the manuscript text is illegible, but potentially for some other reason.
<space/> indicates a significant or deliberate space in the manuscript.
<textSpan/> (non-TEI) marks the start of a non-linguistic feature in the manuscript, the equivalent of unclearSpan, suppliedSpan, sicSpan and corrSpan.
<unclear> contains a word, phrase or passage which cannot be transcribed with certainty because it is illegible in the manuscript.
<supplied> signifies text supplied by the transcriber, encoder or editor in place of text which cannot be read, either because of physical damage or loss in the original or because it is illegible for any reason.
<sic> contains text reproduced in the transcription although apparently incorrect or inaccurate.
<corr> contains the correct form of a passage apparently erroneous in the manuscript text. This element should only be used for corrections made in the transcription or encoding of the manuscript text. It should not be used for corrections made within the manuscript (e.g. by the scribe or a later hand).
<anchor/> marks the end of a non-linguistic feature in the manuscript which began with a -Span element.

Within the tradition of medieval philology there are several different schools concerning the transcription and editing of texts. Scholars have construed different systems for handling the problems arising when phenomena within the manuscript should be rendered in a printed edition. When working with electronic transcriptions and encoding of manuscript texts the same problems are encountered, and although the electronic medium presents possibilities there are obviously parallels between the traditional transcription and editing and the work with electronic texts which could be used as starting points for a manual for handling the latter.

In a discussion on editorial practice for Old Norse texts Helle Jensen, with reference to Stefán Karlsson 1963, LXVII f., outlines aspects of the manuscript text which should be noted in an edition (Jensen 1988). Jensen's suggestions start with structural markup of e.g. linebreaks in the manuscript. She also gives special signs for each of the features that has to do with scribal or later changes in the manuscript as follows (Jensen 1988, 102 f.):

Sign Explanation
` ´ Includes something that has been added above the line in the manuscript.
´ ` Includes something that has been added in the margins. Unless stated in a footnote these additions are considered to be the work of the hand that has written the main text.
|- -| Text that has been struck through, underdotted or erased is placed within these brackets.
-| |- Text that has been written twice without being marked by the scribe in the manuscript is placed within these brackets.
< > Text not present in the exemplar, but supplied in the edition by the editor.
* The following word is corrected by the editor. In a footnote the original form is given.
[ ] The text of the manuscript is illegible due to use or damage. The text included could be supplied from another manuscript or be a conjecture made by the transcriber or editor. If the addition is made from another manuscript it should be given diplomatically, if from other sources, such as editions or transcriptions, it should be rendered in a form normalized in accordance with the manuscript text.
[[ ]] The text of the original has been illegible at the time of a former diplomatic transcription. Characters within double brackets have, however, been legible at the time of the present transcription.
000 Unreadable characters or characters lost e.g. through damage to the manuscript. The number of zeros corresponds to the number of characters presumed missing.
000...000 The number of unreadable characters is not known.

In addition, Helle Jensen suggests that uncertain readings should be subpunctuated. In editions from the Arnamagnæan Institutes in Reykjavík and Copenhagen these suggestions are in general followed, and in most editions of medieval Scandinavian texts similar systems are used. This gives us a starting point when we are transcribing Old Icelandic and Old Scandinavian manuscripts.

The principles presented in this handbook are based on the tradition of producing scholarly editions of texts and individual manuscripts. The system for printed editions outlined by Helle Jensen can therefore very often be translated into the electronic markup language presented in this chapter.

Text written in the margins can be of various kinds and of varying interest for our knowledge about the main text and the history of the manuscript. Notes on the main text in the margins are of course valuable when we are interested in the text tradition. Other notes could indicate that someone at a certain stage has used it for example in a transcription of the text.

In medieval manuscripts, however, we often also find notes in the margins that have nothing whatsoever to do with the manuscript text. These notes can at first sight seem to be of no value to philological investigation, but in a larger context they can sometimes give information as to where a manuscript has been at a certain stage of its history. If e.g. the same type of scribbles are found in a group of manuscripts where one of the manuscripts can be geographically pin-pointed, this could indicate the whereabouts of the whole group. Information of this kind can also lead to the establishing of new connections between manuscripts that were not previously seen as connected. There are thus good arguments for including information also on this kind of marginal note, but these are more properly contained in the manuscript description in the header (cf. ch. 10) than within the encoded transcription.

The first kind of notes, i.e. comments or additions to the main text, are often treated in foot-notes in printed editions. They are considered relevant to the reading of the text, and are therefore given in relation to the main text. Marginal notes that indicate the owner or user of the manuscript in any obvious way are often treated in the introduction to the edition as they are considered relevant to the history of the text or manuscript.

The third category of notes, the ones that do not seem to give any relevant information, is often excluded or treated only briefly in the introduction. This is of course a rational way to handle these scribbles when the printed edition sets the limits, and the information often is obscure and cannot be easily related to parallel information concerning other manuscripts. In the electronic transcription of a manuscript, however, there is no reason to make this limitation. The information can be given in the same way as for the other categories, and thereby give us the possibility to search for all kinds of obscure information.

Medieval manuscripts have often become damaged through use, sometimes with relevance for our reading of the text. Pieces of parchment may for example have been torn out, leaving a physical gap in the manuscript. Parts of the text may be illegible because of use or deliberate erasure, or they may be darkened to such an extent that the text is no longer readable. In printed editions, unreadable sections of a text are marked as suggested by Helle Jensen. In the introductions to printed editions problems related to illegible text and damage to the manuscript are often discussed at length. If there are other text witnesses these are often used to replace missing stretches of text. In a diplomatic transcription of a manuscript text, however, the missing or unreadable parts are most often just marked as such. In the following sections the relation between the traditional markup of these kinds of textual and editorial difficulties and electronic encoding will be obvious. It is therefore relevant to take traditional transcription and editing as a starting point for the electronic encoding of transcriptions of manuscript texts.

The primary aim of the following sections are to give recommendations for the transcription and encoding of manuscript texts. It does, however, in some instances also give recommendations for editorial encoding, e.g. markup that refers to corrections or additions made by the transcriber or encoder. It is therefore important to keep the transcription and encoding of the manuscript text on the one hand and on the other hand the editorial changes consistently separated, so that the former provides a starting point for the editorial work.

 

7.2 Additions, deletions and substitutions

In the manuscript text and in the margins of the manuscript we often find different kinds of corrections, deletions and additions that we want to encode. These changes can be divided into different groups depending on the nature of the change and its relevance for the reading of the manuscript text or our knowledge about the manuscript. The main division is between additions or substitutions to the manuscript text, within the text or in the margins, and deletions made in the manuscript text. The former should be marked with the <add> element while the latter should be marked with the <del> element. Additions and substitutions made by the transcriber or editor are treated in the last section (§7.4).

 

7.2.1 Additions

The following elements are recommended for describing additions made by the author of the text, a compiler, scribe, annotator or corrector in the manuscript text. The TEI Guidelines recommend the use of the <add> element to describe additions in the manuscript (ch. 18.1.4). In the following the use of <add> in relation to our recommended encoding of the individual word within the element <w> and on the three different levels <facs>, <dipl> and <norm> is treated.

Elements Contents
<add> Contains letters, words or phrases inserted in the manuscript text or in the margins of the manuscript by an author, scribe, annotator or corrector. Attributes include:
hand Signifies the agent which made the addition. The value is an XML IDREF, referring to a <hand> element included in the header under <handList>. See the Menota header.
resp Signifies the transcriber or editor responsible for identifying the hand. The value is an XML IDREF, referring to an agent described in the header (cf. also ch. 10).
place Indicates where the addition is made. Suggested values include:
inline The addition is made in a space originally left empty by the scribe.
supralinear The addition is made above the line.
infralinear The addition is made below the line.
left The addition is made in the left margin.
right The addition is made in the right margin.
top The addition is made in the top margin.
bottom The addition is made in the bottom margin.
verso The addition is made on the other side of the leaf.
<addSpan/> An empty element to be used when an addition straddles structural boundaries, e.g. a <div> or when it goes from outside a <w> element to within a <w> element (or vice versa). The <addSpan/> element indicates the beginning of the addition and will typically be linked to an <anchor/> element indicating the end of the addition. The <addSpan/> element has the same attributes as the <add> element.
to This attribute gives the location of the beginning of the addition (e.g. as a line number) and is linked to a corresponding id attribute of an <anchor/> element at the end of the addition. Note that the value of the to and the corresponding id attributes must start with an ASCII letter and consist of letters, digits and/or a '.' or '-'.
<anchor/> An empty element which can be used to indicate e.g. the end of an addition, in conjunction with the <addSpan/> element. Attributes must include:
id A valid XML ID, referred to by the to attribute of the <addSpan/> element.
type "addSpan" value to identify the type of anchor.

Additions which can be ascribed to the author of a text are rare in medieval Nordic manuscripts. The additions being described with the above-mentioned attribute hand will therefore primarily be ascribed to the values scribe, compiler, annotator or corrector. Scribal additions are probably the most common changes to be recorded in the transcription and encoding of a manuscript text.p>. The list of hands in the header (cf. ch. 10) should identify the individual hand, either as anonymous or, if possible, by name. The main hand in a manuscript will normally be marked as mainscribe.

If the addition consists of a series of complete words, the <add> tag should be surrounding the word(s). In the following example (from Rómverjasögur (AM 595 a-b 4to; f. 14r22), 'en' has been added by the main scribe, identified as mainscribe in the header (Note that for the sake of clarity we have limited the use of encoding to the relevant sequence, simplified the orthography and avoided entities):

en skyllda þa til herfararinnar er þu uillder
<add hand="mainscribe"><w>
 <facs>en</facs>
 <dipl>en</dipl>
 <norm>en</norm>
</w></add>
gæyma allz uti ok inni

Note that to ensure correct rendering of the addition, no space is included between the <add> and <w> elements.

In cases where the addition forms part of a word, the markup should normally be restricted to the facsimile level. In any case, the addition need not be marked up as part of the normalised text (AM 748 Ib 4to, 1r15):

annat af
<w>
 <facs><add hand="scribe" place="supralinear">v</add>r&eogon;riligv<abbr>&bar;</abbr></facs>
 <dipl>vr&eogon;riligv<expan>m</expan></dipl>
 <norm>uhr&aelig;riligum</norm>
<w>
annat

The location of the addition in the above markup is indicated by the attribute place; in this case, the addition is made above the line of the manuscript text and therefore uses the value supralinear.

The diplomatic and normalised text will normally include the addition if made by a scribal hand. Additions made by later hands will normally be omitted from the diplomatic and normalised text without markup.

Additions are sometimes made by an annotator, i.e. comments to the text. This kind of additions could be encoded as the marginal note “vantar ekkert F. J.” by Finnur Jónsson in Codex Wormianus (AM 242 fol. p. 60):

<add hand="FJ">
<w>
 <facs>vantar</facs>
</w>
<w>
 <facs>ekkert</facs>
</w>
<w>
 <facs>F&dot;</facs>
</w>
<w>
 <facs>J&dot;</facs>
</w>
</add>

It is also possible to indicate with the attribute place where on the manuscript page the annotation is made. Finnur Jónsson's annotation that is made in the bottom margin should be encoded as follows:

<add hand="FJ" place="bottom">
<w>
 <facs>vantar</facs>
</w>
<w>
 <facs>ekkert</facs>
</w>
<w>
 <facs>F&dot;</facs>
</w>
<w>
 <facs>J&dot;</facs>
</w>
</add>

In (unusual) cases where the addition reaches over structural boundaries in the manuscript, we recommend using the <addSpan/> element to indicate the beginning of the addition and the <anchor/> element to indicate the end. The <addSpan/> element should be specified with the to attribute linked to an identical id attribute of the <anchor/> element; and it should also be classified using type="addSpan" so that it can be formatted appropriately:

[find example]

In cases where the addition consists of a part of a word and extends to another word or words, both the <addSpan/> and <anchor/> elements should be included within the facsimile level of the relevant words:

[find example]

Changes in scribal hands are not considered additions. For the markup of such phenomena, use the <handShift> element.

 

7.2.2 Deletions

The TEI Guidelines recommend the use of the <del> element to describe additions in the manuscript (ch. 18.1.4). In the following the use of <del> in relation to our recommended encoding of the individual word within the element <w> and on the three different levels <facs>, <dipl> and <norm> is treated.

Elements Contents
<del> Contains a letter, word or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the manuscript text by an author, scribe, annotator or corrector. Attributes include:
hand Signifies the agent which made the deletion. The value is an XML IDREF, referring to a <hand> element included in the header under <handList>.
resp Signifies the editor or transcriber responsible for identifying the hand of the restoration. The value is an XML IDREF, referring to an agent described in the header (cf. ch. 10). This information can also be given in the header.
rend Classifies the deletion as displayed, using any convenient typology. Sample values include:
overstrike The text has been struck through.
erasure The text has been erased.
bracketed Deletion indicated by brackets in the text or margin.
subpunction Deletion indicated by dots beneath the letters deleted.
<delSpan/> An empty element to be used when a deletion straddles structural boundaries, e.g. a <div> or when it goes from outside a <w> element to within a <w> element (or vice versa). The <delSpan/> element indicates the beginning of the deletion and will typically be linked to an <anchor/> element indicating the end of the deletion. The <delSpan/> element has the same attributes as the <del> element.
to This attribute gives the location of the beginning of the deletion (e.g. as a line number) and is linked to a corresponding id attribute of an <anchor/> element at the end of the deletion. Note that the value of the to and the corresponding id attributes must start with an ASCII letter and consist of letters, digits and/or a '.' or '-'.
<anchor/> An empty element which can be used to indicate e.g. the end of an deletion, in conjunction with the <delSpan/> element.
id A valid XML ID, referred to by the to attribute of the <delSpan/> element.
type "delSpan" value to identify the type of anchor.

Deletions that can be ascribed to the author of a manuscript text are rare in medieval Nordic manuscripts. The deletions being described with the above mentioned attribute hand will therefore primarily be ascribed to scribe or corrector.

Deletions of one or more words made by the scribe(s) or corrector(s) of a manuscript are encoded as in the passage from Rómverjasögur (AM 595 a-b 4to). Note that we for clarity limit the use of encoding to the relevant sequence, and that only the <facs> level is shown:

en tuenner flokkar þeirar þioðar er<lb n="3r:15"/>
<del hand="mainscribe"><w>
 <facs>liguri</facs>
</w>
<w>
 <facs>hæita</facs>
</w>
<w>
 <facs>er</facs>
</w></del>
traceum hæiter.

In the TEI Guidelines cited above there are a number of possible types of deletion described with the attribute type. These could be applied to deletions made both by scribe(s) and corrector(s). If a deletion is made e.g. by overstriking the deleted text it could be encoded as (here only presented on the <facs> level):

en tuenner flokkar þeirar þioðar er<lb n="3r:15"/>
<del hand="mainscribe" rend="overstrike"><w>
 <facs>liguri</facs>
</w>
<w>
 <facs>hæita</facs>
</w>
<w>
 <facs>>er</facs>
</w></del>
traceum hæiter.

This could then be displayed on the computer screen or in a printed edition in the manner suggested above (ch. 7.1):

14   ...en tuenner flokkar þeirar þioðar er
15   |-liguri hæita er-| traceum hæiter...

The text that is marked as deleted must be at least partly legible in the manuscript so that it can be read by the transcriber. If the deleted text is not legible the deletion should be marked up with the <gap/> element, described below (7.3.1). The <gap/> element could be enclosed in the <del> element to indicate that the gap is in some way intentional. Parts of the deleted text that are legible could be indicated by the <unclear> element in combination with the <gap/> element as described below (ch. 7.3.2).

As for the markup of additions using <add>, if the deletion is of part of a word, it should normally only be marked up at the <facs> level. If the deletion is by a scribal hand, the deleted text will be omitted from the <dipl> and <norm> levels without markup. The following is from AM 242 fol, p. 97:21 (slightly simplified):

r&uacute;m, er
<w>
 <facs><del hand="unknown" rend="overstrike">t</del>ok</facs>
 <dipl>ok</dipl>
 <norm>ok</norm>
</w>
h&aelig;gra

In cases where the deletion reaches over structural boundaries in the manuscript, we recommend using a <delSpan/> element to indicate the beginning of the addition and an <anchor/> element to indicate the end. The <delSpan/> element should be specified with the to attribute linked to an identical id attribute of the <anchor/> element. In the following (fictional) example, the deletion occurs as 'rúm er er tok hægra' in the manuscript, that is, includes part of a word and another word, and therefore crosses word boundaries:

rúm, er
<w>
 <facs><delSpan hand="scribe" rend="overstrike" to="ds-p97.21"/>er</facs>
</w>
<w>
 <facs>t<anchor id="ds-p97.21" type="delSpan"/>ok</facs>
</w>
hægra

As with deletions within words, the <delSpan/> and <anchor/> will normally only be included at the <facs> level, if required. The deleted text will be removed for the other textual levels.

 

7.2.3 Substitutions

In medieval manuscripts a rather common phenomenon is the combination of deleted text and added text. It is not always possible, however, to ascertain the relation between the two. If someone has deleted the originally written text inline this does not automatically mean that a corresponding addition above the line or in the margin is made by the same scribe. It can therefore not be stated as certain whether the correspondence is intentional or not. We suggest that substitutions made in the manuscript should be marked primarily with the two core tags <del> and <add>. In cases where we can be relatively sure about the agent of the whole substitution this could be indicated with a combination of the <del> and the <add> elements as illustrated below.

<del><w>
 <facs>deleted word</facs>
</w></del>
<add><w>
 <facs>added word</facs>
</w></add>

If someone has deleted part of the manuscript text this could be encoded as has been demonstrated above (ch. 7.2.2), and if someone, the same hand or someone else in the manuscript history, has supplied new text for the deletion, this could be encoded as in the following example from Codex Wormianus (AM 242 fol.; here only presented on the <facs> level):

<lb n="5:14"/> ....
<del rend="subpunction" hand="scribe"><w>
 <facs>bar</facs>
</w></del>
<add place="supralinear" hand="scribe"><w>
 <facs>tok</facs>
</w><add>

In this case the attribute hand indicates that both the subpunction and the supralinear addition can be attributed to the scribe of the manuscript text.

In cases where we wish to keep the sequence of <del> and <add> together we can use the empty elements <textSpan type="substitution"/> and <anchor type="substitution"/>, with an ID/IDREF attribute to link them. The above example can then be marked as follows:

<lb n="5:14"/> ....
<textSpan type="substitution" to="sub-5.14"/>
<del rend="subpunction" hand="scribe"><w>
 <facs>bar</facs>
</w></del>
<add place="supralinear" hand="scribe"><w>
 <facs>tok</facs>
</w><add>
<anchor type="substitution" id="sub-5.14"/>

The same type of markup can be used for substitutions which span structural boundaries.

 

7.3 Damage and illegibility

The following section deals with text omitted in the transcription or editing of text due to damage or illegibility in the manuscript, and text supplied from other sources such as other text witnesses or earlier editions.

 

7.3.1 Text omitted from the transcription

When the manuscript is illegible we suggest the use of the elements <gap/> and <supplied> to indicate the illegible text, its extension and how it has been supplied (for the <supplied> element see ch. 7.4.1). The <space/> element is used to represent deliberate omissions from the manuscript which have some significance, e.g. spaces left for decorated initials or words.

Elements Contents
<gap/> Is an element without extention in the encoded manuscript text. It indicates a point where material has been omitted in a transcription because the manuscript text is illegible. Attributes include:
desc Gives a description of the omitted text.
reason Gives the reason for omission. Sample values include: 'sampling', 'illegible', 'irrelevant', 'cancelled', 'cancelled and illegible'.
extent Indicates approximately how much text has been omitted from the transcription, in the way that has been suggested by Helle Jensen refered to above (ch. 7.1). Values can be given as e.g. number of signs, number of lines or number of pages in the manuscript.
resp Indicates the transcriber, encoder or editor responsible for the decision not to provide any transcription and hence the application of the <gap/> element.
hand In instances where text is omitted from the transcription because of deliberate deletion by an identifiable hand, this attribute signifies the hand which made the deletion.
agent In instances where text is omitted from the transcription because of damage resulting from an identifiable cause, this attribute signifies the causative agent.
<space/> Is an element without extention in the encoded manuscript text. It indicates a point in a transcription of a manuscript where the mansucript has a deliberate omission. Attributes include:
extent The extent of the space. Values can be given as e.g. number of signs, number of lines or number of pages in the manuscript.

In medieval manuscripts we often find sections that for some reason are illegible. This can be due to e.g. damage or use. In the transcription we primarily wish to register the sections that are illegible and the extent of the illegibility. We suggest that the illegible sections should be indicated by the <gap/> element. The extent of the illegible section could be encoded as the following two lines from Völuspá in Hauksbók (AM 544 4to):

<lb n="20v:41"/>viðars niðia.<gap extent="00...00"/>naðr<gap extent="00...00"/>
<lb n="20v:42"/>munv halir<gap extent="00...00"/>yðia<gap extent="00...00"/>mið
<gap extent="00...00"/>

With this markup the extent of the illegible section is not defined. It can be presented on the computer screen or in a printed edition in the manner suggested above (ch. 7.1):

41   viðars niðia. 00...00 naðr 00...00
42   munv halir 00...00 yðia 00...00 mið 00...00 ...

If the transcriber or encoder of the text wishes to define the section more accurately it can be done as in the following example. The number of missing signs is given as a value to the attribute extent. It should be noted that the number given in the example is not intended as an exact evaluation of the number of signs missing in the present manuscript.

<lb n="20v:41"/>viðars niðia.<gap extent="40"/>naðr<gap extent="17"/>
<lb n="20v:42"/>munv halir<gap extent="31"/>yðia<gap extent="10"/>mið
<gap extent="11"/>

This could be represented as follows on the computer screen or in a printed edition. As the accuracy of this kind of evaluation is questionable it should not have the highest priority to display this in e.g. a printed edition.

41   viðars niðia. 0000000000000000000000000000000000000000naðr 00000000000000000
42   munv halir 0000000000000000000000000000000yðia 0000000000 mið00000000000 ...

See below (ch. 7.4.1) for this example with text supplied by the editor using the <supplied> element.

A deliberate space in a manuscript should be indicated by the <space/> element. For example, the following line in AM 242 fol (95:30) has a space left by the original scribe:

<lb n="95:30"/>hefer æina <space extent="10"/> fyRi samstafa ...

If the text omitted in the space is supplied using the <supplied> element, the <space/> tag will normally be omitted and instead the reason="space" attribute set in the <supplied> element. See ch. 7.4.1.

 

7.3.2 Uncertain readings in the manuscript

In medieval manuscripts we often encounter problems of illegibility due to use or damage. In the following the encoding of such sequences is treated. To some extent this has already been treated in the above section (ch. 7.3.1). In cases where the text is readable to some extent the <gap/> and <supplied> elements should not be used. The TEI Guidelines (ch. 18.2.3) recommend that the <unclear> element is used for encoding damage and illegibility where the text of the dammaged or illegible area can be read with some, but not full, certainty.

Elements Contents
<unclear> Contains a letter, word, phrase or passage which cannot be transcribed with certainty because it is illegible in the manuscript text. Attributes include:
reason Indicates why the material is hard to transcribe.
resp Indicates the individual responsible for the transcription of the letter, word, phrase or passage contained within the <unclear> element.
hand Signifies the hand responsible for the action where the difficulty in transcription arises from action (partial deletion, etc.) assignable to an identifiable hand. Note that this attribute has the same function in the <del> element above (ch. 7.2.2).
agent Where the difficulty in transcription arises from an identifiable cause, signifies the causative agent.
rend Describes how the unclear reading should be displayed.
<textSpan/> Marks the beginning of a stretch of unclear text which straddles structural boundaries, e.g. a <div> or when it goes from outside a <w> element to within a <w> element (or vice versa). This element is linked to an <anchor/> element indicating the end of the unclear text. The <textSpan/> element includes the same attributes as the <unclear> element.
type "unclear" value must be set to indicate the type of text span being marked up is unclear.
to This attribute gives the location of the beginning of the unclear text (e.g. as a line number) and is linked to a corresponding id attribute of an <anchor/> element at the end of the unclear text. Note that the value of the to and the corresponding id attributes must start with an ASCII letter and consist of letters, digits and/or a '.' or '-'.
<anchor/> An empty element which is used to indicate the end of the unclear text, in conjunction with the <textSpan type="unclear"/> element.
id A valid XML ID, referred to by the to attribute of the <textSpan type="unclear"/> element.
type "unclear" value to identify the type of anchor.

The example given above from the version of Völuspá in Hauksbók (AM 544 4to) can be further encoded with the <unclear> element, to indicate that the text marked with the <unclear> element is read with some certainty while the <gap/> element indicates fully illegible text within the span of unclear text. For the sake of simplicity, we have given identical readings on all three levels below.

<lb n="20v:41"/>
 
<unclear>
<w>
 <facs>viðars</facs>
 <dipl>viðars</dipl>
 <norm>viðars</norm>
</w>
 
<w>
 <facs>niðia</facs>
 <dipl>niðia</dipl>
 <norm>niðia</norm>
</w>
 
<gap extent="00...00"/>
 
<w>
 <facs>naðr</facs>
 <dipl>naðr</dipl>
 <norm>naðr</norm>
</w>
 
<gap extent="00...00"/>
<lb n="20v:42"/>
 
<w>
 <facs>munv</facs>
 <dipl>munv</dipl>
 <norm>munv</norm>
</w>
 
<w>
 <facs>halir</facs>
 <dipl>halir</dipl>
 <norm>halir</norm>
</w>
</unclear>

With this encoding the text could be presented with subpunction for all the words that the editor can not read with absolute certainty.

 

7.4 Editorial interventions

When transcribing medieval material we often encounter words or longer sequences of text that we consider corrupt in one way or another. Sometimes it may also be obvious that text is missing in the manuscript we are transcribing or that the scribe has made a mistake. The transcriber of the manuscript text may in these instances wish to indicate the mistake or even correct the text, either directly from other versions of the same text or based on already existing editions of the text. Sometimes the transcriber or editor may also wish to make obvious grammatical corrections in the text without having any other text witness or precedence in an earlier edition. In the following the encoding of corrections made by the transcriber of the text or by an editor are treated. Note that we do not recommend the use of the attribute hand for the changes made in transcription or encoding of the manuscript text. The attribute resp should be used consistently for corrections or additions made in the transcription or encoding of the text to distinguish clearly between what is found in the manuscript text and what is made in the transcription and encoding of the text.

 

7.4.1 Additions made by the transcriber or editor

If text is obviously missing in the manuscript text we may wish to supply it. This could be based on, for example, another text witness or on a earlier edition of the text. The markup of such additions should give information about the source as well as about the responsibility for the addition. To encode additions made in the transcription we recommend the use of the <supplied> element as described in the TEI Guidelines (ch. 18.1.5). In the following the use of <supplied> in relation to our recommended encoding of the individual word within the element <w> and on the three different levels <facs>, <dipl> and <norm> is treated.

Elements Contents
<supplied> Signifies text supplied by the transcriber, encoder or editor in place of text which cannot be read, either because of physical damage or loss in the original or because it is illegible for any reason. Attributes include:
source States the source of the supplied text if this can be located.
resp Indicates the individual responsible for the addition of letters, words or passages contained within the <supplied> tag. It can be given values like:
transcriber The person responsible for the transcription of the manuscript text.
encoder The person responsible for the encoding of the manuscript text.
editor The editor of the text used for the addition or responsible for the addition in editing the manuscript text.
reason Indicates why the text has had to be supplied
agent Where the presumed loss of text leading to the supplying of text arises from an identifiable cause, signifies the causative agent.
<textSpan/> Marks the beginning of a stretch of supplied text which straddles structural boundaries, e.g. a <div> or when it goes from outside a <w> element to within a <w> element (or vice versa). This element is linked to an <anchor/> element indicating the end of the unclear text. The <textSpan/> element includes the same attributes as the <supplied> element.
type "supplied" value must be set to indicate the type of text span being marked up is supplied by the transcriber/editor/etc.
to This attribute gives the location of the beginning of the unclear text (e.g. as a line number) and is linked to a corresponding id attribute of an <anchor/> element at the end of the unclear text. Note that the value of the to and the corresponding id attributes must start with an ASCII letter and consist of letters, digits and/or a '.' or '-'.
<anchor/> An empty element which is used to indicate the end of the unclear text, in conjunction with the <textSpan type="supplied"/> element.
id A valid XML ID, referred to by the to attribute of the <textSpan type="supplied"/> element.
type "supplied" value to identify the type of anchor.

If the transcriber or editor wishes to supply text that is missing in the transcribed manuscript text from for example another text witness, this can be handled with the <supplied> element. The interpolated text could be transcribed as in this instance from Rómverjasögur (AM 595 a-b 4to). Note that we for clarity limit the use of encoding to the relevant sequence:

þa uar inumidia að þeir er epter uoru latner uoru með herinum af liði Calpurnii <lb n="1v:4"/>fylgðu siðum sins havfðingia
<supplied>
<w><facs>ok</facs></w>
</supplied>
giorðu marga glæpsamliga
<supplied>
<w><facs>luti</facs></w>
</supplied>.

This could then be displayed on the computer screen or in a printed edition in the manner suggested above (ch. 7.1):

3   ...þa uar inumidia að þeir er epter uoru latner uoru með herinum af liði Calpurnii
4   fylgðu siðum sins havfðingia <ok> giorðu marga glæpsamliga <luti>.

In the above example there are other sources available for the illegible text. The text omitted can then be supplied from these sources within the <supplied> element as follows, where the supplied text is from Gustav Neckel's edition of the Edda (Hans Kuhn's revised 5. edition 1983, p. 13; In the following example, there are two additions, one consisting of a series of words, and another consisting of a part of a word and additional words:

<lb n="20v:41"/>viðars niðia.
<gap extent="32"/>
<supplied resp="KGJ" source="Neckel1983:13">
 <w>
  <facs>Gengr</facs>
  <dipl>Gengr</dipl>
 </w>
 <w>
  <facs>inn</facs>
  <dipl>inn</dipl>
 </w>
 <w>
  <facs>mæri</facs>
  <dipl>mæri</dipl>
 </w>
 <w>
  <facs>er</facs>
  <dipl>er</dipl>
 </w>
 <w>
  <facs>af</facs>
  <dipl>af</dipl>
 </w>
 <w>
  <facs>móði</facs>
  <dipl>móði</dipl>
 </w>
 <w>
  <facs>drepr</facs>
  <dipl>drepr</dipl>
 </w>
 <w>
  <facs>neppr</facs>
  <dipl>neppr</dipl>
 </w>
 <w>
  <facs>at</facs>
  <dipl>at</dipl>
 </w>
</supplied>
<w>
 <facs>naðr <gap extent="14"/></facs>
 <dipl>naðr<textSpan type="supplied" to="s-20v42" resp="KGJ" source="Neckel1983:13"/>i</facs>
</w>
<w>
 <facs></facs>
 <dipl>mögr</dipl>
</w>
<w>
 <facs></facs>
 <dipl>Hlóðyniar</dipl>
</w>
<lb n="20v:42"/>
<w>
 <facs></facs>
 <dipl>munv</dipl>
</w>
<w>
 <facs></facs>
 <dipl>halir<anchor type="supplied" id="s-20v42"/></dipl>
</w>
<gap/>

The second unreadable part in the manuscript text marked with <gap/> here starts within a word. Because the supplied text is not a feature of the <facs> level, it is not encoded here but rather at the <dipl> level; the <anchor/> should always be included at the same level as the tag that refers to it. With this encoding it will be possible to display the text as shown in the following example (using "ö" for "o ogonek"):

41   viðars niðia. [Gengr inn mæri er af móði drepr neppr at] naðr [i mögr Hlóðyniar]
42   munv halir [ _ _ _ ]

This kind of editorial change is, however, not suggested as compulsory. In a primary transcription and encoding the use of the <gap/> element should only give the essential manuscript information. The attributes to <gap/> and <supplied>, such as source or resp, can of course be included voluntarily and to the extent that information is available.

 

7.4.2 Corrections

In the manuscript it is not always possible to say anything with certainty about the intention of changes in the text. When transcribing the text, however, corrections of obvious mistakes in the manuscript text could be marked with the following tag set recommended in the TEI Guidelines (ch. 6.5.1). In the following the use of <sic> and <corr> in relation to our recommended encoding of the individual word within the element <w> and on the three different levels <facs>, <dipl> and <norm> is treated.

Elements Contents
<sic> Contains text reproduced although apparently incorrect or inaccurate.
<corr> Contains the correct form of a passage apparently erroneous in the manuscript text.
resp Indicates the individual responsible for the correction of letters, words or passages contained within the <corr> and <sic> elements. It can be given values like:
transcriber The person responsible for the transcription of the manuscript text.
encoder The person responsible for the encoding of the manuscript text.
editor Signifies the editor responsible for suggesting the correction.
rend Describes how incorrect readings in the manuscript text should be displayed.
<textSpan/> Marks the beginning of a stretch of incorrect or uncorrected text which straddles structural boundaries, e.g. a <div> or when it goes from outside a <w> element to within a <w> element (or vice versa). This element is linked to an <anchor/> element indicating the end of the text. The <textSpan/> element includes the same attributes as the <sic> and <corr> elements.
type Indicates the type of text span being marked up. Attributes can be:
sic The span of text is equivalent to the contents of a <sic> element.
corr The span of text is equivalent to the contents of a <corr> element.
to This attribute gives the location of the end of the span of text (e.g. as a line number) and is linked to a corresponding id attribute of an <anchor/> element. Note that the value of the to and the corresponding id attributes must start with an ASCII letter and consist of letters, digits and/or a '.' or '-'.
<anchor/> An empty element which is used to indicate the end of the unclear text, in conjunction with the <textSpan type="sic"/> or <textSpan type="corr"/> element.
id A valid XML ID, referred to by the to attribute of the <textSpan type="unclear"/> element.
type Identifies the type of anchor. The value should be the same as the corresponding <textSpan/> element.

In a first-level transcription it can be relevant just to mark the obviously corrupted instances in the manuscript text. This could be done with the <sic> element as in this instance from Rómverjasögur (AM 595 a-b 4to). Note that we for clarity limit the use of encoding to the relevant sequence, and that the encoding is only presented on the <facs> level.

ok sua mikinn avrugglæik hafa þeir að giora illa
<sic><w><facs>uitier</facs></w></sic>
<lb n="1r:15"/>lavst að æigi munu þeir af lata nema þeim se bannað.

In this example the word uitier is marked as corrupt. There is no indication as to what is corrupt or how it should be corrected. The next step is to correct the corrupted instance, which could be made by adding a corr element at the dipl level.

ok sua mikinn avrugglæik hafa þeir að giora illa
<w>
 <facs><sic>uitier</sic></facs>
 <dipl><corr>uitis</corr></dipl>
</w>
<lb n="1r:15"/>lavst að æigi munu þeir af lata nema þeim se bannað.

This means that the correct reading of this passage should be as follows:

ok sua mikinn avrugglæik hafa þeir að giora illa uitis<lb n="1r:15"/>lavst að æigi munu þeir af lata nema þeim se bannað.

With this markup it is possible to show the text on the computer screen or in a printed edition in accordance with the suggestions above (ch. 7.1):

14   ...ok sua mikinn avrugglæik hafa þeir að giora illa *uitis
15   lavst að æigi munu þeir af lata nema þeim se bannað...

with the corrected form from the manuscript text underneath the edited text:

* uitier

It is also possible to include information about the person responsible for the correction with the attribute resp and its values:

ok sua mikinn avrugglæik hafa þeir að giora illa
<w>
 <facs><sic resp="transcriber">uitier</sic></facs>
 <dipl><corr resp="transcriber">uitis</corr></dipl>
</w>
<lb n="1r:15"/>lavst að æigi munu þeir af lata nema þeim se bannað.

 

Top of page

 

Version 1.0 published 20 May 2003. Version 1.1 published 5 May 2004. Version 2.0 ....