|
MUFI
proposal for coordinated usage of the Private Use
Area
Ed. by Odd Einar
Haugen, University of Bergen, Norway
Version 0.9.2 (15
September 2003)
Changes
from version 0.9.1:
- One character removed from the end of subrange 2,
LATIN ABBREVIATION Z FORM
- Link to ISO/IEC
JTC 1/SC 2 N 3126
added for several characters in subrange 2
- One character added at the end of subrange 4,
LATIN SMALL LETTER U BAR
Background &
contributors
The present
proposal is a major revision and extension of two
proposals published on the MUFI web
site:
A
proposal for subranges within the Private Use Area
of Unicode
(15 June 2002)
A
proposal for supplementary characters in
Unicode
(5 February 2003)
These proposal
received a number of helpful comments from (in
alphabetical order) Jim Allan (U.S.), Deborah W.
Anderson (Berkely, CA), Peter S. Baker
(Charlottesville, VA), Michael Beddow (Leeds),
António H.A. Emiliano (Lisboa), Michael
Everson (Dublin), Jost Gippert (Frankfurt),
Juan-José G. Marcos (Plasencia, Spain),
Susana T. Pedro (Lisboa), David J. Perry (Rye, NY),
Gerhard Schumacher (Köln), Ken Whistler
(Unicode consortium), and several other Unicode
officers.
The present
proposal is the result of a meeting held in Bergen,
Norway, 30-31 August 2003. Participants at this
meeting were Odd Einar Haugen (Bergen), Michael
Everson (Dublin), Michael Irlenbusch (Bergen), Alec
McAllister (Leeds), Gerhard Schumacher (Köln),
and Tarrin Wills (Sydney).
This proposal was
published 8 September 2003 on the MUFI site for
review. It is hoped that the proposal may lead to a
commonly accepted proposal by the end of September
2003, to be posted as version 1.0 around 1 October
on the MUFI site.
Many aspects of
this proposal will be controversial, and more than
one of the contributors and advisors listed above
may disagree with the solutions chosen in the
proposal. It is, however, clear that this proposal
would have progressed much slower and been much
inferior had it not been for all the help and
comments received.
Contents
Introduction
This proposal
contains a set of characters (or variant character
forms) for the use of medievalists and to some
extent classicists. The aim of the proposal is to
establish a coordinated usage of code points in the
Private Usea Area, and it particularly aims at
coordinating the usage of code points in existing
Unicode fonts such as Titus and Junicode with fonts
under development.
The proposal
contains a representative glyph for each character,
a categorisation, a recommended entity name, a
Unicode code point, and a descriptive
name.
Glyphs
The glyphs used in this proposal are for
guidance only.
Categories
Characters are divided into six
categories:
0. Characters
which already are in the Unicode standard. They
have been included in the proposal for the sake of
information.
1.
Characters which should be proposed for inclusion
in the Unicode standard (or currently are pending
approval). Characters approved by Unicode should
subsequently be removed and substituted with a
warning sign, preferably the character in question
placed within a triangular sign.
2.
Precomposed characters which can be encoded with a
sequence of Unicode characters, but which can not
be displayed easily or properly with existing font
technology. Characters in this category should be
removed as soon as there is mature smart font
technology available.
3.
Variant character form (e.g. regional or
chronological variants). Characters in this
category might be proposed as variants in the
Unicode standard. They should be removed as soon as
there is mature smart font technology available for
handling variant letter forms.
4.
Characters that are under review for inclusion in
the MUFI proposal. For this reason, space has been
reserved for them.
5. Characters
already in the Unicode standard which ought to have
a separate entity name for semantic reasons, e.g.
to distinguish the semicolon used as a punctuation
mark from the semicolon used as an abbreviation
mark. This category is relevant only for encoding
purposes.
Entity
names
Entities are used in numerous encoding schemes
such as SGML/XML. For the sake of transparency and
interchangeability, it is recommended that entities
as far as possible conform to the standard ISO
entity sets. An updated list of ISO conformant
entities can be found at the Oasis
web site.
In addition to
the ISO entities, a number of entities for
characters not designated in this standard is
needed. This proposal uses the syntax and inventory
recommended in the Menota
handbook,
ch. 2 and 5, summarised in the table below. Note
that not all slots need to be filled in; in most
cases only one or two slots are used in addition to
the base line character.
|
base
line character
|
main
type
|
variant
|
ligature
|
fixed
modification
|
loose
modification
|
|
a
A
etc.
|
comb
scap
enl
ins
unc
run
|
rot
squ
tall
dotless
brk
cl
|
lig
|
slash
str
ovl
ogon
hook
loopupright
looplowleft
des
|
acute
dblac
dot
uml
grave
brev
macr
dotbl
|
The inventory is
more or less self-explanatory:
|
main
type / variant /ligature
|
full
form
|
fixed
and loose modification
|
full
form
|
|
comb
scap
enl
ins
unc
run
rot
squ
tall
dotless
brk
close
lig
|
combining
mark
small capital
enlarged minuscule
insular form
uncial form
runic form
rotunda (round form)
square form
tall
dotless
broken
closed form
ligature
|
slash
str
ovl
ogon
hook
loopupright
looplowleft
des
curl
acute
dblac
dot
uml
grave
breve
macr
dotbl
|
slash
(diagonal)
stroke (horizontal, part-width)
overline (horizontal, full-width)
ogonek
hook
loop upper right
loop lower left
descending
curl (ogonek above)
acute
double acute
dot above
umlaut (= diaeresis)
grave
breve
macron
dot below
|
Note that if
there is a conflict between the standard ISO
entities as of 01.09.2003 and the syntax suggested
here ISO entites should be preferred.
MUFI code
point
This is a code point within the Private Use
Area in the Basic Multilingual Plane of Unicode,
E000 - F8FF (6400 code points). In this proposal, a
section of 512 characters have been set aside for
MUFI characters, from F400 to F5FF.
If a character
already is in the Unicode standard (or there is a
near-identical character in the standard) official,
non-PUA code points will be given here.
Junicode code
point
Junicode code points are located in the range F100
- F19F. No MUFI code points are allocated to this
section, but Junicode characters in this section
have been allocated duplicate code points in the
MUFI section.
Titus code
point
Titus code points are located in the range E000 -
F0FF. Precomposed MUFI characters are allocated to
the code points used by Titus to the extent that
they are included in Titus. The remaining
precomposed MUFI characters are located in the last
part of the MUFI section, F540 - F5FF.
Descriptive
name
Each character has been given a descriptive name,
as far as possible according to the rules in the
Unicode standard. The Menota
handbook
ch. 2 has further details on the interpretation of
the Unicode naming rules.
Top
of document
MUFI
subrange 1: Mixed script characters (prev. subrange
1)
Medieval
manuscripts were written in several styles, such as
Uncial, Insular, Carolingian, and Gothic. As long
as the manuscript is written in a uniform style,
whether it is Insular or Carolingian, it is usually
not advisable to encode each character as belonging
to a specific style. Thus, the character "b" should
be encoded as "b" whether it is writtin in Insular
or Carolingian style. However, in many Medieval
manuscripts there is a mixture of styles, e.g. of
Insular and Carolingian letter forms, and some
transcribers would like to encode extraneous letter
forms as such, e.g. sporadic Insular letter forms
in an otherwise Carolingian style. This applies
especially to the most distinct letter forms, such
as "f", "r" and "v" in Insular style, and "d", "e",
"m" and "t" in Uncial style.
This range also
includes other types of variant letter forms.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Junicode
|
Descriptive
name
|
|

|
1
|
&dunc;
|
F400
|
F109
|
LATIN
LETTER UNCIAL
D
|
|

|
3
|
&eunc;
|
F401
|
F10A
|
LATIN
LETTER UNCIAL E
|
|

|
4
|
&Fins;
|
F402
|
|
LATIN
CAPITAL LETTER INSULAR F
|
|

|
3
|
&fins;
|
F403
|
F103
|
LATIN
SMALL LETTER INSULAR F
|
|

|
3
|
&Gins;
|
F404
|
|
LATIN
CAPITAL LETTER INSULAR G
|
|

|
3
|
&gins;
|
F405
|
F10F
|
LATIN
SMALL LETTER INSULAR G
|
|

|
3
|
&hins;
|
F406
|
F110
|
LATIN
CAPITAL LETTER INSULAR H
|
|

|
1
|
&jdl;
|
F407
|
|
LATIN
SMALL LETTER J DOTLESS
|
|

|
1
|
&kunc;
|
F408
|
|
LATIN
LETTER UNCIAL K
|
|

|
3
|
&munc;
|
F409
|
F11A
|
LATIN
LETTER UNCIAL M
|
|

|
1
|
&rrot;
|
F40A
|
|
LATIN
SMALL LETTER R ROTUNDA
|
|

|
3
|
&rins;
|
F40B
|
F125
|
LATIN
SMALL LETTER INSULAR R
|
|

|
0
|
&rdes;
|
027C
|
|
LATIN
SMALL LETTER R WITH LONG LEG
|
|

|
3
|
&stalldes;
|
F40C
|
F127
|
LATIN
SMALL LETTER TALL S
DESCENDING
|
|
|
|
* This
character extends below the base line,
while the ordinary tall s is located on
the base line.
|
|

|
3
|
&tunc;
|
F40D
|
F129
|
LATIN
LETTER UNCIAL T
|
|

|
1
|
&Vins;
|
F40E
|
|
LATIN
CAPITAL LETTER INSULAR V
(VENTH)
|
|
|
|
*
Partially similar to 01F7 LATIN CAPITAL
LETTER WYNN in Latin
Extended-B.
Note that the Old Norse variant is open,
resembling the character "Y", and is
translitterated with "v", not with "w"
like in Old English.
|
|

|
1
|
&vins;
|
F40F
|
|
LATIN
SMALL LETTER INSULAR V
(VENTH)
|
|
|
|
*
Partially similar to 01BF LATIN LETTER
WYNN in Latin
Extended-B.
Note that the Old Norse variant is open,
resembling the character "y", and is
translitterated with "v", not with "w"
like in Old English.
|
Reserved
space after this range: F410 to F41F (16 code
points).
Top
of document
MUFI
subrange 2: Base line abbreviation characters
(prev. part of subrange 7)
This range
includes those abbreviation signs which typically
occupy a position on the base line.
Very few
abbreviation signs are included in Unicode
4.0. An exception is the sign for "et", which
is found in the range General
punctuation
as TIRONEAN SIGN ET (204A).
The Runic
characters "f" and "m" are sometimes used as
abbreviation marks; they are now included in the
Unicode range Runic,
as 16AO and 16D8 respectively. Since they are used
with their alphabetical names as abbreviation
("fé" and "maðr" respectively), it is
not necessary to define them as separate
characters.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Junicode
|
Descriptive
name
|
|

|
5
|
&apo;
|
0027
|
|
APOSTROPHE
|
|
|
|
A sign
similar to the apostrophe was used as an
abbreviation for "i" or "e". Cf. D.A.
Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 125.
|
|

|
1
|
&con9;
|
F420
|
F156
|
LATIN
ABBREVIATION SIGN CON
|
|

|
5
|
&cono;
|
0254
|
|
LATIN
SMALL LETTER OPEN O
|
|

|
5
|
✗
|
271D
|
|
LATIN
CROSS
|
|

|
0
|
&obiit;
|
03B8
|
|
GREEK
SMALL LETTER THETA
|
|
|
|
Cf. D.A.
Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 30.
|
|

|
5
|
&est;
|
223B
|
F150
|
LATIN
ABBREVIATION SIGN EST
|
|
|
|
Cf. D.A.
Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 82.
|
|

|
4
|
&ET;
|
F421
|
F142
|
TIRONEAN
LARGE SIGN ET
|
|
|
|
Cf. D.A.
Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 30 and 81.
|
|

|
0
|
&et;
|
204A
|
F143
|
TIRONEAN
SIGN ET
|
|

|
2
|
&etbar;
|
F422
|
|
TIRONEAN
SIGN ET WITH CROSSBAR
|
|
|
|
Identical
to 03/09 LATIN CONTRACTION ET in
ISO/IEC
JTC 1/SC 2 N
3126.
|
|

|
0
|
&
|
0026
|
|
AMPERSAND
|
|
|
|
Used in
some Medieval Nordic manuscripts for
"ok".
|
|

|
1
|
&rum;
|
F423
|
F154
|
LATIN
ABBREVIATION SIGN RUM
|
|

|
1
|
&de;
|
F424
|
|
LATIN
ABBREVIATION SIGN DE
|
|
|
|
This
sign looks similar to "eth", but is used
as an abbreviation for "de" and should
therefore have its own code point. Cf.
D.A. Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 25, 44 and 60.
|
|

|
1
|
&is;
|
F425
|
|
LATIN
ABBREVIATION SIGN IS
|
|
|
|
Cf.
Johs. Brøndum-Nielsen, ed.,
Palæografi: Danmark og
Sverige (Nordisk kultur 23:A),
Stockholm etc. [1943], p. 32, and
D.A. Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 125. Possibly identical to
02/14 LATIN CONTRACTION IS in
ISO/IEC
JTC 1/SC 2 N
3126.
|
|

|
0
|
&ed;
|
003B
|
|
SEMICOLON
|
|

|
1
|
&etfin;
|
F426
|
F155
|
LATIN
ABBREVIATION SIGN FINAL ET
|
|
|
|
Cf. D.A.
Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 125. Possibly identical to
03/12 LATIN CONTRACTION UM in
ISO/IEC
JTC 1/SC 2 N
3126.
|
Reserved
space after this range: F427 to F42F (9 code
points).
Top
of document
MUFI
subrange 3: Base line characters with overline
(prev. part of subrange 7)
The overline (bar
above) is probably the most used and also the most
ambiguous of all abbreviation marks. There are two
typical positions of the overline: above the full
height of the majuscules and above the x-height of
the minuscules. In the latter position it typically
crosses the ascender of characters like "b", "d",
"h", "k", "l" "þ" and tall "s". If the word
has a mixture of characters with and without
ascenders, the bar should sometimes be kept in the
upper position over all characters.
There are two
typical lengths of the overline: less than the
width of a character, like the macron, or the full
width of the character, so that it can extend as a
continuous line over several characters.
With present font
technology, the overline is particularly difficult.
With some fonts and operating systems it will
change its vertical position depending on the
height of each character. Thus, in an abbreviation
such as "ihc" for "Iesus", the overline may have
one position over "c", a slightly higher position
over "i" and an even higher position over
"h".
This range is
intended as a work-around until there is mature
smart font technology. It has separate code points
for all characters where the overline crosses the
ascender, either as a single stroke (macron-length)
or as a continuous stroke (overline). Some of these
characters are already in the standard, such as "b"
with bar across. The font designer should take care
to align the overlines on all
characters.
This range also
has two combining overlines, one for minuscules (in
the same heigth as the dot over "i") and one for
majuscules (in the same height as the accents).
Note that these combining overlines should have
"hard" positions, so that they will have the same
height regardless of the characters below. In this
respect, they will differ from 0304 COMBINING
MACRON and 0305 COMBINING OVERLINE, which may be
displayed with variable heigth due to limitations
in present font technology (cf. subrange
12 below).
The combining overlines ought to have medium width,
suitable for characters like "O" and
"o".
Finally, the
range includes a few precomposed characters for
extra narrow and wide characters, such as "i", "j",
"l" and "m".
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Descriptive
name
|
|

|
3
|
¯fixhi;
|
F430
|
COMBINING
FIXED-HEIGHT HIGH MACRON
|
|

|
3
|
¯fixmed;
|
F431
|
COMBINING
FIXED-HEIGHT MEDIUM-HIGH
MACRON
|
|

|
3
|
&ovlfixhi;
|
F432
|
COMBINING
FIXED-HEIGHT HIGH OVERLINE
|
|

|
3
|
&ovlfixmed;
|
F433
|
COMBINING
FIXED-HEIGHT MEDIUM-HIGH
OVERLINE
|
|

|
0
|
&bstr;
|
0180
|
LATIN
SMALL LETTER B WITH STROKE
|
|

|
2
|
&bovl;
|
F434
|
LATIN
SMALL LETTER B WITH FULL-WIDTH
STROKE
|
|

|
0
|
&dstr;
|
0111
|
LATIN
SMALL LETTER D WITH STROKE
|
|

|
2
|
&dovl;
|
F435
|
LATIN
SMALL LETTER D WITH FULL-WIDTH
STROKE
|
|

|
0
|
&hstr;
|
F127
|
LATIN
SMALL LETTER H WITH STROKE
|
|

|
2
|
&hovl;
|
F436
|
LATIN
SMALL LETTER H WITH FULL-WIDTH
STROKE
|
|

|
2
|
&kstr;
|
F453
|
LATIN
SMALL LETTER K WITH STROKE
|
|
|
|
* This
character is also listed in subrange 4
below.
|
|

|
2
|
&kovl;
|
F437
|
LATIN
SMALL LETTER K WITH FULL-WIDTH
STROKE
|
|

|
2
|
&lstr;
|
F438
|
LATIN
SMALL LETTER L WITH STROKE
|
|

|
2
|
&lovl;
|
F439
|
LATIN
SMALL LETTER L WITH FULL-WIDTH
STROKE
|
|

|
2
|
&stallstr;
|
F455
|
LATIN
SMALL LETTER TALL S WITH
STROKE
|
|
|
|
* This
character is also listed in subrange 4
below.
|
|

|
2
|
&stallovl;
|
F43A
|
LATIN
SMALL LETTER TALL S WITH FULL-WIDTH
STROKE
|
|

|
2
|
þstr;
|
F454
|
LATIN
SMALL LETTER THORN WITH
STROKE
|
|
|
|
* This
character is also listed in subrange 4
below.
|
|

|
2
|
þovl;
|
F43B
|
LATIN
SMALL LETTER THORN WITH FULL-WIDTH
STROKE
|
|

|
0
|
Ī
|
012A
|
LATIN
CAPITAL LETTER I WITH MACRON
|
|

|
2
|
&Iovl;
|
F43C
|
LATIN
CAPITAL LETTER I WITH
OVERLINE
|
|

|
0
|
ī
|
012B
|
LATIN
SMALL LETTER I WITH MACRON
|
|

|
2
|
&iovl;
|
F43D
|
LATIN
SMALL LETTER I WITH
OVERLINE
|
|

|
2
|
&Jmacr;
|
F43E
|
LATIN
CAPITAL LETTER J WITH
MACRON
|
|

|
2
|
&jovl;
|
F43F
|
LATIN
CAPITAL LETTER J WITH
OVERLINE
|
|

|
2
|
&jmacr;
|
F440
|
LATIN
SMALL LETTER J WITH MACRON
|
|

|
2
|
&jovl;
|
F441
|
LATIN
SMALL LETTER J WITH
OVERLINE
|
|

|
2
|
&lmacr;
|
F442
|
LATIN
SMALL LETTER L WITH MACRON
|
|

|
2
|
&lovl;
|
F443
|
LATIN
SMALL LETTER L WITH
OVERLINE
|
|

|
2
|
&Mmacr;
|
F444
|
LATIN
CAPITAL LETTER M WITH
MACRON
|
|

|
2
|
&Movl;
|
F445
|
LATIN
CAPITAL LETTER M WITH
OVERLINE
|
|

|
2
|
&mmacr;
|
F446
|
LATIN
SMALL LETTER M WITH MACRON
|
|

|
2
|
&movl;
|
F447
|
LATIN
SMALL LETTER M WITH
OVERLINE
|
Reserved
space after this range: F448 to F44F (8 code
points).
Top
of document
MUFI
subrange 4: Complex abbreviational characters
(prev. subrange 9)
This range
includes a number of abbreviated characters,
typically a base line character with a bar
across.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Junicode
|
Descriptive
name
|
|

|
5
|
&hhook;
|
0266
|
|
LATIN
SMALL LETTER H WITH HOOK (LIGATURE OF H
AND TALL S)
|
|

|
5
|
&khook;
|
0199
|
|
LATIN
SMALL LETTER K WITH HOOK (LIGATURE OF K
AND TALL S)
|
|

|
1
|
þhook;
|
F450
|
|
LATIN
SMALL LETTER THORN WITH HOOK (LIGATURE OF
THORN AND TALL S)
|
|
|
|
*
Unicode 4.0 has a "p with hook", 01A5, but
that should probably not be used for
"thorn". In many fonts it does not look
like a thorn at all.
|
|

|
1
|
&hhookstr;
|
F451
|
|
LATIN
SMALL LETTER H WITH HOOK (LIGATURE OF H
AND TALL S) AND STROKE
|
|

|
1
|
&khookstr;
|
F452
|
|
LATIN
SMALL LETTER K WITH HOOK (LIGATURE OF K
AND TALL S) AND STROKE
|
|

|
1
|
þhookstr;
|
F453
|
|
LATIN
SMALL LETTER THORN WITH HOOK (LIGATURE OF
THORN AND TALL S) AND
STROKE
|
|

|
1
|
&kstr;
|
F454
|
F14B
|
LATIN
SMALL LETTER K WITH STROKE
|
|

|
1
|
þstr;
|
F455
|
F149
|
LATIN
SMALL LETTER THORN WITH
STROKE
|
|

|
1
|
&stallstr;
|
F456
|
F14F
|
LATIN
SMALL LETTER TALL S WITH
STROKE
|
|

|
1
|
&Pbardes;
|
F457
|
F144
|
LATIN
CAPITAL LETTER P WITH STROKE THROUGH
DESCENDER
|
|

|
1
|
&pbardes;
|
F458
|
F145
|
LATIN
SMALL LETTER P WITH STROKE THROUGH
DESCENDER
|
|

|
1
|
&qbardes;
|
F459
|
F14D
|
LATIN
SMALL LETTER Q WITH STROKE THROUGH
DESCENDER
|
|

|
1
|
&thbardes;
|
F45A
|
|
LATIN
SMALL LETTER THORN WITH STROKE THROUGH
DESCENDER
|
|

|
1
|
&Pflour;
|
F45B
|
F146
|
LATIN
CAPITAL LETTER P WITH
FLOURISH
|
|

|
1
|
&pflour;
|
F45C
|
F147
|
LATIN
SMALL LETTER P WITH
FLOURISH
|
|

|
0
|
&ubar;
|
0289
|
|
LATIN
SMALL LETTER U BAR
|
|
|
|
The "u"
with a bar across was used in some Late
Medieval Danish manuscripts for /y/. Cf.
Johs. Brøndum-Nielsen, ed.,
Palæografi: Danmark og
Sverige (Nordisk kultur 23:A),
Stockholm etc. [1943], p. 53 and
55.
|
Reserved
space after this range: F45D to F46F (19 code
points).
Top
of document
MUFI
subrange 5: Punctuation marks (prev. subrange
6)
This range
includes those punctuation marks that are not
included in the official Unicode ranges. The common
marks, such as full stop, comma, colon, semicolon,
question mark, hyphen and solidus, are all found in
Unicode 4.0 Basic
Latin.
The middle dot is included in Latin-1
Supplement,
(00B7), and although it is not defined specifically
as a punctuation mark in this range, it is probably
not necessary to duplicate it in the present
range.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Junicode
|
Descriptive
name
|
|

|
1
|
&seminv;
|
F470
|
F160
|
PUNCTUATION
MARK PUNCTUS ELEVATUS (INVERTED
SEMICOLON)
|
|

|
3
|
&seminvdiag;
|
F471
|
|
PUNCTUATION
MARK PUNCTUS ELEVATUS DIAGONAL
STROKE
|
|

|
3
|
&questcurl;
|
F472
|
F161
|
PUNCTUATION
MARK PUNCTUS INTERROGATIVUS
|
|

|
3
|
&quest8;
|
F473
|
|
QUESTION
MARK HORIZONTAL 8 FORM
|
|
|
|
Cf. D.A.
Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 63.
|
|

|
1
|
&diacom;
|
F474
|
|
PUNCTUATION
MARK DIAERESIS ABOVE COMMA
|
|
|
|
* Cf.
Hreinn Benediktsson, Early Icelandic
Script, Reykjavík 1965, p. 95,
and D.A. Seip, Palæografi: Norge
og Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 63. Note that the bottom
part should look like a comma, not an
ogonek.
|
|

|
4
|
&brevdot;
|
F475
|
|
PUNCTUATION
MARK BREVE ABOVE DOT
|
|
|
|
Cf. D.A.
Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 34.
|
|

|
0
|
&tridotsupw;
|
2234
|
|
THEREFORE
|
|

|
0
|
&tridotsright;
|
10FB
|
F162
|
GEORGIAN
PARAGRAPH SEPARATOR
|
Reserved
space after this range: F476 to F47F (10 code
points).
Top
of document
MUFI
subrange 6: Critical and epigraphical signs (prev.
subrange 12)
Critical signs
are used in printed editions, indicating
corruptions, deletions, additions etc. The majority
of these signs are already in Unicode 4.0,
such as the asterisk (002A), the obelus (= dagger,
2020), curly brackets (007B, 007D), square brackets
(005B, 005D), single vertical line (007C), double
vertical line (2016), and open brackets (= angle
brackets, 3008, 3009).
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Junicode
|
Descriptive
name
|
|

|
1
|
&whsqbl;
|
F480
|
|
LEFT
WHITE SQUARE BRACKET
|
|
|
|
* Or
possibly 301A LEFT WHITE SQUARE BRACKET in
CJK
Symbols and
Punctuation
(problematic since this sign typically is
very wide).
|
|

|
1
|
&whsqbr;
|
F481
|
|
RIGHT
WHITE SQUARE BRACKET
|
|
|
|
* Or
possibly 301B RIGHT WHITE SQUARE BRACKET
in CJK
Symbols and
Punctuation
(problematic since this sign typically is
very wide).
|
|

|
1
|
&hsqblu;
|
F482
|
|
LEFT
UPPER HALF SQUARE BRACKET
|
|
|
|
* Or
possibly 2308 LEFT CEILING in
Miscellaneous
Technical.
|
|

|
1
|
&hsqbru;
|
F483
|
|
RIGHT
UPPER HALF SQUARE BRACKET
|
|
|
|
* Or
possibly 2309 RIGHT CEILING in
Miscellaneous
Technical.
|
|

|
4
|
&hsqbll;
|
F484
|
|
LEFT
LOWER HALF SQUARE BRACKET
|
|
|
|
* Or
possibly 230A LEFT FLOOR in
Miscellaneous
Technical.
This character is not being used in
Medieval Nordic contexts, but as the upper
variant is being used, space has been
reserved.
|
|

|
4
|
&hsqbrl;
|
F485
|
|
RIGHT
LOWER HALF SQUARE BRACKET
|
|
|
|
* Or
possibly 230B RIGHT FLOOR in
Miscellaneous
Technical.
This character is not being used in
Medieval Nordic contexts, but as the upper
variant is being used, space has been
reserved.
|
|

|
1
|
&slstlu;
|
F486
|
F16E
|
LEFT
UPPER SLANTED STROKE
|
|

|
1
|
&slstru;
|
F487
|
F16F
|
RIGHT
UPPER SLANTED STROKE
|
|

|
1
|
&slstll;
|
F488
|
|
LEFT
LOWER SLANTED STROKE
|
|

|
1
|
&slstrl;
|
F489
|
|
RIGHT
LOWER SLANTED STROKE
|
Reserved
space after this range: F48A to F48F (6 code
points).
Top
of document
MUFI
subrange 7: Metrical symbols (prev. subrange
11)
This is the type
of symbols used by Eduard Sievers in his
Altgermanische metrik (Halle: Max Niemeyer,
1893). They are still frequently used in text
books, monographs and articles on Medieval Nordic
metrics.
Thesaurus
Linguae Grecae has recently proposed a set of
metrical symbols for Greek. This proposal has been
approved by the Unicode Technical Committee, but
not yet by ISO-10646.
Thesaurus
Linguae Grecae: proposal for metrical
symbols
This proposal
includes symbols for the short syllable
(breve) and for the combination of a short
and long syllable (metrical short over
long). The default character (anceps) is
identified with MULTIPLICATION SIGN (00D7) in the
range Latin-1
Supplement,
and the symbol for long syllable (longum) is
identified with FIGURE DASH (2012) or EN DASH
(2013) in the range General
Punctuation.
Combinations of anceps, breve,
longum and grave or acute accent can
presumably be achieved by using COMBINING ACUTE
ACCENT (0300) and COMBINING GRAVE ACCENT (0301) in
the range Combining
Diacritical Marks.
However, this does not work well in most
applications, so for the time being precomposed
characetrs will be needed.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Junicode
|
Descriptive
name
|
|

|
3
|
&anc;
|
F490
|
F164
|
METRICAL
SYMBOL ANCEPS
|
|
|
|
* This
symbol is almost identical to the
MULTIPLICATION SIGN (00D7) in the range
Latin-1
Supplement,
but the anceps should be placed slightly
lower, touching the base line and wholly
within the x height of the
font.
|
|

|
2
|
&ancac;
|
F491
|
F165
|
METRICAL
SYMBOL ANCEPS WITH PRIMARY
STRESS
|
|

|
2
|
&ancgr;
|
F492
|
F166
|
METRICAL
SYMBOL ANCEPS WITH SECONDARY
STRESS
|
|

|
3
|
&sht;
|
F493
|
F16A
|
METRICAL
SYMBOL SHORT SYLLABLE
|
|
|
|
* This
symbol is similar to SPACING BREVE (02D8)
in the range Spacing
Modifying
letters,
but should be positioned much closer to
the base line.
|
|

|
2
|
&shtac;
|
F494
|
F16B
|
METRICAL
SYMBOL SHORT SYLLABLE WITH PRIMARY
STRESS
|
|

|
2
|
&shtacall;
|
F495
|
|
METRICAL
SYMBOL SHORT SYLLABLE WITH PRIMARY STRESS
AND ALLITERATION
[Similar
to F494, but with two stress marks like a
double acute]
|
|

|
2
|
&shtgr;
|
F496
|
F16C
|
METRICAL
SYMBOL SHORT SYLLABLE WITH SECONDARY
STRESS
|
|

|
3
|
&lng;
|
F497
|
F167
|
METRICAL
SYMBOL LONG SYLLABLE
|
|
|
|
* This
symbol is similar to FIGURE DASH (2012) or
EN DASH (2013) in the range
General
Punctuation,
but usually positioned closer to the base
line. FIGURE DASH seems to have the same
graphical properties as EN DASH.
|
|

|
2
|
&lngac;
|
F498
|
F168
|
METRICAL
SYMBOL LONG SYLLABLE WITH PRIMARY
STRESS
|
|

|
2
|
&lngacall;
|
F499
|
|
METRICAL
SYMBOL LONG SYLLABLE WITH PRIMARY STRESS
AND ALLITERATION
[Similar
to F498, but with two stress marks like a
double acute]
|
|

|
2
|
&lnggr;
|
F49A
|
F169
|
METRICAL
SYMBOL LONG SYLLABLE WITH SECONDARY
STRESS
|
|

|
2
|
&shtlng;
|
F49B
|
|
METRICAL
SYMBOL SHORT OR LONG
SYLLABLE
|
|

|
2
|
&shtlngac;
|
F49C
|
|
METRICAL
SYMBOL SHORT OR LONG SYLLABLE WITH PRIMARY
STRESS
|
|

|
2
|
&shtlnggr;
|
F49D
|
|
METRICAL
SYMBOL SHORT OR LONG SYLLABLE WITH
SECONDARY STRESS
|
|

|
2
|
&shtlnggr;
|
F49E
|
|
METRICAL
SYMBOL RESOLVED LIFT
[From
bottom upwards: two breve symbols, like in
F493, side by side, a horizontal line
immediately over these, and finally a
stress mark like in F498 symmetrically
positioned over this line]
|
Reserved
space after this range: F49F to F4A5 (7 code
points).
Top
of document
MUFI
subrange 8: Small capitals (prev. subrange
3)
Small capitals
have the same form as majuscules (capital letters),
but are usually drawn with the same height as a
minuscule (small) letter such as "x". In Medieval
Nordic manuscripts, small capitals were used to
denote geminates, i.e. long consonants, or they
were used ornamentally. The letters "B", "D", "G",
"M", "N", "R", "S" and "T" were most frequently
used as geminates, while these and other letters
might also be used as ornaments in the whole or in
parts of highlighted words. Some of the small
capitals, e.g. "O" and "C", are difficult to
distinguish from minuscule letters.
Unicode
4.0 has defined nine small capitals in the
IPA
Extensions
range, sc. "B", "G", "H", "I", "L", "N", "",
"R" and "Y", and another 14 small capitals for the
Uralic Phonetic Alphabet in the Phonetic
Extensions
range , "A", "C", "D", "ETH", "E", "J", "K", "M",
"O", "P", "T", "U", "V", "W" and "Z". Thus, only a
handful of small capitals remain. Of these, only
small capital "S" and "F" can appear as geminates.
The rest, i.e. "Q", "THORN" and "X" can only appear
as small capitals in ornamental
usage.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Descriptive
name
|
|

|
1
|
&fscap;
|
F4A6
|
LATIN
LETTER SMALL CAPITAL F
|
|

|
4
|
&qscap;
|
F4A7
|
LATIN
LETTER SMALL CAPITAL Q
|
|

|
1
|
&sscap;
|
F4A8
|
LATIN
LETTER SMALL CAPITAL S
|
|

|
4
|
&thscap;
|
F4A9
|
LATIN
LETTER SMALL CAPITAL THORN
|
|

|
4
|
&xscap;
|
F4AA
|
LATIN
LETTER SMALL CAPITAL X
|
Reserved
space after this range: F4AB to F4AF (5 code
points).
Top
of document
MUFI
subrange 9: Enlarged minuscules (prev. subrange
4)
Enlarged
minuscules are recognized as separate characters by
some scholars, cf. e.g. Andrea de Leeuw van Weenen
(A Grammar of Möðruvallabók,
CNWS 85, Leiden 2000). The traditional view has
been to interpret these characters as variants of
majuscules and encode them as such. It can be
argued that this is a functional rather than a
graphemic point of view and that it obscures the
obvious distinction between e.g. "A" (the
majuscule) and "a"
(the enlarged minuscule).
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Descriptive
name
|
|

|
3
|
&aenl;
|
F4B0
|
LATIN
ENLARGED LETTER SMALL A
|
|

|
3
|
&benl;
|
F4B1
|
LATIN
ENLARGED LETTER SMALL B
|
|

|
4
|
&cenl;
|
F4B2
|
LATIN
ENLARGED LETTER SMALL C
|
|

|
3
|
&denl;
|
F4B3
|
LATIN
ENLARGED LETTER SMALL D
|
|

|
3
|
&duncenl;
|
F4B4
|
LATIN
ENLARGED LETTER UNCIAL D
|
|

|
4
|
ðenl;
|
F4B5
|
LATIN
ENLARGED LETTER SMALL ETH
|
|

|
3
|
&eenl;
|
F4B6
|
LATIN
ENLARGED LETTER SMALL E
|
|

|
3
|
&fenl;
|
F4B7
|
LATIN
ENLARGED LETTER SMALL F
|
|

|
3
|
&genl;
|
F4B8
|
LATIN
ENLARGED LETTER SMALL G
|
|

|
3
|
&henl;
|
F4B9
|
LATIN
ENLARGED LETTER SMALL H
|
|

|
3
|
&ienl;
|
F4BA
|
LATIN
ENLARGED LETTER SMALL I
|
|

|
3
|
&jenl;
|
F4BB
|
LATIN
ENLARGED LETTER SMALL J
|
|

|
3
|
&kenl;
|
F4BC
|
LATIN
ENLARGED LETTER SMALL K
|
|

|
4
|
&lenl;
|
F4BD
|
LATIN
ENLARGED LETTER SMALL L
|
|

|
3
|
&menl;
|
F4BE
|
LATIN
ENLARGED LETTER SMALL M
|
|

|
3
|
&nenl;
|
F4BF
|
LATIN
ENLARGED LETTER SMALL N
|
|

|
4
|
&oenl;
|
F4B0
|
LATIN
ENLARGED LETTER SMALL O
|
|

|
3
|
&penl;
|
F4C1
|
LATIN
ENLARGED LETTER SMALL P
|
|

|
3
|
&qenl;
|
F4C2
|
LATIN
ENLARGED LETTER SMALL Q
|
|

|
3
|
&renl;
|
F4C3
|
LATIN
ENLARGED LETTER SMALL R
|
|

|
4
|
&senl;
|
F4C4
|
LATIN
ENLARGED LETTER SMALL S
|
|

|
3
|
&tenl;
|
F4C5
|
LATIN
ENLARGED LETTER SMALL T
|
|

|
3
|
&thenl;
|
F4C6
|
LATIN
ENLARGED LETTER SMALL THORN
|
|

|
4
|
&uenl;
|
F4C7
|
LATIN
ENLARGED LETTER SMALL U
|
|

|
4
|
&venl;
|
F4C8
|
LATIN
ENLARGED LETTER SMALL V
|
|

|
4
|
&wenl;
|
F4C9
|
LATIN
ENLARGED LETTER SMALL W
|
|

|
4
|
&xenl;
|
F4CA
|
LATIN
ENLARGED LETTER SMALL X
|
|

|
3
|
¥l;
|
F4CB
|
LATIN
ENLARGED LETTER SMALL Y
|
|

|
4
|
&zenl;
|
F4CC
|
LATIN
ENLARGED LETTER SMALL Z
|
Reserved
space after this range: F4CD to F4CF (3 code
points).
Top
of document
MUFI
subrange 10: Ligatures (prev. subrange
5)
Ligatures are two
base line characters which are joined so that they
form a new, composite base line character. Some
consist of two identical characters, e.g. "a+a",
others of different characters, e.g. "a+v". In
Medieval Nordic manuscripts, ligatures may be used
to denote length, "a+a", diphthong, "a+v", or a
distinct vowel quality, often mutation (Umlaut),
"a+v". Only ligatures which reflect a distinct
phonological value should be recognised as
characters of their own. - Finally, the broken
character "l" representing "ll" should be seen as a
ligature of two stems, broken in the
middle.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Descriptive
name
|
|

|
1
|
&AAlig;
|
F4D0
|
LATIN
CAPITAL LIGATURE AA
|
|

|
1
|
&aalig;
|
F4D1
|
LATIN
SMALL LIGATURE AA
|
|

|
1
|
&AOlig;
|
F4D2
|
LATIN
CAPITAL LIGATURE AO
|
|

|
1
|
&aolig;
|
F4D3
|
LATIN
SMALL LIGATURE AO
|
|

|
1
|
&AUlig;
|
F4D4
|
LATIN
CAPITAL LIGATURE AU
|
|

|
1
|
&aulig;
|
F4D5
|
LATIN
SMALL LIGATURE AU
|
|

|
1
|
&AVlig;
|
F4D6
|
LATIN
CAPITAL LIGATURE AV
|
|

|
1
|
&avlig;
|
F4D7
|
LATIN
SMALL LIGATURE AV
|
|

|
4
|
&AVligslash;
|
F4D8
|
LATIN
CAPITAL LIGATURE AV WITH
STROKE
|
|

|
2
|
&avligslash;
|
F4D9
|
LATIN
SMALL LIGATURE AV WITH
STROKE
|
|

|
1
|
&AYlig;
|
F4DA
|
LATIN
CAPITAL LIGATURE AY
|
|

|
1
|
&aylig;
|
F4DB
|
LATIN
SMALL LIGATURE AY
|
|

|
3
|
&OOlig;
|
F4DC
|
LATIN
CAPITAL LIGATURE OO
|
|

|
3
|
&oolig;
|
F4DD
|
LATIN
SMALL LIGATURE OO
|
|

|
3
|
&Olll;
|
F4DE
|
LATIN
CAPITAL LIGATURE REDUCED AO
|
|

|
3
|
&olll;
|
F4DF
|
LATIN
SMALL LIGATURE REDUCED AO
|
|

|
3
|
&Ourl;
|
F4EO
|
LATIN
CAPITAL LIGATURE REDUCED OE
|
|

|
3
|
&ourl;
|
F4E1
|
LATIN
SMALL LIGATURE REDUCED OE
|
|

|
4
|
&YYlig;
|
F4E2
|
LATIN
CAPITAL LIGATURE YY
|
|

|
4
|
&yylig;
|
F4E3
|
LATIN
SMALL LIGATURE YY
|
|

|
1
|
&lbrk;
|
F4E4
|
LATIN
SMALL LETTER BROKEN L
|
Reserved
space after this range: F4E5 to F4EF (11 code
points).
Top
of document
Subrange
11: Combining superscript characters (prev.
subrange 10)
This range
includes superscript characters, typically placed
above another base line character. They are found
in many early German printed texts, and in a large
number of Medieval manuscripts. The position
immediately above a base line character
distinguishes them from raised interlinear
characters typically occupying a position
immediately after another base line
character. This latter type includes a handful of
phonetic modifiers such as a raised "w" indicating
rounding, a raised "h" indicating aspiration
etc.
Unicode
4.0 has a selection of 13 superscript
characters, namely "a", "e", "i", "o", "u", "c",
"d", "h", "m", "r", "t", "v", "x". They are located
at the end of the range Combining
diacritical marks,
0363-036F.
The characters in
the list below are documented in Andrea de Leeuw
van Weenen, A Grammar of
Möðruvallabók (CNWS 85), Leiden
2000.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Descriptive
name
|
|

|
1
|
æsup;
|
F4F0
|
COMBINING
LATIN SMALL LETTER AE
|
|

|
1
|
&bsup;
|
F4F1
|
COMBINING
LATIN SMALL LETTER B
|
|

|
1
|
&bscapsup;
|
F4F2
|
COMBINING
LATIN LETTER SMALL CAPITAL
B
|
|

|
1
|
&dhsup;
|
F4F3
|
COMBINING
LATIN SMALL LETTER ETH
|
|

|
1
|
&dscapsup;
|
F4F4
|
COMBINING
LATIN LETTER SMALL CAPITAL
D
|
|

|
1
|
&fsup;
|
F4F5
|
COMBINING
LATIN SMALL LETTER F
|
|

|
1
|
&gsup;
|
F4F6
|
COMBINING
LATIN SMALL LETTER G
|
|

|
1
|
&gscapsup;
|
F4F7
|
COMBINING
LATIN LETTER SMALL CAPITAL
G
|
|

|
1
|
&ksup;
|
F4F8
|
COMBINING
LATIN SMALL LETTER K
|
|

|
1
|
&kscapsup;
|
F4F9
|
COMBINING
LATIN LETTER SMALL CAPITAL
K
|
|

|
1
|
&lsup;
|
F4FA
|
COMBINING
LATIN SMALL LETTER L
|
|

|
1
|
&lscapsup;
|
F4FB
|
COMBINING
LATIN LETTER SMALL CAPITAL
L
|
|

|
1
|
&mscapsup;
|
F4FC
|
COMBINING
LATIN LETTER SMALL CAPITAL
M
|
|

|
1
|
⊅
|
F4FD
|
COMBINING
LATIN SMALL LETTER N
|
|

|
1
|
&nscapsup;
|
F4FE
|
COMBINING
LATIN LETTER SMALL CAPITAL
N
|
|

|
1
|
&psup;
|
F4FF
|
COMBINING
LATIN SMALL LETTER P
|
|

|
1
|
&rscapsup;
|
F500
|
COMBINING
LATIN LETTER SMALL CAPITAL
R
|
|

|
1
|
&ssup;
|
F501
|
COMBINING
LATIN SMALL LETTER S
|
|

|
1
|
&stallsup;
|
F502
|
COMBINING
LATIN LETTER TALL S
|
|

|
1
|
&tscapsup;
|
F503
|
COMBINING
LATIN LETTER SMALL CAPITAL
T
|
|

|
1
|
&ysup;
|
F504
|
COMBINING
LATIN SMALL LETTER Y
|
|

|
1
|
&zsup;
|
F505
|
COMBINING
LATIN SMALL LETTER Z
|
Reserved
space after this range: F506 to F50F (10 code
points).
Top
of document
MUFI
subrange 12: Combining abbreviation and diacritical
marks (prev. subrange 8)
This range
includes those abbreviation signs which typically
occupy a position above, through or below another
base line character. Combining diacritical marks
are also included in this range.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Junicode
|
Descriptive
name
|
|

|
5
|
&bar;
|
0305
|
|
COMBINING
ABBREVIATION MARK BAR ABOVE
|
|

|
2
|
&arbar;
|
F510
|
|
COMBINING
ABBREVIATION MARK BAR ABOVE WITH
DOT
|
|

|
5
|
&baracr;
|
0336
|
|
COMBINING
ABBREVIATION MARK BAR ACROSS
|
|

|
5
|
&barbl;
|
0332
|
|
COMBINING
ABBREVIATION MARK BAR BELOW
|
|

|
1
|
&er;
|
F511
|
F152
|
COMBINING
ABBREVIATION MARK SUPRALINEAR "ER"
(ZIG-ZAG SIGN)
|
|

|
1
|
&ra;
|
F512
|
F157
|
COMBINING
ABBREVIATION MARK SUPRALINEAR "RA" (OMEGA
SIGN)
|
|

|
2
|
&rabar;
|
F513
|
|
COMBINING
ABBREVIATION MARK SUPRALINEAR "RA" (OMEGA
SIGN) WITH BAR ABOVE
|
|

|
1
|
&ur2;
|
F514
|
F153
|
COMBINING
ABBREVIATION MARK SUPRALINEAR "UR"
(2-SIGN)
|
|

|
1
|
&ur8;
|
F515
|
|
COMBINING
ABBREVIATION MARK SUPRALINEAR "UR"
(8-SIGN)
|
|

|
3
|
&ur8open;
|
F516
|
|
COMBINING
ABBREVIATION MARK SUPRALINEAR "UR" (OPEN
8-SIGN)
|
|

|
1
|
&us;
|
F517
|
F151
|
COMBINING
ABBREVIATION MARK SUPRALINEAR "US"
(9-SIGN)
|
|

|
4
|
|
F518
|
|
COMBINING
ABBREVIATION MARK SUPRALINEAR "US"
(9-SIGN)
|
|

|
1
|
&combcurl;
|
F519
|
|
COMBINING
CURL
|
|

|
4
|
|
F51A
|
F163
|
COMBINING
CIRCUMFLEX OVER TWO
CHARACTERS
|
|

|
1
|
|
F51A
|
|
COMBINING
FLOURISH
|
|
|
|
The
combining flourish looks like a large
ogonek and is attached to the end of a
character, typically "d", to indicate a
suspended ending. Cf. P.L. Hjorth, ed.,
Karl Magnus' Krønike,
Copenhagen 1960, p. XXXI.
|
Reserved
space after this range: F51B to F528 (13 code
points).
Top
of document
MUFI
subrange 13: Variant letter forms
This range (if
accepted) will contain a number of commonly
recognised variant letter forms. A handful of the
Junicode PUA characters have been included here in
addition to some of the Medieval Nordic ones. It is
likely that many of the letter forms in the
Portuguese proposal will fit into this range as
well.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Junicode
|
Descriptive
name
|
|

|
3
|
&Ainsqu;
|
F528
|
F13A
|
LATIN
CAPITAL LETTER INSULAR A SQUARE
FORM
|
|

|
3
|
&Cinssqu;
|
F529
|
F106
|
LATIN
CAPITAL LETTER INSULAR C SQUARE
FORM
|
|

|
3
|
&Ginssqu;
|
F52A
|
F10E
|
LATIN
CAPITAL LETTER INSULAR G SQUARE
FORM
|
|

|
3
|
&sins;
|
F52B
|
F126
|
LATIN
SMALL LETTER INSULAR S
|
|
|
|
|
|
|
|
|

|
3
|
&ains;
|
F52C
|
|
LATIN
SMALL LETTER INSULAR A
|
|
|
|
This is
the single-storey "a" of the Insular
script, similar in shape to modern italic
"a".
|
|

|
3
|
&ainsenl;
|
F52D
|
|
LATIN
ENLARGED SMALL LETTER INSULAR
A
|
|
|
|
This is
the enlarged variant av the single-storey
"a", triangular in shape. Cf. D.A. Seip,
Palæografi: Norge og Island
(Nordisk kultur 23:B), Oslo etc. 1954, p.
27.
|
|

|
3
|
&aopen;
|
F52E
|
|
LATIN
SMALL LETTER A OPEN FORM
|
|
|
|
This is
the traditional "open a" from early
Carolingian script, similar in shape to
"cc".
|
|

|
3
|
&aclose;
|
F52F
|
|
LATIN
SMALL LETTER A CLOSE FORM
|
|
|
|
Cf. D.A.
Seip, Palæografi: Norge og
Island (Nordisk kultur 23:B), Oslo
etc. 1954, p. 70.
|
|

|
3
|
|
F530
|
|
LATIN
SMALL LIGATURE AE WITH HOOK
|
|
|
|
Cf.
Johs. Brøndum-Nielsen, ed.,
Palæografi: Danmark og
Sverige (Nordisk kultur 23:A),
Stockholm etc. [1943], p.
102.
|
|

|
3
|
&finsclose;
|
F531
|
|
LATIN
SMALL LETTER INSULAR F CLOSED
FORM
|
|
|
|
|
|
|
|
Reserved
space after this range: F532 to F53F (14 code
points).
Top
of document
MUFI
subrange 14: Precomposed characters with acute
accent (prev. part of subrange 2)
Unicode
4.0 includes acute over the vowels small and
capital "a", "e", "i", "o", "u", "y", "æ",
and "ø", and also over the consonants (small
and capital form) "c", "g", "k", "l", "m", "n",
"p", "r", "s", "w", and "z".
The list below
contains additional character
combinations.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Titus
|
Junicode
|
Descriptive
name
|
|

|
2
|
&AAligacute;;
= &AAlig; + &combacute;
|
F540
= F4D0
+ 0301
|
|
|
LATIN
CAPITAL LIGATURE AA WITH
ACUTE
= LATIN
CAPITAL LIGATURE
AA
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&aaligacute;;
= &aalig; + &combacute;
|
F541
= F4D1
+ 0301
|
|
|
LATIN
SMALL LIGATURE AA WITH
ACUTE
= LATIN
SMALL LIGATURE
AA
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&AOligacute;;
= &AOlig; + &combacute;
|
F542
= F4D2
+ 030
|
|
|
LATIN
CAPITAL LIGATURE AO WITH
ACUTE
= LATIN
CAPITAL LIGATURE
AO
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&aoligacute;;
= &aolig; + &combacute;
|
F543
= F4D3
+ 0301
|
|
|
LATIN
SMALL LIGATURE AO WITH
ACUTE
= LATIN
SMALL LIGATURE
AO
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&AUligacute;;
= &AUlig; + &combacute;
|
F544
= F4D4
+ 0301
|
|
|
LATIN
CAPITAL LIGATURE AU WITH
ACUTE
= LATIN
CAPITAL LIGATURE
AU
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&auligacute;;
= &aulig; + &combacute;
|
F545
= F4D5
+ 0301
|
|
|
LATIN
SMALL LIGATURE AU WITH
ACUTE
= LATIN
SMALL LIGATURE
AU
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&AVligacute;;
= &AVlig; + &combacute;
|
F546
= F4D6
+ 0301
|
|
|
LATIN
CAPITAL LIGATURE AV WITH
ACUTE
= LATIN
CAPITAL LIGATURE
AV
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&avligacute;;
= &avlig; + &combacute;
|
F547
= F4D7
+ 0301
|
|
|
LATIN
SMALL LIGATURE AV WITH
ACUTE
= LATIN
SMALL LIGATURE
AV
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&Eogonacute;
= Ę + &combacute;
= E + &combogon; +
&combacute;
|
E099
= 0118 + 0301
= 0035 + 0328 + 0301
|
E099
|
|
LATIN
CAPITAL LETTER E WITH OGONEK AND
ACUTE
= LATIN CAPITAL LETTER E WITH OGONEK +
COMBINING ACUTE ACCENT
= LATIN CAPITAL LETTER E + COMBINING
OGONEK + COMBINING ACUTE ACCENT
|
|

|
2
|
&eogonacute;
= ę + &combacute;
= e + &combogon; +
&combacute;
|
E499
= 0119 + 0301
= 0065 + 0328 + 0301
|
E499
|
|
LATIN
SMALL LETTER E WITH OGONEK AND
ACUTE
= LATIN SMALL LETTER E WITH OGONEK +
COMBINING ACUTE ACCENT
= LATIN SMALL LETTER E + COMBINING OGONEK
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&Jacute;
= J + &combacute;
|
E153
= 004A + 0301
|
E153
|
|
LATIN
CAPITAL LETTER J WITH
ACUTE
= LATIN CAPITAL LETTER J + COMBINING ACUTE
ACCENT
|
|

|
2
|
&jacute;
= j + &combacute;
|
E553
= 006A + 0301
|
E553
|
|
LATIN
SMALL LETTER J WITH
ACUTE
= LATIN SMALL LETTER J + COMBINING ACUTE
ACCENT
|
|

|
2
|
&Oogonacute;
= &Oogon; + &combacute;
= O + &combogon; +
&combacute;
|
E20C
= 01EA + 0301
= 004F + 0328 + 0301
|
E20C
|
F190
|
LATIN
CAPITAL LETTER O WITH OGONEK AND
ACUTE
= LATIN CAPITAL LETTER O WITH OGONEK +
COMBINING ACUTE ACCENT
= LATIN CAPITAL LETTER O + COMBINING
OGONEK + COMBINING ACUTE ACCENT
|
|

|
2
|
&oogonacute;
= &oogon; + &combacute;
= o + &combogon; +
&combacute;
|
E60C
= 01EB + 0301
= 006F + 0328 + 0301
|
E60C
|
F191
|
LATIN
SMALL LETTER O WITH OGONEK AND
ACUTE
= LATIN SMALL LETTER O WITH OGONEK +
COMBINING ACUTE ACCENT
= LATIN SMALL LETTER O + COMBINING OGONEK
+ COMBINING ACUTE ACCENT
|
|

|
2
|
Øogonacute;
= Øogon; + &combacute;
= Ø + &combogon; +
&combacute;
|
F548
= 01FE + 0328
= 00D8 + 0328 + 0301
|
|
|
LATIN
CAPITAL LETTER O WITH STROKE AND OGONEK
AND ACUTE
= LATIN
CAPITAL LETTER O WITH STROKE AND
OGONEK
+ COMBINING OGONEK
= LATIN CAPITAL LETTER O WITH STROKE +
COMBINING OGONEK + COMBINING ACUTE
ACCENT
|
|

|
2
|
øogonacute;
= øacute; + ˛
= ø + &combogon; +
&combacute;
|
F549
= 01FF + 0328
= 00F8 + 0328 + 0301
|
|
|
LATIN
SMALL LETTER O WITH STROKE AND OGONEK AND
ACUTE
= LATIN
SMALL LETTER O WITH STROKE AND
OGONEK
+ COMBINING OGONEK
= LATIN SMALL LETTER O WITH STROKE +
COMBINING OGONEK + COMBINING ACUTE
ACCENT
|
|

|
2
|
&OOligacute;;
= &OOlig; + &combacute;
|
F54A
= F4DA
+ 0301
|
|
|
LATIN
CAPITAL LIGATURE OO WITH
ACUTE
= LATIN
CAPITAL LIGATURE
OO
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&ooligacute;;
= &oolig; + &combacute;
|
F54B
= F4DB
+ 0301
|
|
|
LATIN
SMALL LIGATURE OO WITH
ACUTE
= LATIN
SMALL LIGATURE
OO
+ COMBINING ACUTE ACCENT
|
|

|
2
|
&Vacute;
= V + &combacute;
|
E33A
= 0056 + 0301
|
E33A
|
|
LATIN
CAPITAL LETTER V WITH
ACUTE
= LATIN CAPITAL LETTER V + COMBINING ACUTE
ACCENT
|
|

|
2
|
&vacute;
= v + &combacute;
|
E73A
= 0076 + 0301
|
E73A
|
|
LATIN
SMALL LETTER V WITH
ACUTE
= LATIN SMALL LETTER V + COMBINING ACUTE
ACCENT
|
Reserved
space after this range: F54C to F54F (4 code
points).
Top
of document
MUFI
subrange 15: Precomposed characters with double
acute accent (prev. part of subrange
2)
Unicode
4.0 includes double acute over the small and
capital "o" and "u". The
list below contains additional character
combinations.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Titus
|
Descriptive
name
|
|

|
2
|
&Adacute;
= A + &combdblac;
|
E025
= 0041 + 030B
|
E025
|
LATIN
CAPITAL LETTER A WITH DOUBLE
ACUTE
= LATIN CAPITAL LETTER A + COMBINING
DOUBLE ACUTE ACCENT
|
|

|
2
|
&adblac;
= a + &combdblac;
|
E425
= 0061 + 030B
|
E425
|
LATIN
SMALL LETTER A WITH DOUBLE
ACUTE
= LATIN SMALL LETTER A + COMBINING DOUBLE
ACUTE ACCENT
|
|

|
2
|
&AAligdblac;
= &AAlig; + &combdblac;
|
F550
= F4D0
+ 030B
|
|
LATIN
CAPITAL LIGATURE AA WITH DOUBLE
ACUTE
= LATIN
CAPITAL LIGATURE
AA
+ COMBINING DOUBLE ACUTE ACCENT
|
|

|
2
|
&aaligdblac;
= &aalig; + &combdblac;
|
F551
= F4D1
+ 030B
|
|
LATIN
SMALL LIGATURE AA WITH DOUBLE
ACUTE
= LATIN
SMALL LIGATURE
AA
+ COMBINING DOUBLE ACUTE ACCENT
|
|

|
2
|
Ædblac;
= Æ + &combdblac;
|
F552
= 00C6 + 030B
|
|
LATIN
CAPITAL LETTER AE WITH DOUBLE
ACUTE
= LATIN CAPITAL LETTER AE + COMBINING
DOUBLE ACUTE ACCENT
|
|

|
2
|
ædblac;
= æ + &combdblac;
|
F553
= 00E6 + 030B
|
|
LATIN
SMALL LETTER AE WITH DOUBLE
ACUTE
= LATIN SMALL LETTER AE + COMBINING DOUBLE
ACUTE ACCENT
|
|

|
2
|
&Edblac;
= E + &combdblac;
|
E0D1
= 0035 + 030B
|
E0D1
|
LATIN
CAPITAL LETTER E WITH DOUBLE
ACUTE
= LATIN CAPITAL LETTER E + COMBINING
DOUBLE ACUTE ACCENT
|
|

|
2
|
&edblac;
= e + &combdblac;
|
E4D1
= 0065 + 030B
|
E4D1
|
LATIN
SMALL LETTER E WITH DOUBLE
ACUTE
= LATIN SMALL LETTER E + COMBINING DOUBLE
ACUTE ACCENT
|
|

|
2
|
&Idblac;
= I + &combdblac;
|
E143
= 0049 + 030B
|
E143
|
LATIN
CAPITAL LETTER I WITH DOUBLE
ACUTE
= LATIN CAPITAL LETTER I + COMBINING
DOUBLE ACUTE ACCENT
|
|

|
2
|
&idblac;
= i + &combdblac;
|
E153
= 0069 + 030B
|
E153
|
LATIN
SMALL LETTER I WITH DOUBLE
ACUTE
= LATIN SMALL LETTER I + COMBINING DOUBLE
ACUTE ACCENT
|
|

|
2
|
&Jdblac;
= J + &combdblac;
|
F554
= 004A + 030B
|
|
LATIN
CAPITAL LETTER J WITH DOUBLE
ACUTE
= LATIN CAPITAL LETTER J + COMBINING
DOUBLE ACUTE ACCENT
|
|

|
2
|
&jdblac;
= j + &combdblac;
|
F555
= 006A + 030B
|
|
LATIN
SMALL LETTER J WITH DOUBLE
ACUTE
= LATIN SMALL LETTER J + COMBINING DOUBLE
ACUTE ACCENT
|
|

|
2
|
&OOligdblac;
= &OOlig; + &combdblac;
|
F556
= F4DA
+ 030B
|
|
LATIN
CAPITAL LIGATURE OO WITH DOUBLE
ACUTE
= LATIN
CAPITAL LIGATURE
OO
+ COMBINING DOUBLE ACUTE ACCENT
|
|

|
2
|
&ooligdblac;
= &oolig; + &combdblac;
|
F557
= F4DB
+ 030B
|
|
LATIN
SMALL LIGATURE OO WITH DOUBLE
ACUTE
= LATIN
SMALL LIGATURE
OO
+ COMBINING DOUBLE ACUTE ACCENT
|
|

|
2
|
&Vdblac;
= V + &combdblac;
|
F558
= 0056 + 030B
|
|
LATIN
CAPITAL LETTER V WITH DOUBLE
ACUTE
= LATIN CAPITAL LETTER V + COMBINING
DOUBLE ACUTE ACCENT
|
|

|
2
|
&vdblac;
= v + &combdblac;
|
F559
= 0076 + 030B
|
|
LATIN
SMALL LETTER V WITH DOUBLE
ACUTE
= LATIN SMALL LETTER V + COMBINING DOUBLE
ACUTE ACCENT
|
|

|
2
|
&Ydblac;
= Y + &combdblac;
|
E37C
= 0059 + 030B
|
E37C
|
LATIN
CAPITAL LETTER Y WITH DOUBLE
ACUTE
= LATIN CAPITAL LETTER Y + COMBINING
DOUBLE ACUTE ACCENT
|
|

|
2
|
&ydblac;
= y + &combdblac;
|
E77C
= 0079 + 030B
|
E77C
|
LATIN
SMALL LETTER Y WITH DOUBLE
ACUTE
= LATIN SMALL LETTER Y + COMBINING DOUBLE
ACUTE ACCENT
|
|

|
2
|
&Ydotacute;
= Y + &combdot; +
&combacute;
|
F55A
= 0059 + 030B + 0301
|
|
LATIN
CAPITAL LETTER Y WITH DOT ABOVE AND
ACUTE
= LATIN CAPITAL LETTER Y + COMBINING DOT
ABOVE + COMBINING DOUBLE ACUTE
ACCENT
|
|

|
2
|
&ydotacute;
= y + &combdot; +
&combacute;
|
F55B
= 0079 + 030B + 0301
|
|
LATIN
SMALL LETTER Y WITH DOT ABOVE AND
ACUTE
= LATIN SMALL LETTER Y + COMBINING DOT
ABOVE + COMBINING DOUBLE ACUTE
ACCENT
|
Reserved
space after this range: F55C to F55F (4 code
points).
Top
of document
MUFI
subrange 16: Precomposed characters with dot above
(prev. part of subrange 2)
Unicode
4.0 includes approx. 40 characters with a dot
above in three ranges, Latin
Extended-A,
Latin
Extended-B
and Latin
Extended
Additional,
intended for use in several languages, mostly Irish
Gaelic (old orthography), and - added in 3.2 - also
for Livonian. The precomposed characters are small
and capital forms of "a", "b", "c", "d", "e", "f",
"g", "h", "m", "n", "o", "p", "r", "s", "t", "w",
"x", "y", "z", and tall "s" (no capital
version). The
list below contains additional character
combinations.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Titus
|
Descriptive
name
|
|

|
2
|
&AAligdot;
= &AAlig; + &combdot;
|
F560
= F4D0
+ 0307
|
|
LATIN
CAPITAL LIGATURE AA WITH DOT
ABOVE
= LATIN
CAPITAL LIGATURE
AA
+ COMBINING DOT ABOVE
|
|

|
2
|
&aaligdot;
= &aalig; + &combdot;
|
F561
= F4D1
+ 0307
|
|
LATIN
SMALL LIGATURE AA WITH DOT
ABOVE
= LATIN
SMALL LIGATURE
AA
+ COMBINING DOT ABOVE
|
|

|
2
|
&AYligdot;
= &AYlig; + &combdot;
|
F562
= F4D8
+ 0307
|
|
LATIN
CAPITAL LIGATURE AY WITH DOT
ABOVE
= LATIN
CAPITAL LIGATURE
AY
+ COMBINING DOT ABOVE
|
|

|
2
|
&ayligdot;
= &aylig; + &combdot;
|
F563
= F4D9
+ 0307
|
|
LATIN
SMALL LIGATURE AY WITH DOT
ABOVE
= LATIN
SMALL LIGATURE
AY
+ COMBINING DOT ABOVE
|
|

|
2
|
&gscapdot;
= &gscap; + &combdot;
|
F564
= 0262 + 0307
|
|
LATIN
LETTER SMALL CAPITAL G WITH DOT
ABOVE
= LATIN LETTER SMALL CAPITAL G + COMBINING
DOT ABOVE
|
|

|
2
|
&kdot;
= k + &combdot;
|
F565
= 006B + 0307
|
|
LATIN
SMALL LETTER K WITH DOT
ABOVE
=
= LATIN SMALL LETTER K + COMBINING DOT
ABOVE
|
|

|
2
|
&nscapdot;
= &nscap; + &combdot;
|
F566
= 0274 + 0307
|
|
LATIN
LETTER SMALL CAPITAL N WITH DOT
ABOVE
= LATIN LETTER SMALL CAPITAL N + COMBINING
DOT ABOVE
|
|

|
2
|
&rscapdot;
= &rscap; + &combdot;
|
F567
= 0280 + 0307
|
|
LATIN
LETTER SMALL CAPITAL R WITH DOT
ABOVE
= LATIN LETTER SMALL CAPITAL R + COMBINING
DOT ABOVE
|
|

|
2
|
&sscapdot;
= &sscap; + &combdot;
|
F568
= F4A5
+ 0307
|
|
LATIN
LETTER SMALL CAPITAL S WITH DOT
ABOVE
= LATIN
LETTER SMALL CAPITAL
S
+ COMBINING DOT ABOVE
|
|

|
2
|
&tscapdot;
= &tscap; + &combdot;
|
F569
= 1D1B + 0307
|
|
LATIN
LETTER SMALL CAPITAL T WITH DOT
ABOVE
= LATIN LETTER SMALL CAPITAL T + COMBINING
DOT ABOVE
|
|

|
2
|
&Vdot;
= V + &combdot;
|
F56A
= 0056 + 0307
|
|
LATIN
CAPITAL LETTER V WITH DOT
ABOVE
= LATIN CAPITAL LETTER V + COMBINING DOT
ABOVE
|
|

|
2
|
&vdot;
= v + &combdot;
|
F56B
= 0076 + 0307
|
|
LATIN
SMALL LETTER V WITH DOT
ABOVE
= LATIN SMALL LETTER V + COMBINING DOT
ABOVE
|
|

|
2
|
&Vinsdot;
= &Vins; + &combdot;
|
F56C
= F40E
+ 0307
|
|
LATIN
CAPITAL LETTER INSULAR V (VENTH) WITH DOT
ABOVE
= LATIN
CAPITAL LETTER INSULAR V
(VENTH)
+ COMBINING DOT ABOVE
|
|

|
2
|
&vinsdot;
= &vins; + &combdot;
|
F56D
= F40F
+ 0307
|
|
LATIN
SMALL LETTER INSULAR V (VENTH) WITH DOT
ABOVE
= LATIN
SMALL LETTER INSULAR V
(VENTH)
+ COMBINING DOT ABOVE
|
Reserved
space after this range: F56E to F577 (10 code
points).
Top
of document
MUFI
subrange 17: Precomposed characters with dot below
(prev. part of subrange 2)
Unicode
4.0 includes no less than 38 characters with a
dot below, basically the whole alphabet, "a-z" in
the Latin
Extended
Additional
range. The
list below contains additional character
combinations.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Titus
|
Descriptive
name
|
|

|
2
|
&AAligdotbl;
= &AAlig; + &combdotbl;
|
F578
=
F4D0
+ 0323
|
|
LATIN
CAPITAL LIGATURE AA WITH DOT BELOW
=
LATIN
CAPITAL LIGATURE
AA
+ COMBINING DOT BELOW
|
|

|
2
|
&aaligdotbl;
= &aalig; + &combdotbl;
|
F579
=
F4D1
+ 0323
|
|
LATIN
SMALL LIGATURE AA WITH DOT BELOW
=
LATIN
SMALL LIGATURE
AA
+ COMBINING DOT BELOW
|
|

|
2
|
&AOligdotbl;
= &AOlig; + &combdotbl;
|
F57A
=
F4D2
+ 0323
|
|
LATIN
CAPITAL LIGATURE AO WITH DOT BELOW
=
LATIN
CAPITAL LIGATURE
AO
+ COMBINING DOT BELOW
|
|

|
2
|
&aoligdotbl;
= &aolig; + &combdotbl;
|
F57B
=
F4D3
+ 0323
|
|
LATIN
SMALL LIGATURE AO WITH DOT BELOW
=
LATIN
SMALL LIGATURE
AO
+ COMBINING DOT BELOW
|
|

|
2
|
&AUligdotbl;
= &AUlig; + &combdotbl;
|
F57C
=
F4D4
+ 0323
|
|
LATIN
CAPITAL LIGATURE AU WITH DOT BELOW
=
LATIN
CAPITAL LIGATURE
AU
+ COMBINING DOT BELOW
|
|

|
2
|
&auligdotbl;
= &aulig; + &combdotbl;
|
F57D
=
F4D5
+ 0323
|
|
LATIN
SMALL LIGATURE AU WITH DOT BELOW
=
LATIN
SMALL LIGATURE
AU
+ COMBINING DOT BELOW
|
|

|
2
|
&AVligdotbl;
= &AVlig; + &combdotbl;
|
F57E
=
F4D6
+ 0323
|
|
LATIN
CAPITAL LIGATURE AV WITH DOT BELOW
=
LATIN
CAPITAL LIGATURE
AV
+ COMBINING DOT BELOW
|
|

|
2
|
&avligdotbl;
= &avlig; + &combdotbl;
|
F57F
=
F4D7
+ 0323
|
|
LATIN
SMALL LIGATURE AV WITH DOT BELOW
=
LATIN
SMALL LIGATURE
AV
+ COMBINING DOT BELOW
|
|

|
2
|
&AYligdotbl;
= &AYlig; + &combdotbl;
|
F580
=
F4D8
+ 0323
|
|
LATIN
CAPITAL LIGATURE AY WITH DOT BELOW
=
LATIN
CAPITAL LIGATURE
AY
+ COMBINING DOT BELOW
|
|

|
2
|
&ayligdotbl;
= &aylig; + &combdotbl;
|
F581
=
F4D9
+ 0323
|
|
LATIN
SMALL LIGATURE AY WITH DOT BELOW
=
LATIN
SMALL LIGATURE
AY
+ COMBINING DOT BELOW
|
|

|
2
|
&bscapdotbl;
= &bscap; + &combdotbl;
|
F582
=
0299 + 0323
|
|
LATIN
LETTER SMALL CAPITAL B WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL B + COMBINING
DOT BELOW
|
|

|
2
|
&Cdotbl;
= C + &combdotbl;
|
E066
=
0043 + 0323
|
E066
|
LATIN
CAPITAL LETTER C WITH DOT BELOW
=
LATIN CAPITAL LETTER C + COMBINING DOT
BELOW
|
|

|
2
|
&cdotbl;
= c + &combdotbl;
|
E466
=
0063 + 0323
|
E466
|
LATIN
SMALL LETTER C WITH DOT BELOW
=
LATIN SMALL LETTER C + COMBINING DOT
BELOW
|
|

|
2
|
&dscapdotbl;
= &dscap; + &combdotbl;
|
F583
=
1D05 + 0323
|
|
LATIN
LETTER SMALL CAPITAL D WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL D + COMBINING
DOT BELOW
|
|

|
2
|
Ðdotbl;
= Ð + &combdotbl;
|
F584
=
00D0 + 0323
|
|
LATIN
CAPITAL LETTER ETH WITH DOT BELOW
=
LATIN CAPITAL LETTER ETH + COMBINING DOT
BELOW
|
|

|
2
|
ðdotbl;
= ð + &combdotbl;
|
F585
=
00F0 + 0323
|
|
LATIN
SMALL LETTER ETH WITH DOT BELOW
=
LATIN SMALL LETTER ETH + COMBINING DOT
BELOW
|
|

|
2
|
&Eogondotbl;
= Ę + &combdotbl;
|
F586
=
0118 + 0323
|
|
LATIN
CAPITAL LETTER E WITH OGONEK AND DOT
BELOW
=
LATIN CAPITAL LETTER E WITH OGONEK +
COMBINING DOT BELOW
|
|

|
2
|
&eogondotbl;
= ę + &combdotbl;
|
F587
=
0119 + 0323
|
|
LATIN
SMALL LETTER E WITH OGONEK AND DOT
BELOW
=
LATIN SMALL LETTER E WITH OGONEK +
COMBINING DOT BELOW
|
|

|
2
|
&Fdotbl;
= F + &combdotbl;
|
E0EE
=
0046 + 0323
|
E0EE
|
LATIN
CAPITAL LETTER F WITH DOT BELOW
=
LATIN CAPITAL LETTER F + COMBINING DOT
BELOW
|
|

|
2
|
&fdotbl;
= f + &combdotbl;
|
E4EE
=
0066 + 0323
|
E4EE
|
LATIN
SMALL LETTER F WITH DOT BELOW
=
LATIN SMALL LETTER F + COMBINING DOT
BELOW
|
|

|
2
|
&Finsdotbl;
= &Fins; + &combdotbl;
|
F588
=
F402
+ 0323
|
|
LATIN
CAPITAL LETTER INSULAR
F
+ COMBINING DOT BELOW
|
|

|
2
|
&finsdotbl;
= &fins; + &combdotbl;
|
F589
=
F403
+ 0323
|
|
LATIN
SMALL LETTER INSULAR F WITH DOT BELOW
=
LATIN
SMALL LETTER INSULAR
F
+ COMBINING DOT BELOW
|
|

|
2
|
&Gdotbl;
= G + &combdotbl;
|
E101
=
0047 + 0323
|
E101
|
LATIN
CAPITAL LETTER F WITH DOT BELOW
=
LATIN CAPITAL LETTER G + COMBINING DOT
BELOW
|
|

|
2
|
&gdotbl;
= g + &combdotbl;
|
E501
=
0067 + 0323
|
E501
|
LATIN
SMALL LETTER G WITH DOT BELOW
=
LATIN SMALL LETTER G + COMBINING DOT
BELOW
|
|

|
2
|
&gscapdotbl;
= &gscap; + &combdotbl;
|
F58A
=
0262 + 0323
|
|
LATIN
LETTER SMALL CAPITAL G WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL G + COMBINING
DOT BELOW
|
|

|
2
|
&Jdotbl;
= J + &combdotbl;
|
F58B
=
004A + 0323
|
|
LATIN
CAPITAL LETTER J WITH DOT BELOW
=
LATIN CAPITAL LETTER J + COMBINING DOT
BELOW
|
|

|
2
|
&jdotbl;
= j + &combdotbl;
|
F58C
=
006A + 0323
|
|
LATIN
SMALL LETTER J WITH DOT BELOW
=
LATIN SMALL LETTER J + COMBINING DOT
BELOW
|
|

|
2
|
&lscapdotbl;
= &lscap; + &combdotbl;
|
F58D
=
029F + 0323
|
|
LATIN
LETTER SMALL CAPITAL L WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL L + COMBINING
DOT BELOW
|
|

|
2
|
&mscapdotbl;
= &mscap; + &combdotbl;
|
F58E
=
1D0D + 0323
|
|
LATIN
LETTER SMALL CAPITAL M WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL M + COMBINING
DOT BELOW
|
|

|
2
|
&nscapdotbl;
= &nscap; + &combdotbl;
|
F58F
=
0274 + 0323
|
|
LATIN
LETTER SMALL CAPITAL N WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL N + COMBINING
DOT BELOW
|
|

|
2
|
&Oogondotbl;
= &Oogon; + &combdotbl;
|
F590
=
01EA + 0323
|
|
LATIN
CAPITAL LETTER O WITH OGONEK AND DOT
BELOW
=
LATIN CAPITAL LETTER O WITH OGONEK+
COMBINING DOT BELOW
|
|

|
2
|
&oogondotbl;
= &oogon; + &combdotbl;
|
F591
=
01EB + 0323
|
|
LATIN
SMALL LETTER O WITH OGONEK AND DOT
BELOW
=
LATIN SMALL LETTER O WITH OGONEK +
COMBINING DOT BELOW
|
|

|
2
|
&OOligdotbl;
= &OOlig; + &combdotbl;
|
F592
=
F4DA
+ 0323
|
|
LATIN
CAPITAL LIGATURE OO WITH DOT BELOW
=
LATIN
CAPITAL LIGATURE
OO
+ COMBINING DOT BELOW
|
|

|
2
|
&ooligdotbl;
= &oolig; + &combdotbl;
|
F593
=
F4DB
+ 0323
|
|
LATIN
SMALL LIGATURE OO WITH DOT BELOW
=
LATIN
SMALL LIGATURE
OO
+ COMBINING DOT BELOW
|
|

|
2
|
&Pdotbl;
= P + &combdotbl;
|
E26D
=
0050 + 0323
|
E26D
|
LATIN
CAPITAL LETTER P WITH DOT BELOW
=
LATIN CAPITAL LETTER P + COMBINING DOT
BELOW
|
|

|
2
|
&pdotbl;
= p + &combdotbl;
|
E66D
=
0070 + 0323
|
E66D
|
LATIN
SMALL LETTER P WITH DOT BELOW
=
LATIN SMALL LETTER P + COMBINING DOT
BELOW
|
|

|
2
|
&Qdotbl;
= Q + &combdotbl;
|
E288
=
0051 + 0323
|
E288
|
LATIN
CAPITAL LETTER Q WITH DOT BELOW
=
LATIN CAPITAL LETTER Q + COMBINING DOT
BELOW
|
|

|
2
|
&qdotbl;
= q + &combdotbl;
|
E688
=
0071 + 0323
|
E688
|
LATIN
SMALL LETTER P WITH DOT BELOW
=
LATIN SMALL LETTER Q + COMBINING DOT
BELOW
|
|

|
2
|
&rscapdotbl;
= &rscap; + &combdotbl;
|
F594
=
0280 + 0323
|
|
LATIN
LETTER SMALL CAPITAL R WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL R + COMBINING
DOT BELOW
|
|

|
2
|
&rrotdotbl;
= &rrot; + &combdotbl;
|
F595
=
F40A
+ 0323
|
|
LATIN
SMALL LETTER R ROTUNDA WITH DOT BELOW
=
LATIN
SMALL LETTER R
ROTUNDA
+ COMBINING DOT BELOW
|
|

|
2
|
&sscapdotbl;
= &sscap; + &combdotbl;
|
F596
=
F4A5
+ 0323
|
|
LATIN
LETTER SMALL CAPITAL S WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL S + COMBINING
DOT BELOW
|
|

|
2
|
&stalldotbl;
= &stall; + &combdotbl;
|
F597
=
017F + 0323
|
|
LATIN
SMALL LETTER TALL S WITH DOT BELOW
=
LATIN SMALL LETTER TALL S + COMBINING DOT
BELOW
|
|

|
2
|
&tscapdotbl;
= &tscap; + &combdotbl;
|
F598
=
1D1B + 0323
|
|
LATIN
LETTER SMALL CAPITAL S WITH DOT
BELOW
= LATIN LETTER SMALL CAPITAL S + COMBINING
DOT BELOW
|
|

|
2
|
Þdotbl;
= Þ + &combdotbl;
|
F599
=
00DE + 0323
|
|
LATIN
CAPITAL LETTER THORN WITH DOT BELOW
=
LATIN CAPITAL LETTER THORN + COMBINING DOT
BELOW
|
|

|
2
|
þdotbl;
= þ + &combdotbl;
|
F59A
=
00FE + 0323
|
|
LATIN
SMALL LETTER THORN WITH DOT BELOW
=
LATIN SMALL LETTER THORN + COMBINING DOT
BELOW
|
|

|
2
|
&Vinsdotbl;
= &Vins; + &combdotbl;
|
F59B
=
F40E
+ 0323
|
|
LATIN
CAPITAL LETTER INSULAR V (VENTH) WITH DOT
BELOW
=
LATIN
CAPITAL LETTER INSULAR V
(VENTH)
+ COMBINING DOT BELOW
|
|

|
2
|
&vinsdotbl;
= &vins; + &combdotbl;
|
F59C
=
F40F
+ 0323
|
|
LATIN
SMALL LETTER INSULAR V (VENTH) WITH DOT
BELOW
=
LATIN
SMALL LETTER INSULAR V
(VENTH)
+ COMBINING DOT BELOW
|
Reserved
space after this range: F59D to F5A4 (7 code
points).
Top
of document
MUFI
subrange 18: Precomposed characters with diaeresis
(prev. part of subrange 2)
Unicode
4.0 includes double dot (diaeresis) over the
small and capital characters "a", "e", "i", "o",
"u", "y" in the ranges Latin-1
Supplement
(all of these except capital "y") and
Latin
Extended-A
(capital "y"), as well as small and capital "h",
"w", "x" and "t" (the latter only small) in the
range Latin
Extended
Additional.
The list below
contains additional character
combinations.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Titus
|
Descriptive
name
|
|

|
2
|
&AAliguml;
= &AAlig; + &combuml;
|
F5A5
= F4D0
+ 0308
|
|
LATIN
CAPITAL LIGATURE AA WITH
DIAERESIS
= LATIN
CAPITAL LIGATURE
AA
+ COMBINING DIAERESIS
|
|

|
2
|
&aaliguml;
= &aalig; + &combuml;
|
F5A6
= FED1
+ 0308
|
|
LATIN
SMALL LIGATURE AA WITH
DIAERESIS
= LATIN
SMALL LIGATURE
AA
+ COMBINING DIAERESIS
|
|

|
2
|
Æuml;
= Æ + &combuml;
|
F5A7
=
00C6 + 0308
|
|
LATIN
CAPITAL LIGATURE AE WITH
DIAERESIS
= LATIN CAPITAL LIGATURE AE + COMBINING
DIAERESIS
|
|

|
2
|
æuml;
= æ + &combuml;
|
F5A8
=
00E6 + 0308
|
|
LATIN
SMALL LIGATURE AE WITH
DIAERESIS
= LATIN
CAPITAL LIGATURE
AE
+ COMBINING DIAERESIS
|
Reserved
space after this range: F5A9 to F5AF (6 code
points).
Top
of document
MUFI
subrange 19: Precomposed characters with a hook
above (prev. part of subrange 2)
Unicode
4.0 does not include any characters with a hook
above.
|
Glyph
|
Cat.
|
Entity
|
MUFI
|
Titus
|
Descriptive
name
|
|

|
2
|
&Acurl;
= A + &combcurl;
|
F5B0
= 0041 + F518
|
|
LATIN
CAPITAL LETTER A WITH
CURL
= LATIN CAPITAL LETTER A +
COMBINING
CURL
|
|

|
2
|
&acurl;
= a + &combcurl;
|
F5B1
= 0061 + F518
|
|
LATIN
SMALL LETTER A WITH
CURL
= LATIN SMALL LETTER A +
COMBINING
CURL
|
|

|
2
|
&Ecurl;
= E + &combcurl;
|
F5B2
= 0035 + F518
|
|
LATIN
CAPITAL LETTER E WITH
CURL
= LATIN CAPITAL LETTER E +
COMBINING
CURL
|
|

|
2
|
&ecurl;
= e + &combcurl;
|
F5B3
= 0065 + F518
|
|
LATIN
SMALL LETTER E WITH
CURL
= LATIN SMALL LETTER E +
COMBINING
CURL
|
|

|
0
|
&Icurl;
= I + &combcurl;
|
1EC8
= 0049 + F518
|
|
LATIN
CAPITAL LETTER I WITH HOOK ABOVE
(CURL)
= LATIN CAPITAL LETTER I +
COMBINING
CURL
|
|

|
0
|
&icurl;
= i + &combcurl;
|
1EC9
= 0131 + F518
|
|
LATIN
SMALL LETTER I WITH HOOK ABOVE (CURL)
= LATIN SMALL LETTER DOTLESS I +
COMBINING
CURL
|
|

|
2
|
&Jcurl;
= J + &combcurl;
|
F5B4
= 0049 + F518
|
|
LATIN
CAPITAL LETTER J WITH
CURL
= LATIN CAPITAL LETTER J +
COMBINING
CURL
|
|

|
2
|
&jcurl;
= j + &combcurl;
|
F5B5
= F40B
+ F518
|
|
LATIN
SMALL LETTER J WITH
CURL
= LATIN SMALL LETTER DOTLESS J +
COMBINING
CURL
|
|

|
2
|
&Ocurl;
= O + &combcurl;
|
F5B6
= 004F + F518
|
|
LATIN
CAPITAL LETTER O WITH
CURL
= LATIN CAPITAL LETTER O +
COMBINING
CURL
|
|

|
2
|
&ocurl;
= o + &combcurl;
|
F5B7
= 006F + F518
|
|
LATIN
SMALL LETTER O WITH
CURL
= LATIN SMALL LETTER O +
COMBINING
CURL
|
|

|
2
|
Øcurl;
= Ø + &combcurl;
|
F5B8
= 004F + F518
|
|
LATIN
CAPITAL LETTER O WITH STROKE AND
CURL
= LATIN CAPITAL LETTER O WITH STROKE +
COMBINING
CURL
|
|

|
2
|
øcurl;
= ø + &combcurl;
|
F5B9
= 0048 + F518
|
|
LATIN
SMALL LETTER O WITH STROKE AND
CURL
= LATIN SMALL LETTER O WITH STROKE +
COMBINING
CURL
|
|

|
2
|
&Ucurl;
= U + &combcurl;
|
F5BA
= 0055 + F518
|
|
LATIN
CAPITAL LETTER U WITH
CURL
= LATIN CAPITAL LETTER U +
COMBINING
CURL
|
|

|
2
|
&ucurl;
= u + &combcurl;
|
F5BB
= 0075 + F518
|
|
LATIN
SMALL LETTER U WITH
CURL
= LATIN SMALL LETTER U +
COMBINING
CURL
|
|

|
2
|
&Ycurl;
= Y + &combcurl;
|
F5BC
= 0059 + F518
|
|
LATIN
CAPITAL LETTER Y WITH
CURL
= LATIN CAPITAL LETTER Y +
COMBINING
CURL
|
|

|
2
|
&ycurl;
= y + &combcurl;
|
|
|