<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:"Yu Gothic";
panose-1:2 11 4 0 0 0 0 0 0 0;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:"\@Yu Gothic";
panose-1:2 11 4 0 0 0 0 0 0 0;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
{mso-style-priority:34;
margin-top:0cm;
margin-right:0cm;
margin-bottom:0cm;
margin-left:36.0pt;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
.MsoChpDefault
{mso-style-type:export-only;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
/* List Definitions */
@list l0
{mso-list-id:808790078;
mso-list-type:hybrid;
mso-list-template-ids:719630890 -1 134807577 134807579 134807567 134807577 134807579 134807567 134807577 134807579;}
@list l0:level1
{mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level2
{mso-level-number-format:alpha-lower;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level3
{mso-level-number-format:roman-lower;
mso-level-tab-stop:none;
mso-level-number-position:right;
text-indent:-9.0pt;}
@list l0:level4
{mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level5
{mso-level-number-format:alpha-lower;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level6
{mso-level-number-format:roman-lower;
mso-level-tab-stop:none;
mso-level-number-position:right;
text-indent:-9.0pt;}
@list l0:level7
{mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level8
{mso-level-number-format:alpha-lower;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-18.0pt;}
@list l0:level9
{mso-level-number-format:roman-lower;
mso-level-tab-stop:none;
mso-level-number-position:right;
text-indent:-9.0pt;}
ol
{margin-bottom:0cm;}
ul
{margin-bottom:0cm;}
--></style>
</head>
<body lang="EN-GB">
<div class="WordSection1">
<p class="MsoNormal">Hi All</p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">1071 Hieroglyphs have been available in Unicode since version 5.2 (2009). Six formatting characters are now in the pipeline (since May). Eventually there will be more hieroglyphs and likely control characters too.</p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">The idea of defining a data file format “UMdC” acknowledging Unicode was discussed at I&E 2006 and afterwards but the lack of Unicode availability in the standard and issues of application and system support made this seem a little premature.
It seems to me the time is now ripe to revisit the topic.</p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">The basics of UMdC (as I see it) are as follows:</p>
<p class="MsoNormal"><o:p> </o:p></p>
<ol style="margin-top:0cm" start="1" type="1">
<li class="MsoListParagraph" style="margin-left:0cm;mso-list:l0 level1 lfo1">A well defined file type “umdc” containing plain text and markup (capable of being edited in simple text editors such as Windows Notepad and HTML textarea blocks).
</li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l0 level1 lfo1">Guidance on subset usage in database records.
</li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l0 level1 lfo1">Basic plain text including the 1071 + 6 for Egyptian characters (plus e.g. transliteration formats).</li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l0 level1 lfo1">Markup to deal with elements missing from Unicode such as hieroglyphs not in the 1071 set.</li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l0 level1 lfo1">Optional markup to help with preparing data for use with other formats such as HTML/CSS and Office applications.</li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l0 level1 lfo1">Optional markup to help with interoperability with MdC88 based data formats (including extensions such as JSesh).</li><li class="MsoListParagraph" style="margin-left:0cm;mso-list:l0 level1 lfo1">Specification of font requirements needed for representation of UMdC data.</li></ol>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">So long as the markup system is sufficiently flexible (e.g. use of XML-like tags) version 1 of UMdC need not be overly featured and then additions can be made as need is proven. It should be possible to create a version 1 specification
supported with basic tools in months not years.</p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">I expect I’m not the only person who has already done related work. Has anyone any points to make of what they would like to see in UMdC? Anyone like to get involved in defining the markup scheme?</p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Thanks</p>
<p class="MsoNormal">Bob Richmond</p>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</body>
</html>