[Egyptian] UMdC - representation of hieroglyphs not available in Unicode

Bob Richmond bobqq at live.co.uk
Wed Aug 2 11:57:57 BST 2017


Hi Simon

Indeed, TEI P5 does not appear to explicitly identify concepts such as variants or arrangement in tag structures and has no <gv></gv> or <ga></ga> tags. Some related functionality is achieved using <g></g> along with information in <char></char> definitions in a <charDecl></charDecl> element and this gives a powerful and flexible way of encoding and documenting non-Unicode 10 characters in texts. Personally I find <charDecl> type data as very useful but for basic UMdC plain text and lightweight markup purposes its overkill to require a definitions table whereas <gv> and <ga> are succinct and easy to understand.

By unencoded signs I meant those not (yet) in the Unicode standard.

As I suggested earlier you can use <g corresp="src:idIBUBjhtZ6Fghe4swalp908klPGB" ref="#H10">?</g> in TEI for a hieroglyph without a variant in Unicode to latch on to - just use a question mark or another character/string if more useful. Unlike <g corresp="src:idIBUBjhtZ6Fghe4swalp908klPGB" ref="#H10" /> this at least gives a visible indication theres content present. In my experience self-closing tags like <g/> are a bad idea for representation of text content.

Regards,
Bob

From: Simon Schweitzer<mailto:schweitzer at bbaw.de>
Sent: 02 August 2017 07:30
To: Bob Richmond<mailto:bobqq at live.co.uk>; Egyptian Hieroglyphs in the UCS<mailto:egyptian at evertype.com>
Subject: Re: [Egyptian] UMdC - representation of hieroglyphs not available in Unicode

Dear Bob,

I cannot find the <ga> and the <gv> element in the TEI P5 guidelines.
Did I miss something?

> Has anyone any opinions on use of tags for unencoded signs?
>

"unencoded signs": Are these "unencoded signs" signs we cannot identify?
Or are these "unencoded signs" signs with no representaion in Unicode?
If so, we could differentiate two cases in the TLA material for a TEI
output:
1) hieroglyphs in Unicode
<g corresp="src:idIBUBdwG9XU3NGkkQi0UjnryUQg0" ref="#G1">𓄿</g>
2) hieroglyphs not in Unicode
<g corresp="src:idIBUBjhtZ6Fghe4swalp908klPGB" ref="#H10" />
If there ist no represantation in Unicode, we cannot transform in the
mentioned way. So we would retain the element in the original form.

All the best,

Simon

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://evertype.com/pipermail/egyptian_evertype.com/attachments/20170802/c36fbc29/attachment.htm>


More information about the Egyptian mailing list