n3976 Mymr Extras
n3976 Mymr Extras
n3976 Mymr Extras
L2/11-130R
L2/12-012
2011-04-19
(replaces L2/11-130)
Introduction
This proposal is to add 28 extra characters to the Myanmar script for the Tai Laing and Shwe Palaung
languages.
The Tai Laing are a language group of about 100,000 speakers living along the Irrawaddy River in Myanmar.
The writing system is part of their history that has not completely died out and there is interest in reviving it.
While the script is not taught formally in schools, it is taught during school breaks. The Shwe Palaung have
about 200,000 speakers, primarily in Shan State, Myanmar. 20% of Shwe Palaung are literate in Shwe
Palaung with ongoing literacy development happening indigenously. The orthography has been in
development since the 1930s with the latest revision occurring in the 1980s when the proposed characters
were added.
Charts: The proposed characters are to be added to two existing Myanmar Extended blocks. This fills out
Myanmar Extended-A and adds to Myanmar Extended-B.
1
U+AA70 Myanmar Extended-A
AA7C ꩼꩼ MYANMAR SIGN TAI LAING TONE-2
AA7 AA7D ꩼꩽ MYANMAR SIGN TAI LAING TONE-5
2
3
4
5
6
7
8
9
A
B
C ꩼꩼ
D ꩼꩽ
E ꩾ
F ꩿ
2
U+A9E0 Myanmar Extended-B
Consonants
ꧧ
A9E A9F A9E7 MYANMAR LETTER TAI LAING NYA
A9E8 ꧨ MYANMAR LETTER TAI LAING FA
0 ꧰ A9E9 ꧩ MYANMAR LETTER TAI LAING GA
A9EA ꧪ MYANMAR LETTER TAI LAING GHA
1 ꧱ A9EB ꧫ MYANMAR LETTER TAI LAING JA
A9EC ꧬ MYANMAR LETTER TAI LAING JHA
2 ꧲ A9ED ꧭ MYANMAR LETTER TAI LAING DDA
A9EE ꧮ MYANMAR LETTER TAI LAING DDHA
Digits
4 ꧴ A9F0 ꧰ MYANMAR TAI LAING DIGIT ZERO
A9F1 ꧱ MYANMAR TAI LAING DIGIT ONE
5 ꧵ A9F2 ꧲ MYANMAR TAI LAING DIGIT TWO
A9F3 ꧳ MYANMAR TAI LAING DIGIT THREE
6 ꧶ A9F4 ꧴ MYANMAR TAI LAING DIGIT FOUR
꧵
ꧧ ꧷
A9F5 MYANMAR TAI LAING DIGIT FIVE
7 A9F6 ꧶ MYANMAR TAI LAING DIGIT SIX
A9F7 ꧷ MYANMAR TAI LAING DIGIT SEVEN
8 ꧨ ꧸ A9F8 ꧸ MYANMAR TAI LAING DIGIT EIGHT
A9F9 ꧹ MYANMAR TAI LAING DIGIT NINE
9 ꧩ ꧹ Consonants
A ꧪ ꧺ
A9FA ꧺ MYANMAR LETTER TAI LAING LLA
A9FB ꧻ MYANMAR LETTER TAI LAING DA
ꧼ
B ꧫ ꧻ
A9FC MYANMAR LETTER TAI LAING DHA
A9FD ꧽ MYANMAR LETTER TAI LAING BA
A9FE ꧾ MYANMAR LETTER TAI LAING BHA
C ꧬ ꧼ
D ꧭ ꧽ
E ꧮ ꧾ
F ꧯ
3
Rationale
The various subgroups of characters will be considered separately, in encoding order.
Tai Laing Tone Marks. Tai Laing has 5 tone marks. Of these, 3 are already encoded and this proposal adds
the remaining two.
Figure 2. Use of Tai Laing tone marks in conjuction with other diacritics.
Shwe Palaung Consonants: The two proposed characters for Shwe Palaung have a visual representation
that is very close to an existing sequence:
The medial is added into the middle of the sequence representing the consonant. This is a problem for data
entry. For sorting, while the example dictionary here sorts them as though the components are medials, being
distinct consonants, these characters should have their own consonantal position. The encoding of these
4
characters follows in the tradition of encoding such characters atomically. Other examples are: U+106F (c.f.
U+101F U+103E), U+1070 (c.f. U+1003 U+103E), U+107E (c.f. U+107D U+103E).
AA7E;MYANMAR LETTER SHWE PALAUNG CHA;Lo;0;L;;;;;N;;;;;
AA7F;MYANMAR LETTER SHWE PALAUNG SHA;Lo;0;L;;;;;N;;;;;
It is also worth noting that the rendering behaviour with medial wa may be different to the way the sequence
is rendered. Some styles prefer them to be the same, others for them to be different.
5
Figure 4. Tai Laing modern alphabet
A9E7;MYANMAR LETTER TAI LAING NYA;Lo;0;L;;;;;N;;;;;
A9E8;MYANMAR LETTER TAI LAING FA;Lo;0;L;;;;;N;;;;;
Tai Laing Pali Consonants: As with most Myanmar script based writing systems, Tai Laing adds character
support for the Pali language.
6
Figure 5. Shiksha showing Devanagari, Burmese, Tai Laing and Roman scripts.
Notice that the character for pha is the same as U+A9E4 MYANMAR LETTER SHAN BHA and that likewise the
code for bha is based on that shape.
7
A9E9;MYANMAR LETTER TAI LAING GA;Lo;0;L;;;;;N;;;;;
A9EA;MYANMAR LETTER TAI LAING GHA;Lo;0;L;;;;;N;;;;;
A9EB;MYANMAR LETTER TAI LAING JA;Lo;0;L;;;;;N;;;;;
A9EC;MYANMAR LETTER TAI LAING JHA;Lo;0;L;;;;;N;;;;;
A9ED;MYANMAR LETTER TAI LAING DDA;Lo;0;L;;;;;N;;;;;
A9EE;MYANMAR LETTER TAI LAING DDHA;Lo;0;L;;;;;N;;;;;
A9EF;MYANMAR LETTER TAI LAING NNA;Lo;0;L;;;;;N;;;;;
Tai Laing Digits: Tai Laing has its own set of digits, which are proposed here:
Figure 6. Tai Laing Digits. The colums are: character name in Tai Laing, Tai Laing digit, Shan digit (from U+1090-
U+1099), Arabic digit.
A9F0;MYANMAR TAI LAING DIGIT ZERO;Nd;0;L;;0;0;0;N;;;;;
A9F1;MYANMAR TAI LAING DIGIT ONE;Nd;0;L;;1;1;1;N;;;;;
A9F2;MYANMAR TAI LAING DIGIT TWO;Nd;0;L;;2;2;2;N;;;;;
A9F3;MYANMAR TAI LAING DIGIT THREE;Nd;0;L;;3;3;3;N;;;;;
A9F4;MYANMAR TAI LAING DIGIT FOUR;Nd;0;L;;4;4;4;N;;;;;
A9F5;MYANMAR TAI LAING DIGIT FIVE;Nd;0;L;;5;5;5;N;;;;;
A9F6;MYANMAR TAI LAING DIGIT SIX;Nd;0;L;;6;6;6;N;;;;;
A9F7;MYANMAR TAI LAING DIGIT SEVEN;Nd;0;L;;7;7;7;N;;;;;
8
A9F8;MYANMAR TAI LAING DIGIT EIGHT;Nd;0;L;;8;8;8;N;;;;;
A9F9;MYANMAR TAI LAING DIGIT NINE;Nd;0;L;;9;9;9;N;;;;;
Sort Order
The default sort order is integrated into the existing default sorting for the Myanmar script. Given that all
Myanmar based languages require complex sort tailoring, the precise values here can be somewhat arbitrary.
The sort order information given here also includes the other characters from the Myanmar Extended-B
block as specified in N3906 (L2/10-345).
&1003 MYANMAR LETTER GHA < A9E0 MYANMAR LETTER SHAN GHA < A9EA MYANMAR LETTER TAI LAING GHA
&1006 MYANMAR LETTER CHA < A9E1 MYANMAR LETTER SHAN CHA
&AA62 MYANMAR LETTER KHAMTI CHA < AA7E MYANMAR LETTER SHWE PALAUNG CHA
&105B MYANMAR LETTER MON JHA < A9E2 MYANMAR LETTER SHAN JHA
&AA64 MYANMAR LETTER KHAMTI JHA < A9EC MYANMAR LETTER TAI LAING JHA
&1061 MYANMAR LETTER SGAW KAREN SHA < AA7F MYANMAR LETTER SHWE PALAUNG SHA
&AA65 MYANMAR LETTER KHAMTI NYA < A9E7 MYANMAR LETTER TAI LAING NYA
&106E MYANMAR LETTER EASTERN PWO KAREN NNA < A9E3 MYANMAR LETTER SHAN NNA < A9EE MYANMAR
LETTER TAI LAING NNA
&108E MYANMAR LETTER RUMAI PALAUNG FA < A9E8 MYANMAR LETTER TAI LAING FA
&1018 MYANMAR LETTER BHA < A9E4 MYANMAR LETTER SHAN BHA < A9FE MYANMAR LETTER TAI LAING BHA
&1068 MYANMAR VOWEL SIGN WESTERN PWO KAREN UE < A9E5 MYANMAR SIGN SHAN SAW
&108D MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE << AA7C MYANMAR SIGN TAI LAING TONE-2
&1089 MYANMAR SIGN SHAN TONE-5 < AA7D MYANMAR SIGN TAI LAING TONE-5
&AA70 MYANMAR MODIFIER LETTER KHAMTI REDUPLICATION < A9E6 MYANMAR MODIFIER LETTER SHAN
REDUPLICATION
&AA60 MYANMAR LETTER KHAMTI GA < A9E9 MYANMAR LETTER TAI LAING GA
&105B MYANMAR LETTER MON JA < A9EB MYANMAR LETTER TAI LAING JA
&AA68 MYANMAR LETTER KHAMTI DDA < A9ED MYANMAR LETTER TAI LAING DDA
&AA69 MYANMAR LETTER KHAMTI DDHA < A9EE MYANMAR LETTER TAI LAING DDHA
&100F MYANMAR LETTER NNA < A9EF MYANMAR LETTER TAI LAING NNA
&1020 MYANMAR LETTER LLA < A9FA MYANMAR LETTER TAI LAING LLA
&107B MYANMAR LETTER SHAN DA < A9FB MYANMAR LETTER TAI LAING DA
&1013 MYANMAR LETTER DHA < A9FC MYANMAR LETTER TAI LAING DHA
&107F MYANMAR LETTER SHAN BA < A9FD MYANMAR LETTER TAI LAING BA
MYANMAR TAI LAING DIGIT characters are sorted following their corresponding MYANMAR SHAN DIGIT
characters.
Bibliography
Hosken, Martin “Representing Myanmar in Unicode” (Unicode Technical Note 11, version 3).
ၸꩫ့်င
ႍဝ် ꧤိုꩼ ꩫ်လ့်တႆးလꧥ င်ꩽ
SonNgaw Phawnla TaiLaing
O Thuwa Palaung-Burmese Dictionary (Namhfan, 2003)
Acknowledgements
Thanks go to Payap University Linguistics Institute, Chiang Mai, Thailand, under whose auspices this work
is done.
9
ISO/IEC JTC 1/SC 2/WG 2
PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS
FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 1 TP PT
A. Administrative
B. Technical – General
1. Choose one of the following:
a. This proposal is for a new script (set of characters):
Proposed name of script:
b. The proposal is for addition of character(s) to an existing block: X
Name of the existing block: Myanmar Extended-A, Myanmar Extended-B
2. Number of characters in proposal: 28
3. Proposed category (select one from below - see section 2.2 of P&P document):
A-Contemporary X B.1-Specialized (small collection) B.2-Specialized (large collection)
C-Major extinct D-Attested extinct E-Minor extinct
F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols
4. Is a repertoire including character names provided? yes
a. If YES, are the names in accordance with the “character naming guidelines”
in Annex L of P&P document? yes
b. Are the character shapes attached in a legible form suitable for review? yes
5. Who will provide the appropriate computerized font (ordered preference: True Type, or PostScript format) for
publishing the standard? SIL
If available now, identify source(s) for the font (include address, e-mail, ftp-site, etc.) and indicate the tools
used:
6. References:
a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided? yes
b. Are published examples of use (such as samples from newspapers, magazines, or other sources)
of proposed characters attached? yes
7. Special encoding issues:
Does the proposal address other aspects of character data processing (if applicable) such as input,
presentation, sorting, searching, indexing, transliteration etc. (if yes please enclose information)? yes
8. Additional Information:
Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist
in correct understanding of and correct linguistic processing of the proposed character(s) or script. Examples of such properties
are: Casing information, Numeric information, Currency information, Display behaviour information such as line breaks, widths
etc., Combining behaviour, Spacing behaviour, Directional behaviour, Default Collation behaviour, relevance in Mark Up contexts,
Compatibility equivalence and other Unicode normalization related information. See the Unicode standard at
http://www.unicode.org for such information on other scripts. Also see http://www.unicode.org/Public/UNIDATA/UCD.html
HTU UTH HTU UTH
and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for
inclusion in the Unicode Standard.
1 Form number: N3102-F (Original 1994-10-14; Revised 1995-01, 1995-04, 1996-04, 1996-08, 1999-03, 2001-05, 2001-09, 2003-11, 2005-01,
TPPT
10
C. Technical - Justification
1. Has this proposal for addition of character(s) been submitted before? no
If YES explain
2. Has contact been made to members of the user community (for example: National Body,
user groups of the script or characters, other experts, etc.)? yes
If YES, with whom? local experts
If YES, available relevant documents: see bibliography
3. Information on the user community for the proposed characters (for example:
size, demographics, information technology use, or publishing use) is included? yes
Reference: this document
4. The context of use for the proposed characters (type of use; common or rare) common
Reference:
5. Are the proposed characters in current use by the user community? yes
If YES, where? Reference: see bibliography
6. After giving due considerations to the principles in the P&P document must the proposed characters be entirely
in the BMP? yes
If YES, is a rationale provided? yes
If YES, reference: addition to existing BMP blocks
7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)? no
8. Can any of the proposed characters be considered a presentation form of an existing
character or character sequence? yes
If YES, is a rationale for its inclusion provided? yes
If YES, reference: this document
9. Can any of the proposed characters be encoded using a composed character sequence of either
existing characters or other proposed characters? no
If YES, is a rationale for its inclusion provided?
If YES, reference:
10. Can any of the proposed character(s) be considered to be similar (in appearance or function)
to an existing character? yes
If YES, is a rationale for its inclusion provided? yes
If YES, reference: this document
11. Does the proposal include use of combining characters and/or use of composite sequences? yes
If YES, is a rationale for such use provided? yes
If YES, reference: this document
Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided? no
If YES, reference:
12. Does the proposal contain characters with any special properties such as
control function or similar semantics? no
If YES, describe in detail (include attachment if necessary)
11