SC22/WG20 N728 Subject: Gurmukhi Collation The second action item for me from the Copenhagen meeting was to provide Gurmukhi collation for 14651. Here it is. The Observations section, dealing with the order within groups, is followed by the Recommended Groupings which orders the groups. The Collation Order section has not been expanded out as in Devanagri but the same principle can be used to stream out the details. Baldev. Observations 1.The alphabet, in order: U+0A73, +0A05, +0A72, +0A38, +0A39 U+0A15, +0A16, +0A17, +0A18, +0A19 U+0A1A, +0A1B, +0A1C, +0A1D, +0A1E U+0A1F, +0A20, +0A21, +0A22, +0A23 U+0A24, +0A25, +0A26, +0A27, +0A28 U+0A2A, +0A2B, +0A2C, +0A2D, +0A2E U+0A2F, +0A30, +0A32, +0A35, +0A5C and U+0A36, +0A59, +0A5A, +0A5B, +0A5E 2.The digits, in order, are: U+0A66 (digit 0) to U+0A6F (digit 9) 3.The un-modified character preceeds the modified character where "modified" is the association of that character with any of a number of modifiers such as nukta, vowel signs etc. This means: a non-nukta character (i.e one without a nukta) preceeds its nukta modified form (i.e. one with a nukta). The various signs (generically called "modifiers" above) are applied to the alphabetic characters (above). All of the signs are not applicable in all cases. As noted above, the un-modified character preceeds the modified character in terms of sorting. Within the modifier set, the order is: nukta bindi tippi addak 4.Dependent vowels, like the modifiers above, are combining characters. They generally combine with the consonants. The following order works for the dependent vowels: vowel sign aa vowel sign i vowel sign ii vowel sign u vowel sign uu vowel sign ee vowel sign ai vowel sign oo vowel sign au 5.The signs, in order, are: U+0A74 (ek onkar) danda - missing from gurmukhi range but is in the devanagri range double danda - as above Recommended Groupings These groups are in sort order: Signs: Ek Onkar (given the highest priority since it is a revered religious symbol) followed by the danda and the double danda. Digits: gurmukhi digits. Base alphabet character: base character followed by modified base character (see 3 above) followed by base character combining with dependent vowels. Dependent vowels and modifiers are not given a separate place in the collation order. Essentially they are combining marks and as such should not occur independently in any hindi text stream. If they do occur, by error, then they will be treated as any undefined characters - after the defined sequences and in code-point order. Collation Order Stream out following above recommended groupings. Regards, Baldev.