Bengali Unicode characters missing in GlyphData.xml

Greetings!

The following Unicode characters are not included in the GlyphData.xml file. I’m on Glyphs 3.2.3 (3260).

<glyph unicode="09FC" name="vedicAnusvara-beng" sortName="be248" category="Letter" subCategory="Spacing" script="bengali" production="09FC" altNames="vedicAnusvara-bengali" description="BENGALI LETTER VEDIC ANUSVARA" />
<glyph unicode="09FD" name="abbreviation-beng" sortName="be250" category="Punctuation" script="bengali" production="uni09FD" altNames="abbreviation-bengali" description="BENGALI ABBREVIATION SIGN" />
<glyph unicode="09FE" name="sandhi-beng" sortName="be252" category="Mark" subCategory="Nonspacing" script="bengali" production="uni09FE" altNames="sandhi-bengali" description="BENGALI SANDHI MARK" />

Also, any insights why the sortName values for the Bengali range are even numbers (be000, be002, and so on)?

Why is “vedicAnusvara-beng” a letter and “anusvara-beng” a spacing mark?

that is like that for many entries. It makes it eerier to reorder things. You can fit in some entries without the need to change all sortNames.

1 Like

Why is “vedicAnusvara-beng” a letter and “anusvara-beng” a spacing mark?

This is based on the character’s Unicode properties. The requester hasn’t provided sufficient rationale for the properties in their proposal (page 2). In fact, they suggested that, “[…] the character (and other similar characters for Vedic usage) should be treated equivalent with the anusvara, since it only occurs in place of the latter in specific contexts.” which is …interesting.

Anyway, it’s only used in the reproduction of classical/ancient texts, so not seen at all in contemporary use.

Hope that helps!

Then we should treat it as a spacing mark, too. The preview I get in UnicodeChecker even has the dotted circle.

Sounds good.