icu4c--> ushape.c 塑造过程中缺少字符？

发布于 2024-09-26 08:20:50 字数 959 浏览 3 评论 0原文

在我们的语言中，我们使用阿拉伯字符进行书写，但存在一些差异， icu 的 ushape.c （阿拉伯语整形器）仅适用于主要阿拉伯语字符，不会塑造我的语言特定字符（即 0x6D5 等），我更改了 ushape.c 以适用于我的语言，除了字符外，它运行良好，即是 0x649，在阿拉伯语中它们只有 2 个形状，在我的语言中我们有 4 个形状。

我将第 183 行更改

1                + 256 * 0x7F,/*0x0649*/

为

1+2+8             + 256 * 0x98 /*0x649*/

并将第 121 行更改

static const UChar yehHamzaToYeh[] =
{
/* isolated*/ 0xFEEF,
/* final   */ 0xFEF0
};

为

static const UChar yehHamzaToYeh[] =
    {
        /* isolated */0xFEEF, 
                       0xFBE8, // my language specific
                      0xFBE9,// my language specific
        /* final */   0xFEF0 
   };

ushape.c

现在它可以毫无问题地生成 3 个形状（开始、孤立和最终），但中间形状显示为正方形（缺少字符）。

我尝试用其他数字替换“* 0x98”，但这是我能得到的最好的。

我应该怎么办？

原文

in our langauge we use arabic characters in writing with some differences,
icu's ushape.c ( arabic shaper) only works with main arabic characters and dosn't shape my language specific characters ( i.e 0x6D5 etc) i'v changed ushape.c to work with my language and it worked well except for on character, that is 0x649, in arabic they have only 2 shapes, in my langauge we have 4 shapes for it.

i'v changed line 183

1                + 256 * 0x7F,/*0x0649*/

1+2+8             + 256 * 0x98 /*0x649*/

and changed line 121

static const UChar yehHamzaToYeh[] =
{
/* isolated*/ 0xFEEF,
/* final   */ 0xFEF0
};

static const UChar yehHamzaToYeh[] =
    {
        /* isolated */0xFEEF, 
                       0xFBE8, // my language specific
                      0xFBE9,// my language specific
        /* final */   0xFEF0 
   };

from ushape.c

now it can produce 3 shapes with no problem ( the beginning,isolated and final), but middle shape is displayed as a square ( missing character ) .

i tried replacing "* 0x98" with other numbers, but this best i can get.

what should i do ?

分享到QQ

分享到微博