当前位置：文江博客话题详情

我如何使用 php 检测字符串中的 iso8859-8 和 utf8 希伯来语字符

发布于 2024-08-11 00:13:24 字数 69 浏览 12 评论 0原文

我希望能够检测（使用正则表达式）字符串是否包含 php 编程语言中的 utf8 和 iso8859-8 希伯来语字符。谢谢！

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

半﹌身腐败 2024-08-18 00:13:24

这是iso8859-8 字符集映射。 E0 - FA 范围似乎是为希伯来语保留的。您可以检查字符类中的这些字符：

[\xE0-\xFA]

对于 UTF-8，为希伯来语保留的范围似乎是 0591 到 05F4。因此，您可以通过以下方式检测到：

[\u0591-\u05F4]

这是 PHP 中正则表达式匹配的示例：

echo preg_match("/[\u0591-\u05F4]/", $string);

Here's map of the iso8859-8 character set. The range E0 - FA appears to be reserved for Hebrew. You could check for those characters in a character class:

[\xE0-\xFA]

For UTF-8, the range reserved for Hebrew appears to be 0591 to 05F4. So you could detect that with:

[\u0591-\u05F4]

Here's an example of a regex match in PHP:

echo preg_match("/[\u0591-\u05F4]/", $string);

回复收藏 0 原文

感情旳空白 2024-08-18 00:13:24

如果您的 PHP 文件是用 UTF-8 编码的（在其中包含希伯来语的情况下），您应该使用以下 RegX：

$string="אבהג";
echo preg_match("/\p{Hebrew}/u", $string);
// output: 1

well if your PHP file is encoded with UTF-8 as should be in cases that you have hebrew in it, you should use the following RegX:

$string="אבהג";
echo preg_match("/\p{Hebrew}/u", $string);
// output: 1

回复收藏 0 原文

各空 2024-08-18 00:13:24

这是一个小函数，用于检查字符串中的第一个字符是否为希伯来语：

function IsStringStartsWithHebrew($string)
{
    return (strlen($string) > 1 && //minimum of chars for hebrew encoding
        ord($string[0]) == 215 && //first byte is 110-10111
        ord($string[1]) >= 144 && ord($string[1]) <= 170 //hebrew range in the second byte.
        );
}

祝你好运:)

Here's a small function to check whether the first character in a string is in hebrew:

function IsStringStartsWithHebrew($string)
{
    return (strlen($string) > 1 && //minimum of chars for hebrew encoding
        ord($string[0]) == 215 && //first byte is 110-10111
        ord($string[1]) >= 144 && ord($string[1]) <= 170 //hebrew range in the second byte.
        );
}

good luck :)

回复收藏 0 原文

凉宸 2024-08-18 00:13:24

首先，这样的字符串完全没有用——两种不同字符集的混合？

iso8859-8 中的希伯来语字符和 UTF-8 中多字节序列的每个字节都有一个值 ord($char) > 127..所以我要做的就是找到值大于 127 的所有字节，然后检查它们是否像 is8859-8 一样有意义，或者您认为它们作为 UTF8 序列是否更有意义......

回复收藏 0 原文

戏蝶舞 2024-08-18 00:13:24

function is_hebrew($string)
{
    return preg_match("/\p{Hebrew}/u", $string);
}

function is_hebrew($string)
{
    return preg_match("/\p{Hebrew}/u", $string);
}

回复收藏 0 原文

~没有更多了~

关于作者

辞慾

暂无简介

文章

29 人气

关注发私信

qq_VRzBBA45

文章 0 评论 0

关注

痴情

文章 0 评论 0

关注

。

文章 0 评论 0

关注

Mu.

文章 0 评论 0

关注

凉薄对峙

文章 0 评论 0

关注

不落城

文章 0 评论 0

友情链接

文江博客

我如何使用 php 检测字符串中的 iso8859-8 和 utf8 希伯来语字符

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

关于作者

相关话题

热门标签

推荐作者

qq_VRzBBA45

痴情

。

Mu.

凉薄对峙

不落城

友情链接

我如何使用 php 检测字符串中的 iso8859-8 和 utf8 希伯来语字符

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

关于作者

相关话题

热门标签

推荐作者

qq_VRzBBA45

痴情

。

Mu.

凉薄对峙

不落城

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。