Ruby 1.8 和 UTF-8 字符串大小写语句比较

发布于 2024-07-10 04:43:13 字数 382 浏览 8 评论 0原文

我有一个 Rake 任务（在 lib/tasks 目录中），我在共享虚拟主机上使用 cron 运行该任务。问题是我想使用 case 语句比较 UTF-8 字符串，但我的源代码不是 UTF-8 编码的。如果我将源代码保存为 UTF-8，当我尝试启动它时会出现错误:(

我必须做什么？

可以从外部 UTF-8 txt 文件读取此字符串吗？

PS 我正在使用 Ruby 1.8

P.S. 我的意思是比较这边走：

result = case utf8string
   when 'АБВ': 1
   when 'ГДИ': 2
   when 'ЙКЛ': 3
   when 'МНО': 4
   else 5
end

原文

I have a Rake task (in lib/tasks directory) that I run with cron on my shared web hosting. The problem is that I want to compare a UTF-8 string using case statment but my source code is not UTF-8 encoded. If I save source code as UTF-8 there is error when I try to start it :(

What I have to do?

May be read this strings from external UTF-8 txt file?

P.S. I'm using Ruby 1.8

P.S. I mean compare this way:

result = case utf8string
   when 'АБВ': 1
   when 'ГДИ': 2
   when 'ЙКЛ': 3
   when 'МНО': 4
   else 5
end

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

时光是把杀猪刀 2024-07-17 04:43:13

我发现我的问题不在于 case 语句，

问题在于当我以 UTF-8 格式保存源代码时，我的文本编辑器在开头添加 3 个字节（BOM）以指示编码是 UTF-8。

问：什么是 BOM？
A: 字节顺序标记 (BOM) 由字符代码组成
U+FEFF 位于数据流的开头，可用作
定义字节顺序和编码形式的签名，主要是未标记的
纯文本文件。在某些更高级别的协议下，BOM 的使用可能是
在该协议中定义的 Unicode 数据流中强制（或禁止）。

UTF-8、UTF-16、UTF-32 和 BOM

我得到的错误是：

1: Invalid char `\357' in expression
1: Invalid char `\273' in expression
1: Invalid char `\277' in expression

I found that my problem was not in case statment

The problem was that when I save my source code in UTF-8 format, my text editor add 3 bytes (BOM) at the beginning to indicate that encoding is UTF-8.

Q: What is a BOM?
A: A byte order mark (BOM) consists of the character code
U+FEFF at the beginning of a data stream, where it can be used as a
signature defining the byte order and encoding form, primarily of unmarked
plaintext files. Under some higher level protocols, use of a BOM may be
mandatory (or prohibited) in the Unicode data stream defined in that protocol.

UTF-8, UTF-16, UTF-32 & BOM

The error that I get was:

1: Invalid char `\357' in expression
1: Invalid char `\273' in expression
1: Invalid char `\277' in expression

回复收藏 0 原文

呆萌少年 2024-07-17 04:43:13

我想说您需要更改文本编辑器，因为 UTF-8 不需要 BOM。 UTF-8 不依赖于字节顺序。有关详细信息，请参阅链接文本。

回复收藏 0 原文

阳光下的泡沫是彩色的 2024-07-17 04:43:13

尝试使用 Rails 的 ActiveSupportmb_chars 方法a> 框架：

result = case utf8string.mb_chars
   when 'АБВ': 1
   when 'ГДИ': 2
   when 'ЙКЛ': 3
   when 'МНО': 4
   else 5
end

Try using the mb_chars method from Rails' ActiveSupport framework:

result = case utf8string.mb_chars
   when 'АБВ': 1
   when 'ГДИ': 2
   when 'ЙКЛ': 3
   when 'МНО': 4
   else 5
end

回复收藏 0 原文

~没有更多了~

关于作者

硬不硬你别怂

暂无简介

0 文章

0 评论

23 人气

关注发私信

友情链接

文江博客

Ruby 1.8 和 UTF-8 字符串大小写语句比较

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

1CH1MKgiKxn9p

ゞ记忆︶ㄣ

JackDx

信远

yaoduoduo1995

霞映澄塘

友情链接

Ruby 1.8 和 UTF-8 字符串大小写语句比较

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（3）

关于作者

相关话题

热门标签

推荐作者

1CH1MKgiKxn9p

ゞ记忆︶ㄣ

JackDx

信远

yaoduoduo1995

霞映澄塘

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。