快速、支持 Unicode、跨平台程序员的文本编辑器,可以像 ZWSP 一样显示不可见内容?

发布于 2024-08-29 23:27:11 字数 711 浏览 4 评论 0原文

我们的发布工作流程包括 Windows 和 Linux 机器(也有一些 Mac,但不在关键路径工作流程中)。许多文本包含英语和高棉语,并以 XML 进行标记。

XML Copy Editor 是我发现的最好的跨平台开源 XML 编辑器。它利用 Scintilla 编辑组件,该组件通常适用于 Unicode,但无法显示非打印或不可见字符,如 U+200B(零宽度空格)和 U+200C(零宽度非连接符)。高棉语不像西方语言那样用空格字符分隔单词,因此在电子文本中使用 ZWSP 以使应用程序能够轻松换行。

理想情况下,我会在单个编辑器中编辑标记和内容,但有时 XML 感知不如能够显示不可见内容重要。 (OpenOffice.org WriterMicrosoft Word 是我所知道的仅有的两个会显示 ZWSP 的应用程序。它们不适合需要执行的标记和文本操作不幸的是,准备出版手稿,尽管我猜它们适合创作。)

上周我尝试了一个有前途的编辑器,但在 TextPad 4.7.3 中花费不到一秒的搜索和替换正则表达式操作持续了二十多秒。所以我想提一下,速度和处理大文件(最多 150mb)的能力也是一个问题。

是否有一个好的、快速的、免费的或不太昂贵的文本编辑器,有 Windows 和 Linux 上的版本,也许还有 Mac 上的版本,支持 Unicode 并能够显示像 ZWSP 这样的不可见内容?它具有语法高亮显示功能,可以处理大文件,并且可以自定义,这样我就不会沮丧得抓狂了?

Our publishing workflow includes Windows and Linux machines (there are some Macs too, but not in the critical-path workflow). Many texts include both English and Khmer and are marked-up in XML.

XML Copy Editor is the best cross-platform open-source XML editor I've discovered. It utilizes the Scintilla editing component, which is generally good with Unicode but which does not enable non-printing or invisible characters like U+200B (zero-width space) and U+200C (zero-width non-joiner) to be displayed. Khmer does not separate words with a space character as Western languages do, so ZWSP is used in electronic texts to enable applications to break lines easily.

Ideally I'd edit the markup and the content in a single editor, but XML awareness is less important at times than being able to display invisibles. (OpenOffice.org Writer and Microsoft Word are the only two apps I know that will display ZWSP. They are not suitable for the markup and text manipulations that need to be done to prepare manuscripts for publication, unfortunately, although I guess they're fine for authoring.)

I tried out a promising editor last week, but a search-and-replace regex operation that took under a second in TextPad 4.7.3 lasted over twenty seconds. So I want to mention that speed and the ability to handle large (up to 150mb) files is also a concern.

Is there a good, fast, free or not too expensive text editor, with versions on Windows and Linux and maybe mac too, Unicode-aware and capable of displaying invisibles like ZWSP? That has syntax highlighting, can handle large files and is customizable enough that I won't tear my hair out in frustration?

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(3

缱绻入梦 2024-09-05 23:27:11

我不太了解 ZWSP,但是 EditPadPro 很好,速度快,不贵,有一个很好的正则表达式引擎并且支持 Unicode(也非常适合编辑 XML)。开发人员 (Jan Goyvaerts) 居住在泰国,了解东方文字和语言的要求,因此很有可能能够处理这些文本。

I don't know about ZWSP in particular, but EditPadPro is good, fast, not expensive, has a very good regex engine and is Unicode-aware (and well-suited to editing XML, too). The developer (Jan Goyvaerts) lives in Thailand and knows about requirements for Eastern scripts and languages, so chances are good that it will be able to handle these texts.

℡Ms空城旧梦 2024-09-05 23:27:11

EditPad Pro(尚)无法可视化除 ASCII 空格和制表符之外的不可打印字符。在通过双击或 Ctrl+Shift+左/右进行自动换行和选择单词时,版本 6 确实将 ZWSP 识别为单词边界。

您可以做的就是搜索正则表达式\u200B。虽然这不会使零宽度空间可见,但它会选择它并将光标放在它后面。您可以使用正则表达式 \u200B\X 并打开搜索面板上的“突出显示”按钮来突出显示 U+200B 之后的每个字素。您甚至可以使用 语法着色方案编辑器 编辑提供的 XML 方案以始终使用该正则表达式突出显示 U+200B 之后的每个字素。

EditPad Pro 可轻松处理 150 MB 文件,并具有强大的正则表达式引擎(与 RegexBuddy 和 PowerGREP 中使用的相同)。最大文件大小为 2 GB。仅限 Windows。

EditPad Pro does not (yet) have the ability to visualize non-printable characters other than the ASCII space and tab. Version 6 does recognize ZWSP as a word boundary when doing word wrapping and selecting words by double-clicking or Ctrl+Shift+Left/Right.

What you can do is to search for the regular expression \u200B. Though this doesn't make the zero-width space visible, it will select it and put the cursor after it. You could use the regex \u200B\X and turn on the Highlight button on the search panel to highlight each grapheme after U+200B. You could even use the syntax coloring scheme editor to edit the provided XML scheme to use that regex always highlight each grapheme after U+200B.

EditPad Pro easily handles 150 MB files and has a powerful regex engine (same as used in RegexBuddy and PowerGREP). Maximum file size is 2 GB. Windows only.

云裳 2024-09-05 23:27:11

我正在使用 CKEditor ,它是跨平台的并且完全支持unicode。

看看它

I'm using CKEditor , it's cross platform and completly support unicode.

Take a look at it

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文