我应该使用什么编程语言、算法来进行词典翻译?
我需要使用英语-孟加拉语词典将英语文档中的单词翻译(或者更确切地说,映射)为孟加拉语。我收集了大量这些文件。我应该使用什么算法,什么语言?文档和词典是UTF-8编码的文本文件。翻译很幼稚和直接,因为单词会被盲目地替换成字典上所说的任何单词,无论是单个单词的含义还是短语或句子。
I need to translate (or rather, map) the words in an English document to Bengali using an English-to-Bengali dictionary. I have a huge collection of these documents. What algorithm should I use, and what language? The documents and dictionary are UTF-8 encoded text files. The translation is naive and straightforward, as words will be blindly replaced with whatever the dictionary says, be it a single-word meaning or a phrase or a sentence.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
您可以使用任何您想要的主流语言。只需检查它对 unicode 的支持类型以及是否具有内置关联数据结构(例如 C++ 中的
stl::map
、Java 中的HashMap/TreeMap
、< code>dict(Python 等)。几乎任何现代语言都有它们,或者有一个免费的库。You can use any mainstream language you want. Just check what sort of support it has for unicode and whether it has a built in associative data structure (e.g.
stl::map
in C++,HashMap/TreeMap
in Java,dict
in Python etc.). Pretty much any modern language has them, or has a freely available library for them.几乎任何主流语言。如果时间紧迫,我会坚持使用 python。如果你有时间并且想利用你的编码技能赚大钱(如果你选择开发它们),我会选择 c#
Pretty much any mainstream language. If time is short I'd stick with python. If you have time and want to make excellent money with your coding skills (should you chose to develop them) I'd go with c#