需要统计自然语言处理资源
我正在用 Java 编写一个需要解析自然语言的程序。我需要使用概率和统计来完成此操作。是否有任何资源可以轻松解释统计自然语言处理技术?
I'm writing a program in Java that needs to parse natural language. I need this to be done using probability and statistics. Are there any resources that can easily explain Statistical Natural Language Processing techniques?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
通常引用的“介绍性”参考是 统计自然语言处理基础 (1999) Manning &舒兹。虽然内容全面、相对容易理解,而且无疑是一个很好的参考,但对于对该领域的更随意的介绍来说,这可能有点过头了。
您也许可以找到一些在线课程,例如NLP 统计方法短期课程< /a>
而且,既然你提到了 java,你可以找到一个通用的“工具箱”,例如
并开始亲自接触 NLP 的特定领域,例如词性标记或实体提取。
另外值得一提的是,“虽然它与基于 Python 的 NLTK 有关,但使用 Python 进行自然语言处理” 在线(和硬拷贝)书籍 构成了常见 NLP 任务的非常实用的指南。
涉足 NLP 有一个第 22 条军规:它是一个相当广泛的研究和实践领域。它充满了学术研究以及经过时间和行业考验的实践和图书馆。在人们更好地掌握适合给定问题的 NLP 特定应用之前,人们可能会浪费大量时间来研究那些不成熟或不太适合当前问题的技术。
A commonly cited "introductory" reference is Foundations of Statistical Natural Language Processing (1999) by Manning & Shutze. While comprehensive, relatively accessible and certainly a excellent reference, this may be overkill for a more casual introduction to the field.
You can maybe find some online courses such as Short course on Statistical Methods in NLP
And also, since you mentioned java, you can find a generic "toolbox" such as
and start getting hands-on exposure to specific areas of NLP such as, say, POS Tagging or Entity Extraction.
Also worthy of mention, 'though it is related to a the Python-based NLTK, the Natural Language Processing with Python online (and hardcopy) book constitutes a very practical guide into common NLP tasks.
There is a bit of a catch-22 with getting one's feet wet with NLP: it is a rather extensive field of study and practice. It is rife with both scholarly research and time and industry tested practices and libraries. Until one has a better grasp of the particular applications of NLP that are suitable for a given problem, one may waste a lot time poking technologies that are either immature or not well suited to the problems at hand.