I have not used mg4j. I have used Lucene, though. The number one feature of Lucene IMO is its wide adoption and wonderful community of users/developers/committers. This means that there is a fair chance that somebody worked on a use case similar to yours using Lucene. Current weak points of Lucene are its scoring model and its ability to scale to large collections of text. The Lucene developers are working on these issues.
I believe that the choice of a search library is very dependent on your (academic or industrial) setting, the other parts of your application and your use case.
发布评论
评论(1)
Jeff Dalton 评论了许多开源搜索引擎 包括2007年的Lucene和mg4j,2009年更新了对比。
我没有使用过mg4j。不过我用过Lucene。在我看来,Lucene 的第一个特性是它的广泛采用和精彩的用户/开发者/提交者社区。这意味着有人很有可能使用 Lucene 处理过与您类似的用例。
Lucene 当前的弱点是其评分模型及其扩展到大型文本集合的能力。 Lucene 开发人员正在努力解决这些问题。
我相信搜索库的选择很大程度上取决于您的(学术或工业)设置、应用程序的其他部分以及您的用例。
Jeff Dalton reviewed many open source search engines including Lucene and mg4j in 2007, and updated the comparison in 2009.
I have not used mg4j. I have used Lucene, though. The number one feature of Lucene IMO is its wide adoption and wonderful community of users/developers/committers. This means that there is a fair chance that somebody worked on a use case similar to yours using Lucene.
Current weak points of Lucene are its scoring model and its ability to scale to large collections of text. The Lucene developers are working on these issues.
I believe that the choice of a search library is very dependent on your (academic or industrial) setting, the other parts of your application and your use case.