Apache lucene 和文本含义

发布于 2024-12-09 03:20:34 字数 552 浏览 5 评论 0原文

我有一个关于 lucene/ 中搜索过程的问题。我使用此代码进行搜索

    Directory directory = FSDirectory.GetDirectory(@"c:\index");
    Analyzer analyzer = new StandardAnalyzer();

    QueryParser qp = new QueryParser("content", analyzer);
    qp.SetDefaultOperator(QueryParser.Operator.AND);

    Query query = qp.Parse(search string);

在一个文档中，我为字段设置了“我想要去购物”，在其他文档中我设置了“我想要去购物” ”。

两个句子的意思是一样的！

lucene 有什么好的解决方案来理解句子的含义或标准化气味吗？例如，保存“我想要/想要/去购物”等字段，并使用结果中的正则表达式删除注释。

原文

I have a question about searching process in lucene/.
I use this code for search

    Directory directory = FSDirectory.GetDirectory(@"c:\index");
    Analyzer analyzer = new StandardAnalyzer();

    QueryParser qp = new QueryParser("content", analyzer);
    qp.SetDefaultOperator(QueryParser.Operator.AND);

    Query query = qp.Parse(search string);

In one document I've set "I want to go shopping" for a field and in other document I've set "I wanna go shopping".

the meaning of both sentences is same!

is there any good solution for lucene to understand meaning of sentences or kind of normalize the scentences ? for example save the fields like "I wanna /want to/ go shopping" and remove the comment with regexp in result.

分享到QQ

分享到微博