当前位置：文江博客话题详情

从 solr 和 nutch 生成的搜索索引中获取文本片段

发布于 2024-11-27 04:57:02 字数 230 浏览 5 评论 0原文

我刚刚按照入门教程配置了 nutch 和 solr，以成功对网站上的文本进行爬网和索引。现在我尝试通过修改示例速度模板来制作搜索页面。

现在回答我的问题。我如何告诉 solr 提供点击内容的相关文本片段？我只获得与每个点击相关的以下字段：

分数、提升、摘要、id、段、标题、日期、tstamp 和 url。

内容确实被索引了，因为我可以搜索我只知道在全文中的单词，但我仍然没有得到与命中相关的全文。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

浅浅 2024-12-04 04:57:02

不要忘记：索引与存储不同。

如果所有字段都被索引，但没有存储任何字段，则您可以搜索文档中的单词。
要获取特定字段的内容，还必须在 schema.xml 中存储为 true

如果您的全文字段是
存储，因此默认的“字段列表设置”可能不包括全文字段。
您可以使用 fl 参数添加此内容：

http://<solr-url>:port/select/?......&fl=mytext,*

...此示例，如果您的全文存储在名为 mytext 的字段中

最后，如果您只想包含包含搜索词的文本片段（不是全文）查看 solr/lucene 中的突出显示组件

don't forget: indexed is not the same as stored.

You can search words in an document, if all field are indexed, but no field is stored.
To get the content of a specific field, it must be also stored=true in schema.xml

If your fulltext-field is
stored, so probably the default "field-list-settings" does not include the fulltext-field.
You can add this by using the fl parameter:

http://<solr-url>:port/select/?......&fl=mytext,*

...this example, if your fulltext is stored in the field called mytext

Finally, if you like to have only a snippet of the text with the searched words (not the whole text) look at the highlight-component from solr/lucene

回复收藏 0 原文

~没有更多了~

关于作者

橙幽之幻

暂无简介

文章

28 人气

关注发私信

十二

文章 0 评论 0

关注

飞烟轻若梦

文章 0 评论 0

关注

OPleyuhuo

文章 0 评论 0

关注

wxb0109

文章 0 评论 0

关注

旧城空念

文章 0 评论 0

关注

-小熊_

文章 0 评论 0

友情链接

文江博客

从 solr 和 nutch 生成的搜索索引中获取文本片段

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者