将多个可搜索文件添加到一个 Solr-index-document

发布于 2024-12-11 17:25:32 字数 837 浏览 3 评论 0原文

有没有一种方法或最佳实践可以使用提取处理程序将多个文件（例如 2 个 pdf 和 1 个 doc）添加到一个 solr-index-doc 中？查询时的结果应该看起来像这样：

<result name="response">
 <str name="id">123</str>

  <doc>
   <arr name="attr_content">
    content of pdf-1
   </arr>
  </doc>

  <doc>
   <arr name="attr_content">
    content of pdf-2
   </arr>
  </doc>

  <doc>
   <arr name="attr_content">
    content of doc-1
   </arr>
  </doc>

</result>

在我的 java 应用程序中，我将文件添加到 Solr-Index 中，就像只添加一个文件一样：

ContentStreamUpdateRequest up = new ContentStreamUpdateRequest("/update/extract");
up.addFile(new File("c:\\document1.pdf"));
up.setParam("literal.id", solrId);
up.setAction(AbstractUpdateRequest.ACTION.COMMIT, true, true);
solr.request(up);

原文

Is there a way or best practice to add more than one file (e.g. 2 pdfs and 1 doc) into one solr-index-doc using the extract handler? The result when querying should look somehow like this:

<result name="response">
 <str name="id">123</str>

  <doc>
   <arr name="attr_content">
    content of pdf-1
   </arr>
  </doc>

  <doc>
   <arr name="attr_content">
    content of pdf-2
   </arr>
  </doc>

  <doc>
   <arr name="attr_content">
    content of doc-1
   </arr>
  </doc>

</result>

In my java application I am adding files to the Solr-Index like that which adds only one file:

ContentStreamUpdateRequest up = new ContentStreamUpdateRequest("/update/extract");
up.addFile(new File("c:\\document1.pdf"));
up.setParam("literal.id", solrId);
up.setAction(AbstractUpdateRequest.ACTION.COMMIT, true, true);
solr.request(up);

分享到QQ

分享到微博