Elasticsearch索引JSON带有逃脱的引号标记 - “总字段的限制[1000]已超过”

发布于 2025-01-22 11:29:36 字数 999 浏览 6 评论 0 原文

在升级了VOM Elasticsearch 5.6.10到7.15.1之后，JSON字符串带有逃逸的引号。当然，这导致了胡说八道。当我意识到这是我得到以下例外的那一刻：

primary java.lang.illegalgumentException拒绝了映射更新：总字段的限制[1000]已超过

索引代码类似于：

for (...){
  def idx_record = buildEsRecord(r)     // getting a valid map without escape characters
  if (idx_record != null) {
    IndexRequest singleRequest = new IndexRequest(myIndex)
    singleRequest.id(idx_record['_id'].toString())
    idx_record.remove('_id')
    singleRequest.source(idx_record as JSON, XContentType.JSON)
    bulkRequest.add(singleRequest)
  }
}
BulkResponse bulkResponse = esClient.bulk(bulkRequest, RequestOptions.DEFAULT)

debugging IDX_RECORD作为JSON 显示一个完全很好的JSON字符串，而没有引号，例如：

{
    "uuid": "63fa7627-7d03-465b-93a3-a498feeb6689",
    "contentType": null,
    "description": null,
    "descriptionURL": null,
    ...
}

Elasticsearch 7的配置中是否有一些我错过的东西？我们可以在Elasticsearch客户端上设置任何参数吗？还有其他想法吗？

原文

After having upgraded vom Elasticsearch 5.6.10 to 7.15.1, Json strings are indexed with escaped quotation marks. This leads to nonsense data of course. The moment I realised it was when I got the following exception:

mapping update rejected by primary java.lang.IllegalArgumentException: Limit of total fields [1000] has been exceeded

The indexing code is like:

for (...){
  def idx_record = buildEsRecord(r)     // getting a valid map without escape characters
  if (idx_record != null) {
    IndexRequest singleRequest = new IndexRequest(myIndex)
    singleRequest.id(idx_record['_id'].toString())
    idx_record.remove('_id')
    singleRequest.source(idx_record as JSON, XContentType.JSON)
    bulkRequest.add(singleRequest)
  }
}
BulkResponse bulkResponse = esClient.bulk(bulkRequest, RequestOptions.DEFAULT)

Debugging idx_record as JSON shows a totally fine Json string without quotation marks being escaped, like:

{
    "uuid": "63fa7627-7d03-465b-93a3-a498feeb6689",
    "contentType": null,
    "description": null,
    "descriptionURL": null,
    ...
}

Is there something in the configuration of Elasticsearch 7 that I have missed? Can we set any parameters on the Elasticsearch client? Any other ideas?

分享到QQ

分享到微博