将mySQL聚合查询转换为alasticsearch
我有一个评论
表,在过去的一年中,我将其移至Elasticsearch。
问题在于,我需要调整我当前在MySQL中的查询,该查询可以返回过去7天内每天的评论总数。
这是我现在拥有的MySQL查询:
SELECT count(*) AS number, DATE(created_at) AS date
FROM `comments`
WHERE `post_id` = ?
GROUP BY `date`
ORDER BY `date` DESC
LIMIT 7
我的索引看起来像:
{
"mappings": {
"_doc": {
"properties": {
"id": {
"type": "keyword"
},
"post_id": {
"type": "integer"
},
"subject": {
"analyzer": "custom_html_strip",
"type": "text"
},
"body": {
"analyzer": "custom_html_strip",
"type": "text"
},
"created_at": {
"format": "yyyy-MM-dd HH:mm:ss",
"type": "date"
}
}
}
}
}
是否可以重现Elasticsearch的查询?如果是这样,那会怎样?
我的Elasticsearch知识有限,我知道它提供了聚合
,但我真的不知道如何将所有内容放在一起。
I have a comments
table that over the past year has grown considerably and I'm moving it to ElasticSearch.
The problem is that I need to adapt a query that I currently have in MySQL which returns the total number of comments for each day in the last 7 days for a given post.
Here's the MySQL query that I have now:
SELECT count(*) AS number, DATE(created_at) AS date
FROM `comments`
WHERE `post_id` = ?
GROUP BY `date`
ORDER BY `date` DESC
LIMIT 7
My index looks like this:
{
"mappings": {
"_doc": {
"properties": {
"id": {
"type": "keyword"
},
"post_id": {
"type": "integer"
},
"subject": {
"analyzer": "custom_html_strip",
"type": "text"
},
"body": {
"analyzer": "custom_html_strip",
"type": "text"
},
"created_at": {
"format": "yyyy-MM-dd HH:mm:ss",
"type": "date"
}
}
}
}
}
Is it possible to reproduce that query for ElasticSearch? If so, how would that look like?
My ElasticSearch knowledge is kinda limited, I know that it offers aggregation
, but I don't really know how to put it all together.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
Elasticsearch也支持SQL查询(尽管有限)
因此,有了一点更改,您可以使用此类内容
,可以使用
_SQL/Translate
看到相应的查询,这将返回说,不需要翻译查询中使用的某些内容,因此这将是一个更好的本地查询
Elasticsearch supports sql queries as well (though they are limited)
so with a little change you can use something like this
You can see the corresponding query using
_sql/translate
, which will returnThat being said, some of the stuff used in the translated query is not needed, so this will be a better native query
首先,建议定期汇总您的时间字段,因此使用直接写入“ Yyyy-MM-DD”格式的字段将提高您的性能并减少资源的消费
First, it is recommended that your time fields be regularly aggregated, so using fields written directly to a 'YYYY-MM-DD' format will improve your performance and reduce your consumption of resources
在过去的7天中,使用以下查询在给定的“ post_id”上获取所有评论。
从聚合中,在客户端应用程序中获取前7个存储桶。
Use the following query to get all the comments on a given "post_id" for the last 7 days.
From the aggregation, pick up the first 7 buckets in your client application.