如何使用mongoDB在每个嵌套阵列上平均间隔
我们正在使用数据尺寸的尺寸存储量图案,我们想知道如何在此嵌套数组上汇总每1、5或10分钟的平均值。结构如下:
{
"_id" : ObjectId("62ad1548a7d67adf4943d2f7"),
"data" : ObjectId("6262a3048cbdacd24c38fbc4"),
"day" : ISODate("2022-06-17T00:00:00.000Z"),
"first" : ISODate("2022-06-17T23:59:04.659Z"),
"last" : ISODate("2022-06-17T23:59:59.915Z"),
"nSamples" : 97,
"samples" : [
{
"time" : ISODate("2022-06-17T23:59:04.659Z"),
"value" : 8
},
{
"time" : ISODate("2022-06-17T23:59:50.706Z"),
"value" : 6
},
///// MAX 200 Documents in nested array
{
"time" : ISODate("2022-06-17T23:59:59.342Z"),
"value" : 4
},
{
"time" : ISODate("2022-06-17T23:59:59.915Z"),
"value" : 12
}
],
"version" : 0
}
{
"_id" : ObjectId("62ad1548a7d67adf4943d2f7"),
"data" : ObjectId("62629d178cbdac1e9938fb9e"),
"day" : ISODate("2022-06-17T00:00:00.000Z"),
"first" : ISODate("2022-06-17T23:32:04.659Z"),
"last" : ISODate("2022-06-17T23:40:59.915Z"),
"nSamples" : 97,
"samples" : [
{
"time" : ISODate("2022-06-17T23:32:04.659Z"),
"value" : 2
},
{
"time" : ISODate("2022-06-17T23:33:50.706Z"),
"value" : 65
},
///// MAX 200 Documents in nested array
{
"time" : ISODate("2022-06-17T23:36:59.342Z"),
"value" : 45
},
{
"time" : ISODate("2022-06-17T23:40:59.915Z"),
"value" : 22
}
],
"version" : 0
}
有可能获得这种结构吗?
{
"_id" : ObjectId("62ad1548a7d67adf4943d2f7"),
"data" : ObjectId("6262a3048cbdacd24c38fbc4"),
"day" : ISODate("2022-06-17T00:00:00.000Z"),
"first" : ISODate("2022-06-17T23:59:04.659Z"),
"last" : ISODate("2022-06-17T23:59:59.915Z"),
"nSamples" : 97,
"samples" : [
// Less documents
{
"time" : ISODate("2022-06-17T23:56:00.000Z"),
"value" : 7
},
{
"time" : ISODate("2022-06-17T23:58:00.000Z"),
"value" : 6
},
{
"time" : ISODate("2022-06-17T23:59:00.000Z"),
"value" : 4
},
],
"version" : 0
}
{
"_id" : ObjectId("62ad1548a7d67adf4943d2f7"),
"data" : ObjectId("62629d178cbdac1e9938fb9e"),
"day" : ISODate("2022-06-17T00:00:00.000Z"),
"first" : ISODate("2022-06-17T23:59:04.659Z"),
"last" : ISODate("2022-06-17T23:59:59.915Z"),
"nSamples" : 97,
"samples" : [
// Less docs
{
"time" : ISODate("2022-06-17T23:32:00.000Z"),
"value" : 2
},
{
"time" : ISODate("2022-06-17T23:33:00.000Z"),
"value" : 65
},
{
"time" : ISODate("2022-06-17T23:36:00.000Z"),
"value" : 45
},
{
"time" : ISODate("2022-06-17T23:37:00.000Z"),
"value" : 22
}
],
"version" : 0
}
到目前为止,我首先是从$ match
选择值开始的,然后添加一个“ fromDate”值以进行搜索:
$match: {
$and: [
{
data: {"$in": [ObjectId("62629d178cbdac1e9938fb9e"), ObjectId("62629d178cbdac1e9938fb9f"), ObjectId("62629d178cbdac1e9938fb9d")]} ,
},
{
first: {
$gte: new Date('2022-07-08T08:10:22.525+00:00')
}
}
],
}
然后使用$ undind
在样本上汇总
$unwind: {
path: "$samples",
preserveNullAndEmptyArrays: true
}
,然后分组有一个间隔的值:
$group: {
_id: {
"$toDate": {
"$subtract": [
{ "$toLong": "$samples.time" },
{ "$mod": [ { "$toLong": "$samples.time" }, 1000 * 60 * 1 ] } // 1 for 1 minute could be replaced by 5, 10 etc...
]
}
},
values: {
$avg: "$samples.value"
}
}
这里的问题是,考虑到我有多个“数据” ...我如何也可以按数据进行分组?是否可以保持相同的结构并拥有“较少的根文档”?
We are using the size bucket pattern for our datas and we are wondering how could we aggregate the nested array for getting a average every 1, 5 or 10 minutes on this nested array. The structure is as follow:
{
"_id" : ObjectId("62ad1548a7d67adf4943d2f7"),
"data" : ObjectId("6262a3048cbdacd24c38fbc4"),
"day" : ISODate("2022-06-17T00:00:00.000Z"),
"first" : ISODate("2022-06-17T23:59:04.659Z"),
"last" : ISODate("2022-06-17T23:59:59.915Z"),
"nSamples" : 97,
"samples" : [
{
"time" : ISODate("2022-06-17T23:59:04.659Z"),
"value" : 8
},
{
"time" : ISODate("2022-06-17T23:59:50.706Z"),
"value" : 6
},
///// MAX 200 Documents in nested array
{
"time" : ISODate("2022-06-17T23:59:59.342Z"),
"value" : 4
},
{
"time" : ISODate("2022-06-17T23:59:59.915Z"),
"value" : 12
}
],
"version" : 0
}
{
"_id" : ObjectId("62ad1548a7d67adf4943d2f7"),
"data" : ObjectId("62629d178cbdac1e9938fb9e"),
"day" : ISODate("2022-06-17T00:00:00.000Z"),
"first" : ISODate("2022-06-17T23:32:04.659Z"),
"last" : ISODate("2022-06-17T23:40:59.915Z"),
"nSamples" : 97,
"samples" : [
{
"time" : ISODate("2022-06-17T23:32:04.659Z"),
"value" : 2
},
{
"time" : ISODate("2022-06-17T23:33:50.706Z"),
"value" : 65
},
///// MAX 200 Documents in nested array
{
"time" : ISODate("2022-06-17T23:36:59.342Z"),
"value" : 45
},
{
"time" : ISODate("2022-06-17T23:40:59.915Z"),
"value" : 22
}
],
"version" : 0
}
Is it possible to get this structure ?
{
"_id" : ObjectId("62ad1548a7d67adf4943d2f7"),
"data" : ObjectId("6262a3048cbdacd24c38fbc4"),
"day" : ISODate("2022-06-17T00:00:00.000Z"),
"first" : ISODate("2022-06-17T23:59:04.659Z"),
"last" : ISODate("2022-06-17T23:59:59.915Z"),
"nSamples" : 97,
"samples" : [
// Less documents
{
"time" : ISODate("2022-06-17T23:56:00.000Z"),
"value" : 7
},
{
"time" : ISODate("2022-06-17T23:58:00.000Z"),
"value" : 6
},
{
"time" : ISODate("2022-06-17T23:59:00.000Z"),
"value" : 4
},
],
"version" : 0
}
{
"_id" : ObjectId("62ad1548a7d67adf4943d2f7"),
"data" : ObjectId("62629d178cbdac1e9938fb9e"),
"day" : ISODate("2022-06-17T00:00:00.000Z"),
"first" : ISODate("2022-06-17T23:59:04.659Z"),
"last" : ISODate("2022-06-17T23:59:59.915Z"),
"nSamples" : 97,
"samples" : [
// Less docs
{
"time" : ISODate("2022-06-17T23:32:00.000Z"),
"value" : 2
},
{
"time" : ISODate("2022-06-17T23:33:00.000Z"),
"value" : 65
},
{
"time" : ISODate("2022-06-17T23:36:00.000Z"),
"value" : 45
},
{
"time" : ISODate("2022-06-17T23:37:00.000Z"),
"value" : 22
}
],
"version" : 0
}
So far, I first start by a $match
for selecting values and add a "fromDate" value for searching:
$match: {
$and: [
{
data: {"$in": [ObjectId("62629d178cbdac1e9938fb9e"), ObjectId("62629d178cbdac1e9938fb9f"), ObjectId("62629d178cbdac1e9938fb9d")]} ,
},
{
first: {
$gte: new Date('2022-07-08T08:10:22.525+00:00')
}
}
],
}
Then by using the $unwind
aggregate on samples
$unwind: {
path: "$samples",
preserveNullAndEmptyArrays: true
}
And then grouping the values with an interval:
$group: {
_id: {
"$toDate": {
"$subtract": [
{ "$toLong": "$samples.time" },
{ "$mod": [ { "$toLong": "$samples.time" }, 1000 * 60 * 1 ] } // 1 for 1 minute could be replaced by 5, 10 etc...
]
}
},
values: {
$avg: "$samples.value"
}
}
The problem here is that it does not take in consideration that I have multiple "data"... How can I group them by data as well ? Is it possible to keep the same structure and having "less root documents" ?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
继续查询的一个选项是:
查看如何在 Playground示例
One option to continue your query is:
See how it works on the playground example