对 SQL 存储的数据进行二次采样以绘制绘图

发布于 2024-10-13 15:54:49 字数 699 浏览 13 评论 0原文

假设您有一个每 30 秒将（时间戳、股票价格）记录到 SQL 数据库的程序，并且您想要生成不同时间尺度上的股票价格图。如果您绘制 1 小时范围内的测量结果，则可以使用该时间段内采集的全部 120 个样本。但是，如果您想绘制 1 年范围内的价格，您显然不想从数据库中提取超过 100 万个样本。最好从数据库中提取一些具有代表性的样本子集。

这让我想起了计算机图形学中的细节级别技术——当您远离 3D 模型时，可以使用模型的保真度较低的版本。

是否有通用技术来表示数据库中的详细级别信息，或者快速查询均匀分布的数据子集（例如，给我 2009 年 1 月以来的 100 个均匀分布的样本）？

到目前为止我提出的解决方案是在数据库表中包含 level_of_detail 列。如果 level_of_detail=0，则该行保存单个瞬时样本。如果 level_of_detail=n，则该行包含最后 (sample_interval*(2^n)) 秒数据的平均值，并且该级别有 1/(2^n) 行。该表在 (level_of_detail, timestamp) 上有一个索引，当您想要生成绘图时，您可以根据所需的样本数量计算适当的 level_of_detail 值，并使用该约束进行查询。缺点是：

对于 N 个样本，表需要存储 2*N 行
客户端必须知道指定适当的 level_of_detail 约束
当样本添加到表中时，某些进程需要负责构建平均行

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

那一片橙海， 2024-10-20 15:54:49

对于 SQL Server，您可以使用 ntile。这会对数据集进行排序，然后将其分为 N 个不同的组，第一组返回 1，最后一组返回 N。

select  MIN(MeasureTime) as PeriodStart
,       MAX(MeasureTime) as PeriodEnd
,       AVG(StockPrice) as AvgStockPrice
from    (
        select  MeasureTime
        ,       StockPrice
        ,       NTILE(100) over (order by MeasureTime) as the_tile
        from    @t YourTable
        ) tiled
group by
        the_tile

这将恰好返回 100 行。如果您有兴趣尝试查询，这里是测试数据的副本：

declare @t table (MeasureTime datetime, StockPrice int)
declare @dt date
set @dt = '2010-01-01'
while @dt < '2011-01-01'
    begin
    insert @t values (@dt, DATEDIFF(day,'2010-01-01',@dt))
    select @dt = DATEADD(day,1,@dt)
    end

For SQL Server, you could use ntile. This orders the dataset, and then splits it in N different groups, returning 1 for the first group and N for the last group.

select  MIN(MeasureTime) as PeriodStart
,       MAX(MeasureTime) as PeriodEnd
,       AVG(StockPrice) as AvgStockPrice
from    (
        select  MeasureTime
        ,       StockPrice
        ,       NTILE(100) over (order by MeasureTime) as the_tile
        from    @t YourTable
        ) tiled
group by
        the_tile

This would return exactly 100 rows. Here's a copy of the test data if you're interested in trying the query:

declare @t table (MeasureTime datetime, StockPrice int)
declare @dt date
set @dt = '2010-01-01'
while @dt < '2011-01-01'
    begin
    insert @t values (@dt, DATEDIFF(day,'2010-01-01',@dt))
    select @dt = DATEADD(day,1,@dt)
    end

回复收藏 0 原文

~没有更多了~