SQL 确定最小连续访问天数？

发布于 2024-07-28 23:20:06 字数 649 浏览 13 评论 0原文

以下用户历史记录表包含给定用户访问网站的每一天的一条记录（在 24 小时 UTC 时间段内）。它有数千条记录，但每个用户每天只有一条记录。如果用户当天没有访问该网站，则不会生成任何记录。

Id      UserId   CreationDate
------  ------   ------------
750997      12   2009-07-07 18:42:20.723
750998      15   2009-07-07 18:42:20.927
751000      19   2009-07-07 18:42:22.283

我正在寻找的是对该表的 SQL 查询具有良好的性能，它告诉我哪些用户 ID 连续 (n) 天访问该网站而没有错过一天。

换句话说，有多少用户在此表中拥有 (n) 条具有连续日期（前天或后天）日期的记录？如果序列中缺少任何一天，序列就会被破坏，并应从 1 重新开始；我们正在寻找在此处连续停留天数且没有间断的用户。

当然，此查询与特定 Stack Overflow 徽章之间的任何相似之处纯属巧合。.:)

原文

The following User History table contains one record for every day a given user has accessed a website (in a 24 hour UTC period). It has many thousands of records, but only one record per day per user. If the user has not accessed the website for that day, no record will be generated.

Id      UserId   CreationDate
------  ------   ------------
750997      12   2009-07-07 18:42:20.723
750998      15   2009-07-07 18:42:20.927
751000      19   2009-07-07 18:42:22.283

What I'm looking for is a SQL query on this table with good performance, that tells me which userids have accessed the website for (n) continuous days without missing a day.

In other words, how many users have (n) records in this table with sequential (day-before, or day-after) dates? If any day is missing from the sequence, the sequence is broken and should restart again at 1; we're looking for users who have achieved a continuous number of days here with no gaps.

Any resemblance between this query and a particular Stack Overflow badge is purely coincidental, of course.. :)

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

凤舞天涯 2024-08-04 23:20:06

怎么样（请确保前面的语句以分号结尾）：

WITH numberedrows
     AS (SELECT ROW_NUMBER() OVER (PARTITION BY UserID 
                                       ORDER BY CreationDate)
                - DATEDIFF(day,'19000101',CreationDate) AS TheOffset,
                CreationDate,
                UserID
         FROM   tablename)
SELECT MIN(CreationDate),
       MAX(CreationDate),
       COUNT(*) AS NumConsecutiveDays,
       UserID
FROM   numberedrows
GROUP  BY UserID,
          TheOffset

这个想法是，如果我们有天数列表（作为数字）和 row_number，那么错过的天数会使这两个列表之间的偏移量稍微增加大。所以我们正在寻找一个具有一致偏移的范围。

你可以在最后使用“ORDER BY NumConsecutiveDays DESC”，或者说“HAVING count(*) > 14”作为阈值......

我还没有测试过这个——只是把它写在我的脑海里。希望能在 SQL2005 及更高版本中工作。

...并且对表名（UserID，CreationDate）上的索引会有很大帮助

编辑：原来Offset是一个保留字，所以我使用了TheOffset。

编辑：使用 COUNT(*) 的建议非常有效 - 我应该首先这样做，但并没有真正考虑。以前它使用 datediff(day, min(CreationDate), max(CreationDate)) 代替。

抢

How about (and please make sure the previous statement ended with a semi-colon):

WITH numberedrows
     AS (SELECT ROW_NUMBER() OVER (PARTITION BY UserID 
                                       ORDER BY CreationDate)
                - DATEDIFF(day,'19000101',CreationDate) AS TheOffset,
                CreationDate,
                UserID
         FROM   tablename)
SELECT MIN(CreationDate),
       MAX(CreationDate),
       COUNT(*) AS NumConsecutiveDays,
       UserID
FROM   numberedrows
GROUP  BY UserID,
          TheOffset

The idea being that if we have list of the days (as a number), and a row_number, then missed days make the offset between these two lists slightly bigger. So we're looking for a range that has a consistent offset.

You could use "ORDER BY NumConsecutiveDays DESC" at the end of this, or say "HAVING count(*) > 14" for a threshold...

I haven't tested this though - just writing it off the top of my head. Hopefully works in SQL2005 and on.

...and would be very much helped by an index on tablename(UserID, CreationDate)

Edited: Turns out Offset is a reserved word, so I used TheOffset instead.

Edited: The suggestion to use COUNT(*) is very valid - I should've done that in the first place but wasn't really thinking. Previously it was using datediff(day, min(CreationDate), max(CreationDate)) instead.

Rob

SQL 确定最小连续访问天数？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（19）

关于作者

相关话题

热门标签

推荐作者

雪花的坚持

温柔一刀

扛起拖把扫天下

北方的韩爷

绝對不後悔。

青衫负雪

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。