PostgreSQL 窗口函数：按列分组而不排序（~ Python itertools.groupby）

发布于 2024-10-18 13:28:32 字数 713 浏览 1 评论 0原文

我需要根据列对 PostgreSQL 中的表进行分区，而不进行排序 &使结果独一无二；基本上我想要实现的是在 PostgreSQL 中重现 Python 的 itertools.groupby() 行为。

给定包含两列的表：

我想按第二列中的值对其进行分区（同时保留现有顺序），最终得到以下结果：

我尝试使用窗口函数来实现这一点，使用 ROW_NUMBER( ) 和 LAG() 将当前行与前一行进行比较，看看它是否已更改。在这种情况下的问题是我还需要一个每次值变化时都会递增的变量。

原文

I need to partition a table in PostgreSQL based on a column without sorting & making the result unique; Basically what I am trying to achieve is to reproduce the itertools.groupby() behavior from Python in PostgreSQL.

Given the table containing two columns:

I want to partition it by the value in the second (whilst preserving the existing order), to end up with this:

I tried to achieve that with window functions, using a combination of ROW_NUMBER() and LAG() to compare the current row with the previous to see if it has changed. The problem in that case was that I would also need a variable that increments every time the value changes.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

雨轻弹 2024-10-25 13:28:32

试试这个：

WITH T1 AS
(
    SELECT
        id,
        grp,
        LAG(grp) OVER (ORDER BY id) IS DISTINCT FROM grp AS changes
    FROM yourtable
)
SELECT id, grp, SUM(changes::int) OVER (ORDER BY id) FROM T1

Try this:

WITH T1 AS
(
    SELECT
        id,
        grp,
        LAG(grp) OVER (ORDER BY id) IS DISTINCT FROM grp AS changes
    FROM yourtable
)
SELECT id, grp, SUM(changes::int) OVER (ORDER BY id) FROM T1

回复收藏 0 原文