在Teradata SQL中使用Union删除重复行

发布于 2025-01-22 02:22:38 字数 902 浏览 3 评论 0原文

我正在使用Teradata SQL使用Union提取数据。

SEL CAST(a.dttm AS DATE), count(a.cs) FROM  cin.cell a
LEFT JOIN cin.comm c ON a.cs_sk = c.cs_sk
LEFT JOIN CIN.CID d ON a.cn_cd = d.CN_CD
WHERE CAST(a.dttm AS DATE) >= CURRENT_DATE-10
GROUP BY 1
UNION 
SEL CAST(a.dttm AS DATE), count(a.cs) FROM  cin_ps.cell a
LEFT JOIN cin_ps.comm c ON a.cs_sk = c.cs_sk
LEFT JOIN CIN_ps.CID d ON a.cn_cd = d.CN_CD
WHERE CAST(a.dttm AS DATE) >= CURRENT_DATE-10
GROUP BY 1

但是我在第一列中得到了重复的行,如下所示,对于任何第一组表或第二组表当前结果,任何特定日期都没有行

N.  PROCESSED_DTTM  Count(cs)
1   4/8/2022    40
2   4/8/2022    66
3   4/9/2022    49
4   4/9/2022    71
5   4/10/2022   117
6   4/10/2022   1430
7   4/11/2022   261
8   4/11/2022   841

必需的结果:

N.  PROCESSED_DTTM  Count(cs)
1   4/8/2022    106
2   4/9/2022    120
5   4/10/2022   1547
7   4/11/2022   1102

I'm using Teradata sql to extract data using UNION.

SEL CAST(a.dttm AS DATE), count(a.cs) FROM  cin.cell a
LEFT JOIN cin.comm c ON a.cs_sk = c.cs_sk
LEFT JOIN CIN.CID d ON a.cn_cd = d.CN_CD
WHERE CAST(a.dttm AS DATE) >= CURRENT_DATE-10
GROUP BY 1
UNION 
SEL CAST(a.dttm AS DATE), count(a.cs) FROM  cin_ps.cell a
LEFT JOIN cin_ps.comm c ON a.cs_sk = c.cs_sk
LEFT JOIN CIN_ps.CID d ON a.cn_cd = d.CN_CD
WHERE CAST(a.dttm AS DATE) >= CURRENT_DATE-10
GROUP BY 1

but I'm getting duplicate rows in first column as below Please note, there might be the case there is no row for any particular day for any first set of table or second set of table

Current result:

N.  PROCESSED_DTTM  Count(cs)
1   4/8/2022    40
2   4/8/2022    66
3   4/9/2022    49
4   4/9/2022    71
5   4/10/2022   117
6   4/10/2022   1430
7   4/11/2022   261
8   4/11/2022   841

Required results:

N.  PROCESSED_DTTM  Count(cs)
1   4/8/2022    106
2   4/9/2022    120
5   4/10/2022   1547
7   4/11/2022   1102

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

还不是爱你 2025-01-29 02:22:38

您没有重复,您在这两套方面都有非唯一的结果。如果要将联合封装在子查询中并选择“不同”,则将收到相同的数据集。您要做的是使用计数列上的总和汇总数据:

SELECT  PROCESSED_DTTM,  
        SUM([Count(cs)]) [Count(cs)]
FROM
(
    SEL       CAST(a.dttm AS DATE) PROCESSED_DTTM, 
              count(a.cs) [Count(cs)] 
    FROM      cin.cell a
    LEFT JOIN cin.comm c ON a.cs_sk = c.cs_sk
    LEFT JOIN CIN.CID d ON a.cn_cd = d.CN_CD
    WHERE     CAST(a.dttm AS DATE) >= CURRENT_DATE-10
    GROUP BY  1
    UNION ALL
    SEL       CAST(a.dttm AS DATE), count(a.cs) 
    FROM      cin_ps.cell a
    LEFT JOIN cin_ps.comm c ON a.cs_sk = c.cs_sk
    LEFT JOIN CIN_ps.CID d ON a.cn_cd = d.CN_CD
    WHERE     CAST(a.dttm AS DATE) >= CURRENT_DATE-10
    GROUP BY  1
) AS TBL1
GROUP BY PROCESSED_DTTM

You're not getting duplicates, you are have non-unique results in both sets. If you were to encapsulate the union in a subquery and select distinct you will receive the same set of data. What you want to do is aggregate the data using SUM on the Count column:

SELECT  PROCESSED_DTTM,  
        SUM([Count(cs)]) [Count(cs)]
FROM
(
    SEL       CAST(a.dttm AS DATE) PROCESSED_DTTM, 
              count(a.cs) [Count(cs)] 
    FROM      cin.cell a
    LEFT JOIN cin.comm c ON a.cs_sk = c.cs_sk
    LEFT JOIN CIN.CID d ON a.cn_cd = d.CN_CD
    WHERE     CAST(a.dttm AS DATE) >= CURRENT_DATE-10
    GROUP BY  1
    UNION ALL
    SEL       CAST(a.dttm AS DATE), count(a.cs) 
    FROM      cin_ps.cell a
    LEFT JOIN cin_ps.comm c ON a.cs_sk = c.cs_sk
    LEFT JOIN CIN_ps.CID d ON a.cn_cd = d.CN_CD
    WHERE     CAST(a.dttm AS DATE) >= CURRENT_DATE-10
    GROUP BY  1
) AS TBL1
GROUP BY PROCESSED_DTTM
~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文