在 SQL 或 CODE 中更好地对聚合数据进行分组（就性能而言）

发布于 2024-08-10 18:37:56 字数 1588 浏览 9 评论 0原文

我想征求关于通过连接字符串来生成聚合数据的意见。如果我有一个列聚合，但我想在聚合列中连接，哪个在性能方面更快？执行一条 SQL，然后在代码中聚合。或者选择主要数据然后一次查询一个。

例如：

TABLE_A        
COL_A_1         COL_A_2
A               a
B               b
C               c

TABLE_B
COL_B_1         COL_B_2
a               Text1
a               Text2
a               Text3
b               Text4
b               Text5

表在 COL_A_2 = COL_B_1 处连接，聚合列为 COL_B_2。

选项1（执行一个SQL）

SELECT TABLE_1.COL_A_1, TABLE_1.COL_A_2, TABLE_2.COL_B_2 
FROM TABLE_A 
LEFT OUTER JOIN TABLE_B ON TABLE_A.COL_A_2 = TABLE_2.COL_B_1
ORDER BY TABLE_1.COL_A_1

然后在代码中，仅循环结果集并聚合COL_B_1。（例如使用Java）

String oldColA1 = "";
InfoEntity currInfo = null;
for (InfoEntity info : infoList) {
  if (!oldColA1.equals(info.colA1)) {
    currInfo = info;
  }

  if (currInfo.colB2 == null || currInfo.colB2.equals("")) {
    currInfo.colB2 = info.colB2;
  } else {
    currInfo.colB2 += info.colB2;
  }
  oldColA1 = info.colA1;
}

选项2（执行多个SQL）

SELECT TABLE_1.COL_A_1, TABLE_1.COL_A_2
FROM TABLE_A 
ORDER BY TABLE_1.COL_A_1

然后在代码中选择每个（例如Java）

for (InfoEntity info : infoList) {
  // Select TableB entity based on info.colA2
  ...
  tableBList = dao.selectTableB(info.colA2);
  ...
  for (TableBEntity b : tableBList) {
    info.colB2 += b.colB2;
  }
}

我通常执行选项1，因为我认为进行多个SQL选择可能成本很高。但我对此并不确定。但是还有其他方法可以进行这种聚合吗？我已经在网上搜索过，在标准 SQL 中没有字符串连接聚合。另外，TableA 上的说明过于简化，通常 TableA 是多个表连接在一起的复杂查询，TableB 也是如此。

原文

I would like to ask for opinion on making aggregate data by concatenating strings. If I have a column aggregate but I want to concatenate then in an aggregate column, which is faster in terms of performance? Doing one SQL then just aggregating then in the CODE. Or selecting the main data then querying one at a time.

For Example:

TABLE_A        
COL_A_1         COL_A_2
A               a
B               b
C               c

TABLE_B
COL_B_1         COL_B_2
a               Text1
a               Text2
a               Text3
b               Text4
b               Text5

The table is joined at COL_A_2 = COL_B_1, the aggregate column is COL_B_2.

Option 1 (Doing one SQL)

SELECT TABLE_1.COL_A_1, TABLE_1.COL_A_2, TABLE_2.COL_B_2 
FROM TABLE_A 
LEFT OUTER JOIN TABLE_B ON TABLE_A.COL_A_2 = TABLE_2.COL_B_1
ORDER BY TABLE_1.COL_A_1

Then in the code, just loop for the result set and aggregate the COL_B_1. (ex using Java)

String oldColA1 = "";
InfoEntity currInfo = null;
for (InfoEntity info : infoList) {
  if (!oldColA1.equals(info.colA1)) {
    currInfo = info;
  }

  if (currInfo.colB2 == null || currInfo.colB2.equals("")) {
    currInfo.colB2 = info.colB2;
  } else {
    currInfo.colB2 += info.colB2;
  }
  oldColA1 = info.colA1;
}

Option 2 (Doing several SQL)

SELECT TABLE_1.COL_A_1, TABLE_1.COL_A_2
FROM TABLE_A 
ORDER BY TABLE_1.COL_A_1

Then select for each in code (Ex Java)

for (InfoEntity info : infoList) {
  // Select TableB entity based on info.colA2
  ...
  tableBList = dao.selectTableB(info.colA2);
  ...
  for (TableBEntity b : tableBList) {
    info.colB2 += b.colB2;
  }
}

I usually do option 1, because I think making multiple SQL select maybe costly. But I am not sure about this. But is there another way on doing this kind of aggregation. I already search the net and in standard SQL there is no string concatenation aggregation. Also, the illustration on TableA is over simplified, usually TableA is a complex query of multiple table joined together and so is TableB.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

扎心 2024-08-17 18:37:56

我通常选择选项 1，因为我认为进行多个 SQL 选择的成本很高。

你是对的——数据库的访问应该尽可能少。

不过，组串联是可能的 - MySQL 有 GROUP_CONCAT，自 4.1 起。对于 SQL Server，您可以使用：

SELECT @out = COALESCE(@out + ',' + t.column, t.column)
  FROM TABLE t

检查此链接了解一个应该在 Oracle 9i+ 中运行的示例。

I usually do option 1, because I think making multiple SQL select is costly.

You're correct - trips to the database should be as few as possible.

Group concatenation is possible though - MySQL has GROUP_CONCAT, since 4.1. With SQL Server, you can use:

SELECT @out = COALESCE(@out + ',' + t.column, t.column)
  FROM TABLE t

Check this link for an example that should work in Oracle 9i+.

回复收藏 0 原文

~没有更多了~

关于作者

情丝乱

暂无简介

文章

25 人气

关注发私信

友情链接

文江博客

在 SQL 或 CODE 中更好地对聚合数据进行分组（就性能而言）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

知足的幸福

我一向站在原地

慕烟庭风

秉忠贞之诚守退让之实

小兔几

mb_3y7WUgWY

友情链接

在 SQL 或 CODE 中更好地对聚合数据进行分组（就性能而言）

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

知足的幸福

我一向站在原地

慕烟庭风

秉忠贞之诚 守退让之实

小兔几

mb_3y7WUgWY

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

秉忠贞之诚守退让之实