当前位置：文江博客话题详情

我如何在表格中获得每个组的平均值 - 在bash中抢先（尴尬？

发布于 2025-02-13 08:14:18 字数 1456 浏览 1 评论 0原文

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

假面具 2025-02-20 08:14:18

在awk中进行此操作非常简单：

awk '{sum[$1]+=$2; count[$1]++} END {for(key in sum) print key ":  " sum[key]/count[key]}' input_file

示例文件的输出：

grp1: 2
grp2: 6.5
grp4: 9

{sum [sum [$ 1]+= $ 2;计数[$ 1] ++}：对于输入文件的每一行，我们使用2个关联数组
- 计数存储遇到第一字段的时间
- sum存储此特定组的第二个字段的每个值的总和
结束{for（ke in sum）打印键“：” sum [sum [key]/count [key]}> ：什么时候文件已经完成解析，我们打印每个组以及此组的sum/Count

It's fairly simple to do this in awk :

awk '{sum[$1]+=$2; count[$1]++} END {for(key in sum) print key ":  " sum[key]/count[key]}' input_file

Output for your sample file :

grp1: 2
grp2: 6.5
grp4: 9

Explanation :

{sum[$1]+=$2; count[$1]++} : for every line of your input file, we use 2 associative arrays
- count that stores the number of time the 1st field is encountered
- sum that stores the sum of every value for the 2nd field for this specific group
END {for(key in sum) print key ": " sum[key]/count[key]} : when your file has finished parsing, we print every group, as well as the sum/count for this group

回复收藏 0 原文

小伙你站住 2025-02-20 08:14:18

给定：

cat file
grp1 1
grp1 3
grp2 5
grp2 8
grp4 9
    

awk '{d[$1]+=$2; cnt[$1]++} END{for (e in d) print e, d[e] / cnt[e]}' file

打印：

grp1 2
grp2 6.5
grp4 9

如果您希望它们全部都是浮点呈现：

awk '{d[$1]+=$2; cnt[$1]++} END{for (e in d) printf("%s %0.2f\n", e, d[e] / cnt[e])}'

打印：

grp1 2.00
grp2 6.50
grp4 9.00

知道awk中的关联阵列不维护顺序，因此grpx可能会从文件中的顺序变化。

Given:

cat file
grp1 1
grp1 3
grp2 5
grp2 8
grp4 9
    

awk '{d[$1]+=$2; cnt[$1]++} END{for (e in d) print e, d[e] / cnt[e]}' file

Prints:

grp1 2
grp2 6.5
grp4 9

Of if you want them all to be floating point presentation:

awk '{d[$1]+=$2; cnt[$1]++} END{for (e in d) printf("%s %0.2f\n", e, d[e] / cnt[e])}'

Prints:

grp1 2.00
grp2 6.50
grp4 9.00

Know that associative arrays in awk do not maintain order so the grpX may change from the order found in the file.

回复收藏 0 原文

~没有更多了~

关于作者

北城孤痞

暂无简介

文章

26 人气

关注发私信

櫻之舞

文章 0 评论 0

关注

弥枳

文章 0 评论 0

关注

m2429

文章 0 评论 0

关注

寻找一个思念的角度

文章 0 评论 0

关注

野却迷人

文章 0 评论 0

关注

我怀念的。

文章 0 评论 0

友情链接

文江博客

我如何在表格中获得每个组的平均值 - 在bash中抢先（尴尬？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

我如何在表格中获得每个组的平均值 - 在bash中抢先（尴尬？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。