循环循环与组的列

发布于 2025-02-13 23:08:19 字数 1039 浏览 3 评论 0原文

我有此数据集，

age salary gender
44  3000   M
32  4555   F
45  6000   M
50  4200   F
43  5000   F
23  1700   M

我想通过使用Numby进行性别循环，并获得年龄/薪金组的最大/最小值，我做到了：

import pandas as pd
import numby as np
data = pd.read_excel("file")
var = ["age","salary","gender"]
dat = data[var]
column_list = dat.columns.values.tolist()
resAll = pd.DataFrame()
for col in column_list:
            minn = np.min(dat[col])
            maxx = np.max(dat[col])
            res = [[str(col), ' '],
                   ['min', str(minn)],
                   ['max',str(maxx)]
                  ]
            resNum = pd.DataFrame(res)
            resNum.columns = ['0','1']
print(resNum)

输出

0       1
age
max     50
min     23
salary  
max     6000
min     1700
gender 
max f
min m

我希望能够按性别组成！有帮助吗？这样，知道列的数量不是固定的大小。

value       male   female      
age
max         45     50
min         23     32
salary  
max         6000   5000
min         1700   4200

原文

I have this data set

age salary gender
44  3000   M
32  4555   F
45  6000   M
50  4200   F
43  5000   F
23  1700   M

I want to loop through each column and get max/min value for age/salary group by gender using numby, i did this:

import pandas as pd
import numby as np
data = pd.read_excel("file")
var = ["age","salary","gender"]
dat = data[var]
column_list = dat.columns.values.tolist()
resAll = pd.DataFrame()
for col in column_list:
            minn = np.min(dat[col])
            maxx = np.max(dat[col])
            res = [[str(col), ' '],
                   ['min', str(minn)],
                   ['max',str(maxx)]
                  ]
            resNum = pd.DataFrame(res)
            resNum.columns = ['0','1']
print(resNum)

output

0       1
age
max     50
min     23
salary  
max     6000
min     1700
gender 
max f
min m

I want to be able to do that grouped by gender! any help ? like this, knowing that number of columns is not fixed size.

value       male   female      
age
max         45     50
min         23     32
salary  
max         6000   5000
min         1700   4200

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

北笙凉宸 2025-02-20 23:08:19

数据框有一个方法Dricest（），该可输出每列的描述性统计信息。可以在按列分组后使用它来分解每个组的统计数据。如果您只需要最小值和最大值，则需要删除其余的统计数据。

df = df.groupby('gender').describe()

如果您只想保持最小并最大放下剩余的时间，请使用掉落。

df = df.drop(columns=['count', '25%', '75%', '50%', 'mean', 'std'], level=1, axis=1)

dataframes have a method describe() which outputs the descriptive statistics for each column. It can be used after grouping by a column to break down the stats for each group. If you only need the min and max, you need to drop the rest of stats.

df = df.groupby('gender').describe()

if you want to keep just the min and max and drop the rest, use drop.

df = df.drop(columns=['count', '25%', '75%', '50%', 'mean', 'std'], level=1, axis=1)

回复收藏 0 原文

~没有更多了~

关于作者

夜清冷一曲。

暂无简介

文章

27 人气

关注发私信

櫻之舞

文章 0 评论 0

关注

弥枳

文章 0 评论 0

关注

m2429

文章 0 评论 0

关注

寻找一个思念的角度

文章 0 评论 0

关注

野却迷人

文章 0 评论 0

关注

我怀念的。

文章 0 评论 0

友情链接

文江博客

循环循环与组的列

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

循环循环与组的列

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

櫻之舞

弥枳

m2429

寻找一个思念的角度

野却迷人

我怀念的。

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。