计算 r 中的独特因素
我想知道在记录的每个出生日期出生的独特水坝的数量。我的数据框与此类似:
dam <- c("2A11","2A11","2A12","2A12","2A12","4D23","4D23","1X23")
bdate <- c("2009-10-01","2009-10-01","2009-10-01","2009-10-01",
"2009-10-01","2009-10-03","2009-10-03","2009-10-03")
mydf <- data.frame(dam,bdate)
mydf
# dam bdate
# 1 2A11 2009-10-01
# 2 2A11 2009-10-01
# 3 2A12 2009-10-01
# 4 2A12 2009-10-01
# 5 2A12 2009-10-01
# 6 4D23 2009-10-03
# 7 4D23 2009-10-03
# 8 1X23 2009-10-03
我使用了aggregate(dam ~ bdate, data=mydf, FUN=length)
,但它计算了在特定日期出生的所有水坝
bdate dam
1 2009-10-01 5
2 2009-10-03 3
,相反,我需要有这样的事情:
mydf2
bdate dam
1 2009-10-01 2
2 2009-10-03 2
非常感谢您的帮助!
I would like to know the number of unique dams which gave birth on each of the birth dates recorded. My data frame is similar to this one:
dam <- c("2A11","2A11","2A12","2A12","2A12","4D23","4D23","1X23")
bdate <- c("2009-10-01","2009-10-01","2009-10-01","2009-10-01",
"2009-10-01","2009-10-03","2009-10-03","2009-10-03")
mydf <- data.frame(dam,bdate)
mydf
# dam bdate
# 1 2A11 2009-10-01
# 2 2A11 2009-10-01
# 3 2A12 2009-10-01
# 4 2A12 2009-10-01
# 5 2A12 2009-10-01
# 6 4D23 2009-10-03
# 7 4D23 2009-10-03
# 8 1X23 2009-10-03
I used aggregate(dam ~ bdate, data=mydf, FUN=length)
but it counts all the dams that gave birth on a particular date
bdate dam
1 2009-10-01 5
2 2009-10-03 3
Instead, I need to have something like this:
mydf2
bdate dam
1 2009-10-01 2
2 2009-10-03 2
Your help is very much appreciated!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(4)
怎么样:
What about:
您还可以先对数据运行
unique
:然后您也可以使用
table
而不是aggregate
,尽管输出略有不同。You could also run
unique
on the data first:Then you could also use
table
instead ofaggregate
, though the output is a little different.这只是如何思考问题以及如何解决问题的方法之一的示例。
This is just an example of how to think of the problem and one of the approaches on how to solve it.
在dplyr中,您可以使用
n_distinct
:In dplyr you can use
n_distinct
: