如何使用dplyr在一列中总结组的唯一值?
目前,我有以下代码:
categories <- df %>% #this is a very large df but that should not matter to my question
group_by(category, subcategory, IV_type) %>%
summarise(n = n())
生成以下DF:
category <- c('a','a','a','a','b','b','b','c','c')
subcategory <- c(1,1,2,3,4,4,5,6,7)
N <- c(21,13,7,9,11,17,19,23,27)
type <- c('nom', 'ord', 'nom', 'scale', 'nom', 'scale', 'nom', 'scale', 'scale')
categories <- data.frame(category, subcategory, N, type)
但是,我想获得此数据框架:
category1 <- c('a','a','a','b','b','c','c')
subcategory1 <- c(1,2,3,4,5,6,7)
N1 <- c(34,7,9,28,19,23,27)
type1 <- c('nom, ord', 'nom', 'scale', 'nom, scale', 'nom', 'scale', 'scale')
categories1 <- data.frame(category1, subcategory1, N1, type1)
我的尝试:
categories <- df %>%
group_by(category, subcategory) %>%
summarise(n = n(), unique_types = unique(type))
不幸的是,这会引发错误。有人知道我如何实现这一目标吗?
At the moment I have the following code:
categories <- df %>% #this is a very large df but that should not matter to my question
group_by(category, subcategory, IV_type) %>%
summarise(n = n())
Which produces the following df:
category <- c('a','a','a','a','b','b','b','c','c')
subcategory <- c(1,1,2,3,4,4,5,6,7)
N <- c(21,13,7,9,11,17,19,23,27)
type <- c('nom', 'ord', 'nom', 'scale', 'nom', 'scale', 'nom', 'scale', 'scale')
categories <- data.frame(category, subcategory, N, type)
However, I would like to obtain this dataframe:
category1 <- c('a','a','a','b','b','c','c')
subcategory1 <- c(1,2,3,4,5,6,7)
N1 <- c(34,7,9,28,19,23,27)
type1 <- c('nom, ord', 'nom', 'scale', 'nom, scale', 'nom', 'scale', 'scale')
categories1 <- data.frame(category1, subcategory1, N1, type1)
my try:
categories <- df %>%
group_by(category, subcategory) %>%
summarise(n = n(), unique_types = unique(type))
Unfortunately, this throws an error. Does anyone know how I can accomplish this?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您可以使用以下内容:
You can use the following: