使用逗号创建数据框

发布于 2025-01-11 15:17:13 字数 786 浏览 0 评论 0原文

是否可以制作一个包含“多个元素”列的数据框?

例如 - 给定以下数据:

a = sample(c(1,-1), size=2 ,replace = T, prob=c(0.5, 0.5))
b = sample(c(1,-1), size=3 ,replace = T, prob=c(0.5, 0.5))
c = sample(c(1,-1), size=4 ,replace = T, prob=c(0.5, 0.5))

#some random numbers
d = rexp(3,5)

#some random letters
e = "g"

#id column
n_id = 1:3

所有这些都可以组合成一个数据框(4 列,3 行)吗?我尝试以常规方式执行此操作:

answer = data.frame(a,b,c,d,e)

但出现此错误:

Error in data.frame(a, b, c, d, e, n_id) : 
  arguments imply differing number of rows: 2, 3, 4, 1

Is it possible to do this in R?我试图得到这样的东西:

在此处输入图像描述

谢谢!

Is it possible to make a data frame containing a column with "multiple elements"?

For instance - given the following data:

a = sample(c(1,-1), size=2 ,replace = T, prob=c(0.5, 0.5))
b = sample(c(1,-1), size=3 ,replace = T, prob=c(0.5, 0.5))
c = sample(c(1,-1), size=4 ,replace = T, prob=c(0.5, 0.5))

#some random numbers
d = rexp(3,5)

#some random letters
e = "g"

#id column
n_id = 1:3

Can all this be combined into a single data frame (4 columns, 3 rows)? I tried to do this the regular way:

answer = data.frame(a,b,c,d,e)

But I get this error:

Error in data.frame(a, b, c, d, e, n_id) : 
  arguments imply differing number of rows: 2, 3, 4, 1

Is it possible to do this in R? I am trying to get something like this:

enter image description here

Thank you!

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(2

青芜 2025-01-18 15:17:13
data.table(n_id = n_id,a=list(a,b,c),d=d,e=e)

    n_id           a          d      e
   <int>      <list>      <num> <char>
1:     1       -1, 1 0.01357525      g
2:     2    -1,-1, 1 0.34263042      g
3:     3  1, 1,-1, 1 0.08830073      g

您还可以使用 tidyverse

tibble(n_id = n_id,a=list(a,b,c),d=d,e=e)

   n_id a              d e    
  <int> <list>     <dbl> <chr>
1     1 <dbl [2]> 0.0136 g    
2     2 <dbl [3]> 0.343  g    
3     3 <dbl [4]> 0.0883 g   

注意,在这两种方法中 a 是一个列表列

data.table(n_id = n_id,a=list(a,b,c),d=d,e=e)

    n_id           a          d      e
   <int>      <list>      <num> <char>
1:     1       -1, 1 0.01357525      g
2:     2    -1,-1, 1 0.34263042      g
3:     3  1, 1,-1, 1 0.08830073      g

You can also do with tidyverse

tibble(n_id = n_id,a=list(a,b,c),d=d,e=e)

   n_id a              d e    
  <int> <list>     <dbl> <chr>
1     1 <dbl [2]> 0.0136 g    
2     2 <dbl [3]> 0.343  g    
3     3 <dbl [4]> 0.0883 g   

Notice under both approaches that a is a list-column

煮茶煮酒煮时光 2025-01-18 15:17:13

您可以使用此代码:

a = sample(c(1,-1), size=2 ,replace = T, prob=c(0.5, 0.5))
         b = sample(c(1,-1), size=3 ,replace = T, prob=c(0.5, 0.5))
         c = sample(c(1,-1), size=4 ,replace = T, prob=c(0.5, 0.5))
         
         #some random numbers
         d = rexp(3,5)
         
         #some random letters
         e = "g"

df = list(a=a, b=b, c=c, d=d, e=e)
         attributes(df) = list(names = names(df),
                                 row.names=1:max(length(c), length(c)), class='data.frame')

随着输出:

     a    b  c          d    e
1   -1    1 -1 0.05939183    g
2    1    1 -1 0.01683215 <NA>
3 <NA>   -1  1 0.59068018 <NA>
4 <NA> <NA>  1       <NA> <NA>
Warning message:
In format.data.frame(if (omit) x[seq_len(n0), , drop = FALSE] else x,  :
  corrupt data frame: columns will be truncated or padded with NAs

它会发出警告,因为您实际上不希望数据帧具有不同的向量长度。

You can use this code:

a = sample(c(1,-1), size=2 ,replace = T, prob=c(0.5, 0.5))
         b = sample(c(1,-1), size=3 ,replace = T, prob=c(0.5, 0.5))
         c = sample(c(1,-1), size=4 ,replace = T, prob=c(0.5, 0.5))
         
         #some random numbers
         d = rexp(3,5)
         
         #some random letters
         e = "g"

df = list(a=a, b=b, c=c, d=d, e=e)
         attributes(df) = list(names = names(df),
                                 row.names=1:max(length(c), length(c)), class='data.frame')

With output:

     a    b  c          d    e
1   -1    1 -1 0.05939183    g
2    1    1 -1 0.01683215 <NA>
3 <NA>   -1  1 0.59068018 <NA>
4 <NA> <NA>  1       <NA> <NA>
Warning message:
In format.data.frame(if (omit) x[seq_len(n0), , drop = FALSE] else x,  :
  corrupt data frame: columns will be truncated or padded with NAs

It gives an warning because you actually don't want dataframes with different vector lengths.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文