选择每隔 n 行的数据框并添加到 R 中的数据框列表

发布于 2025-01-13 10:06:56 字数 686 浏览 0 评论 0原文

我目前拥有以下数据框，并尝试开发一个包含 5 个唯一数据框的列表，其中包含原始df的每第5行。有没有办法选择每隔 5 行并将其添加到列表中的新数据框？使用for循环还是lapply？

df
X1 X2 X3     X4 X5
1  0 0 1.501990  0
2  0 0 1.883904  0
3  0 0 1.333195  0
4  0 0 0.000000  0
5  0 0 2.136760  0
6  0 0 2.186790  0
7  0 0 1.269592  0
8  0 0 1.458405  0
9  0 0 1.816493  0
10 0 0 0.000000  0
11 0 0 2.190029  0
12 0 0 0.000000  0
13 0 0 1.460534  0
14 0 0 1.470776  0
15 0 0 1.675406  0
16 0 0 1.842470  0
17 0 0 1.937999  0
18 0 0 0.000000  0
19 0 0 1.649926  0
20 0 0 2.067902  0

例如，第一个数据帧将由第1、6、11和16行组成，而下一个数据帧将从第2行开始，并继续向下移动df< /代码>?

原文

I currently have the below data frame and am trying to develop a list of 5 unique data frames containing every 5th row of the original df. Is there a way to select every other 5th row and add it to a new data frame in a list? Either using a for loop or lapply?

df
X1 X2 X3     X4 X5
1  0 0 1.501990  0
2  0 0 1.883904  0
3  0 0 1.333195  0
4  0 0 0.000000  0
5  0 0 2.136760  0
6  0 0 2.186790  0
7  0 0 1.269592  0
8  0 0 1.458405  0
9  0 0 1.816493  0
10 0 0 0.000000  0
11 0 0 2.190029  0
12 0 0 0.000000  0
13 0 0 1.460534  0
14 0 0 1.470776  0
15 0 0 1.675406  0
16 0 0 1.842470  0
17 0 0 1.937999  0
18 0 0 0.000000  0
19 0 0 1.649926  0
20 0 0 2.067902  0

For example, the first data frame would consist of the 1st, 6th, 11th, and 16th row, while the next would start with the 2nd row and carry on down the rows of the df?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

瀞厅☆埖开 2025-01-20 10:06:56

使用 split 和 1:5 创建具有 5 行间隔的数据帧。

split(df, 1:5)

输出

使用 split 和 1:5 创建具有 5 行间隔的数据帧。
split(df, 1:5)

输出
1`
   X1 X2 X3       X4 X5
1   1  0  0 1.501990  0
6   6  0  0 2.186790  0
11 11  0  0 2.190029  0
16 16  0  0 1.842470  0

使用 split 和 1:5 创建具有 5 行间隔的数据帧。
split(df, 1:5)

输出
2`
   X1 X2 X3       X4 X5
2   2  0  0 1.883904  0
7   7  0  0 1.269592  0
12 12  0  0 0.000000  0
17 17  0  0 1.937999  0

使用 split 和 1:5 创建具有 5 行间隔的数据帧。
split(df, 1:5)

输出
3`
   X1 X2 X3       X4 X5
3   3  0  0 1.333195  0
8   8  0  0 1.458405  0
13 13  0  0 1.460534  0
18 18  0  0 0.000000  0

使用 split 和 1:5 创建具有 5 行间隔的数据帧。
split(df, 1:5)

输出
4`
   X1 X2 X3       X4 X5
4   4  0  0 0.000000  0
9   9  0  0 1.816493  0
14 14  0  0 1.470776  0
19 19  0  0 1.649926  0

使用 split 和 1:5 创建具有 5 行间隔的数据帧。
split(df, 1:5)

输出
5`
   X1 X2 X3       X4 X5
5   5  0  0 2.136760  0
10 10  0  0 0.000000  0
15 15  0  0 1.675406  0
20 20  0  0 2.067902  0

dplyr::group_split 的替代方案是：

group_split(df, rep(1:5, nrow(df)/5), .keep = F)

data

df <- structure(list(X1 = 1:20, X2 = c(0L, 0L, 0L, 0L, 0L, 0L, 0L, 
0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L), X3 = c(0L, 
0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 
0L, 0L, 0L), X4 = c(1.50199, 1.883904, 1.333195, 0, 2.13676, 
2.18679, 1.269592, 1.458405, 1.816493, 0, 2.190029, 0, 1.460534, 
1.470776, 1.675406, 1.84247, 1.937999, 0, 1.649926, 2.067902), 
    X5 = c(0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 
    0L, 0L, 0L, 0L, 0L, 0L, 0L)), class = "data.frame", row.names = c(NA, 
-20L))

Use split with 1:5 to create dataframes with a 5-row interval.

split(df, 1:5)

output

Use split with 1:5 to create dataframes with a 5-row interval.
split(df, 1:5)

output
1`
   X1 X2 X3       X4 X5
1   1  0  0 1.501990  0
6   6  0  0 2.186790  0
11 11  0  0 2.190029  0
16 16  0  0 1.842470  0

Use split with 1:5 to create dataframes with a 5-row interval.
split(df, 1:5)

output
2`
   X1 X2 X3       X4 X5
2   2  0  0 1.883904  0
7   7  0  0 1.269592  0
12 12  0  0 0.000000  0
17 17  0  0 1.937999  0

Use split with 1:5 to create dataframes with a 5-row interval.
split(df, 1:5)

output
3`
   X1 X2 X3       X4 X5
3   3  0  0 1.333195  0
8   8  0  0 1.458405  0
13 13  0  0 1.460534  0
18 18  0  0 0.000000  0

Use split with 1:5 to create dataframes with a 5-row interval.
split(df, 1:5)

output
4`
   X1 X2 X3       X4 X5
4   4  0  0 0.000000  0
9   9  0  0 1.816493  0
14 14  0  0 1.470776  0
19 19  0  0 1.649926  0

Use split with 1:5 to create dataframes with a 5-row interval.
split(df, 1:5)

output
5`
   X1 X2 X3       X4 X5
5   5  0  0 2.136760  0
10 10  0  0 0.000000  0
15 15  0  0 1.675406  0
20 20  0  0 2.067902  0

An alternative with dplyr::group_split is:

group_split(df, rep(1:5, nrow(df)/5), .keep = F)

data

df <- structure(list(X1 = 1:20, X2 = c(0L, 0L, 0L, 0L, 0L, 0L, 0L, 
0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L), X3 = c(0L, 
0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 
0L, 0L, 0L), X4 = c(1.50199, 1.883904, 1.333195, 0, 2.13676, 
2.18679, 1.269592, 1.458405, 1.816493, 0, 2.190029, 0, 1.460534, 
1.470776, 1.675406, 1.84247, 1.937999, 0, 1.649926, 2.067902), 
    X5 = c(0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 0L, 
    0L, 0L, 0L, 0L, 0L, 0L, 0L)), class = "data.frame", row.names = c(NA, 
-20L))

回复收藏 0 原文

吐个泡泡 2025-01-20 10:06:56

一个可能的解决方案，基于dplyr::group_split：

library(dplyr)

df %>% 
  mutate(id = 0:(nrow(df)-1) %% 4) %>% 
  group_by(id) %>% 
  group_split(.keep = F) %>% 
  as.list

#> [[1]]
#> # A tibble: 5 × 5
#>      X1    X2    X3    X4    X5
#>   <int> <int> <int> <dbl> <int>
#> 1     1     0     0  1.50     0
#> 2     5     0     0  2.14     0
#> 3     9     0     0  1.82     0
#> 4    13     0     0  1.46     0
#> 5    17     0     0  1.94     0
#> 
#> [[2]]
#> # A tibble: 5 × 5
#>      X1    X2    X3    X4    X5
#>   <int> <int> <int> <dbl> <int>
#> 1     2     0     0  1.88     0
#> 2     6     0     0  2.19     0
#> 3    10     0     0  0        0
#> 4    14     0     0  1.47     0
#> 5    18     0     0  0        0
#> 
#> [[3]]
#> # A tibble: 5 × 5
#>      X1    X2    X3    X4    X5
#>   <int> <int> <int> <dbl> <int>
#> 1     3     0     0  1.33     0
#> 2     7     0     0  1.27     0
#> 3    11     0     0  2.19     0
#> 4    15     0     0  1.68     0
#> 5    19     0     0  1.65     0
#> 
#> [[4]]
#> # A tibble: 5 × 5
#>      X1    X2    X3    X4    X5
#>   <int> <int> <int> <dbl> <int>
#> 1     4     0     0  0        0
#> 2     8     0     0  1.46     0
#> 3    12     0     0  0        0
#> 4    16     0     0  1.84     0
#> 5    20     0     0  2.07     0

A possible solution, based on dplyr::group_split:

library(dplyr)

df %>% 
  mutate(id = 0:(nrow(df)-1) %% 4) %>% 
  group_by(id) %>% 
  group_split(.keep = F) %>% 
  as.list

#> [[1]]
#> # A tibble: 5 × 5
#>      X1    X2    X3    X4    X5
#>   <int> <int> <int> <dbl> <int>
#> 1     1     0     0  1.50     0
#> 2     5     0     0  2.14     0
#> 3     9     0     0  1.82     0
#> 4    13     0     0  1.46     0
#> 5    17     0     0  1.94     0
#> 
#> [[2]]
#> # A tibble: 5 × 5
#>      X1    X2    X3    X4    X5
#>   <int> <int> <int> <dbl> <int>
#> 1     2     0     0  1.88     0
#> 2     6     0     0  2.19     0
#> 3    10     0     0  0        0
#> 4    14     0     0  1.47     0
#> 5    18     0     0  0        0
#> 
#> [[3]]
#> # A tibble: 5 × 5
#>      X1    X2    X3    X4    X5
#>   <int> <int> <int> <dbl> <int>
#> 1     3     0     0  1.33     0
#> 2     7     0     0  1.27     0
#> 3    11     0     0  2.19     0
#> 4    15     0     0  1.68     0
#> 5    19     0     0  1.65     0
#> 
#> [[4]]
#> # A tibble: 5 × 5
#>      X1    X2    X3    X4    X5
#>   <int> <int> <int> <dbl> <int>
#> 1     4     0     0  0        0
#> 2     8     0     0  1.46     0
#> 3    12     0     0  0        0
#> 4    16     0     0  1.84     0
#> 5    20     0     0  2.07     0

回复收藏 0 原文

~没有更多了~