通过使用 R 中的 lag() 添加到上面的行来填充列

发布于 2025-01-11 09:36:30 字数 631 浏览 0 评论 0原文

我想用不断添加到上面行的值填充现有列。这在 Excel 中很容易，但我还没有找到在 R 中自动化它的好方法。

如果我们在 Excel 中有 2 列，A 和 B，我们希望单元格 B2 =B1+A2，单元格 B3 = B2 +A3。我怎样才能在 R 中做到这一点？

#example dataframe
df <- data.frame(A = 0:9, B = c(50,0,0,0,0,0,0,0,0,0))
#desired output
desired <- data.frame(A = 0:9, B = c("NA",51,53,56,60,65,71,78,86,95))

我尝试使用 lag() 函数，但它没有给出正确的输出。

df <- df %>%
mutate(B = B + lag(A))

所以我做了一个有效的 for 循环，但我觉得有更好的解决方案。

for(i in 2:nrow(df)){
df$B[i] <- df$B[i-1] + df$A[i]
}

最终，我想在整个数据帧的每 n 行上迭代此函数，本质上是让求和重置每 n 行。（任何有关如何做到这一点的提示将不胜感激！）

原文

I want to populate an existing column with values that continually add onto the row above.
This is easy in Excel, but I haven't figured out a good way to automate it in R.

If we had 2 columns in Excel, A and B, we want cell B2 to =B1+A2, and cell B3 would = B2+A3. How can I do this in R?

#example dataframe
df <- data.frame(A = 0:9, B = c(50,0,0,0,0,0,0,0,0,0))
#desired output
desired <- data.frame(A = 0:9, B = c("NA",51,53,56,60,65,71,78,86,95))

I tried using the lag() function, but it didn't give the correct output.

df <- df %>%
mutate(B = B + lag(A))

So I made a for loop that works, but I feel like there's a better solution.

for(i in 2:nrow(df)){
df$B[i] <- df$B[i-1] + df$A[i]
}

Eventually, I want to iterate this function over every n rows of the whole dataframe, essentially so the summation resets every n rows. (any tips on how to do that would be greatly appreciated!)

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

路还长，别太狂 2025-01-18 09:36:31

这可能接近您的需要，并使用 tidyverse。具体来说，它使用来自 purrr 的 accumulate。

假设您想将每 n 行重置为零，您也可以提前使用 group_by。

目前尚不完全清楚您希望如何处理第一行；在这里，它只会使用第一个 B 值并忽略第一个 A 值，该值看起来与您在帖子中的内容类似。

n <- 5

library(tidyverse)

df %>%
  group_by(grp = ceiling(row_number() / n)) %>%
  mutate(B = accumulate(A[-1], sum, .init = B[1]))

输出

       A     B   grp
   <int> <dbl> <dbl>
 1     0    50     1
 2     1    51     1
 3     2    53     1
 4     3    56     1
 5     4    60     1
 6     5     0     2
 7     6     6     2
 8     7    13     2
 9     8    21     2
10     9    30     2

This might be close to what you need, and uses tidyverse. Specifically, it uses accumulate from purrr.

Say you want to reset to zero every n rows, you can also use group_by ahead of time.

It was not entirely clear how you'd like to handle the first row; here, it will just use the first B value and ignore the first A value, which looked similar to what you had in the post.

n <- 5

library(tidyverse)

df %>%
  group_by(grp = ceiling(row_number() / n)) %>%
  mutate(B = accumulate(A[-1], sum, .init = B[1]))

Output

       A     B   grp
   <int> <dbl> <dbl>
 1     0    50     1
 2     1    51     1
 3     2    53     1
 4     3    56     1
 5     4    60     1
 6     5     0     2
 7     6     6     2
 8     7    13     2
 9     8    21     2
10     9    30     2

回复收藏 0 原文

月牙弯弯 2025-01-18 09:36:31

cumsum() 可用于获取您需要的结果。

df$B <- cumsum(df$B + df$A)
df
   A  B
1  0 50
2  1 51
3  2 53
4  3 56
5  4 60
6  5 65
7  6 71
8  7 78
9  8 86
10 9 95

cumsum() can be used to get the result you need.

df$B <- cumsum(df$B + df$A)
df
   A  B
1  0 50
2  1 51
3  2 53
4  3 56
5  4 60
6  5 65
7  6 71
8  7 78
9  8 86
10 9 95

回复收藏 0 原文

~没有更多了~

关于作者

甜宝宝

暂无简介

文章

26 人气

关注发私信

友情链接

文江博客

通过使用 R 中的 lag() 添加到上面的行来填充列

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

紫罗兰の梦幻

-2134

liuxuanli

意中人

○愚か者の日

xxhui

友情链接

通过使用 R 中的 lag() 添加到上面的行来填充列

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（2）

关于作者

相关话题

热门标签

推荐作者

紫罗兰の梦幻

-2134

liuxuanli

意中人

○愚か者の日

xxhui

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。