在 R 中使用 FFT 确定 IID 和的密度函数

发布于 2025-01-14 21:36:29 字数 1274 浏览 1 评论 0原文

目标是通过以下随机变量之一的密度函数计算 n 个 IID 随机变量之和的密度函数：

通过 fft 将密度函数转换为特征函数
将特征函数提升到 n
变换所得特征函数通过 fft(inverse=TRUE) 进入感兴趣的密度函数

下面是我对此的天真的尝试：

sum_of_n <- function(density, n, xstart, xend, power_of_2)
{
  
  x <- seq(from=xstart, to=xend, by=(xend-xstart)/(2^power_of_2-1))
  y <- density(x)
  fft_y <- fft(y)
  fft_sum_of_y <- (fft_y ^ n) 
  sum_of_y <- Re(fft(fft_sum_of_y, inverse=TRUE))
  return(sum_of_y)
  
}

在上面，密度是任意密度函数：例如

density <- function(x){return(dgamma(x = x, shape = 2, rate = 1))}

n 表示被求和的 IID 随机变量的数量。 xstart 和 xend 是随机变量的近似支持的开始和结束。 power_of_2 是所使用的数值向量的 2 长度的幂。据我了解，二次幂的长度提高了 fft 算法的效率。

我至少部分理解为什么上述内容通常不能按预期工作。首先，值本身将无法正确缩放，因为 fft(inverse=TRUE) 默认情况下不会标准化。然而，我发现当我除以向量的长度时，这些值仍然不正确，即

sum_of_y <- sum_of_y / length(sum_of_y)

基于我对 fft 的有限理解是归一化计算。其次，由于执行 fft 时发生的零频率偏移（如果我错了，请有人纠正我），所得矢量将异相。例如，我尝试使用 pracma 的 fftshift 和 ifftshift，但它们似乎无法正确解决此问题。对于对称分布（例如正态分布），这并不难解决，因为相移通常恰好是一半，因此类似的操作

sum_of_y <- c(sum_of_y[(length(y)/2+1):length(y)], sum_of_y[1:(length(y)/2)])

可以作为校正。然而，对于像上面的伽玛分布这样的不对称分布，这种方法会失败。

总之，是否对上述代码进行了调整，从而为 IID 和产生适当缩放和适当移动的最终密度函数？

原文

The goal is to compute the density function of a sum of n IID random variables via the density function of one of these random variables by:

Transforming the density function into the characteristic function via fft
Raise the characteristic function to the n
Transform the resulting characteristic function into the density function of interest via fft(inverse=TRUE)

The below is my naive attempt at this:

sum_of_n <- function(density, n, xstart, xend, power_of_2)
{
  
  x <- seq(from=xstart, to=xend, by=(xend-xstart)/(2^power_of_2-1))
  y <- density(x)
  fft_y <- fft(y)
  fft_sum_of_y <- (fft_y ^ n) 
  sum_of_y <- Re(fft(fft_sum_of_y, inverse=TRUE))
  return(sum_of_y)
  
}

In the above, density is an arbitrary density function: for example

density <- function(x){return(dgamma(x = x, shape = 2, rate = 1))}

n indicates the number of IID random variables being summed. xstart and xend are the start and end of the approximate support of the random variable. power_of_2 is the power of 2 length for the numeric vectors used. As I understand things, lengths of powers of two increase the efficiency of the fft algorithm.

I understand at least partially why the above does not work as intended in general. Firstly, the values themselves will not be scaled correctly, as fft(inverse=TRUE) does not normalize by default. However, I find that the values are still not correct when I divide by the length of the vector i.e.

sum_of_y <- sum_of_y / length(sum_of_y)

which based on my admittedly limited understanding of fft is the normalizing calculation. Secondly, the resulting vector will be out of phase due to (someone correct me on this if I am wrong) the shifting of the zero frequency that occurs when fft is performed. I have tried to use, for example, pracma's fftshift and ifftshift, but they do not appear to address this problem correctly. For symmetric distributions e.g. normal, this is not difficult to address since the phase shift is typically exactly half, so that an operation like

sum_of_y <- c(sum_of_y[(length(y)/2+1):length(y)], sum_of_y[1:(length(y)/2)])

works as a correction. However, for asymmetric distributions like the gamma distribution above this fails.

In conclusion, are there adjustments to the code above that will result in an appropriately scaled and appropriately shifted final density function for the IID sum?

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

故事未完 2025-01-21 21:36:29

dsumf <- function(d, n, a, b, k) {
  x <- seq(a, b, length.out = k)
  p <- d(x)
  s <- sum(p)
  data.frame(x = x, d = Re(fft(fft(p/s)^n, TRUE))*s/k)
}

# use fft to approximate the density of `sum(rgamma(5, 2, 1))`
dsum <- dsumf(\(x) dgamma(x, 2, 1), 5, 0, 32, 2^10)
# plot the approximation against the true density
plot(dsum$x, dgamma(dsum$x, 10, 1), type = "l", col = "blue",
     xlab = "x", ylab = "density")
lines(dsum$x, dsum$d, lty = 2, col = "orange")
legend(
  "topright",
  legend = c("True density", "FFT approximation"),
  col = c("blue", "orange"),
  lty = c(1, 2)
)

dsumf <- function(d, n, a, b, k) {
  x <- seq(a, b, length.out = k)
  p <- d(x)
  s <- sum(p)
  data.frame(x = x, d = Re(fft(fft(p/s)^n, TRUE))*s/k)
}

# use fft to approximate the density of `sum(rgamma(5, 2, 1))`
dsum <- dsumf(\(x) dgamma(x, 2, 1), 5, 0, 32, 2^10)
# plot the approximation against the true density
plot(dsum$x, dgamma(dsum$x, 10, 1), type = "l", col = "blue",
     xlab = "x", ylab = "density")
lines(dsum$x, dsum$d, lty = 2, col = "orange")
legend(
  "topright",
  legend = c("True density", "FFT approximation"),
  col = c("blue", "orange"),
  lty = c(1, 2)
)

回复收藏 0 原文

~没有更多了~