修改常用的 hpfilter 函数以忽略 na's

发布于 2024-12-12 13:33:56 字数 1069 浏览 1 评论 0原文

我是一个新的 R 用户，试图快速学习，但我自己无法破解这个问题。我主要处理经济时间序列 - 因此，尝试以 xts 多列格式维护我的数据集，例如：

> head(USDATAq)
         tq   ngdp    rgdp  profit
1947 Q1   0  237.2  1770.7    20.7
1947 Q2   1  240.4  1768.0    23.9
1947 Q3   2  244.5  1766.5    23.8
1947 Q4   3  254.3  1793.3    25.5
1948 Q1   4  260.3  1821.8    29.4
1948 Q2   5  267.3  1855.3    31.2

我应用 hpfilter 函数进行过滤。在本网站的其他地方，我发现此实现使用 coredata 函数将 hpfilter 应用于 xts 对象：

hpfilter <- function(x, lambda=2){
  eye <- diag(length(x))
  dcrossprod <- crossprod(diff(eye, lag=1, d=2))
  coredata(x) <- solve(eye + lambda * dcrossprod, coredata(x))
  return(x)
}

我的问题是：

如何修改该函数，以便它将与具有以下变量的变量一起使用NA 观测值（目前，如果有 NA，它会计算整个日期范围的 NA）？

我可以将数据集作为 na.omit(USDATAq) 传递，这可以工作，但这会将数据集中的所有变量限制为最小观测值。但是，不同的变量在不同的日期之前可用，随后是 NA。我希望最终将该函数应用到循环或映射中的数据集的每一列，以便该函数使用该系列的所有可用观察结果返回每个过滤后的系列。

原文

I'm a new R user, trying to learn quickly, but I couldn't crack this myself. I work mostly with economic time series – hence, try to maintain my dataset in xts multi-column format, e.g.:

> head(USDATAq)
         tq   ngdp    rgdp  profit
1947 Q1   0  237.2  1770.7    20.7
1947 Q2   1  240.4  1768.0    23.9
1947 Q3   2  244.5  1766.5    23.8
1947 Q4   3  254.3  1793.3    25.5
1948 Q1   4  260.3  1821.8    29.4
1948 Q2   5  267.3  1855.3    31.2

I apply the hpfilter function for filtering. Elsewhere on this site, I found this implementation which uses the coredata function to apply hpfilter to xts objects:

hpfilter <- function(x, lambda=2){
  eye <- diag(length(x))
  dcrossprod <- crossprod(diff(eye, lag=1, d=2))
  coredata(x) <- solve(eye + lambda * dcrossprod, coredata(x))
  return(x)
}

My question is:

How can I modify the function so that it will work with variables having NA observations (at present, it calculates NA for the entire date range if there is any NA)?

I can pass on the dataset as na.omit(USDATAq), which works, but this curtails all variables in dataset to the minimum observations. But, different variables are available until different dates, followed by NA's. I would like to eventually apply the function to every column of the dataset in a loop or mapply, so that the function returns each filtered series using all available observations of that series.

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

゛时过境迁 2024-12-19 13:33:56

谢谢@ran2。我按照你的建议工作并设法解决了问题——但是，以一种相当不优雅的方式。首先，我无法让任何“apply family”函数在 xts 对象上正常工作，维持其结构。对于按列应用程序，使用 apply(x, MARGIN=2,..) 进行普通应用显示出希望，但在“coredata”语句处停滞。 lapply 等产生了损坏的列表。

然后我进入了for循环。但由于 x<-na.omit(x) 改变了变量的长度，因此它无法替换循环内的原始变量。

> for(i in 1:ncol(USDATAq)) {
+ USDATAq[,i]<-hpfilter(USDATAq[,i])
+ }

NextMethod(.Generic) 中的错误：
要替换的项目数量不是替换长度的倍数

因此，我不得不向 hpfilter 添加不合时宜的代码，以将结果“合并”回原始值（带有 NA），然后返回变量。此合并按日期（因此，长度）匹配 2 个变量，将 NA 填充到结果中。然后，这个结果可以在循环中替换原来的结果。总之，我必须将 hpfilter 修改为：

hpfilter <- function(x,lambda=2){
y<-na.omit(x)
eye <- diag(length(y))
coredata(y) <- solve(eye + lambda * crossprod(diff(eye, lag=1, d=2)), coredata(y))
xy<-merge(x,y) 
return(xy[,2])
}

然后使用上面的循环，最终获得无错误的结果。不过，我对 R 的了解还很初级，可能有更简单的方法可以做到这一点。但至少，我现在可以继续。感谢所有人为我指明了正确的方向。我仍然欢迎对上面的代码进行进一步更正。

Thanks @ran2. I worked on your suggestion and managed to solve the problem – but, in a rather inelegant way. First of all, I could not get any of the 'apply family' functions to work correctly on an xts object, maintaining its structure. Plain apply with the apply(x, MARGIN=2,..) for column-wise application showed promise, but stalled at the 'coredata' statement. lapply etc yielded mangled lists.

I then went to the for loop. But because the x<-na.omit(x) changes the length of the variable, it cannot replace the original within loop.

> for(i in 1:ncol(USDATAq)) {
+ USDATAq[,i]<-hpfilter(USDATAq[,i])
+ }

Error in NextMethod(.Generic) :
number of items to replace is not a multiple of replacement length

So, I had to add unseemly code to hpfilter to ‘merge’ the result back to the original (with NA) and then return the variable. This merging matches the 2 variables by date (hence, length) filling NA’s into the result. Then, this result can replace the original in a loop. In conclusion, I had to modify hpfilter to:

hpfilter <- function(x,lambda=2){
y<-na.omit(x)
eye <- diag(length(y))
coredata(y) <- solve(eye + lambda * crossprod(diff(eye, lag=1, d=2)), coredata(y))
xy<-merge(x,y) 
return(xy[,2])
}

and then use the loop above, to finally get error free results. My knowledge of R is so rudimentary, though, that there probably are easier ways to do this. But, at least, I can proceed now. Thanks to all for pointing me in the right direction. I'd still welcome further corrections to my code above.

回复收藏 0 原文

木緿 2024-12-19 13:33:56

我认为你走在正确的道路上。为什么不在这个函数中添加 na.omit 呢？就在创建眼睛矩阵之前？ x<-na.omit(x)。然后你所要做的就是将单变量系列传递给它而不是整个 data.frames。换句话说：保持函数不变，添加 na.omit 并将其与 lapply （或任何形式的 apply 系列（sapply、tapply、lapply）组合）你最好的。

回复收藏 0 原文

素年丶 2024-12-19 13:33:56

对于动物园对象，使用 attributes() 而不是 coredata() 会稍微干净一些，然后您可以直接合并回动物园对象。（我还没有对 xts 对象尝试过这个）：

hpfilter <- function(x,lambda=1600){
    y<-na.omit(x)
    eye <- diag(length(y))
    result <- solve(eye+lambda*crossprod(diff(eye,lag=1,d=2)),y)
    attributes(result) <- attributes(y)
    return(result)
}

With zoo objects, it is marginally cleaner using attributes() rather than coredata() then you can merge straight back into the zoo object. (I haven't tried this for xts objects):

hpfilter <- function(x,lambda=1600){
    y<-na.omit(x)
    eye <- diag(length(y))
    result <- solve(eye+lambda*crossprod(diff(eye,lag=1,d=2)),y)
    attributes(result) <- attributes(y)
    return(result)
}

回复收藏 0 原文

~没有更多了~