使用 na.approx 进行插值：它是如何做到的？

发布于 2024-10-16 06:44:25 字数 773 浏览 6 评论 0原文

我正在对就业数据进行一些简单的取消抑制，我偶然发现了动物园包中的 na.approx 方法。这些数据代表了政府总就业人数的百分比，我认为粗略的估计是看看州和地方政府之间的变化趋势。他们应该添加到一个。

        State % Local %
2001    na  na
2002    na  na
2003    na  na
2004    0.118147539 0.881852461
2005    0.114500321 0.885499679
2006    0.117247083 0.882752917
2007    0.116841331 0.883158669

我使用样条设置，它允许估计前导 na，

z <- zoo(DF2,1:7)    
d<-na.spline(z,na.rm=FALSE,maxgap=Inf)

这给出了输出：

State % Local %
0.262918013 0.737081987
0.182809891 0.817190109
0.137735231 0.862264769
0.118147539 0.881852461
0.114500321 0.885499679
0.117247083 0.882752917
0.116841331 0.883158669

很好，对吧？让我惊讶的是，近似的 na 值总和为 1（这是我想要的，但出乎意料！），但 na.approx 的文档说它按列单独处理每一列。我错过了什么吗？我的钱花在误读文档上

原文

I am doing some light un-suppression of employment data, and I stumbled on na.approx approach in the zoo package. The data represents the percentage of total government employment, and I figured a rough estimate would be to look at the trends of change between state and local government. They should add to one.

        State % Local %
2001    na  na
2002    na  na
2003    na  na
2004    0.118147539 0.881852461
2005    0.114500321 0.885499679
2006    0.117247083 0.882752917
2007    0.116841331 0.883158669

I use the spline setting which allows the estimation of leading na's

z <- zoo(DF2,1:7)    
d<-na.spline(z,na.rm=FALSE,maxgap=Inf)

Which gives the output:

State % Local %
0.262918013 0.737081987
0.182809891 0.817190109
0.137735231 0.862264769
0.118147539 0.881852461
0.114500321 0.885499679
0.117247083 0.882752917
0.116841331 0.883158669

Great right? The part that amazes me is that, the approximated na values sum to 1 (which is what I want, but unexpected!) but the documentation for na.approx says that it does each column separately, column-wise. Am I missing something? My money's on mis-reading the documentation

分享到QQ

分享到微博