当列在 NA 结束时插入 NA 值

Interpolate NA values when column ends on NA

我有一列数值数据以 NA 结尾,也以 NA 结尾:

df <- data.frame(
  Diam_av = c(12.3, 13, 15.5, NA, NA, NA, NA, 13.7, NA, NA, NA, 9.98, 4,0, 8.76, NA, NA, NA)
)

我想插入缺失值。这与 zoo 的函数 na.approx 一起工作,只要有正边界值可以从中插入,但如果在我的情况下,其中一个边界值是 NA(在列 Daim_av:

library(zoo)
df %>%
  mutate(Diam_intpl = na.approx(Diam_av))
Error: Problem with `mutate()` input `Diam_intpl`.
x Input `Diam_intpl` can't be recycled to size 18.
ℹ Input `Diam_intpl` is `na.approx(Diam_av)`.
ℹ Input `Diam_intpl` must be size 18 or 1, not 15. 

知道如何 exclude/neutralize 列最终 NA 值吗?

添加na.rm=F以删除错误消息。添加 rule=2 以从最后一个非 NA 值中获取值。

df %>%
  mutate(Diam_intpl = na.approx(Diam_av, na.rm=F),
         Diam_intpl2 = na.approx(Diam_av, na.rm=F, rule=2))

   Diam_av Diam_intpl Diam_intpl2
1    12.30      12.30       12.30
2    13.00      13.00       13.00
3    15.50      15.50       15.50
4       NA      15.14       15.14
5       NA      14.78       14.78
6       NA      14.42       14.42
7       NA      14.06       14.06
8    13.70      13.70       13.70
9       NA      12.77       12.77
10      NA      11.84       11.84
11      NA      10.91       10.91
12    9.98       9.98        9.98
13    4.00       4.00        4.00
14    0.00       0.00        0.00
15    8.76       8.76        8.76
16      NA         NA        8.76
17      NA         NA        8.76
18      NA         NA        8.76

如果我理解的好,你可以把NA换成imputeTS::na_interpolation(),有很多选择:

library(imputeTS)
df$interpolated <- na_interpolation(df,option = 'linear')$Diam_av

  Diam_av interpolated
1    12.30        12.30
2    13.00        13.00
3    15.50        15.50
4       NA        15.14
5       NA        14.78
6       NA        14.42
7       NA        14.06
8    13.70        13.70
9       NA        12.77
10      NA        11.84
11      NA        10.91
12    9.98         9.98
13    4.00         4.00
14    0.00         0.00
15    8.76         8.76
16      NA         8.76
17      NA         8.76
18      NA         8.76