R:计算时间差异(分组)

R: Calculate time-differences (groupwise)

我有以下示例数据

df <- data.frame(id = c(1,1,2,2,3,3),
                 times = c("2021-05-20 07:10:20", "2021-05-20 07:13:20", "2021-05-20 07:20:20", "2021-05-20 07:30:20", "2021-05-20 07:05:20", "2021-05-20 07:07:20"),
                 var1 = c("A", "B", "A", "B", "A", "B")) %>%
      mutate(times = as.POSIXct(times,format="%Y-%m-%d %H:%M:%OS"))

我的目标是添加一个额外的列,其中包含每个 ID 的 A 和 B 之间的时间差(以秒或分钟为单位)。有人可以帮我吗?

它应该是这样的:

  id               times var1  duration_in_sec
1  1 2021-05-20 07:10:20    A  NA
2  1 2021-05-20 07:13:20    B  180
3  2 2021-05-20 07:20:20    A  NA
4  2 2021-05-20 07:30:20    B  600
5  3 2021-05-20 07:05:20    A  NA
6  3 2021-05-20 07:07:20    B  120

您可以使用以下解决方案:

library(dplyr)

df %>% 
  group_by(id) %>%
  mutate(diff_time = difftime(times, lag(times), units = "secs"))


# A tibble: 6 x 4
# Groups:   id [3]
     id times               var1  diff_time
  <dbl> <dttm>              <chr> <drtn>   
1     1 2021-05-20 07:10:20 A      NA secs 
2     1 2021-05-20 07:13:20 B     180 secs 
3     2 2021-05-20 07:20:20 A      NA secs 
4     2 2021-05-20 07:30:20 B     600 secs 
5     3 2021-05-20 07:05:20 A      NA secs 
6     3 2021-05-20 07:07:20 B     120 secs 

使用data.table

library(data.table)
setDT(df)[, diff_time := difftime(times, shift(times), units = 'secs'), id]