r - 将 dplyr::group_by 与 purrr::pmap 结合使用

Question

我有以下数据框：

df <- data.frame(a = c(1:20),
             b = c(2:21),
             c = as.factor(c(rep(1,5), rep(2,10), rep(3,5))))

并且我想执行以下操作：

df1 <- df %>% group_by(c) %>% mutate(a = lead(b))

但最初我有很多变量，我需要将 lead() 函数与 group_by() 结合应用于多个变量。我正在尝试 purrr::pmap() 来实现这一点：

df2 <- pmap(list(df[,1],df[,2],df[,3]), function(x,y,z) group_by(z) %>% lead(y))

不幸的是，这会导致错误：

 Error in UseMethod("group_by_") : 
  no applicable method for 'group_by_' applied to an object of class "c('integer', 'numeric')"

Answer 1

您可以使用 mutate_at 和 funs() 的命名参数来执行此操作，这会创建新列而不是覆盖它们。请注意，这对 a 没有任何作用，但您可以根据需要重命名此后的列。

df <- data.frame(
  a = c(1:20),
  b = c(2:21),
  b2 = 3:22,
  b3 = 4:23,
  c = as.factor(c(rep(1, 5), rep(2, 10), rep(3, 5)))
)

library(tidyverse)
df %>%
  group_by(c) %>%
  mutate_at(vars(starts_with("b")), funs(lead = lead(.)))
#> # A tibble: 20 x 8
#> # Groups:   c [3]
#>        a     b    b2    b3 c     b_lead b2_lead b3_lead
#>    <int> <int> <int> <int> <fct>  <int>   <int>   <int>
#>  1     1     2     3     4 1          3       4       5
#>  2     2     3     4     5 1          4       5       6
#>  3     3     4     5     6 1          5       6       7
#>  4     4     5     6     7 1          6       7       8
#>  5     5     6     7     8 1         NA      NA      NA
#>  6     6     7     8     9 2          8       9      10
#>  7     7     8     9    10 2          9      10      11
#>  8     8     9    10    11 2         10      11      12
#>  9     9    10    11    12 2         11      12      13
#> 10    10    11    12    13 2         12      13      14
#> 11    11    12    13    14 2         13      14      15
#> 12    12    13    14    15 2         14      15      16
#> 13    13    14    15    16 2         15      16      17
#> 14    14    15    16    17 2         16      17      18
#> 15    15    16    17    18 2         NA      NA      NA
#> 16    16    17    18    19 3         18      19      20
#> 17    17    18    19    20 3         19      20      21
#> 18    18    19    20    21 3         20      21      22
#> 19    19    20    21    22 3         21      22      23
#> 20    20    21    22    23 3         NA      NA      NA

由 reprex package (v0.2.0) 创建于 2018-09-07。

r - 将 dplyr::group_by 与 purrr::pmap 结合使用

r - use dplyr::group_by in combination with purrr::pmap

group-by

r

pmap

dataframe

purrr