'mutate' 在 R 的 tidyverse 中通过单个 fn 调用添加两列

Question

这是一个 R 版本 3.4.4 的问题

投票函数 voteOnBase，接受 2 个参数和 returns 一个 2 元素列表：WINNER 和 VOTE.COUNT。我想用它来将这两列添加到 notVotedYet，一个 tibble。以下代码运行正确。

 library(tidyverse)

 withVotes <- notVotedYet %>%
    group_by(BASE) %>%
    mutate(WINNER     = voteOnBase(BASE, CODES)[[1]],
           VOTE.COUNT = voteOnBase(BASE, CODES)[[2]])

但是，它会针对相同的输入调用 voteOnBase 两次。如何消除额外的函数调用但仍添加相同的两列？

Answer 1

没有一些示例数据和输出就不容易回答，但我建议将 voteOnBase() 写到 return 一个小标题，而不是列表。然后您可以将结果存储在列表列中并使用 unnest().

创建列

为了说明：这里有一个函数，square_it()，它和你的一样，有 2 个参数和 returns 2 个元素 - 但作为小标题中的列。

square_it <- function(x, y) {
  tibble(x = x^2, y = y^2)
}

我们可以使用 iris 数据集来传递参数。我们使用 pmap() 来指定变量和函数。列表列名为 sq:

iris %>% 
  as_tibble() %>% 
  mutate(sq = pmap(list(Sepal.Length, Sepal.Width), square_it))

# A tibble: 150 x 6
   Sepal.Length Sepal.Width Petal.Length Petal.Width Species sq              
          <dbl>       <dbl>        <dbl>       <dbl> <fct>   <list>          
 1          5.1         3.5          1.4         0.2 setosa  <tibble [1 x 2]>
 2          4.9         3            1.4         0.2 setosa  <tibble [1 x 2]>
 3          4.7         3.2          1.3         0.2 setosa  <tibble [1 x 2]>
 4          4.6         3.1          1.5         0.2 setosa  <tibble [1 x 2]>
 5          5           3.6          1.4         0.2 setosa  <tibble [1 x 2]>
 6          5.4         3.9          1.7         0.4 setosa  <tibble [1 x 2]>
 7          4.6         3.4          1.4         0.3 setosa  <tibble [1 x 2]>
 8          5           3.4          1.5         0.2 setosa  <tibble [1 x 2]>
 9          4.4         2.9          1.4         0.2 setosa  <tibble [1 x 2]>
10          4.9         3.1          1.5         0.1 setosa  <tibble [1 x 2]>
# ... with 140 more rows

只需将 %>% unnest(sq) 添加到该代码，即可生成列 x 和 y:

iris %>% 
  as_tibble() %>% 
  mutate(sq = pmap(list(Sepal.Length, Sepal.Width), square_it)) %>%
  unnest(sq)

# A tibble: 150 x 7
   Sepal.Length Sepal.Width Petal.Length Petal.Width Species     x     y
          <dbl>       <dbl>        <dbl>       <dbl> <fct>   <dbl> <dbl>
 1          5.1         3.5          1.4         0.2 setosa   26.0 12.2 
 2          4.9         3            1.4         0.2 setosa   24.0  9   
 3          4.7         3.2          1.3         0.2 setosa   22.1 10.2 
 4          4.6         3.1          1.5         0.2 setosa   21.2  9.61
 5          5           3.6          1.4         0.2 setosa   25   13.0 
 6          5.4         3.9          1.7         0.4 setosa   29.2 15.2 
 7          4.6         3.4          1.4         0.3 setosa   21.2 11.6 
 8          5           3.4          1.5         0.2 setosa   25   11.6 
 9          4.4         2.9          1.4         0.2 setosa   19.4  8.41
10          4.9         3.1          1.5         0.1 setosa   24.0  9.61
# ... with 140 more rows

Answer 2

您可能想使用 group_map :

library(dplyr)
useless_dupes <- function(x){list(x1=x, x2=x)}

mtcars %>%
  group_by(cyl) %>%
  group_map(~as_tibble(useless_dupes(.$disp)))
#> # A tibble: 32 x 3
#> # Groups:   cyl [3]
#>      cyl    x1    x2
#>    <dbl> <dbl> <dbl>
#>  1     4 108   108  
#>  2     4 147.  147. 
#>  3     4 141.  141. 
#>  4     4  78.7  78.7
#>  5     4  75.7  75.7
#>  6     4  71.1  71.1
#>  7     4 120.  120. 
#>  8     4  79    79  
#>  9     4 120.  120. 
#> 10     4  95.1  95.1
#> # ... with 22 more rows

'mutate' 在 R 的 tidyverse 中通过单个 fn 调用添加两列

'mutate' to add two columns with a single fn-call in tidyverse in R

r

processing-efficiency

dplyr

magrittr