使用 R 和 Table 函数，我可以使用更大数据集中的 4 个特定列来创建一个二乘二的频率 table

Question

我有几个专栏

 Location|Yes M & M Peanuts| No M& M Peanuts | Yes M & M Almond| No M& M Almond|Location
               5                 10                 20             6                 NYC

我想使用 table 函数或更方便的方法将这些列转换为

              Yes | No
M & M Peanuts  5    10
M & M Almond   20    6

更新示例

df2 <- structure(list(`Yes M & M Peanuts` = 5L, `No M & M Peanuts` = 10L, 
                      `Yes M & M Almond` = 20L, `No M & M Almond` = 6L, "Location" = "NYC"), class = "data.frame", 
                 row.names = c(NA, 
                               -1L))

Answer 1

这可以通过 pivot_longer 轻松完成，指定 names_pattern 提取值 (.value) 部分以进入列 'Yes'，'No' 和另一列 'grp' 提取列名的后缀部分。然后，'grp' 列可以用 column_to_rownames

转换为行名

library(dplyr)
library(tidyr)
library(tibble)
df1 %>% 
  pivot_longer(cols = everything(), names_to = c(".value", "grp"),
        names_pattern = "(Yes|No)\s*(.*)") %>%
  column_to_rownames('grp')

-输出

#               Yes No
#M & M Peanuts   5 10
#M & M Almond   20  6

在更新的 post 中使用 OP 的第二个数据集，我们需要指定没有 'Location'

的 cols

df2 %>% 
  pivot_longer(cols = -Location, names_to = c(".value", "grp"),
    names_pattern = "(Yes|No)\s*(.*)") %>%
  column_to_rownames('grp')
#              Location Yes No
#M & M Peanuts      NYC   5 10
#M & M Almond       NYC  20  6

数据

df1 <- structure(list(`Yes M & M Peanuts` = 5L, `No M & M Peanuts` = 10L, 
    `Yes M & M Almond` = 20L, `No M & M Almond` = 6L), class = "data.frame", 
    row.names = c(NA, 
-1L))

使用 R 和 Table 函数，我可以使用更大数据集中的 4 个特定列来创建一个二乘二的频率 table

Using R and Table function can i use 4 specific columns out of a larger dataset to create a two by two frequency table

r

frequency-table

数据