如何将唯一行值更改为 R 中数据框中的另一组唯一行值？

Question

我有一个数据框，其中包含来自在线测试参与者的实验结果。在数据文件中，每个独特的参与者都通过在实验结束时提供给他们的随机生成的代码来识别。由于通过看起来像乱码的随机代码来识别每个人很麻烦，我想用 Participant_1、Participant_2 等可读标签替换这些代码

所以我想我需要一段代码来识别数据文件中每个唯一的随机代码，并将它们一个一个地替换为参与者标签。但我想不通，如有任何帮助，我们将不胜感激。

这是一段代码，显示了我的输出与我想要的输出。请注意，每个参与者都回答了不同数量的问题，因此这不能作为解析它们的简单方法。

Participant_Identifiers <- c(rep("QHDKWEFHWKHFFH", 4), rep("WHWIHFJNWFKWF", 7), rep("HEIFFFBBKQLSD", 3))

Participant_Scores <- c(20, 30, 59, 20, 47, 84, 21, 90,54,78,90,97)

df <- data.frame("Participant_Identifiers" = c(rep("QHDKWEFHWKHFFH", 4), rep("WHWIHFJNWFKWF", 7), rep("HEIFFFBBKQLSD", 3)), 
                  "Participant_Scores" = c(20, 30, 59, 20, 47, 84, 21, 90,54,78,90,97, 35, 67))

df

df_I_want <- data.frame("Participant_Identifiers" = c(rep("Participant_1", 4), rep("Participant_2", 7), rep("Participant_3", 3)), 
                       "Participant_Scores" = c(20, 30, 59, 20, 47, 84, 21, 90,54,78,90,97, 35, 67))

df_I_want

Answer 1

您可以执行以下操作：

# example 
ano <- replicate(5, paste(letters[sample(1:25, 12, replace = TRUE)], collapse = ""))
df <- data.frame(pat = sample(ano, 15, replace = TRUE), var = runif(15), 
  stringsAsFactors = FALSE )

# 1. create another data frame with the id you want 
patu <- unique(df$pat)
df_id <- data.frame(pat = patu, id = paste0("Participant_", seq_along(patu)))

# 2. merge with your df 
res <- merge(df, df_id)

Answer 2

您可以将 match 与 unique

一起使用

df$new_col <- paste0("Participant_", match(df$Participant_Identifiers, 
                                     unique(df$Participant_Identifiers)))

或者由于 Participant_Identifiers 是因子，您可以将它们转换为整数

df$new_col <- paste0("Participant_", as.integer(df$Participant_Identifiers))

如何将唯一行值更改为 R 中数据框中的另一组唯一行值？

How do I change unique row values into another set of unique row values in a data frame in R?

row

r

dataframe

data-wrangling