R:将字符变量编码为数字

R: Encode character variables into numeric

在 R 代码中,我有一个字符变量 var,其值为 "AA"、"AB"、"AC" 等

str(var)
chr [1:17003] "AA" "AA" "AA" "AA" "AB" "AB" ...

如何将它转换为数字变量,以便 "AA" 被编码为,例如1,"AB" - 为 2,等等

您可以将字符串转换为因子,然后再转换为数值。

x <- c("AA", "AB", "AB", "AC", "AA", "XY")
as.numeric(as.factor(x))
# [1] 1 2 2 3 1 4

或者,您可以使用 matchunique:

match(x, unique(x))
# [1] 1 2 2 3 1 4

you can use them by directly converting them into factors with labeling.

x$Country = factor(x$Country,
               levels = c('AA', 'AB', 'AC'),
               labels = c(1, 2, 3))