单词术语矩阵

Word Term Matrix

我很想根据一些推文创建一个词矩阵,推文中的每个词都必须是一个新变量,并且只有与推文中该文本对应的词才填充 1

x <- data.frame("Tweet" = c("hi all","I need help"), "N" = 1, "Reaction" = c("Happy", "Sad"), stringsAsFactors = FALSE)

我很想粘贴输出,但老实说我不知道​​该怎么做,抱歉

enter image description here

你可以这样做:

library(tm)

x <- data.frame("Tweet" = c("hi all","I need help"), "N" = 1, "Reaction" = c("Happy", "Sad"), stringsAsFactors = FALSE)

corp <- VCorpus(VectorSource(x$Tweet))
# adjust wordLengths, default is c(3, Inf)
dtm <- DocumentTermMatrix(corp, control = list(wordLengths = c(1, Inf)))
data.frame(Tweet = x$Tweet, as.matrix(dtm), Reaction = x$Reaction)

            Tweet all help hi i need Reaction
1      hi all   1    0  1 0    0    Happy
2 I need help   0    1  0 1    1      Sad