句子到单词 Table with R
Sentence to Word Table with R
我有一些句子,我想从句子中分离单词以获得每个行向量。但是这些词正在重复以匹配我不想要的最大句子的行向量。我希望无论句子有多大,每个句子的行向量都只会是一次单词。
sentence <- c("case sweden", "meeting minutes ht board meeting st march now also attachment added agenda today s board meeting", "draft meeting minutes board meeting final meeting minutes ht board meeting rd april")
sentence <- cbind(sentence)
word_table <- do.call(rbind, strsplit(as.character(sentence), " "))
test <- cbind(sentence, word_table)
这就是我现在得到的,
这就是我想要的,
我的意思是不重复。
来自 rawr,
的解决方案
sentence <- c("case sweden", "meeting minutes ht board meeting st march now also attachment added agenda today s board meeting", "draft meeting minutes board meeting final meeting minutes ht board meeting rd april")
dd <- read.table(text = paste(sentence, collapse = '\n'), fill = TRUE)
test <- cbind(sentence, dd)
或者,
cc <- read.table(text = paste(gsub('\n', '', sentence), collapse = '\n'), fill = TRUE)
test1 <- cbind(sentence, cc)
谢谢。
我有一些句子,我想从句子中分离单词以获得每个行向量。但是这些词正在重复以匹配我不想要的最大句子的行向量。我希望无论句子有多大,每个句子的行向量都只会是一次单词。
sentence <- c("case sweden", "meeting minutes ht board meeting st march now also attachment added agenda today s board meeting", "draft meeting minutes board meeting final meeting minutes ht board meeting rd april")
sentence <- cbind(sentence)
word_table <- do.call(rbind, strsplit(as.character(sentence), " "))
test <- cbind(sentence, word_table)
这就是我现在得到的,
这就是我想要的,
我的意思是不重复。
来自 rawr,
的解决方案sentence <- c("case sweden", "meeting minutes ht board meeting st march now also attachment added agenda today s board meeting", "draft meeting minutes board meeting final meeting minutes ht board meeting rd april")
dd <- read.table(text = paste(sentence, collapse = '\n'), fill = TRUE)
test <- cbind(sentence, dd)
或者,
cc <- read.table(text = paste(gsub('\n', '', sentence), collapse = '\n'), fill = TRUE)
test1 <- cbind(sentence, cc)
谢谢。