在 R 中使用停用词 "tr" 时分析土耳其语文本的问题
Problem with Analysing Turkish Text while using stopwords "tr" with R
我正在用 R 分析土耳其语文本。但是在使用停用词时出现问题"tr"
虽然,在指示 link 中,土耳其语用 "tr" 表示,但它仍然无法识别它。
这是错误:
错误:语言 "tr" 在源 "snowball" 中不可用。有关支持的语言的详细信息,请参阅 stopwords_getlanguages
。
如有任何帮助,我们将不胜感激。
你快到了。您只需要更改 stopwords::stopwords
从哪里获取语言的 source
。
tldr:
对于 运行 您需要的代码:
stopwords::stopwords("tr", source = "stopwords-iso")
[1] "acaba" "acep" "adamakıllı" "adeta" "ait" "altmýþ" ...
解释:
这些是默认源中可用的语言 = "snowball"
stopwords::stopwords_getlanguages(source = "snowball")
[1] "da" "de" "en" "es" "fi" "fr" "hu" "ir" "it" "nl" "no" "pt" "ro" "ru" "sv"
要获取土耳其语,您只需将来源更改为 source = "stopwords-iso"
。您可以在下面看到此来源中可用的所有停用词。
stopwords::stopwords_getlanguages(source = "stopwords-iso")
[1] "af" "ar" "hy" "eu" "bn" "br" "bg" "ca" "zh" "hr" "cs" "da" "nl" "en" "eo" "et" "fi" "fr" "gl" "de" "el" "ha" "he" "hi" "hu" "id" "ga"
[28] "it" "ja" "ko" "ku" "la" "lt" "lv" "ms" "mr" "no" "fa" "pl" "pt" "ro" "ru" "sk" "sl" "so" "st" "es" "sw" "sv" "th" "tl" "tr" "uk" "ur"
[55] "vi" "yo" "zu"
这意味着 运行 您需要的代码:
stopwords::stopwords("tr", source = "stopwords-iso")
[1] "acaba" "acep" "adamakıllı" "adeta" "ait" "altmýþ" ...
我正在用 R 分析土耳其语文本。但是在使用停用词时出现问题"tr" 虽然,在指示 link 中,土耳其语用 "tr" 表示,但它仍然无法识别它。
这是错误:
错误:语言 "tr" 在源 "snowball" 中不可用。有关支持的语言的详细信息,请参阅 stopwords_getlanguages
。
如有任何帮助,我们将不胜感激。
你快到了。您只需要更改 stopwords::stopwords
从哪里获取语言的 source
。
tldr:
对于 运行 您需要的代码:
stopwords::stopwords("tr", source = "stopwords-iso")
[1] "acaba" "acep" "adamakıllı" "adeta" "ait" "altmýþ" ...
解释:
这些是默认源中可用的语言 = "snowball"
stopwords::stopwords_getlanguages(source = "snowball")
[1] "da" "de" "en" "es" "fi" "fr" "hu" "ir" "it" "nl" "no" "pt" "ro" "ru" "sv"
要获取土耳其语,您只需将来源更改为 source = "stopwords-iso"
。您可以在下面看到此来源中可用的所有停用词。
stopwords::stopwords_getlanguages(source = "stopwords-iso")
[1] "af" "ar" "hy" "eu" "bn" "br" "bg" "ca" "zh" "hr" "cs" "da" "nl" "en" "eo" "et" "fi" "fr" "gl" "de" "el" "ha" "he" "hi" "hu" "id" "ga"
[28] "it" "ja" "ko" "ku" "la" "lt" "lv" "ms" "mr" "no" "fa" "pl" "pt" "ro" "ru" "sk" "sl" "so" "st" "es" "sw" "sv" "th" "tl" "tr" "uk" "ur"
[55] "vi" "yo" "zu"
这意味着 运行 您需要的代码:
stopwords::stopwords("tr", source = "stopwords-iso")
[1] "acaba" "acep" "adamakıllı" "adeta" "ait" "altmýþ" ...