NLTK 找不到 Stanford 词性标注器模型文件

NLTK can't find the Stanford POS tagger model file

我正在尝试使用 NLTK 中的 StanfordPOSTagger。我下载了 Stanford POS full tagger。我设置了

CLASSPATH=/home/waheeb/Stanford_Tools/stanford-postagger-full-2015-12-09  /stanford-postagger.jar
STANFORD_MODELS=home/waheeb/Stanford_Tools/stanford-postagger-full-2015-12-09/models

当我在 python 中键入以下内容时:

>>> from nltk.tag import StanfordPOSTagger
>>> st = StanfordPOSTagger('english-bidirectional-distsim.tagger')

我收到以下错误:

Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/home/waheeb/anaconda2/lib/python2.7/site-packages/nltk/tag /stanford.py", line 136, in __init__
super(StanfordPOSTagger, self).__init__(*args, **kwargs)
File "/home/waheeb/anaconda2/lib/python2.7/site-packages/nltk/tag/stanford.py", line 56, in __init__
env_vars=('STANFORD_MODELS',), verbose=verbose)
File "/home/waheeb/anaconda2/lib/python2.7/site-packages /nltk/internals.py", line 573, in find_file
file_names, url, verbose))
File "/home/waheeb/anaconda2/lib/python2.7/site-packages/nltk/internals.py", line 567, in find_file_iter
raise LookupError('\n\n%s\n%s\n%s' % (div, msg, div))

查找错误:

=========================================================================
NLTK was unable to find the english-bidirectional-distsim.tagger file!
Use software specific configuration paramaters or set the TANFORD_MODELS  environment variable.
==========================================================================

这是为什么??

您忘记在调用 python 脚本之前在命令行中使用 export。 IE。

alvas@ubi:~$ export STANFORDTOOLSDIR=$HOME
alvas@ubi:~$ export CLASSPATH=$STANFORDTOOLSDIR/stanford-postagger-full-2015-12-09/stanford-postagger.jar
alvas@ubi:~$ export STANFORD_MODELS=$STANFORDTOOLSDIR/stanford-postagger-full-2015-12-09/models
alvas@ubi:~$ python

有关详细信息,请参阅 https://gist.github.com/alvations/e1df0ba227e542955a8a


类似的问题包括:

  • Setting NLTK with Stanford NLP (both StanfordNERTagger and StanfordPOSTagger) for Spanish
  • Error using Stanford POS Tagger in NLTK Python
  • Can't make Stanford POS tagger working in nltk
  • trouble importing stanford pos tagger into nltk
  • Stanford Parser and NLTK