在 meta-toolkit 的 config.toml 中配置默认链
Configure default chain in config.toml in meta-toolkit
我想在 meta-toolkit 中使用默认过滤器链配置 config.toml 文件,但要指定每个过滤器。我计划对一些过滤器进行修改,因此我想要一个基线过滤器链。
我试过以下属性:
[[analyzers]]
method = "ngram-word"
ngram = 1
[[analyzers.filter]]
type = "icu-tokenizer"
[[analyzers.filter]]
type = "lowercase"
[[analyzers.filter]]
type = "alpha"
[[analyzers.filter]]
type = "length"
min = 2
max = 35
[[analyzers.filter]]
type = "list"
filename = "../data/lemur-stopwords.txt"
[[analyzers.filter]]
type = "porter2-stemmer"
[[analyzers.filter]]
type = "empty-sentence"
我得到:
token_stream_exception: what(): file required for list_filter config
属性应该是文件而不是文件名。
[[analyzers.filter]]
type = "list"
file = "../data/lemur-stopwords.txt"
我想在 meta-toolkit 中使用默认过滤器链配置 config.toml 文件,但要指定每个过滤器。我计划对一些过滤器进行修改,因此我想要一个基线过滤器链。
我试过以下属性:
[[analyzers]]
method = "ngram-word"
ngram = 1
[[analyzers.filter]]
type = "icu-tokenizer"
[[analyzers.filter]]
type = "lowercase"
[[analyzers.filter]]
type = "alpha"
[[analyzers.filter]]
type = "length"
min = 2
max = 35
[[analyzers.filter]]
type = "list"
filename = "../data/lemur-stopwords.txt"
[[analyzers.filter]]
type = "porter2-stemmer"
[[analyzers.filter]]
type = "empty-sentence"
我得到:
token_stream_exception: what(): file required for list_filter config
属性应该是文件而不是文件名。
[[analyzers.filter]]
type = "list"
file = "../data/lemur-stopwords.txt"