如何在 spaCy 库中设置注释以将标签视为名词,Python

How to set annotations to treat labels as nouns in spaCy library, Python

我有这个标记的句子:

[x] moved to [y] in [z].

如何设置注释 [x],[y] 为名词,[z] 为日期时间?我提到了 https://spacy.io/usage/linguistic-features#native-tokenizer-additions 但没有找到我想要的东西或者我错过了它。

您可以使用 tokenizer 特殊情况设置 POS (https://spacy.io/usage/linguistic-features#special-cases):

orth = "[z]"
nlp.tokenizer.add_special_case(orth, [{"ORTH": orth, "TAG": "NUM"}])

(老实说,有分词器设置标签有点奇怪,但现在这个功能是有的。)