注释语料库时如何解释 synaxnet 的输出

How to interpret the output of a synaxnet when annotating a corpus

我使用预训练的 syntaxnet 模型（即使用 Parse McParseface）对语料库进行了注释。我在理解输出时遇到问题。输出中有两个指标。那些用于 POS 标记和依赖项解析？如果是，哪一个是词性标注性能，哪个是依赖解析性能？

这是输出：

INFO:tensorflow:Total processed documents: 21710 INFO:tensorflow:num correct tokens: 454150 INFO:tensorflow:total tokens: 560993 INFO:tensorflow:Seconds elapsed in evaluation: 1184.63, eval metric: 80.95% INFO:tensorflow:Processed 206 documents INFO:tensorflow:Total processed documents: 21710 INFO:tensorflow:num correct tokens: 291851 INFO:tensorflow:total tokens: 504496 INFO:tensorflow:Seconds elapsed in evaluation: 1193.17, eval metric: 57.85%

如果您正在使用 https://github.com/tensorflow/models/blob/master/syntaxnet/syntaxnet/demo.sh 那么第一个指标是 POS 标签的准确性，第二个指标是 UAS。只有当您输入的 conll 数据包含黄金 POS 标签和黄金依赖项时，它们才有意义。