如何使用 LUIS 短语列表来提高模式识别？

How can you use LUIS Phrase Lists to boost Pattern recognition?

azure-language-understanding

这个问题最初是作为对与 LUIS 模式相关的另一个问题 () 的评论而提出的。将其迁移到自己的 post 以使社区更容易搜索信息。

"How do you get [LUIS] phrase list to work properly? I have never been able to get it to recognize the values in the actual entity. I've always had to create a list entity and add synonyms to the normalized values."

总结

LUIS 中的模式首先取决于能够检测到实体以便正确匹配模式。
使用简单实体和短语列表来提高对该简单实体的检测。

短语列表用于 "boost" 另一个实体的信号，但至少在某些情况下依赖于您尝试增强信号的实体有足够的数据来识别它。

所以假设您正确地定义了一个 简单的 Apple 实体 ，并且 LUIS 正确地检测值，例如 Golden Delicious 、Granny Smith、Cosmic，等等。它甚至可以正确地检测到一个全新的（人造的）苹果作物，Flamingo，即使你没有明确地将 Flamingo 值标记为 Apple话语中的实体。

LUIS 使用基于话语上下文的机器学习来识别，"Oh yeah, normally an Apple entity can occur here, so Flamingo must be an Apple." Flamingo 也是一个英文单词，因此 LUIS 已经 "background knowledge" 这个单词（例如，可以根据英语知道它是一个名词）。

但是，如果一所大学创建了一个英文中不存在的新苹果作物名称，例如 Jirio，那会怎样呢？ LUIS 没有这个词的背景知识，就像 Flamingo 一样。因此，您可以创建一个 短语列表 来帮助将 Jirio 正确识别为 Apple。

我可以创建一个新的 Apples 短语列表，并包含很多值，例如 Golden Delicious、Granny Smith、Jirio 等。当 LUIS 看起来：“我不知道 Jirio 是什么，但我知道 Golden Delicious 是什么。根据 [=25]，Jirio 看起来与 Golden Delicious 是一回事=] 短语列表，因此更有可能将 Jirio 标记为 Apple 实体。

也有帮助的相关 Whosebug 帖子（带屏幕截图！）:

How to Improve Entity Detection in your LUIS App

如何使用 LUIS 短语列表来提高模式识别？

How can you use LUIS Phrase Lists to boost Pattern recognition?

azure-language-understanding