IBM Speech To Text - 获取具有最高置信度和关键字的备选方案作为第一个结果

Question

我正在使用 IBM Speech to Text。结果还可以，但我想知道为什么不按最高置信度排序。是否有一个参数 returning 这个排序，这样我就可以只选择第一个选项？如果还找到传递的关键字，最好只 return 结果。

有一个 max_alternatives 参数默认为 1，但是当明确指定此参数时，returned 有多个替代选项。

我目前正在手动对响应进行排序，我不需要代码示例来完成此操作。

JSON 示例：

   "result": {
        "result_index": 0,
        "results": [
            {
                "final": true,
                "alternatives": [
                    {
                        "transcript": "l\u00f6schen es tut echte betroffen ",
                        "confidence": 0.71
                    }
                ],
                "keywords_result": {}
            },
            {
                "final": true,
                "alternatives": [
                    {
                        "transcript": "sie sp\u00fcren dass eine \u00e4ra zu ende ",
                        "confidence": 0.91
                    }
                ],
                "keywords_result": {}
            },
            {
                "final": true,
                "alternatives": [
                    {
                        "transcript": "auto fahre eins zwei drei vier ",
                        "confidence": 0.95
                    }
                ],
                "keywords_result": {
                    "auto": [
                        {
                            "start_time": 6.31,
                            "end_time": 7.19,
                            "confidence": 0.99,
                            "normalized_text": "auto"
                        }
                    ]
                }
            }
        ]
    },
...

Answer 1

问题是 end_of_phrase_silence_time。当检测到默认的 0.8 静音期时，语音将被拆分为一个额外的短语。所以我看到的不是不同的识别结果，而是前面提到的录音中存在的短语。见参数end_of_phrase_silence_time

IBM Speech To Text - 获取具有最高置信度和关键字的备选方案作为第一个结果

IBM Speech To Text - Get alternative with highest confidence and keyword found as first result

websphere

speech-to-text

ibm-speech-to-text