将语法与 SpeechRecognition 包一起使用时出现运行时错误
Runtime error when using grammars with the SpeechRecognition package
所以我尝试使用 speech_recognition.recognize_sphinx()
的语法,但是,我收到以下错误:
RuntimeError: Decoder_set_fsg returned -1
这是我的代码:
Main.py
# Dependencies:
import speech_recognition as sr
# Collect audio sample
r = sr.Recognizer()
print('Please say "perquisition":')
with sr.Microphone() as source:
audio_en = r.listen(source)
print('Processing...')
# Attempt to convert the speech to text
try:
print(r.recognize_sphinx(audio_en, grammar='perquisition.gram'))
except sr.UnknownValueError:
print("Sphinx could not understand audio")
except sr.RequestError as e:
print("Sphinx error; {0}".format(e))
perquisition.gram
#JSGF V1.0;
grammar perquisition;
// Grammar rule names should be [a-zA-Z0-9] only!
public <perquisition> = ( perquisition );
有什么想法吗?
这里发生的一些事情掩盖了潜在的错误。
speech_recognition.recognize_sphinx()
只是一些 CMUsphinx 命令的包装器,可以在第 746 行 here 找到。这个确切的问题有点混乱,所以我们将专注于下面的代码片段:
# Dependencies
import speech_recognition as sr
import os
import pocketsphinx as ps
# Manually point to the grammar file
grammar = 'search.gram'
try:
# Point to the model files
language_directory = os.path.join(os.path.dirname(os.path.realpath(__file__)), "pocketsphinx-data", "en-US")
acoustic_parameters_directory = os.path.join(language_directory, "acoustic-model")
language_model_file = os.path.join(language_directory, "language-model.lm.bin")
phoneme_dictionary_file = os.path.join(language_directory, "pronounciation-dictionary.dict")
# Create a decoder object with our custom parameters
config = ps.Decoder.default_config()
config.set_string("-hmm",
acoustic_parameters_directory) # set the path of the hidden Markov model (HMM) parameter files
config.set_string("-lm", language_model_file)
config.set_string("-dict", phoneme_dictionary_file)
config.set_string("-logfn", os.devnull) # <--- Prevents you from seeing the actual bug!!!
decoder = ps.Decoder(config)
# Convert grammar
grammar_path = os.path.abspath(os.path.dirname(grammar))
grammar_name = os.path.splitext(os.path.basename(grammar))[0]
fsg_path = "{0}/{1}.fsg".format(grammar_path, grammar_name)
if not os.path.exists(fsg_path): # create FSG grammar if not available
jsgf = ps.Jsgf(grammar)
rule = jsgf.get_rule("{0}.{0}".format(grammar_name))
fsg = jsgf.build_fsg(rule, decoder.get_logmath(), 7.5)
fsg.writefile(fsg_path)
print('Successful JSFG to FSG conversion!!!')
# Pass the fsg file into the decoder
decoder.set_fsg(grammar_name, fsg) # <--- BUG IS HERE!!!
except Exception as e:
print('Ach no! {0}'.format(e))
finally:
os.remove('search.fsg') # Remove again to help prove that the grammar to fsg conversion isn't at fault
运行 那个代码我们找到了错误弹出的行,而且日志信息已被关闭!启用它后,大量文本会被转储到终端中,这可能会很麻烦。在这种情况下,重新打开它们以发现...
会更有用
...
ERROR: "fsg_search.c", line 141: The word 'perquisition' is missing in the dictionary
...
现在我们有所进展。这给我们留下了两个选择之一。首先我们可以扫描字典(pocketsphinx.get_model_path()+'/cmudict-en-us.dict'
或类似的)来确定一个词是否存在。然后我们可以决定是简单地忽略这个词,还是将它添加到词典中。
添加到词典中并不一定是直截了当的...取决于它与词典中其他单词的相似程度,您可能能够摆脱它。否则,您还必须重新训练模型。可以找到关于如何执行此操作的更好的解释 here。享受吧。
所以我尝试使用 speech_recognition.recognize_sphinx()
的语法,但是,我收到以下错误:
RuntimeError: Decoder_set_fsg returned -1
这是我的代码:
Main.py
# Dependencies:
import speech_recognition as sr
# Collect audio sample
r = sr.Recognizer()
print('Please say "perquisition":')
with sr.Microphone() as source:
audio_en = r.listen(source)
print('Processing...')
# Attempt to convert the speech to text
try:
print(r.recognize_sphinx(audio_en, grammar='perquisition.gram'))
except sr.UnknownValueError:
print("Sphinx could not understand audio")
except sr.RequestError as e:
print("Sphinx error; {0}".format(e))
perquisition.gram
#JSGF V1.0;
grammar perquisition;
// Grammar rule names should be [a-zA-Z0-9] only!
public <perquisition> = ( perquisition );
有什么想法吗?
这里发生的一些事情掩盖了潜在的错误。
speech_recognition.recognize_sphinx()
只是一些 CMUsphinx 命令的包装器,可以在第 746 行 here 找到。这个确切的问题有点混乱,所以我们将专注于下面的代码片段:
# Dependencies
import speech_recognition as sr
import os
import pocketsphinx as ps
# Manually point to the grammar file
grammar = 'search.gram'
try:
# Point to the model files
language_directory = os.path.join(os.path.dirname(os.path.realpath(__file__)), "pocketsphinx-data", "en-US")
acoustic_parameters_directory = os.path.join(language_directory, "acoustic-model")
language_model_file = os.path.join(language_directory, "language-model.lm.bin")
phoneme_dictionary_file = os.path.join(language_directory, "pronounciation-dictionary.dict")
# Create a decoder object with our custom parameters
config = ps.Decoder.default_config()
config.set_string("-hmm",
acoustic_parameters_directory) # set the path of the hidden Markov model (HMM) parameter files
config.set_string("-lm", language_model_file)
config.set_string("-dict", phoneme_dictionary_file)
config.set_string("-logfn", os.devnull) # <--- Prevents you from seeing the actual bug!!!
decoder = ps.Decoder(config)
# Convert grammar
grammar_path = os.path.abspath(os.path.dirname(grammar))
grammar_name = os.path.splitext(os.path.basename(grammar))[0]
fsg_path = "{0}/{1}.fsg".format(grammar_path, grammar_name)
if not os.path.exists(fsg_path): # create FSG grammar if not available
jsgf = ps.Jsgf(grammar)
rule = jsgf.get_rule("{0}.{0}".format(grammar_name))
fsg = jsgf.build_fsg(rule, decoder.get_logmath(), 7.5)
fsg.writefile(fsg_path)
print('Successful JSFG to FSG conversion!!!')
# Pass the fsg file into the decoder
decoder.set_fsg(grammar_name, fsg) # <--- BUG IS HERE!!!
except Exception as e:
print('Ach no! {0}'.format(e))
finally:
os.remove('search.fsg') # Remove again to help prove that the grammar to fsg conversion isn't at fault
运行 那个代码我们找到了错误弹出的行,而且日志信息已被关闭!启用它后,大量文本会被转储到终端中,这可能会很麻烦。在这种情况下,重新打开它们以发现...
会更有用...
ERROR: "fsg_search.c", line 141: The word 'perquisition' is missing in the dictionary
...
现在我们有所进展。这给我们留下了两个选择之一。首先我们可以扫描字典(pocketsphinx.get_model_path()+'/cmudict-en-us.dict'
或类似的)来确定一个词是否存在。然后我们可以决定是简单地忽略这个词,还是将它添加到词典中。
添加到词典中并不一定是直截了当的...取决于它与词典中其他单词的相似程度,您可能能够摆脱它。否则,您还必须重新训练模型。可以找到关于如何执行此操作的更好的解释 here。享受吧。