使用 pocketsphinx 未获得所需的输出

Question

我的问题是我正在使用音频文件并将其转换为文本

我的音频文件包含 "HI HELLO" 但我得到的输出为 针对印度的卖空者 我不知道怎么做？

我使用的代码如下。

import sys,os


  def decodeSpeech(hmmd,lmdir,dictp,wavfile):
    """
    Decodes a speech file
    """

    try:
        import pocketsphinx as ps
        import sphinxbase

    except:
        print """Pocket sphinx and sphixbase is not installed
        in your system. Please install it with package manager.
        """

    speechRec = ps.Decoder(hmm = hmmd, lm = lmdir, dict = dictp)
    wavFile = file(wavfile,'rb')
    wavFile.seek(44)
    speechRec.decode_raw(wavFile)
    result = speechRec.get_hyp()

    return result[0]

if __name__ == "__main__":
    hmdir = "/usr/share/pocketsphinx/model/hmm/wsj1"
    lmd = "/usr/share/pocketsphinx/model/lm/wsj/wlist5o.3e-7.vp.tg.lm.DMP"
    dictd = "/usr/share/pocketsphinx/model/lm/wsj/wlist5o.dic"
    wavfile = sys.argv[1]
    recognised = decodeSpeech(hmdir,lmd,dictd,wavfile)

    print "%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%"
    print recognised
    print "%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%"

Answer 1

您输入的文件格式有误。确保它是 16khz 16bit 单声道 PCM 文件。

此外，您使用的是旧的 pocketsphinx。确保使用 http://github.com/cmusphinx/pocketsphinx-python

使用 pocketsphinx 未获得所需的输出

Not getting the desired output using pocketsphinx

python

pocketsphinx