在 Python 中将语音转换为文本时出错

Question

我正在尝试使用语音识别库将语音转换为文本。但是当我运行代码显示关于音频类型的值错误时，我试图将文件格式更改为多种音频格式，例如：“PCM、WAV、AIFF、AIFF-C、Mp3、Mp4、FLAC， WebM，wav ...”通过重命名文件扩展名。但是，它仍然显示相同的错误。

错误：

ValueError: Audio file could not be read as PCM WAV, AIFF/AIFF-C, or Native FLAC; check if file is corrupted or in another format

代码：

import speech_recognition as sr
filename = "hello.mp3"
r = sr.Recognizer()
with sr.AudioFile(filename) as source:
   audio = r.record(source)
   text = r.recognize_google(audio)
print(text)

Answer 1

我认为重命名文件扩展名对您没有帮助，您应该使用文件转换器来确保音频数据以另一种格式正确编码。尝试使用 SoundConverter

在 Python 中将语音转换为文本时出错

Error when Converting Speech to Text in Python

python

audio

speech-recognition

speech-to-text

python-3.x