Google Speech-to-Text JupyterLab notebook 脚本 运行 在本地使用 Google Cloud SDK
Google Speech-to-Text JupyterLab notebook script run locally using Google Cloud SDK
我有以下 Python 脚本,它 运行 在 Google JupyterLab notebook 上没问题,但在本地使用 Google Cloud SDK 时不行:
from google.cloud import speech_v1p1beta1
def speech_to_text(audio_file):
client = speech_v1p1beta1.SpeechClient()
enable_word_time_offsets = True
enable_word_confidence = True
enable_automatic_punctuation = True
language_code = 'en-US'
config = {
'enable_word_confidence': enable_word_confidence,
'enable_word_time_offsets': enable_word_time_offsets,
'enable_automatic_punctuation': enable_automatic_punctuation,
'language_code': language_code
}
audio = {'uri': audio_file}
operation = client.long_running_recognize (config, audio)
response = client.recognize(config, audio)
result = response.results[0]
alternative = result.alternatives[0]
print(alternative)
speech_to_text('gs://my-bucket/my-folder/my-subfolder/my-audio-file.flac')
但是,当我尝试在虚拟环境中使用 Google Cloud SDK 在本地 运行 此脚本(WIN10,Python 3.8)时,我收到以下错误消息:
Traceback (most recent call last):
File "my-speech-to-text-script.py", line 32, in <module>
speech_to_text('gs://my-bucket/my-folder/my-subfolder/my-audio-file.flac')
File "my-speech-to-text-script.py", line 25, in speech_to_text
operation = client.long_running_recognize (config, audio)
TypeError: long_running_recognize() takes from 1 to 2 positional arguments but 3 were given
我按照本教程设置了虚拟环境 https://cloud.google.com/python/setup#windows 然后 运行 pip install google-cloud-speech
我做错了什么?
我通过更新我的代码解决了这个问题,我的代码和你的一样,可能是基于 Speech-to-Text 库的旧版本。
重要变化:
operation = client.long_running_recognize(request={"config":config, "audio":audio})
解决了我的问题,非常感谢。这是现在可以使用的代码:
from google.cloud import speech_v1p1beta1
def speech_to_text(audio_file):
client = speech_v1p1beta1.SpeechClient()
enable_word_time_offsets = True
enable_word_confidence = True
enable_automatic_punctuation = True
language_code = "en-US"
config = {
"enable_word_confidence": enable_word_confidence,
"enable_word_time_offsets": enable_word_time_offsets,
"enable_automatic_punctuation": enable_automatic_punctuation,
"language_code": language_code
}
audio = {"uri": audio_file}
operation = client.long_running_recognize(request={"config":config, "audio":audio})
response = client.recognize(request={"config":config, "audio":audio})
result = response.results[0]
alternative = result.alternatives[0]
print(alternative)
speech_to_text('gs://my-bucket/my-folder/my-subfolder/my-audio-file.flac')
我有以下 Python 脚本,它 运行 在 Google JupyterLab notebook 上没问题,但在本地使用 Google Cloud SDK 时不行:
from google.cloud import speech_v1p1beta1
def speech_to_text(audio_file):
client = speech_v1p1beta1.SpeechClient()
enable_word_time_offsets = True
enable_word_confidence = True
enable_automatic_punctuation = True
language_code = 'en-US'
config = {
'enable_word_confidence': enable_word_confidence,
'enable_word_time_offsets': enable_word_time_offsets,
'enable_automatic_punctuation': enable_automatic_punctuation,
'language_code': language_code
}
audio = {'uri': audio_file}
operation = client.long_running_recognize (config, audio)
response = client.recognize(config, audio)
result = response.results[0]
alternative = result.alternatives[0]
print(alternative)
speech_to_text('gs://my-bucket/my-folder/my-subfolder/my-audio-file.flac')
但是,当我尝试在虚拟环境中使用 Google Cloud SDK 在本地 运行 此脚本(WIN10,Python 3.8)时,我收到以下错误消息:
Traceback (most recent call last):
File "my-speech-to-text-script.py", line 32, in <module>
speech_to_text('gs://my-bucket/my-folder/my-subfolder/my-audio-file.flac')
File "my-speech-to-text-script.py", line 25, in speech_to_text
operation = client.long_running_recognize (config, audio)
TypeError: long_running_recognize() takes from 1 to 2 positional arguments but 3 were given
我按照本教程设置了虚拟环境 https://cloud.google.com/python/setup#windows 然后 运行 pip install google-cloud-speech
我做错了什么?
我通过更新我的代码解决了这个问题,我的代码和你的一样,可能是基于 Speech-to-Text 库的旧版本。
重要变化:
operation = client.long_running_recognize(request={"config":config, "audio":audio})
解决了我的问题,非常感谢。这是现在可以使用的代码:
from google.cloud import speech_v1p1beta1
def speech_to_text(audio_file):
client = speech_v1p1beta1.SpeechClient()
enable_word_time_offsets = True
enable_word_confidence = True
enable_automatic_punctuation = True
language_code = "en-US"
config = {
"enable_word_confidence": enable_word_confidence,
"enable_word_time_offsets": enable_word_time_offsets,
"enable_automatic_punctuation": enable_automatic_punctuation,
"language_code": language_code
}
audio = {"uri": audio_file}
operation = client.long_running_recognize(request={"config":config, "audio":audio})
response = client.recognize(request={"config":config, "audio":audio})
result = response.results[0]
alternative = result.alternatives[0]
print(alternative)
speech_to_text('gs://my-bucket/my-folder/my-subfolder/my-audio-file.flac')