['kenlm/build/bin/build_binary', '-a', '255', '-q', '8', '-v', 'trie', 'lm_filtered.arpa', '/content/lm.binary ']' 返回非零退出状态 1

['kenlm/build/bin/build_binary', '-a', '255', '-q', '8', '-v', 'trie', 'lm_filtered.arpa', '/content/lm.binary']' returned non-zero exit status 1

在构建 lm binay 以创建 scorer doe deepspeech 模型的过程中,我一次又一次地收到以下错误

subprocess.CalledProcessError: Command '['/content/kenlm/build/bin/build_binary', '-a', '255', '-q', '8', '-v', 'trie', '/content/lm_filtered.arpa', '/content/lm.binary']' returned non-zero exit status 1.

我使用的命令如下

!python /content/DeepSpeech/data/lm/generate_lm.py \
--input_txt /content/transcripts.txt \
--output_dir /content/scorer/ \
--top_k 50000 \
--kenlm_bins /content/kenlm/build/bin/ \
--arpa_order 5 --max_arpa_memory "95%" --arpa_prune "0|0|1" \
--binary_a_bits 255 --binary_q_bits 8 --binary_type trie

以下对我有用 前往

DeepSpeech -> data -> lm -> generate_lm.py

现在在里面找到如下代码栈

subprocess.check_call(
        [
            os.path.join(args.kenlm_bins, "build_binary"),
            "-a",
            str(args.binary_a_bits),
            "-q",
            str(args.binary_q_bits),
            "-v",
            args.binary_type,
            filtered_path,
            binary_path,
        ]

通过在其中添加“-s”标志来调整代码,如下所示

subprocess.check_call(
    [
        os.path.join(args.kenlm_bins, "build_binary"),
        "-a",
        str(args.binary_a_bits),
        "-q",
        str(args.binary_q_bits),
        "-v",
        args.binary_type,
        filtered_path,
        binary_path,
        "-s"
    ]

现在你的命令 运行 没问题