Kaldi: qsub 的输出是: qsub: illegal -c value "" when trying to 运行 the Common Voice recipe
Kaldi: Output of qsub was: qsub: illegal -c value "" when trying to run the Common Voice recipe
我正在尝试 运行 Kaldi 的 Common Voice 食谱 (kaldi/egs/commonvoice/s5/run.sh
) 在我的计算机上(即不在集群上)。它崩溃并显示错误消息 Output of qsub was: qsub: illegal -c value ""
。可能是什么问题?
具体来说,这是整个错误堆栈:
[...]
Succeeded in formatting LM: 'data/local/lm.gz'
steps/make_mfcc.sh --cmd queue.pl --mem 2G --nj 20 data/valid_train exp/make_mfcc/valid_train mfcc
utils/validate_data_dir.sh: Successfully validated data-directory data/valid_train
steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance.
queue.pl: Error submitting jobs to queue (return status was 512)
queue log file is exp/make_mfcc/valid_train/q/make_mfcc_valid_train.log, command was qsub -v PATH -cwd -S /bin/bash -j y -l arch=*64* -o exp/make_mfcc/valid_train/q/make_mfcc_valid_train.log -l mem_free=2G,ram_free=2G -t 1:20 /home/ubuntu/kaldi/egs/commonvoice/s5/exp/make_mfcc/valid_train/q/make_mfcc_valid_train.sh >>exp/make_mfcc/valid_train/q/make_mfcc_valid_train.log 2>&1
Output of qsub was: qsub: illegal -c value ""
usage: qsub [-a date_time] [-A account_string] [-b secs]
[-c [ none | { enabled | periodic | shutdown |
depth=<int> | dir=<path> | interval=<minutes>}... ]
[-C directive_prefix] [-d path] [-D path]
[-e path] [-h] [-I] [-j oe] [-k {oe}] [-l resource_list] [-m n|{abe}]
[-M user_list] [-N jobname] [-o path] [-p priority] [-P proxy_user] [-q queue]
[-r y|n] [-S path] [-t number_to_submit] [-T type] [-u user_list] [-w] path
[-W otherattributes=value...] [-v variable_list] [-V ] [-x] [-X] [-z] [script]
我运行 Common Voice 配方如下:
## Retrieving Kaldi source code
git clone https://github.com/kaldi-asr/kaldi.git
cd kaldi
export KALDI_GIT_ROOT=`pwd`
echo $KALDI_GIT_ROOT
## Compile Kaldi tools
cd $KALDI_GIT_ROOT/tools
sudo apt-get install -y zlib1g-dev automake autoconf libtool libatlas3-base subversion
make -j 24
## Compile Kaldi
cd $KALDI_GIT_ROOT/src
./configure
make -j 24
## Common Voice
cd $KALDI_GIT_ROOT/tools
sudo ./install_srilm.sh
# required by install_sequitur.sh
curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py
sudo python get-pip.py
sudo apt-get install -y swig python-setuptools python-dev
sudo pip install numpy
sudo ./extras/install_sequitur.sh
# For qsub
sudo apt-get install -y torque-server torque-client torque-mom torque-pam
# Start Kaldi training on Common Voice
sudo apt install sox libsox-fmt-mp3 # run.sh will convert Common Voice MP3s to WAV using sox
cd $KALDI_GIT_ROOT/egs/commonvoice/s5
source $KALDI_GIT_ROOT/tools/env.sh
./run.sh
我使用 Ubuntu 16.04.4 LTS。
问题是 qsub
是错误的:应该使用 Sun Grid Engine, and not the one used in the Torque program 中包含的。
sudo apt-get remove torque-server torque-client torque-mom torque-pam
sudo apt-get install gridengine-master gridengine-client gridengine-exec
请注意,如果Sun Grid Engine can be tricky to configure并且如果一个人使用一台计算机,那么通过它并没有多大优势。
要在没有 Sun Grid Engine 的情况下使用 Common Voice 训练 Kaldi,可以将 queue.pl
替换为 run.pl
in /kaldi/egs/commonvoice/s5/cmd.sh
:
export train_cmd="run.pl --mem 2G"
export decode_cmd="run.pl --mem 4G"
export mkgraph_cmd="run.pl --mem 8G"
我正在尝试 运行 Kaldi 的 Common Voice 食谱 (kaldi/egs/commonvoice/s5/run.sh
) 在我的计算机上(即不在集群上)。它崩溃并显示错误消息 Output of qsub was: qsub: illegal -c value ""
。可能是什么问题?
具体来说,这是整个错误堆栈:
[...]
Succeeded in formatting LM: 'data/local/lm.gz'
steps/make_mfcc.sh --cmd queue.pl --mem 2G --nj 20 data/valid_train exp/make_mfcc/valid_train mfcc
utils/validate_data_dir.sh: Successfully validated data-directory data/valid_train
steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance.
queue.pl: Error submitting jobs to queue (return status was 512)
queue log file is exp/make_mfcc/valid_train/q/make_mfcc_valid_train.log, command was qsub -v PATH -cwd -S /bin/bash -j y -l arch=*64* -o exp/make_mfcc/valid_train/q/make_mfcc_valid_train.log -l mem_free=2G,ram_free=2G -t 1:20 /home/ubuntu/kaldi/egs/commonvoice/s5/exp/make_mfcc/valid_train/q/make_mfcc_valid_train.sh >>exp/make_mfcc/valid_train/q/make_mfcc_valid_train.log 2>&1
Output of qsub was: qsub: illegal -c value ""
usage: qsub [-a date_time] [-A account_string] [-b secs]
[-c [ none | { enabled | periodic | shutdown |
depth=<int> | dir=<path> | interval=<minutes>}... ]
[-C directive_prefix] [-d path] [-D path]
[-e path] [-h] [-I] [-j oe] [-k {oe}] [-l resource_list] [-m n|{abe}]
[-M user_list] [-N jobname] [-o path] [-p priority] [-P proxy_user] [-q queue]
[-r y|n] [-S path] [-t number_to_submit] [-T type] [-u user_list] [-w] path
[-W otherattributes=value...] [-v variable_list] [-V ] [-x] [-X] [-z] [script]
我运行 Common Voice 配方如下:
## Retrieving Kaldi source code
git clone https://github.com/kaldi-asr/kaldi.git
cd kaldi
export KALDI_GIT_ROOT=`pwd`
echo $KALDI_GIT_ROOT
## Compile Kaldi tools
cd $KALDI_GIT_ROOT/tools
sudo apt-get install -y zlib1g-dev automake autoconf libtool libatlas3-base subversion
make -j 24
## Compile Kaldi
cd $KALDI_GIT_ROOT/src
./configure
make -j 24
## Common Voice
cd $KALDI_GIT_ROOT/tools
sudo ./install_srilm.sh
# required by install_sequitur.sh
curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py
sudo python get-pip.py
sudo apt-get install -y swig python-setuptools python-dev
sudo pip install numpy
sudo ./extras/install_sequitur.sh
# For qsub
sudo apt-get install -y torque-server torque-client torque-mom torque-pam
# Start Kaldi training on Common Voice
sudo apt install sox libsox-fmt-mp3 # run.sh will convert Common Voice MP3s to WAV using sox
cd $KALDI_GIT_ROOT/egs/commonvoice/s5
source $KALDI_GIT_ROOT/tools/env.sh
./run.sh
我使用 Ubuntu 16.04.4 LTS。
问题是 qsub
是错误的:应该使用 Sun Grid Engine, and not the one used in the Torque program 中包含的。
sudo apt-get remove torque-server torque-client torque-mom torque-pam
sudo apt-get install gridengine-master gridengine-client gridengine-exec
请注意,如果Sun Grid Engine can be tricky to configure并且如果一个人使用一台计算机,那么通过它并没有多大优势。
要在没有 Sun Grid Engine 的情况下使用 Common Voice 训练 Kaldi,可以将 queue.pl
替换为 run.pl
in /kaldi/egs/commonvoice/s5/cmd.sh
:
export train_cmd="run.pl --mem 2G"
export decode_cmd="run.pl --mem 4G"
export mkgraph_cmd="run.pl --mem 8G"