get_egs

aligner.multiprocessing.get_egs(config, ali_dir, valid_uttlist, train_subset_uttlist)[source]

Multiprocessing function that gets training examples for the neural net

See:

for more details on the Kaldi binary this runs.

Also see https://github.com/kaldi-asr/kaldi/blob/master/egs/wsj/s5/steps/online/nnet2/get_egs.sh for the original bash script that this function was based on.

Parameters:
nnet_dir : str

Directory of nnet training

egs_dir : str

Directory where examples will go

training_dir : str

Directory of combined “split” files (e.g. utt2spk, cmvn.scp)

split_dir : str

Directory of training data split into the number of jobs

ali_dir : str

Directory of previous alignment

ivector_dir : str

Directory of the extracted i-vectors

feats : str

Path to features

valid_uttlist : str

Path to list of valid utterances

train_subset_uttlist : str

Path to subset list of training utterances

config : NnetBasicConfig

Configuration object for training

num_jobs : int

The number of processes to use in calculation