AcousticCorpusMixin#
- class montreal_forced_aligner.corpus.acoustic_corpus.AcousticCorpusMixin(audio_directory=None, **kwargs)[source]#
Bases:
CorpusMixin
,FeatureConfigMixin
Mixin class for acoustic corpora
- Parameters:
audio_directory (str) – Extra directory to look for audio files
See also
CorpusMixin
For corpus parsing parameters
FeatureConfigMixin
For feature generation parameters
- Variables:
- calc_cmvn()[source]#
Calculate CMVN statistics for speakers
See also
- featbin/compute-cmvn-stats.cc
Relevant Kaldi binary
- calc_fmllr(iteration=None)[source]#
Multiprocessing function that computes speaker adaptation transforms via feature-space Maximum Likelihood Linear Regression (fMLLR).
See also
CalcFmllrFunction
Multiprocessing helper function for each job
AcousticCorpusMixin.calc_fmllr_arguments
Job method for generating arguments for the helper function
- align_fmllr.sh
Reference Kaldi script
- train_sat.sh
Reference Kaldi script
- calc_fmllr_arguments(iteration=None)[source]#
Generate Job arguments for
CalcFmllrFunction
- Returns:
Arguments for processing
- Return type:
list[
CalcFmllrArguments
]
- compute_vad()[source]#
Compute Voice Activity Detection features over the corpus
See also
ComputeVadFunction
Multiprocessing helper function for each job
AcousticCorpusMixin.compute_vad_arguments
Job method for generating arguments for helper function
- compute_vad_arguments()[source]#
Generate Job arguments for
ComputeVadFunction
- Returns:
Arguments for processing
- Return type:
list[
VadArguments
]
- final_feature_arguments()[source]#
Generate Job arguments for
MfccFunction
- Returns:
Arguments for processing
- Return type:
list[
MfccArguments
]
- get_feat_dim()[source]#
Calculate the feature dimension for the corpus
- Returns:
Dimension of feature vectors
- Return type:
- load_reference_alignments(reference_directory)[source]#
Load reference alignments to use in alignment evaluation from a directory
- Parameters:
reference_directory (
Path
) – Directory containing reference alignments
- mfcc()[source]#
Multiprocessing function that converts sound files into MFCCs.
See Kaldi feat page for an overview on feature generation in Kaldi.
See also
MfccFunction
Multiprocessing helper function for each job
AcousticCorpusMixin.mfcc_arguments
Job method for generating arguments for helper function
- make_mfcc.sh
Reference Kaldi script
- mfcc_arguments()[source]#
Generate Job arguments for
MfccFunction
- Returns:
Arguments for processing
- Return type:
list[
MfccArguments
]
- property no_transcription_files#
List of sound files without text files
- property transcriptions_without_wavs#
List of text files without sound files