LdaTrainer#

class montreal_forced_aligner.acoustic_modeling.LdaTrainer(subset=10000, num_leaves=2500, max_gaussians=15000, lda_dimension=40, uses_splices=True, splice_left_context=3, splice_right_context=3, random_prune=4.0, boost_silence=1.0, power=0.25, **kwargs)[source]#

Bases: TriphoneTrainer

Triphone trainer

Parameters:

subset (int) – Number of utterances to use, defaults to 10000
num_leaves (int) – Number of states in the decision tree, defaults to 2500
max_gaussians (int) – Number of gaussians in the decision tree, defaults to 15000
lda_dimension (int) – Dimensionality of the LDA matrix
uses_splices (bool) – Flag to use spliced and LDA calculation
splice_left_context (int or None) – Number of frames to splice on the left for calculating LDA
splice_right_context (int or None) – Number of frames to splice on the right for calculating LDA
random_prune (float) – This is approximately the ratio by which we will speed up the LDA and MLLT calculations via randomized pruning