TrainableAligner#
- class montreal_forced_aligner.acoustic_modeling.trainer.TrainableAligner(training_configuration=None, phone_set_type=None, model_version=None, subset_word_count=3, minimum_utterance_length=2, **kwargs)[source]#
Bases:
TranscriberMixin
,TopLevelMfaWorker
,ModelExporterMixin
Train acoustic model
- Parameters:
See also
CorpusAligner
For dictionary and corpus parsing parameters and alignment parameters
TopLevelMfaWorker
For top-level parameters
ModelExporterMixin
For model export parameters
- Variables:
param_dict (dict[str, Any]) – Parameters to pass to training blocks
final_identifier (str) – Identifier of the final training block
current_subset (int) – Current training block’s subset
current_acoustic_model (
AcousticModel
) – Acoustic model to use in aligning, based on previous training blocktraining_configs (dict[str,
AcousticModelTrainingMixin
]) – Training blocks
- add_config(train_type, params)[source]#
Add a trainer to the pipeline
- Parameters:
- Raises:
ConfigError – If an invalid train_type is specified
- align()[source]#
Multiprocessing function that aligns based on the current model.
See also
AlignFunction
Multiprocessing helper function for each job
AlignMixin.align_arguments
Job method for generating arguments for the helper function
- align_si.sh
Reference Kaldi script
- align_fmllr.sh
Reference Kaldi script
- property align_options#
Alignment options
- property alignment_model_path#
Current alignment model path
- property configuration#
Configuration for the worker
- property data_directory#
Current data directory based on the trainer’s subset
- export_files(output_directory, output_format=None, include_original_text=False)[source]#
Export a TextGrid file for every sound file in the dataset
- Parameters:
- export_model(output_model_path)[source]#
Export an acoustic model to the specified path
- Parameters:
output_model_path (str) – Path to save acoustic model
- property meta#
Metadata about the final round of training
- property model_path#
Current model path
- property num_current_utterances#
Number of utterances in the current subset
- classmethod parse_parameters(config_path=None, args=None, unknown_args=None)[source]#
Parse configuration parameters from a config file and command line arguments
- transition_acc_arguments()[source]#
Generate Job arguments for
TransitionAccArguments
- Returns:
Arguments for processing
- Return type:
list[
TransitionAccArguments
]
- property working_directory#
Working directory
- property working_log_directory#
Current log directory