Pretrained models#

The command for interacting with MFA models is mfa model. The subcommands allow for inspecting currently saved pretrained models, downloading ones from MFA’s model repo, and saving models you have trained to be used with a simple name rather than the full path each time.

Following installation of MFA, mfa model list acoustic will not list any models. If you want to download the default English model trained on LibriSpeech, you can run mfa model download acoustic english_us_arpa. At which point, the previous list command will output “english_us_arpa” as an option. When referring to an acoustic model in another MFA command, rather than the full path to the acoustic model, you can now supply just english_us_arpa and MFA will resolve it to the saved path.

Similarly, if you train a new model, you can run mfa model save acoustic /path/where/the/model/was/saved.zip, then this model will be available via saved in the future. The name defaults to whatever the archive is called without the directory or extension. You can modify this name with the --name NEWNAME option

There are a number of pretrained models for aligning and generating pronunciation dictionaries. The command for downloading these is mfa model download <model_type> where model_type is one of acoustic, g2p, or dictionary.

Note

Please see the MFA Models site for information and statistics about various models.

Command reference#

mfa model#

Inspect, download, and save pretrained MFA models and dictionaries

mfa model [OPTIONS] COMMAND [ARGS]...

Options

-h, --help#: Show this message and exit.

add_words#

Add words from one pronunciation dictionary to another pronunciation dictionary, so long as the new pronunciations do not contain any new phones

mfa model add_words [OPTIONS] DICTIONARY_PATH NEW_PRONUNCIATIONS_PATH

Options

-h, --help#: Show this message and exit.

-p, --profile <profile>#: Configuration profile to use, defaults to “global”

-t, --temporary_directory <temporary_directory>#: Set the default temporary directory, default is /home/docs/Documents/MFA

-j, --num_jobs <num_jobs>#: Set the number of processes to use by default, defaults to 3

--clean, --no_clean#: Remove files from previous runs, default is False

--final_clean, --no_final_clean#: Remove temporary files at the end of run, default is False

-v, --verbose, -nv, --no_verbose#: Output debug messages, default is False

-q, --quiet, -nq, --no_quiet#: Suppress all output messages (overrides verbose), default is False

--overwrite, --no_overwrite#: Overwrite output files when they exist, default is False

--use_mp, --no_use_mp#: Turn on/off multiprocessing. Multiprocessing is recommended will allow for faster executions.

--use_threading, --no_use_threading#: Use threading library rather than multiprocessing library. Multiprocessing is recommended will allow for faster executions.

-d, --debug, -nd, --no_debug#: Run extra steps for debugging issues, default is False

--use_postgres, --no_use_postgres#: Use postgres instead of sqlite for extra functionality, default is False

--single_speaker#: Single speaker mode creates multiprocessing splits based on utterances rather than speakers. This mode also disables speaker adaptation equivalent to --uses_speaker_adaptation false.

--textgrid_cleanup, --cleanup_textgrids, --no_textgrid_cleanup, --no_cleanup_textgrids#: Turn on/off post-processing of TextGrids that cleans up silences and recombines compound words and clitics.

Arguments

DICTIONARY_PATH#: Required argument

NEW_PRONUNCIATIONS_PATH#: Required argument

download#

Download pretrained models from the MFA repository. If no model names are specified, the list of all downloadable models of the given model type will be printed.

mfa model download [OPTIONS]
                   {acoustic|dictionary|g2p|ivector|language_model|tokenizer}
                   [MODEL_NAME]...

Options

--github_token <github_token>#: Personal access token to use for requests to GitHub to increase rate limit.

--version <version>#: Specific version of model to download rather than the latest.

-f, --force, --ignore_cache#: Flag to ignore existing downloaded models and force a re-download.

-h, --help#: Show this message and exit.

Arguments

MODEL_TYPE#: Required argument

MODEL_NAME#: Optional argument(s)

inspect#

Inspect a model and print out its metadata.

mfa model inspect [OPTIONS]
                  {acoustic|dictionary|g2p|ivector|language_model|tokenizer}
                  MODEL

Options

-h, --help#: Show this message and exit.

Arguments

MODEL_TYPE#: Required argument

MODEL#: Required argument

list#

List of locally saved models.

mfa model list [OPTIONS]
               {acoustic|dictionary|g2p|ivector|language_model|tokenizer}

Options

-h, --help#: Show this message and exit.

Arguments

MODEL_TYPE#: Required argument

save#

Save a model to pretrained folder for later use

Parameters#

path: Path: Path to model
model_type: str: Type of model

mfa model save [OPTIONS]
               {acoustic|dictionary|g2p|ivector|language_model|tokenizer} PATH

Options

--name <name>#: Name to use as reference (defaults to the name of the zip file).

--overwrite, --no_overwrite#: Overwrite output files when they exist, default is False

-h, --help#: Show this message and exit.

Arguments

MODEL_TYPE#: Required argument

PATH#: Required argument