All commands#

The mfa command line utility has several subcommands, which are listed below grouped by general domain.

Preparation#

Command

Description

Link

mfa validate

Validate a corpus

Validating data

Forced Alignment#

Command

Description

Link

mfa align

Perform forced alignment with a pretrained model

Align with an acoustic model (mfa align)

mfa train

Train an acoustic model and export resulting alignment

Train a new acoustic model (mfa train)

mfa adapt

Adapt a pretrained acoustic model on a new dataset

Adapt acoustic model to new data (mfa adapt)

mfa train_dictionary

Estimate pronunciation probabilities from aligning a corpus

Add probabilities to a dictionary (mfa train_dictionary)

Corpus creation#

Command

Description

Link

mfa create_segments

Use voice activity detection to create segments

Segment transcribed files (mfa segment)

mfa train_ivector

Train an ivector extractor for speaker classification

Train an ivector extractor (mfa train_ivector)

mfa diarize_speakers

Use ivector extractor to classify files or cluster them

Speaker diarization (mfa diarize_speakers)

mfa transcribe

Generate transcriptions using an acoustic model, dictionary, and language model

Transcribe audio files (mfa transcribe)

mfa train_lm

Train a language model from a text corpus or from an existing language model

Train a new language model (mfa train_lm)

mfa anchor

Run the Anchor annotator utility (if installed) for editing and managing corpora

Anchor annotator (mfa anchor)

Other utilities#

Command

Description

Link

mfa model

Inspect/list/download/save models

Pretrained models

mfa configure

Configure MFA to use customized defaults for command line arguments

Configuration

mfa history

List previous MFA commands run locally

Grapheme-to-phoneme#

Command

Description

Link

mfa g2p

Use a G2P model to generate a pronunciation dictionary

Generate pronunciations for words (mfa g2p)

mfa train_g2p

Train a G2P model from a pronunciation dictionary

Train a new G2P model (mfa train_g2p)