Dictionary format

Non-probabilistic format

Dictionaries should be specified in the following format:

WORDA PHONEA PHONEB
WORDA PHONEC
WORDB PHONEB PHONEC

where each line is a word with a transcription separated by white space. Each phone in the transcription should be separated by white space as well.

A dictionary for English that has good coverage is the lexicon derived from the LibriSpeech corpus (LibriSpeech lexicon). This lexicon uses the Arpabet transcription format (like the CMU Pronouncing Dictionary).

The Prosodylab-aligner has two preconstructed dictionaries as well, one for English (Prosodylab-aligner English dictionary) and one for Quebec French (Prosodylab-aligner French dictionary), also see Available pronunciation dictionaries for a list of supported dictionaries.

Note

See the page on Generating a dictionary for how to use G2P models to generate a dictionary from our pretrained models or how to generate pronunciation dictionaries from orthographies.

Dictionaries with pronunciation probability

Dictionaries can be parsed with pronunciation probabilities, usually as the output of Modeling pronunciation probabilities.

The format for this dictionary format is:

WORDA 1.0 PHONEA PHONEB
WORDA 0.3 PHONEC
WORDB 1.0 PHONEB PHONEC

Note

The most likely probability for a word is set to 1.0 in the algorithm implemented in Modeling pronunciation probabilities. While this means that the sum of probabilities per word is greater than 1, it does not penalize words for having multiple pronunciations, and these probabilities are converted to log costs in the eventual weighted FST.

Non-speech annotations

There are two special phones that can be used for annotations that are not speech, sil and spn. The sil phone is used to model silence, and the spn phone is used to model unknown words. If you have annotations for non-speech vocalizations that are similar to silence like breathing or exhalation, you can use the sil phone to align those. You can use the spn phone to align annotations like laughter, coughing, etc.

{LG} spn
{SL} sil