Prior to running the aligner, make sure the following are set up:
- A pronunciation dictionary for your language should specify the pronunciations of orthographic transcriptions.
- The sound files to align.
- Orthographic annotations in .lab files for individual sound files (Prosodylab-aligner format) or in TextGrid intervals for longer sound files (TextGrid format).
The sound files and the orthographic annotations should be contained in one directory structured as follows:
+-- textgrid_corpus_directory | --- recording1.wav | --- recording1.TextGrid | --- recording2.wav | --- recording2.TextGrid | --- ... +-- prosodylab_corpus_directory | +-- speaker1 | --- recording1.wav | --- recording1.lab | --- recording2.wav | --- recording2.lab | +-- speaker2 | --- recording3.wav | --- recording3.lab | --- ...
A collection of preprocessing scripts to get various corpora of other formats is available in the MFA-reorganization-scripts repository.
For details on how to organize each of these three components, see below.
- Dictionary format
- Sound files
- Data formats