Corpus¶
- class aligner.corpus.Corpus(directory, output_directory, use_speaker_information=True, speaker_characters=0, num_jobs=3, debug=False, ignore_exceptions=False)[source]¶
Class that stores information about the dataset to align.
Corpus objects have a number of mappings from either utterances or speakers to various properties, and mappings between utterances and speakers.
See http://kaldi-asr.org/doc/data_prep.html for more information about the files that are created by this class.
- Parameters:
- directorystr
Directory of the dataset to align
- output_directorystr
Directory to store generated data for the Kaldi binaries
- mfcc_configMfccConfig
Configuration object for how to calculate MFCCs
- speaker_charactersint, optional
Number of characters in the filenames to count as the speaker ID, if not specified, speaker IDs are generated from directory names
- num_jobsint, optional
Number of processes to use, defaults to 3
- Raises:
- CorpusError
Raised if the specified corpus directory does not exist
- SampleRateError
Raised if the wav files in the dataset do not share a consistent sample rate
Attributes
grouped_cmvngrouped_featgrouped_segmentsgrouped_spk2uttgrouped_utt2spkgrouped_wavmfcc_directorymfcc_log_directorynum_utterancessplit_directoryword_setMethods
create_mfccs()find_best_groupings()get_feat_dim()get_wav_duration(utt)get_word_frquency(dictionary)grouped_text([dictionary])grouped_text_int(dictionary)grouped_utt2fst(dictionary[, num_frequent_words])initialize_corpus(dictionary[, skip_input])parse_mfcc_logs()speaker_utterance_info()write()