SoundFile#
- class montreal_forced_aligner.db.SoundFile(**kwargs)[source]#
Bases:
Base
Database class for storing information about sound files
- Parameters:
file (
File
) – Root filesound_file_path (
Path
) – Path to the audio fileformat (str) – Format of the audio file (flac, wav, mp3, etc)
sample_rate (int) – Sample rate of the audio file
duration (float) – Duration of audio file
num_channels (int) – Number of channels in the audio file
sox_string (str) – String that Kaldi will use to process the sound file
- load_audio(begin=0, end=None)[source]#
Load a normalized waveform for acoustic processing/visualization