speaker_name (str) – Speaker name
file_name (str) – File name
begin (float, optional) – Begin timestamp
end (float, optional) – End timestamp
channel (int, optional) – Sound file channel
text (str, optional) – Utterance text
oovs (set[str]) – Set of words not found in a look up