SimpleTokenizer#

class montreal_forced_aligner.tokenization.simple.SimpleTokenizer(word_break_markers, punctuation, clitic_markers, compound_markers, brackets, laughter_word='[laughter]', oov_word='<unk>', bracketed_word='<bracketed>', cutoff_word='<cutoff>', ignore_case=True, use_g2p=False, clitic_set=None, grapheme_set=None, word_table=None)[source]#

Bases: object