Segmentation options#
Parameter |
Default value |
Notes |
---|---|---|
energy_threshold |
5.5 |
Energy threshold above which a frame will be counted as voiced |
energy_mean_scale |
0.5 |
Proportion of the mean energy of the file that should be added to the energy_threshold |
max_segment_length |
30 |
Maximum length of segments before they do not get merged |
min_pause_duration |
0.05 |
Minimum unvoiced duration to split speech segments |
Default segmentation config file#
energy_threshold: 5.5
energy_mean_scale: 0.5
max_segment_length: 30
min_pause_duration: 0.05