Affiliation:
1. Department of Speech Pathology and Audiology, and National Center for Voice and Speech The University of Iowa Iowa City and The Recording and Research Center The Denver Center for the Performing Arts Denver, CO
Abstract
Voice perturbation measures, such as jitter and shimmer, depend on accurate extraction of fundamental frequency (F
o
) and amplitude of various waveform types. The extraction method directly affects the accuracy of the measures, particularly if several waveform types (with or without formant structure) are under consideration and if noise and modulation are present in the signal. For frequency perturbation, high precision is defined here as the ability to extract F
o
to ±0.01% under conditions of noise and modulation. Three F
o
-extraction methods and their software implementations are discussed and compared. The methods are cycle-to-cycle waveform matching, zero-crossing and peak-picking. Interpolation between samples is added to make the extractions more accurate and reliable. The sensitivity of the methods to different parameters such as sampling frequency, mean F
o
, signal-to-noise ratio, frequency modulation, and amplitude modulation are explored.
Publisher
American Speech Language Hearing Association
Subject
Speech and Hearing,Linguistics and Language,Language and Linguistics
Cited by
132 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献