An Acoustic Analysis of Fluctuations for Inter- and Intra-Speaker Variability in Speech Sounds

Author:

Kaur Jasdeep1,Juglan Kailash Chandra1,Sharma Kush2,Sharma Vishal3

Affiliation:

1. Department of Physics, School of Chemical Engineering and Physical Sciences Lovely Professional University, Phagwara, Punjab, India

2. Data Manager, School of Community Medicine and Public Health, Postgraduate Institute of Medical Education and Research, Chandigarh, India

3. Institute of Forensic Science and Criminology, Panjab University, Chandigarh, India

Abstract

Background: Variation in the speech of speakers is a crucial issue for the forensic system. The main reason behind incorrect speaker identification is greater intra-speaker fluctuation. In the forensic state of play, a lot of research has been carried out on speaker identification. However inter variations and intra fluctuations in speakers for the Punjabi language is still a grey area. Aims and Objectives: Our aim is to study acoustic analysis of fluctuations for inter and intra speaker variability in speech sounds. In our study, we will consider Punjabi vowel with consonants. The Statistical methods will be applied to analyze the data; firstly, the Shapiro-Wilk test will be checked for normality and then Levene’s Test to assess the equality of variances. Materials and Method: Five vowels were selected with different consonants. They were combined to make meaningful words. Then these meaningful words were embedded in sentences. Ten speakers participated voluntarily. All are students of A.S College at Khanna in Punjab. The individuals were aged between 20-22 years with no hearing or speech disorder. The voice samples were recorded with help of good quality microphone and by Goldwave software in the sound proof lab.Samples were introduced directly into PRAAT software by the use of a Sony microphone and with sampling rate of 44100 Hz frequency. Acoustic Analysis has been done with help of Goldwave software in form of spectrograms. Results and Conclusion: Each formant shows a different value for inter variations and inter speaker fluctuations. F1 and F2 shows lesser speaker variation than the high-frequency region in F3 and F4, so we can say that in comparison with the lower part, high-frequency regions are more valuable. The assumptions for TWO-WAY ANOVA is violated and hence, we have used the non-parametric Friedman Test and performed its Post hoc analysis. From Posthoc analysis, we can say that F1 and F2 (p >0.05) and F2 and F3 (p>0.05) gave the same type of results. Hence, from the results of these statistical tests, we can conclude that F1 is recommended over F2, F3, and F4. As the frequency of F1 is high as well as in line with the results of statistical tests. Because we prefer more variation among frequencies so that we can easily distinguish different speakers and it would be more beneficial for inter variations and intra fluctuations.

Publisher

Medknow

Reference27 articles.

1. Inter and intra speaker variability in fundamental voice frequency;Atkinson;J Acoust Soc Am,1976

2. Acoustic analysis of whispery voice disguise in Chinese;Zhang;J Acoust Soc Am,2017

3. Voice spectrograms as a function of age, voice disguise, and voice imitation;Endres;J Acoust Soc Am,1971

4. Speaker identification by long-term spectra under normal and distorted speech conditions;Hollien;J Acoust Soc Am,1977

5. Effects of selected vocal disguises upon spectrographic speaker identification;Riech;J Acoust Soc Am,1976

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3