고신대학교 언어치료학과 배인호 교수가 2021년 Journal of Voice(SCI) 에 논문을 게재하였습니다.

부산대학교병원 이비인후과와 공동연구로 한국인 화자 2,863명을 대상으로 발성장애와 정상음성을 변별하기 위한 방법으로 구어과업과 음성분절추출에 기초한 캡스트랄 분석을 실시하여 비교한 연구를 진행하여 ‘Comparison of cepstral analysis based on voiced-segment extraction and voice tasks for discriminating dysphonic and normophonic Korean speakers’ 으로 2021년 Journal of Voice (SCI) 35권 2호에 논문을 게재하였습니다.

배인호 교수는 양산부산대학교병원 이비인후과에서 다년간의 임상을 거친후 고신대학교 언어치료학과에 조교수로 재직중이며 현재 언어재활사협회 부산울산경남 지부회 부지부장, 한국언어재활사협회 이사, 한국언어치료학회 이사로서 언어치료의 학문적 발전과 언어재활사의 권익신장에 이바지하고 있습니다.

Kim, G. H., Bae, I. H., Park, H. J., & Lee, Y. W. (2021). Comparison of cepstral analysis based on voiced-segment extraction and voice tasks for discriminating dysphonic and normophonic Korean speakers. Journal of Voice, 35(2). 328.e11–328.e22.


Comparison of cepstral analysis based on voiced-segment extraction and voice tasks for discriminating dysphonic and normophonic Korean speakers

Geun-Hyo Kim*, In-Ho Bae†, Hee-June Park‡, Yeon-Woo Lee*

Department of Otorhinolaryngology—Head and Neck Surgery and Biomedical Research Institute, Pusan National University Hospital, Busan, South Korea

Department of Otorhinolaryngology—Head and Neck Surgery, Pusan National University Yangsan Hospital, Yangsan, Gyeongsangnam-do, South Korea

Department of Speech and Hearing Therapy, Catholic University of Pusan, Busan, South Korea

Objectives

This study investigated whether there are differences in the discriminatory power of cepstral analysis according to the voiced-segment extraction method and voice tasks used for identifying dysphonic and normophonic Korean individuals.

Materials and Methods

A total of 2,863 subjects (2,595 subjects with and 268 subjects without dysphonia) were included in this study. The 3-second sustained vowel (SV) /a/ and one sentence of “Sanchaek” were edited and analyzed using Praat scripts. Cepstral analyses (cepstral peak prominence [CPP], smoothed cepstral peak prominence [CPPS], and low/high spectral ratio [LHRatio]) were performed using three voice tasks, namely, SV, continuous speech (CS), and extracted continuous speech (EXT) samples. Additionally, auditory-perceptual (A-P) assessments were performed by three speech language pathologists

Results

Significant differences were found between dysphonic and normophonic voice groups for all cepstral parameters, except the LHRatio_EXT. Cepstral measurements of both SV and CS were highly correlated with A-P ratings. Furthermore, the diagnostic predictive power of CPP and CPPS for CS using the area under the receiver operating characteristic curve (AUC) was >0.919, the positive likelihood ratio (LR+) was ≥6.85, and the negative likelihood ratio (LR−) was ≤0.23. Additionally, for EXT, the AUC was >0.816, LR+ was 3.10, and LR− was ≤0.33.

Conclusion

Both CS and EXT can predict dysphonia relatively well (r > 0.816). EXT showed lower predictability than the original sample (CS) analysis. Subsequent studies should implement voiced-segment extraction methods using various algorithms.

Keywords: Acoustic analysis, Auditory-perceptual ratings, Dysphonia, Praat