Research |

Machine Learning Applied to GRBAS Voice Quality Assessment

Tools

Lists

Xie, Zheng ORCID: 0000-0001-8649-6235, Gadepalli, Chaitanya, Farideh, Jalalinajafabadi, Cheetham, Barry M.G. and Homer, Jarrod J. (2018) Machine Learning Applied to GRBAS Voice Quality Assessment. Advances in Science, Technology and Engineering Systems Journal, 3 (6). pp. 329-338. ISSN 2415-6698

Preview

PDF (Version of Record) - Published Version
Available under License Creative Commons Attribution Share Alike.
617kB

Official URL: http://dx.doi.org/10.25046/aj030641

Abstract

Voice problems are routinely assessed in hospital voice clinics by speech and language therapists (SLTs) who are highly skilled in making audio-perceptual evaluations of voice quality. The evaluations are often presented numerically in the form of five-dimensional ‘GRBAS’ scores. Computerised voice quality assessment may be carried out using digital signal processing (DSP) techniques which process recorded segments of a patient’s voice to measure certain acoustic features such as periodicity, jitter and shimmer. However, these acoustic features are often not obviously related to GRBAS scores that are widely recognised and understood by clinicians. This paper investigates the use of machine learning (ML) for mapping acoustic feature measurements to more familiar GRBAS scores. The training of the ML algorithms requires accurate and reliable GRBAS assessments of a representative set of voice recordings, together with corresponding acoustic feature measurements. Such ‘reference’ GRBAS assessments were obtained in this work by engaging a number of highly trained SLTs as raters to independently score each voice recording. Clearly, the consistency of the scoring is of interest, and it is possible to measure this consistency and take it into account when computing the reference scores, thus increasing their accuracy and reliability. The properties of well known techniques for the measurement of consistency, such as intra-class correlation (ICC) and the Cohen and Fleiss Kappas, are studied and compared for the purposes of this paper. Two basic ML techniques, i.e. K-nearest neighbour regression and multiple linear regression were evaluated for producing the required GRBAS scores by computer. Both were found to produce reasonable accuracy according to a repeated cross-validation test.

Repository Staff Only: item control page

Altmetric

Summary Table

Item Type:	Article
Creators (Authors or editors):	Creators Email ORCID ORCID Put Code Xie, Zheng zxie2@uclan.ac.uk https://orcid.org/0000-0001-8649-6235 UNSPECIFIED Gadepalli, Chaitanya UNSPECIFIED UNSPECIFIED UNSPECIFIED Farideh, Jalalinajafabadi UNSPECIFIED UNSPECIFIED UNSPECIFIED Cheetham, Barry M.G. UNSPECIFIED UNSPECIFIED UNSPECIFIED Homer, Jarrod J. UNSPECIFIED UNSPECIFIED UNSPECIFIED
Uncontrolled Keywords (separate with ;):	Voice quality assessment; GRBAS; Consistency measures; Cohen Kappa; Fleiss Kappa; Intra-class correlation; Feature detection; Machine learning
Subjects:	B - Subjects allied to medicine > B610 - Audiology G - Mathematical Sciences > G300 - Statistics G - Mathematical Sciences > G310 - Applied statistics H - Engineering > H600 - Electronic & electrical engineering I - Computer science > I400 - Artificial intelligence I - Computer science > I460 - Machine learning I - Computer science > I510 - Health technologies
Schools:	School of Engineering and Computing > Engineering, Construction, Maths and Physics
Related URLs:	Publisher
ID Code:	25743
Depositing User ID:	Zheng Xie
Date Deposited:	14 Jan 2019 11:04
Last Modified:	19 Jun 2025 17:46

CORE (COnnecting REpositories)

Search CLok

Machine Learning Applied to GRBAS Voice Quality Assessment

Abstract

Follow Us