Automatic speech and singing classification in ambulatory recordings for normal and disordered voices
Author(s) -
Andrew J. Ortiz,
Laura E. Toles,
Katherine L. Marks,
Silvia Capobianco,
Daryush D. Mehta,
Robert E. Hillman,
Jarrad H. Van Stan
Publication year - 2019
Publication title -
the journal of the acoustical society of america
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.619
H-Index - 187
eISSN - 1520-8524
pISSN - 0001-4966
DOI - 10.1121/1.5115804
Subject(s) - ambulatory , audiology , singing , speech recognition , set (abstract data type) , classifier (uml) , accelerometer , logistic regression , computer science , acoustics , medicine , artificial intelligence , machine learning , physics , programming language , operating system
Ambulatory voice monitoring is a promising tool for investigating phonotraumatic vocal hyperfunction (PVH), associated with the development of vocal fold lesions. Since many patients with PVH are professional vocalists, a classifier was developed to better understand phonatory mechanisms during speech and singing. Twenty singers with PVH and 20 matched healthy controls were monitored with a neck-surface accelerometer-based ambulatory voice monitor. An expert-labeled ground truth data set was used to train a logistic regression on 15 subject-pairs with fundamental frequency and autocorrelation peak amplitude as input features. Overall classification accuracy of 94.2% was achieved on the held-out test set.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom