Perceived Audio Quality of Sounds Degraded by Nonlinear Distortions and Single-Ended Assessment Using HASQI
Author(s) -
Paul Kendrick,
et al.
Publication year - 2015
Publication title -
journal of the audio engineering society
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.234
H-Index - 60
ISSN - 1549-4950
DOI - 10.17743/jaes.2015.0068
Subject(s) - sound quality , quality (philosophy) , nonlinear system , computer science , quality assessment , acoustics , speech recognition , reliability engineering , engineering , evaluation methods , physics , quantum mechanics
For field recordings and user generated content recorded on phones, tablets, and other mobile devices nonlinear distortions caused by clipping and limiting at pre-amplification stages, and dynamic range control (DRC) are common causes of poor audio quality. A single-ended method to detect these distortions and predict perceived degradation in speech, music, and soundscapes has been developed. This was done by training an ensemble of decision trees. During training, both clean and distorted audio was available and so the perceived quality could be gauged using HASQI (Hearing Aid Sound Quality Index). The new single-ended method can correctly predict HASQI from distorted samples to an accuracy of ±0.19 (95% confidence interval) using a quality range between 0.0 and 1.0. The method also has potential for estimating HASQI when other types of degradations are present. Subsequent perceptual tests validated the method for music and soundscapes. For the average mean opinion score for perceived audio quality on a scale from 0 to 1, the single ended method could estimate it within ±0.33.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom