
Forensic Automatic Speaker Recognition Based on Likelihood Ratio Using Acoustic-phonetic Features Measured Automatically
Author(s) -
Huapeng Wang,
Cuiling Zhang
Publication year - 2015
Publication title -
journal of forensic science and medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.132
H-Index - 6
eISSN - 2455-0094
pISSN - 2349-5014
DOI - 10.4103/2349-5014.169617
Subject(s) - speech recognition , computer science , speaker recognition , speaker diarisation , pattern recognition (psychology) , artificial intelligence
Forensic speaker recognition is experiencing a remarkable paradigm shift in terms of the evaluation framework and presentation of voice evidence. This paper proposes a new method of forensic automatic speaker recognition using the likelihood ratio framework to quantify the strength of voice evidence. The proposed method uses a reference database to calculate the within- and between-speaker variability. Some acoustic-phonetic features are extracted automatically using the software VoiceSauce. The effectiveness of the approach was tested using two Mandarin databases: A mobile telephone database and a landline database. The experiment's results indicate that these acoustic-phonetic features do have some discriminating potential and are worth trying in discrimination. The automatic acoustic-phonetic features have acceptable discriminative performance and can provide more reliable results in evidence analysis when fused with other kind of voice features