Premium
Composite Machine Learning Algorithm for Material Sourcing ,
Author(s) -
Casale Amanda,
Dettman Josh
Publication year - 2020
Publication title -
journal of forensic sciences
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.715
H-Index - 96
eISSN - 1556-4029
pISSN - 0022-1198
DOI - 10.1111/1556-4029.14436
Subject(s) - computer science , algorithm , open source , composite number , random forest , trace (psycholinguistics) , data mining , artificial intelligence , pattern recognition (psychology) , machine learning , software , linguistics , philosophy , programming language
This study developed a composite machine learning algorithm for attribution of materials of forensic interest (like ammonium nitrate) to original sources. k ‐nearest neighbor and random forest models were used for source elimination and classification, respectively, in a two‐step, composite algorithm based on particle color, size/shape, and trace element concentration features. Novel approaches for simulation to supplement within‐source reference features based on empirically measured multi‐lot analyses, an improved hold‐one‐lot‐out method for cross‐validation, an assessment of the likelihood of the presence of a reference sample, fusion of the source probabilities from the respective classification models, and the calculation of metrics for assessing ensemble sourcing performance are described. Excellent sourcing predictions were obtained; the sourcing algorithm identified the correct source as the top choice 89% of the time, and the correct source was identified to be an average of 2.7 times more likely than the most likely incorrect source.