z-logo
Premium
Evaluating the weight of evidence by using quantitative short tandem repeat data in DNA mixtures
Author(s) -
Tvedebrink Torben,
Eriksen Poul Svante,
Mogensen Helle Smidt,
Morling Niels
Publication year - 2010
Publication title -
journal of the royal statistical society: series c (applied statistics)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.205
H-Index - 72
eISSN - 1467-9876
pISSN - 0035-9254
DOI - 10.1111/j.1467-9876.2010.00722.x
Subject(s) - statistics , matching (statistics) , mathematics , allele , multivariate normal distribution , similarity (geometry) , dna profiling , microsatellite , multivariate statistics , biology , dna , genetics , computer science , artificial intelligence , gene , image (mathematics)
Summary.  The evaluation of results from mixtures of deoxyribonucleic acid (DNA) from two or more people in crime case investigations may be improved by taking not only the qualitative but also the quantitative part of the results into consideration. We present a statistical likelihood approach to assess the probability of observed peak heights and peak areas information for a pair of profiles matching the DNA mixture. Furthermore, we demonstrate how to incorporate this probability in the evaluation of the weight of the evidence by a likelihood ratio approach. Our model is based on a multivariate normal distribution of peak areas for assessing the weight of the evidence. On the basis of data from analyses of controlled experiments with mixed DNA samples, we exploited the linear relationship between peak heights and peak areas, and the linear relationships of the means and variances of the measurements. Furthermore, the contribution from one individual's allele to the mean area of this allele is assumed to be proportional to the average of peak height measurements of alleles, where the individual is the only contributor. For shared alleles in mixed DNA samples, it is possible to observe only the cumulative peak heights and areas. Complying with this latent structure, we used the EM algorithm to impute the missing variables on the basis of a compound symmetry model. The measurements were subject to intralocus and interlocus correlations not depending on the actual alleles of the DNA profiles. Owing to factorization of the likelihood, properties of the normal distribution and use of auxiliary variables, an ordinary implementation of the EM algorithm solved the missing data problem.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here