On the statistical assessment of similarities in DNA sequences
Author(s) -
J. Reich,
Heinz Drabsch,
Astrid Däumler
Publication year - 1984
Publication title -
nucleic acids research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 9.008
H-Index - 537
eISSN - 1362-4954
pISSN - 0305-1048
DOI - 10.1093/nar/12.13.5529
Subject(s) - biology , similarity (geometry) , dna , base pair , coincidence , simple (philosophy) , statistics , genetics , base (topology) , statistical analysis , standard deviation , computational biology , mathematics , statistical physics , artificial intelligence , computer science , physics , medicine , mathematical analysis , philosophy , alternative medicine , epistemology , pathology , image (mathematics)
The statistical behavior of the similarity score for unrelated DNA sequences calculated as letter-by-letter comparison or from various forms of optimal alignment was studied. It was found that natural DNA-sequences from a data base and true random sequences show the same statistical behavior in terms of such scores. This makes it possible to adopt a simple criterion for the rejection of fortuitous similarity. It is based on the mean and standard deviation of chance scores whose expected values, depending on chain length, gap penalty and probability of letter coincidence, may be calculated from formulae given in the paper.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom