Premium
Fitting Markov chain models to discrete state series such as DNA sequences
Author(s) -
Avery P. J.,
Henderson D. A.
Publication year - 1999
Publication title -
journal of the royal statistical society: series c (applied statistics)
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.205
H-Index - 72
eISSN - 1467-9876
pISSN - 0035-9254
DOI - 10.1111/1467-9876.00139
Subject(s) - markov chain , series (stratigraphy) , contingency table , deviance (statistics) , variable order markov model , mathematics , markov model , multinomial distribution , additive markov chain , markov process , markov chain mixing time , statistical physics , computer science , algorithm , statistics , paleontology , biology , physics
Discrete state series such as DNA sequences can often be modelled by Markov chains. The analysis of such series is discussed in the context of log‐linear models. The data produce contingency tables with similar margins due to the dependence of the observations. However, despite the unusual structure of the tables, the analysis is equivalent to that for data from multinomial sampling. The reason why the standard number of degrees of freedom is correct is explained by using theoretical arguments and the asymptotic distribution of the deviance is verified empirically. Problems involved with fitting high order Markov chain models, such as reduced power and computational expense, are also discussed.