z-logo
open-access-imgOpen Access
Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology
Author(s) -
Kimmen Sjölander,
Kevin Karplus,
Michael Brown,
Richard Hughey,
Anders Krogh,
Shahzad I. Mian,
David Haussler
Publication year - 1996
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/12.4.327
Subject(s) - dirichlet distribution , generalization , hierarchical dirichlet process , homology (biology) , position (finance) , mixture model , markov chain , mathematics , latent dirichlet allocation , computer science , amino acid , algorithm , statistics , biology , artificial intelligence , topic model , mathematical analysis , genetics , boundary value problem , finance , economics
We present a method for condensing the information in multiple alignments of proteins into a mixture of Dirichlet densities over amino acid distributions. Dirichlet mixture densities are designed to be combined with observed amino acid frequencies to form estimates of expected amino acid probabilities at each position in a profile, hidden Markov model or other statistical model. These estimates give a statistical model greater generalization capacity, so that remotely related family members can be more reliably recognized by the model. This paper corrects the previously published formula for estimating these expected probabilities, and contains complete derivations of the Dirichlet mixture formulas, methods for optimizing the mixtures to match particular databases, and suggestions for efficient implementation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom