Premium
Computational tools for protein–DNA interactions
Author(s) -
Kauffman Christopher,
Karypis George
Publication year - 2011
Publication title -
wiley interdisciplinary reviews: data mining and knowledge discovery
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.506
H-Index - 47
eISSN - 1942-4795
pISSN - 1942-4787
DOI - 10.1002/widm.48
Subject(s) - identification (biology) , computer science , domain (mathematical analysis) , data science , computational biology , dna binding site , dna , key (lock) , machine learning , biology , genetics , mathematics , promoter , gene , mathematical analysis , gene expression , botany , computer security
Interactions between deoxyribonucleic acid (DNA) and proteins are central to living systems, and characterizing how and when they occur would greatly enhance our understanding of working genomes. We review the computational problems associated with protein–DNA interactions and the various methods used to solve them. A wide range of topics is covered including physics‐based models for direct and indirect recognition, identification of transcription‐factor‐binding sites, and methods to predict DNA‐binding proteins. Our goal is to introduce this important problem domain to data mining researchers by identifying the key issues and challenges inherent to the area as well as provide directions for fruitful future research. © 2011 Wiley Periodicals, Inc. This article is categorized under: Algorithmic Development > Biological Data Mining Technologies > Machine Learning