Customized strategies for discovering distant ncRNA homologs
Author(s) -
Axel Mosig,
Lijuan Zhu,
Peter F. Stadler
Publication year - 2009
Publication title -
briefings in functional genomics and proteomics
Language(s) - English
Resource type - Journals
eISSN - 1477-4062
pISSN - 1473-9550
DOI - 10.1093/bfgp/elp035
Subject(s) - biology , computational biology , focus (optics) , sequence (biology) , conserved sequence , non coding rna , homology (biology) , sequence alignment , computer science , rna , base sequence , genetics , peptide sequence , gene , physics , optics
A large fraction of non-coding RNAs is short and/or poorly conserved in sequence. Most of the longer examples, furthermore, consist of a collection of conserved structural motifs rather than a coherent globally conserved secondary structure. As a consequence, the conceptually simple problem of homology search becomes a complex and technically demanding task. Despite the best efforts of databases such as Rfam, the situation is complicated further by the sparsity of information in many--in particular prokaryotic--RNA families. In this contribution, we review recent efforts to customize sequence-based search tools for ncRNA applications. In particular, semi-global alignments and the development of methods for fragmented pattern search have brought significant practical advances. Current developments in this area focus on the integration of fragmented sequence pattern search with search algorithms for secondary structure patterns. We focus here, in particular, on strategies that can be successful in the 'twilight zone' where generic approaches from blast to infernal to start to fail.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom