z-logo
open-access-imgOpen Access
Empirical Transition Probability Indexing Sparse-Coding Belief Propagation (ETPI-SCoBeP) Genome Sequence Alignment
Author(s) -
Aminmohammad Roozgard,
Nafise Barzigar,
Shuang Wang,
Xiaoqian Jiang,
Samuel Cheng
Publication year - 2014
Publication title -
cancer informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.606
H-Index - 31
ISSN - 1176-9351
DOI - 10.4137/cin.s13887
Subject(s) - computer science , preprocessor , scalability , search engine indexing , robustness (evolution) , belief propagation , coding (social sciences) , sequence (biology) , human genome , genome , data mining , theoretical computer science , artificial intelligence , algorithm , decoding methods , mathematics , genetics , biology , statistics , database , gene
The advance in human genome sequencing technology has significantly reduced the cost of data generation and overwhelms the computing capability of sequence analysis. Efficiency, efficacy, and scalability remain challenging in sequence alignment, which is an important and foundational operation for genome data analysis. In this paper, we propose a two-stage approach to tackle this problem. In the preprocessing step, we match blocks of reference and target sequences based on the similarities between their empirical transition probability distributions using belief propagation. We then conduct a refined match using our recently published sparse-coding belief propagation (SCoBeP) technique. Our experimental results demonstrated robustness in nucleotide sequence alignment, and our results are competitive to those of the SOAP aligner and the BWA algorithm. Moreover, compared to SCoBeP alignment, the proposed technique can handle sequences of much longer lengths.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom