Multiple sequence alignment in parallel on a workstation cluster
Author(s) -
Justin Ebedes,
Amitava Datta
Publication year - 2004
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/bth055
Subject(s) - speedup , computer science , workstation , multiple sequence alignment , sequence (biology) , alignment free sequence analysis , parallel computing , sequence alignment , message passing interface , set (abstract data type) , software , cluster (spacecraft) , tree (set theory) , computer cluster , distance matrix , algorithm , theoretical computer science , message passing , peptide sequence , distributed computing , programming language , mathematics , biology , operating system , combinatorics , genetics , gene
Multiple sequence alignment is the NP-hard problem of aligning three or more DNA or amino acid sequences in an optimal way so as to match as many characters as possible from the set of sequences. The popular sequence alignment program ClustalW uses the classical method of approximating a sequence alignment, by first computing a distance matrix and then constructing a guide tree to show the evolutionary relationship of the sequences. We show that parallelizing the ClustalW algorithm can result in significant speedup. We used a cluster of workstations using C and message passing interface for our implementation. Experimental results show that speedup of over 5.5 on six processors is obtainable for most inputs.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom