MUSCLE: multiple sequence alignment with high accuracy and high throughput | Zendy

R. C. Edgar | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

MUSCLE: multiple sequence alignment with high accuracy and high throughput

Author(s) -

R. C. Edgar

Publication year - 2004

Publication title -

nucleic acids research

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 9.008

H-Index - 537

eISSN - 1362-4954

pISSN - 0305-1048

DOI - 10.1093/nar/gkh340

Subject(s) - benchmark (surveying) , multiple sequence alignment , biology , computer science , rank (graph theory) , sequence alignment , source code , tree (set theory) , mathematics , combinatorics , biochemistry , geodesy , peptide sequence , gene , geography , operating system

We describe MUSCLE, a new computer program for creating multiple alignments of protein sequences. Elements of the algorithm include fast distance estimation using kmer counting, progressive alignment using a new profile function we call the log-expectation score, and refinement using tree-dependent restricted partitioning. The speed and accuracy of MUSCLE are compared with T-Coffee, MAFFT and CLUSTALW on four test sets of reference alignments: BAliBASE, SABmark, SMART and a new benchmark, PREFAB. MUSCLE achieves the highest, or joint highest, rank in accuracy on each of these sets. Without refinement, MUSCLE achieves average accuracy statistically indistinguishable from T-Coffee and MAFFT, and is the fastest of the tested methods for large numbers of sequences, aligning 5000 sequences of average length 350 in 7 min on a current desktop computer. The MUSCLE program, source code and PREFAB test data are freely available at http://www.drive5. com/muscle.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research