z-logo
open-access-imgOpen Access
Improving the quality of protein similarity network clustering algorithms using the network edge weight distribution
Author(s) -
Leonard Apeltsin,
John H. Morris,
Patricia C. Babbitt,
Thomas E. Ferrin
Publication year - 2010
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btq655
Subject(s) - cluster analysis , computer science , data mining , correlation clustering , cure data clustering algorithm , fuzzy clustering , heuristic , similarity (geometry) , artificial intelligence , algorithm , image (mathematics)
Clustering protein sequence data into functionally specific families is a difficult but important problem in biological research. One useful approach for tackling this problem involves representing the sequence dataset as a protein similarity network, and afterwards clustering the network using advanced graph analysis techniques. Although a multitude of such network clustering algorithms have been developed over the past few years, comparing algorithms is often difficult because performance is affected by the specifics of network construction. We investigate an important aspect of network construction used in analyzing protein superfamilies and present a heuristic approach for improving the performance of several algorithms.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom