
Analysis of protein complexes through model‐based biclustering of label‐free quantitative AP‐MS data
Author(s) -
Choi Hyungwon,
Kim Sinae,
Gingras AnneClaude,
Nesvizhskii Alexey I
Publication year - 2010
Publication title -
molecular systems biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 8.523
H-Index - 148
ISSN - 1744-4292
DOI - 10.1038/msb.2010.41
Subject(s) - biclustering , cluster analysis , context (archaeology) , computational biology , data mining , biology , similarity (geometry) , computer science , artificial intelligence , fuzzy clustering , cure data clustering algorithm , image (mathematics) , paleontology
Affinity purification followed by mass spectrometry (AP‐MS) has become a common approach for identifying protein–protein interactions (PPIs) and complexes. However, data analysis and visualization often rely on generic approaches that do not take advantage of the quantitative nature of AP‐MS. We present a novel computational method, nested clustering , for biclustering of label‐free quantitative AP‐MS data. Our approach forms bait clusters based on the similarity of quantitative interaction profiles and identifies submatrices of prey proteins showing consistent quantitative association within bait clusters. In doing so, nested clustering effectively addresses the problem of overrepresentation of interactions involving baits proteins as compared with proteins only identified as preys. The method does not require specification of the number of bait clusters, which is an advantage against existing model‐based clustering methods. We illustrate the performance of the algorithm using two published intermediate scale human PPI data sets, which are representative of the AP‐MS data generated from mammalian cells. We also discuss general challenges of analyzing and interpreting clustering results in the context of AP‐MS data.