Assembly of an Interactive Correlation Network for the Arabidopsis Genome Using a Novel Heuristic Clustering Algorithm
Author(s) -
Marek Mutwil,
Björn Usadel,
Moritz Schuݶtte,
Ann E. Loraine,
Oliver Ebenhöh,
Staffan Persson
Publication year - 2009
Publication title -
plant physiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.554
H-Index - 312
eISSN - 1532-2548
pISSN - 0032-0889
DOI - 10.1104/pp.109.145318
Subject(s) - cluster analysis , heuristic , arabidopsis , visualization , computer science , node (physics) , genome , computational biology , graph drawing , gene ontology , cluster (spacecraft) , complex network , clustering coefficient , gene regulatory network , biological network , data mining , biology , gene , artificial intelligence , genetics , mutant , gene expression , structural engineering , world wide web , engineering , programming language
A vital quest in biology is comprehensible visualization and interpretation of correlation relationships on a genome scale. Such relationships may be represented in the form of networks, which usually require disassembly into smaller manageable units, or clusters, to facilitate interpretation. Several graph-clustering algorithms that may be used to visualize biological networks are available. However, only some of these support weighted edges, and none provides good control of cluster sizes, which is crucial for comprehensible visualization of large networks. We constructed an interactive coexpression network for the Arabidopsis (Arabidopsis thaliana) genome using a novel Heuristic Cluster Chiseling Algorithm (HCCA) that supports weighted edges and that may control average cluster sizes. Comparative clustering analyses demonstrated that the HCCA performed as well as, or better than, the commonly used Markov, MCODE, and k-means clustering algorithms. We mapped MapMan ontology terms onto coexpressed node vicinities of the network, which revealed transcriptional organization of previously unrelated cellular processes. We further explored the predictive power of this network through mutant analyses and identified six new genes that are essential to plant growth. We show that the HCCA-partitioned network constitutes an ideal "cartographic" platform for visualization of correlation networks. This approach rapidly provides network partitions with relative uniform cluster sizes on a genome-scale level and may thus be used for correlation network layouts also for other species.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom