Assembly of an Interactive Correlation Network for the Arabidopsis Genome Using a Novel Heuristic Clustering Algorithm
Assembly of an Interactive Correlation Network for the Arabidopsis Genome Using a Novel Heuristic Clustering Algorithm
Abstract A vital quest in biology is comprehensible visualization and interpretation of correlation relationships on a genome scale. Such relationships may be represented in the form of networks, which usually require disassembly into smaller manageable units, or clusters, to facilitate interpretation. Several graph-clustering algorithms that may be used to visualize biological networks are available. However, only some of these support weighted edges, and none provides good control of cluster sizes, which is crucial for comprehensible visualization of large networks. We constructed an interactive coexpression network for the Arabidopsis (Arabidopsis thaliana) genome using a novel Heuristic Cluster Chiseling Algorithm (HCCA) that supports weighted edges and that may control average cluster sizes. Comparative clustering analyses demonstrated that the HCCA performed as well as, or better than, the commonly used Markov, MCODE, and k-means clustering algorithms. We mapped MapMan ontology terms onto coexpressed node vicinities of the network, which revealed transcriptional organization of previously unrelated cellular processes. We further explored the predictive power of this network through mutant analyses and identified six new genes that are essential to plant growth. We show that the HCCA-partitioned network constitutes an ideal “cartographic” platform for visualization of correlation networks. This approach rapidly provides network partitions with relative uniform cluster sizes on a genome-scale level and may thus be used for correlation network layouts also for other species.
- Max Planck Institute of Molecular Plant Physiology Germany
- Max Planck Society Germany
- University of Aberdeen United Kingdom
- University of North Carolina at Chapel Hill United States
QP Physiology, 570, Arabidopsis Proteins, Gene Expression Profiling, QK, Arabidopsis, 610, QP, QK Botany, Phenotype, Gene Expression Regulation, Plant, Mutation, Cluster Analysis, Algorithms, Genome, Plant, Phylogeny, Software
QP Physiology, 570, Arabidopsis Proteins, Gene Expression Profiling, QK, Arabidopsis, 610, QP, QK Botany, Phenotype, Gene Expression Regulation, Plant, Mutation, Cluster Analysis, Algorithms, Genome, Plant, Phylogeny, Software
6 Research products, page 1 of 1
- 2017IsRelatedTo
- 2017IsRelatedTo
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).172 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 1% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 1%
