Modulated Modularity Clustering as an Exploratory Tool for Functional Genomic Inference
Modulated Modularity Clustering as an Exploratory Tool for Functional Genomic Inference
In recent years, the advent of high-throughput assays, coupled with their diminishing cost, has facilitated a systems approach to biology. As a consequence, massive amounts of data are currently being generated, requiring efficient methodology aimed at the reduction of scale. Whole-genome transcriptional profiling is a standard component of systems-level analyses, and to reduce scale and improve inference clustering genes is common. Since clustering is often the first step toward generating hypotheses, cluster quality is critical. Conversely, because the validation of cluster-driven hypotheses is indirect, it is critical that quality clusters not be obtained by subjective means. In this paper, we present a new objective-based clustering method and demonstrate that it yields high-quality results. Our method, modulated modularity clustering (MMC), seeks community structure in graphical data. MMC modulates the connection strengths of edges in a weighted graph to maximize an objective function (called modularity) that quantifies community structure. The result of this maximization is a clustering through which tightly-connected groups of vertices emerge. Our application is to systems genetics, and we quantitatively compare MMC both to the hierarchical clustering method most commonly employed and to three popular spectral clustering approaches. We further validate MMC through analyses of human and Drosophila melanogaster expression data, demonstrating that the clusters we obtain are biologically meaningful. We show MMC to be effective and suitable to applications of large scale. In light of these features, we advocate MMC as a standard tool for exploration and hypothesis generation.
- North Carolina Agricultural and Technical State University United States
- North Carolina State University United States
Models, Genetic, Gene Expression Profiling, Systems Biology, Genomics, QH426-470, Drosophila melanogaster, Genetic Techniques, Data Interpretation, Statistical, Multigene Family, Databases, Genetic, Genetics, Animals, Cluster Analysis, Humans, Computer Simulation, Lymphocytes, Research Article
Models, Genetic, Gene Expression Profiling, Systems Biology, Genomics, QH426-470, Drosophila melanogaster, Genetic Techniques, Data Interpretation, Statistical, Multigene Family, Databases, Genetic, Genetics, Animals, Cluster Analysis, Humans, Computer Simulation, Lymphocytes, Research Article
38 Research products, page 1 of 4
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2018IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
chevron_left - 1
- 2
- 3
- 4
chevron_right
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).111 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 1%
