Systematic Identification of Functional Plant Modules through the Integration of Complementary Data Sources
Systematic Identification of Functional Plant Modules through the Integration of Complementary Data Sources
Abstract A major challenge is to unravel how genes interact and are regulated to exert specific biological functions. The integration of genome-wide functional genomics data, followed by the construction of gene networks, provides a powerful approach to identify functional gene modules. Large-scale expression data, functional gene annotations, experimental protein-protein interactions, and transcription factor-target interactions were integrated to delineate modules in Arabidopsis (Arabidopsis thaliana). The different experimental input data sets showed little overlap, demonstrating the advantage of combining multiple data types to study gene function and regulation. In the set of 1,563 modules covering 13,142 genes, most modules displayed strong coexpression, but functional and cis-regulatory coherence was less prevalent. Highly connected hub genes showed a significant enrichment toward embryo lethality and evidence for cross talk between different biological processes. Comparative analysis revealed that 58% of the modules showed conserved coexpression across multiple plants. Using module-based functional predictions, 5,562 genes were annotated, and an evaluation experiment disclosed that, based on 197 recently experimentally characterized genes, 38.1% of these functions could be inferred through the module context. Examples of confirmed genes of unknown function related to cell wall biogenesis, xylem and phloem pattern formation, cell cycle, hormone stimulus, and circadian rhythm highlight the potential to identify new gene functions. The module-based predictions offer new biological hypotheses for functionally unknown genes in Arabidopsis (1,701 genes) and six other plant species (43,621 genes). Furthermore, the inferred modules provide new insights into the conservation of coexpression and coregulation as well as a starting point for comparative functional annotation.
- Ghent University Belgium
COMPARATIVE GENOMICS, Arabidopsis Proteins, SCALE DATA SETS, Arabidopsis, Biology and Life Sciences, GENE-COEXPRESSION NETWORK, Computational Biology, CIS-REGULATORY ELEMENTS, Molecular Sequence Annotation, Genes, Plant, BINDING-SITES, SECONDARY CELL-WALL, PROTEIN-PROTEIN INTERACTIONS, Gene Expression Regulation, Plant, Databases, Genetic, ARABIDOPSIS-THALIANA, Gene Regulatory Networks, TRANSCRIPTION FACTOR, HYPOTHESIS GENERATION
COMPARATIVE GENOMICS, Arabidopsis Proteins, SCALE DATA SETS, Arabidopsis, Biology and Life Sciences, GENE-COEXPRESSION NETWORK, Computational Biology, CIS-REGULATORY ELEMENTS, Molecular Sequence Annotation, Genes, Plant, BINDING-SITES, SECONDARY CELL-WALL, PROTEIN-PROTEIN INTERACTIONS, Gene Expression Regulation, Plant, Databases, Genetic, ARABIDOPSIS-THALIANA, Gene Regulatory Networks, TRANSCRIPTION FACTOR, HYPOTHESIS GENERATION
286 Research products, page 1 of 29
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
chevron_left - 1
- 2
- 3
- 4
- 5
chevron_right
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).104 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 1%
