dagLogo: An R/Bioconductor package for identifying and visualizing differential amino acid group usage in proteomics data
dagLogo: An R/Bioconductor package for identifying and visualizing differential amino acid group usage in proteomics data
Sequence logos have been widely used as graphical representations of conserved nucleic acid and protein motifs. Due to the complexity of the amino acid (AA) alphabet, rich post-translational modification, and diverse subcellular localization of proteins, few versatile tools are available for effective identification and visualization of protein motifs. In addition, various reduced AA alphabets based on physicochemical, structural, or functional properties have been valuable in the study of protein alignment, folding, structure prediction, and evolution. However, there is lack of tools for applying reduced AA alphabets to the identification and visualization of statistically significant motifs. To fill this gap, we developed an R/Bioconductor package dagLogo, which has several advantages over existing tools. First, dagLogo allows various formats for input sets and provides comprehensive options to build optimal background models. It implements different reduced AA alphabets to group AAs of similar properties. Furthermore, dagLogo provides statistical and visual solutions for differential AA (or AA group) usage analysis of both large and small data sets. Case studies showed that dagLogo can better identify and visualize conserved protein sequence patterns from different types of inputs and can potentially reveal the biological patterns that could be missed by other logo generators.
- National Cancer Institute United States
- Technical University of Munich Germany
- University of Massachusetts Medical School United States
- Duke University United States
- University of Lausanne Switzerland
Proteomics, Bioinformatics, Science, Amino Acid Motifs, Humans, Position-Specific Scoring Matrices, Amino Acids, Databases, Protein, Molecular Biology, Conserved Sequence, amino acids, Q, and Proteins, R, Computational Biology, Proteins, protein motifs, sequence logos, amino acid alphabet, Medicine, Peptides, Sequence Alignment, Algorithms, Software, Research Article
Proteomics, Bioinformatics, Science, Amino Acid Motifs, Humans, Position-Specific Scoring Matrices, Amino Acids, Databases, Protein, Molecular Biology, Conserved Sequence, amino acids, Q, and Proteins, R, Computational Biology, Proteins, protein motifs, sequence logos, amino acid alphabet, Medicine, Peptides, Sequence Alignment, Algorithms, Software, Research Article
18 Research products, page 1 of 2
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsSupplementedBy
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
- 2017IsRelatedTo
chevron_left - 1
- 2
chevron_right
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).12 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 10%
