A Deep Neural Network for Predicting and Engineering Alternative Polyadenylation
A Deep Neural Network for Predicting and Engineering Alternative Polyadenylation
Alternative polyadenylation (APA) is a major driver of transcriptome diversity in human cells. Here, we use deep learning to predict APA from DNA sequence alone. We trained our model (APARENT, APA REgression NeT) on isoform expression data from over 3 million APA reporters. APARENT's predictions are highly accurate when tasked with inferring APA in synthetic and human 3'UTRs. Visualizing features learned across all network layers reveals that APARENT recognizes sequence motifs known to recruit APA regulators, discovers previously unknown sequence determinants of 3' end processing, and integrates these features into a comprehensive, interpretable, cis-regulatory code. We apply APARENT to forward engineer functional polyadenylation signals with precisely defined cleavage position and isoform usage and validate predictions experimentally. Finally, we use APARENT to quantify the impact of genetic variants on APA. Our approach detects pathogenic variants in a wide range of disease contexts, expanding our understanding of the genetic origins of disease.
- University of Mary United States
- University of Washington United States
RNA Cleavage, Base Sequence, Models, Genetic, Gene Expression, Polyadenylation, Deep Learning, HEK293 Cells, Mutagenesis, Databases, Genetic, Humans, Synthetic Biology, RNA, Messenger, RNA-Seq, Transcriptome, 3' Untranslated Regions
RNA Cleavage, Base Sequence, Models, Genetic, Gene Expression, Polyadenylation, Deep Learning, HEK293 Cells, Mutagenesis, Databases, Genetic, Humans, Synthetic Biology, RNA, Messenger, RNA-Seq, Transcriptome, 3' Untranslated Regions
3 Research products, page 1 of 1
- 2019IsRelatedTo
- 2020IsRelatedTo
- 2019IsRelatedTo
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).183 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 1% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Top 10% impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 1%
