Promoter prediction in eukaryotes using soft computing techniques
Promoter prediction in eukaryotes using soft computing techniques
In molecular biology, in silico identification of eukaryotic promoters is a challenging task. Currently available classifiers generate either poor sensitivity or specificity. In this paper, we propose a support vector machine classifier, referred to as PSVM, to recognize the human pol.II promoters using markov model for extracting features representing k-mer frequency, along with features representing other transcription signals such as TATA box, GC box etc. This classifier is trained using data set comprising 1862 promoters and 1759 non promoters in human genome and takes only 12 parameters to classify a given sequence as promoter or not. Among the 20 verified promoters in human chromosome 22, PSVM recognizes 18. Also it successfully identifies all the 14 well annotated exons of human chromosome 22 as non promoters. When 90% of data is used to train PSVM, it yields a sensitivity of 93.55% and specificity of 98.86% which are significantly better than previously reported results and also those of online promoter prediction tools such as NNPP, ProScan, and TSSG. Thus, k-mer frequency represented by markov model of order k, TATA box, GC box, CAAT box, Init box, and CpG island can be a valuable combination of features for predicting eukaryotic pol.II promoters.
14 Research products, page 1 of 2
- 2022IsAmongTopNSimilarDocuments
- 2010IsAmongTopNSimilarDocuments
- 2001IsAmongTopNSimilarDocuments
- 2003IsAmongTopNSimilarDocuments
- 2022IsAmongTopNSimilarDocuments
- 2021IsAmongTopNSimilarDocuments
- 2003IsAmongTopNSimilarDocuments
- 2010IsAmongTopNSimilarDocuments
chevron_left - 1
- 2
chevron_right
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).0 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Average influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average
