Literature DB >> 22499686

MicroRNA transcription start site prediction with multi-objective feature selection.

Malay Bhattacharyya1, Lars Feuerbach, Tapas Bhadra, Thomas Lengauer, Sanghamitra Bandyopadhyay.   

Abstract

MicroRNAs (miRNAs) are non-coding, short (21-23nt) regulators of protein-coding genes that are generally transcribed first into primary miRNA (pri-miR), followed by the generation of precursor miRNA (pre-miR). This finally leads to the production of the mature miRNA. A large amount of information is available on the pre- and mature miRNAs. However, very little is known about the pri-miRs, due to a lack of knowledge about their transcription start sites (TSSs). Based on the genomic loci, miRNAs can be categorized into two types --intragenic (intra-miR) and intergenic (inter-miR). While it is already an established fact that intra-miRs are commonly transcribed in conjunction with their host genes, the transcription machinery of inter-miRs is poorly understood. Although it is assumed that miRNA promoters are similar in structure to gene promoters, since both are transcribed by RNA polymerase II (Pol II), computational validations exhibit poor performance of gene promoter prediction methods on miRNAs. In this paper, we concentrate on the problem of TSS prediction for miRNAs. The present study begins with the identification of positive and negative promoter samples from recently published data stemming from RNA-sequencing studies. From these samples of experimentally validated miRNA TSSs, a number of standard sequence features are extracted. Furthermore, to account for potential footprints related to promoter regulation by CpG dinucleotide targeted DNA methylation, a number of novel features are defined. We develop a support vector machine (SVM) with RBF kernel for the prediction of miRNA TSSs trained on human miRNA promoters. A novel feature reduction technique based on archived multi-objective simulated annealing (AMOSA) identifies the final set of features. The resulting model trained on miRNA promoters shows improved performance over the one trained on protein-coding gene promoters in terms of classification accuracy, sensitivity and specificity. Results are also reported for a completely independent biologically validated test set. In a part of the investigation, the proposed approach is used to predict protein-coding gene TSSs. It shows a significantly improved performance when compared to previously published gene TSS prediction methods.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22499686     DOI: 10.2202/1544-6115.1743

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  10 in total

Review 1.  Small Genetic Circuits and MicroRNAs: Big Players in Polymerase II Transcriptional Control in Plants.

Authors:  Molly Megraw; Jason S Cumbie; Maria G Ivanchenko; Sergei A Filichkin
Journal:  Plant Cell       Date:  2016-02-11       Impact factor: 11.277

Review 2.  An overview of microRNAs.

Authors:  Scott M Hammond
Journal:  Adv Drug Deliv Rev       Date:  2015-05-12       Impact factor: 15.470

3.  Describing a Transcription Factor Dependent Regulation of the MicroRNA Transcriptome.

Authors:  Uri Rozovski; Inbal Hazan-Halevy; George Calin; David Harris; Ping Li; Zhiming Liu; Michael J Keating; Zeev Estrov
Journal:  J Vis Exp       Date:  2016-06-15       Impact factor: 1.355

4.  DNA methylation patterns facilitate the identification of microRNA transcription start sites: a brain-specific study.

Authors:  Tapas Bhadra; Malay Bhattacharyya; Lars Feuerbach; Thomas Lengauer; Sanghamitra Bandyopadhyay
Journal:  PLoS One       Date:  2013-06-24       Impact factor: 3.240

5.  Regulation of miRNA processing and miRNA mediated gene repression in cancer.

Authors:  Sarah Bajan; Gyorgy Hutvagner
Journal:  Microrna       Date:  2014

Review 6.  Methods of MicroRNA Promoter Prediction and Transcription Factor Mediated Regulatory Network.

Authors:  Yuming Zhao; Fang Wang; Su Chen; Jun Wan; Guohua Wang
Journal:  Biomed Res Int       Date:  2017-06-05       Impact factor: 3.411

7.  Predicting Presynaptic and Postsynaptic Neurotoxins by Developing Feature Selection Technique.

Authors:  Hua Tang; Yunchun Yang; Chunmei Zhang; Rong Chen; Po Huang; Chenggang Duan; Ping Zou
Journal:  Biomed Res Int       Date:  2017-02-12       Impact factor: 3.411

8.  Comparison of five supervised feature selection algorithms leading to top features and gene signatures from multi-omics data in cancer.

Authors:  Tapas Bhadra; Saurav Mallik; Neaj Hasan; Zhongming Zhao
Journal:  BMC Bioinformatics       Date:  2022-04-28       Impact factor: 3.307

9.  Large-scale investigation of human TF-miRNA relations based on coexpression profiles.

Authors:  Chia-Hung Chien; Yi-Fan Chiang-Hsieh; Ann-Ping Tsou; Shun-Long Weng; Wen-Chi Chang; Hsien-Da Huang
Journal:  Biomed Res Int       Date:  2014-06-09       Impact factor: 3.411

10.  miRT: a database of validated transcription start sites of human microRNAs.

Authors:  Malay Bhattacharyya; Manali Das; Sanghamitra Bandyopadhyay
Journal:  Genomics Proteomics Bioinformatics       Date:  2012-09-29       Impact factor: 7.691

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.