Literature DB >> 33471337

Prediction of Rice Transcription Start Sites Using TransPrise: A Novel Machine Learning Approach.

Stepan Pachganov1, Khalimat Murtazalieva2, Alexei Zarubin3, Tatiana Taran4, Duane Chartier5, Tatiana V Tatarinova6,7,8,9.   

Abstract

As the interest in genetic resequencing increases, so does the need for effective mathematical, computational, and statistical approaches. One of the difficult problems in genome annotation is determination of precise positions of transcription start sites. In this paper, we present TransPrise-an efficient deep learning tool for predicting positions of eukaryotic transcription start sites. TransPrise offers significant improvement over existing promoter-prediction methods. To illustrate this, we compared predictions of TransPrise with the TSSPlant approach for well-annotated genome of Oryza sativa. Using a computer with a graphics processing unit, the run time of TransPrise is 250 min on a genome of 374 Mb long.We provide the full basis for the comparison and encourage users to freely access a set of our computational tools to facilitate and streamline their own analyses. The ready-to-use Docker image with all the necessary packages, models, and code as well as the source code of the TransPrise algorithm are available at http://compubioverne.group/ . The source code is ready to use and to be customized to predict TSS in any eukaryotic organism.

Entities:  

Keywords:  Machine learning; Rice; TransPrise; Transcription start site

Year:  2021        PMID: 33471337     DOI: 10.1007/978-1-0716-1068-8_17

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  47 in total

1.  Genome-wide discovery of cis-elements in promoter sequences using gene expression.

Authors:  Maxim Troukhan; Tatiana Tatarinova; John Bouck; Richard B Flavell; Nickolai N Alexandrov
Journal:  OMICS       Date:  2009-04

Review 2.  Exploring the function of genetic variants in the non-coding genomic regions: approaches for identifying human regulatory variants affecting gene expression.

Authors:  Mulin Jun Li; Bin Yan; Pak Chung Sham; Junwen Wang
Journal:  Brief Bioinform       Date:  2014-06-10       Impact factor: 11.622

3.  Large-scale production of pertussis vaccine by serial subculture technic.

Authors:  Z Csizér; I Joó
Journal:  Zentralbl Bakteriol Orig A       Date:  1972-07

4.  Whole-genome discovery of transcription factor binding sites by network-level conservation.

Authors:  Moshe Pritsker; Yir-Chung Liu; Michael A Beer; Saeed Tavazoie
Journal:  Genome Res       Date:  2003-12-12       Impact factor: 9.043

5.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.

Authors:  Susanna Atwell; Yu S Huang; Bjarni J Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M Tarone; Tina T Hu; Rong Jiang; N Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R Ecker; Nathalie Faure; Joel M Kniskern; Jonathan D G Jones; Todd Michael; Adnane Nemri; Fabrice Roux; David E Salt; Chunlao Tang; Marco Todesco; M Brian Traw; Detlef Weigel; Paul Marjoram; Justin O Borevitz; Joy Bergelson; Magnus Nordborg
Journal:  Nature       Date:  2010-03-24       Impact factor: 49.962

6.  MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.

Authors:  Carson Holt; Mark Yandell
Journal:  BMC Bioinformatics       Date:  2011-12-22       Impact factor: 3.307

7.  Cis-motifs upstream of the transcription and translation initiation sites are effectively revealed by their positional disequilibrium in eukaryote genomes using frequency distribution curves.

Authors:  Kenneth W Berendzen; Kurt Stüber; Klaus Harter; Dierk Wanke
Journal:  BMC Bioinformatics       Date:  2006-11-30       Impact factor: 3.169

8.  Nucleotide diversity analysis highlights functionally important genomic regions.

Authors:  Tatiana V Tatarinova; Evgeny Chekalin; Yuri Nikolsky; Sergey Bruskin; Dmitry Chebotarov; Kenneth L McNally; Nickolai Alexandrov
Journal:  Sci Rep       Date:  2016-10-24       Impact factor: 4.379

9.  Nucleotide patterns aiding in prediction of eukaryotic promoters.

Authors:  Martin Triska; Victor Solovyev; Ancha Baranova; Alexander Kel; Tatiana V Tatarinova
Journal:  PLoS One       Date:  2017-11-15       Impact factor: 3.240

10.  Combining RNA-seq data and homology-based gene prediction for plants, animals and fungi.

Authors:  Jens Keilwagen; Frank Hartung; Michael Paulini; Sven O Twardziok; Jan Grau
Journal:  BMC Bioinformatics       Date:  2018-05-30       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.