Literature DB >> 25217576

Improved rat genome gene prediction by integration of ESTs with RNA-Seq information.

Liping Li1, Enguo Chen2, Chun Yang2, Jun Zhu2, Pushkala Jayaraman2, Jeffrey De Pons2, Catherine C Kaczorowski2, Howard J Jacob1, Andrew S Greene2, Matthew R Hodges2, Allen W Cowley2, Mingyu Liang2, Haiming Xu2, Pengyuan Liu1, Yan Lu1.   

Abstract

MOTIVATION: RNA-Seq (also called whole-transcriptome sequencing) is an emerging technology that uses the capabilities of next-generation sequencing to detect and quantify entire transcripts. One of its important applications is the improvement of existing genome annotations. RNA-Seq provides rapid, comprehensive and cost-effective tools for the discovery of novel genes and transcripts compared with expressed sequence tag (EST), which is instrumental in gene discovery and gene sequence determination. The rat is widely used as a laboratory disease model, but has a less well-annotated genome as compared with humans and mice. In this study, we incorporated deep RNA-Seq data from three rat tissues-bone marrow, brain and kidney-with EST data to improve the annotation of the rat genome.
RESULTS: Our analysis identified 32 197 transcripts, including 13 461 known transcripts, 13 934 novel isoforms and 4802 new genes, which almost doubled the numbers of transcripts in the current public rat genome database (rn5). Comparisons of our predicted protein-coding gene sets with those in public datasets suggest that RNA-Seq significantly improves genome annotation and identifies novel genes and isoforms in the rat. Importantly, the large majority of novel genes and isoforms are supported by direct evidence of RNA-Seq experiments. These predicted genes were integrated into the Rat Genome Database (RGD) and can serve as an important resource for functional studies in the research community.
AVAILABILITY AND IMPLEMENTATION: The predicted genes are available at http://rgd.mcw.edu.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25217576      PMCID: PMC4296142          DOI: 10.1093/bioinformatics/btu608

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

Review 2.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

Review 3.  Progress and prospects in rat genetics: a community view.

Authors:  Timothy J Aitman; John K Critser; Edwin Cuppen; Anna Dominiczak; Xose M Fernandez-Suarez; Jonathan Flint; Dominique Gauguier; Aron M Geurts; Michael Gould; Peter C Harris; Rikard Holmdahl; Norbert Hubner; Zsuzsanna Izsvák; Howard J Jacob; Takashi Kuramoto; Anne E Kwitek; Anna Marrone; Tomoji Mashimo; Carol Moreno; John Mullins; Linda Mullins; Tomas Olsson; Michal Pravenec; Lela Riley; Kathrin Saar; Tadao Serikawa; James D Shull; Claude Szpirer; Simon N Twigger; Birger Voigt; Kim Worley
Journal:  Nat Genet       Date:  2008-05       Impact factor: 38.330

4.  Narrowing a region on rat chromosome 13 that protects against hypertension in Dahl SS-13BN congenic strains.

Authors:  Carol Moreno; Jan M Williams; Limin Lu; Mingyu Liang; Jozef Lazar; Howard J Jacob; Allen W Cowley; Richard J Roman
Journal:  Am J Physiol Heart Circ Physiol       Date:  2011-01-21       Impact factor: 4.733

5.  A gene atlas of the mouse and human protein-encoding transcriptomes.

Authors:  Andrew I Su; Tim Wiltshire; Serge Batalov; Hilmar Lapp; Keith A Ching; David Block; Jie Zhang; Richard Soden; Mimi Hayakawa; Gabriel Kreiman; Michael P Cooke; John R Walker; John B Hogenesch
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-09       Impact factor: 11.205

Review 6.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

7.  AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome.

Authors:  Mario Stanke; Ana Tzvetkova; Burkhard Morgenstern
Journal:  Genome Biol       Date:  2006-08-07       Impact factor: 13.583

8.  Annotating genomes with massive-scale RNA sequencing.

Authors:  France Denoeud; Jean-Marc Aury; Corinne Da Silva; Benjamin Noel; Odile Rogier; Massimo Delledonne; Michele Morgante; Giorgio Valle; Patrick Wincker; Claude Scarpelli; Olivier Jaillon; François Artiguenave
Journal:  Genome Biol       Date:  2008-12-16       Impact factor: 13.583

9.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

10.  Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources.

Authors:  Mario Stanke; Oliver Schöffmann; Burkhard Morgenstern; Stephan Waack
Journal:  BMC Bioinformatics       Date:  2006-02-09       Impact factor: 3.169

View more
  2 in total

1.  Intersection of toxicogenomics and high throughput screening in the Tox21 program: an NIEHS perspective.

Authors:  B Alex Merrick; Richard S Paules; Raymond R Tice
Journal:  Int J Biotechnol       Date:  2015

2.  CircPLEKHM3 acts as a tumor suppressor through regulation of the miR-9/BRCA1/DNAJB6/KLF4/AKT1 axis in ovarian cancer.

Authors:  Lei Zhang; Qing Zhou; Qiongzi Qiu; Ling Hou; Mengting Wu; Jia Li; Xufan Li; Bingjian Lu; Xiaodong Cheng; Pengyuan Liu; Weiguo Lu; Yan Lu
Journal:  Mol Cancer       Date:  2019-10-17       Impact factor: 27.401

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.