Literature DB >> 15805498

Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions.

Chaochun Wei1, Philippe Lamesch, Manimozhiyan Arumugam, Jennifer Rosenberg, Ping Hu, Marc Vidal, Michael R Brent.   

Abstract

The genome of Caenorhabditis elegans was the first animal genome to be sequenced. Although considerable effort has been devoted to annotating it, the standard WormBase annotation contains thousands of predicted genes for which there is no cDNA or EST evidence. We hypothesized that a more complete experimental annotation could be obtained by creating a more accurate gene-prediction program and then amplifying and sequencing predicted genes. Our approach was to adapt the TWINSCAN gene prediction system to C. elegans and C. briggsae and to improve its splice site and intron-length models. The resulting system has 60% sensitivity and 58% specificity in exact prediction of open reading frames (ORFs), and hence, proteins-the best results we are aware of any multicellular organism. We then attempted to amplify, clone, and sequence 265 TWINSCAN-predicted ORFs that did not overlap WormBase gene annotations. The success rate was 55%, adding 146 genes that were completely absent from WormBase to the ORF clone collection (ORFeome). The same procedure had a 7% success rate on 90 Worm Base "predicted" genes that do not overlap TWINSCAN predictions. These results indicate that the accuracy of WormBase could be significantly increased by replacing its partially curated predicted genes with TWINSCAN predictions. The technology described in this study will continue to drive the C. elegans ORFeome toward completion and contribute to the annotation of the three Caenorhabditis species currently being sequenced. The results also suggest that this technology can significantly improve our knowledge of the "parts list" for even the best-studied model organisms.

Entities:  

Mesh:

Year:  2005        PMID: 15805498      PMCID: PMC1074372          DOI: 10.1101/gr.3329005

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  21 in total

Review 1.  Recent advances in gene structure prediction.

Authors:  Michael R Brent; Roderic Guigó
Journal:  Curr Opin Struct Biol       Date:  2004-06       Impact factor: 6.809

2.  Analysis of compositionally biased regions in sequence databases.

Authors:  J C Wootton; S Federhen
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

3.  Protein interaction mapping in C. elegans using proteins involved in vulval development.

Authors:  A J Walhout; R Sordella; X Lu; J L Hartley; G F Temple; M A Brasch; N Thierry-Mieg; M Vidal
Journal:  Science       Date:  2000-01-07       Impact factor: 47.728

4.  Prediction of complete gene structures in human genomic DNA.

Authors:  C Burge; S Karlin
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

5.  WormBase: a multi-species resource for nematode biology and genomics.

Authors:  Todd W Harris; Nansheng Chen; Fiona Cunningham; Marcela Tello-Ruiz; Igor Antoshechkin; Carol Bastiani; Tamberlyn Bieri; Darin Blasiar; Keith Bradnam; Juancarlos Chan; Chao-Kung Chen; Wen J Chen; Paul Davis; Eimear Kenny; Ranjana Kishore; Daniel Lawson; Raymond Lee; Hans-Michael Muller; Cecilia Nakamura; Philip Ozersky; Andrei Petcherski; Anthony Rogers; Aniko Sabo; Erich M Schwarz; Kimberly Van Auken; Qinghua Wang; Richard Durbin; John Spieth; Paul W Sternberg; Lincoln D Stein
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

6.  Gene prediction and verification in a compact genome with numerous small introns.

Authors:  Aaron E Tenney; Randall H Brown; Charles Vaske; Jennifer K Lodge; Tamara L Doering; Michael R Brent
Journal:  Genome Res       Date:  2004-10-12       Impact factor: 9.043

Review 7.  Genome sequence of the nematode C. elegans: a platform for investigating biology.

Authors: 
Journal:  Science       Date:  1998-12-11       Impact factor: 47.728

8.  A phylogeny of caenorhabditis reveals frequent loss of introns during nematode evolution.

Authors:  Soochin Cho; Suk-Won Jin; Adam Cohen; Ronald E Ellis
Journal:  Genome Res       Date:  2004-07       Impact factor: 9.043

9.  Identification of rat genes by TWINSCAN gene prediction, RT-PCR, and direct sequencing.

Authors:  Jia Qian Wu; David Shteynberg; Manimozhiyan Arumugam; Richard A Gibbs; Michael R Brent
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

10.  Eval: a software package for analysis of genome annotations.

Authors:  Evan Keibler; Michael R Brent
Journal:  BMC Bioinformatics       Date:  2003-10-17       Impact factor: 3.169

View more
  21 in total

1.  Computational and experimental approaches double the number of known introns in the pathogenic yeast Candida albicans.

Authors:  Quinn M Mitrovich; Brian B Tuch; Christine Guthrie; Alexander D Johnson
Journal:  Genome Res       Date:  2007-03-09       Impact factor: 9.043

2.  Quantitative measures for the management and comparison of annotated genomes.

Authors:  Karen Eilbeck; Barry Moore; Carson Holt; Mark Yandell
Journal:  BMC Bioinformatics       Date:  2009-02-23       Impact factor: 3.169

3.  The use of Saccharomyces cerevisiae proteomic libraries to identify RNA-modifying proteins.

Authors:  Jane E Jackman; Elizabeth J Grayhack; Eric M Phizicky
Journal:  Methods Mol Biol       Date:  2008

4.  Multigenome DNA sequence conservation identifies Hox cis-regulatory elements.

Authors:  Steven G Kuntz; Erich M Schwarz; John A DeModena; Tristan De Buysscher; Diane Trout; Hiroaki Shizuya; Paul W Sternberg; Barbara J Wold
Journal:  Genome Res       Date:  2008-11-03       Impact factor: 9.043

5.  Massively parallel sequencing of the polyadenylated transcriptome of C. elegans.

Authors:  Ladeana W Hillier; Valerie Reinke; Philip Green; Martin Hirst; Marco A Marra; Robert H Waterston
Journal:  Genome Res       Date:  2009-01-30       Impact factor: 9.043

6.  More than 9,000,000 unique genes in human gut bacterial community: estimating gene numbers inside a human body.

Authors:  Xing Yang; Lu Xie; Yixue Li; Chaochun Wei
Journal:  PLoS One       Date:  2009-06-29       Impact factor: 3.240

7.  Meeting report: a workshop on Best Practices in Genome Annotation.

Authors:  Ramana Madupu; Lauren M Brinkac; Jennifer Harrow; Laurens G Wilming; Ulrike Böhme; Philippe Lamesch; Linda I Hannick
Journal:  Database (Oxford)       Date:  2010-02-18       Impact factor: 3.451

8.  TWS1, a Novel Small Protein, Regulates Various Aspects of Seed and Plant Development.

Authors:  Elisa Fiume; Virginie Guyon; Carine Remoué; Enrico Magnani; Martine Miquel; Damaris Grain; Loïc Lepiniec
Journal:  Plant Physiol       Date:  2016-09-09       Impact factor: 8.340

9.  A collection of 10,096 indica rice full-length cDNAs reveals highly expressed sequence divergence between Oryza sativa indica and japonica subspecies.

Authors:  Xiaohui Liu; Tingting Lu; Shuliang Yu; Ying Li; Yuchen Huang; Tao Huang; Lei Zhang; Jingjie Zhu; Qiang Zhao; Danlin Fan; Jie Mu; Yingying Shangguan; Qi Feng; Jianping Guan; Kai Ying; Yu Zhang; Zhixin Lin; Zongxiu Sun; Qian Qian; Yuping Lu; Bin Han
Journal:  Plant Mol Biol       Date:  2007-05-24       Impact factor: 4.076

10.  Genomic DNA sequence comparison between two inbred soybean cyst nematode biotypes facilitated by massively parallel 454 micro-bead sequencing.

Authors:  Sadia Bekal; J P Craig; M E Hudson; T L Niblack; L L Domier; K N Lambert
Journal:  Mol Genet Genomics       Date:  2008-05       Impact factor: 3.291

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.