Literature DB >> 11242119

Open-reading-frame sequence tags (OSTs) support the existence of at least 17,300 genes in C. elegans.

J Reboul1, P Vaglio, N Tzellas, N Thierry-Mieg, T Moore, C Jackson, T Shin-i, Y Kohara, D Thierry-Mieg, J Thierry-Mieg, H Lee, J Hitti, L Doucette-Stamm, J L Hartley, G F Temple, M A Brasch, J Vandenhaute, P E Lamesch, D E Hill, M Vidal.   

Abstract

The genome sequences of Caenorhabditis elegans, Drosophila melanogaster and Arabidopsis thaliana have been predicted to contain 19,000, 13,600 and 25,500 genes, respectively. Before this information can be fully used for evolutionary and functional studies, several issues need to be addressed. First, the gene number estimates obtained in silico and not yet supported by any experimental data need to be verified. For example, it seems biologically paradoxical that C. elegans would have 50% more genes than Drosophilia. Second, intron/exon predictions need to be tested experimentally. Third, complete sets of open reading frames (ORFs), or "ORFeomes," need to be cloned into various expression vectors. To address these issues simultaneously, we have designed and applied to C. elegans the following strategy. Predicted ORFs are amplified by PCR from a highly representative cDNA library using ORF-specific primers, cloned by Gateway recombination cloning and then sequenced to generate ORF sequence tags (OSTs) as a way to verify identity and splicing. In a sample (n=1,222) of the nearly 10,000 genes predicted ab initio (that is, for which no expressed sequence tag (EST) is available so far), at least 70% were verified by OSTs. We also observed that 27% of these experimentally confirmed genes have a structure different from that predicted by GeneFinder. We now have experimental evidence that supports the existence of at least 17,300 genes in C. elegans. Hence we suggest that gene counts based primarily on ESTs may underestimate the number of genes in human and in other organisms.

Entities:  

Mesh:

Year:  2001        PMID: 11242119     DOI: 10.1038/85913

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


  58 in total

1.  The new biology. Genomics fosters a "systems approach" and collaborations between academic, government, and industry scientists.

Authors:  N A Eckardt
Journal:  Plant Cell       Date:  2001-04       Impact factor: 11.277

2.  A protein-protein interaction map of the Caenorhabditis elegans 26S proteasome.

Authors:  A Davy; P Bello; N Thierry-Mieg; P Vaglio; J Hitti; L Doucette-Stamm; D Thierry-Mieg; J Reboul; S Boulton; A J Walhout; O Coux; M Vidal
Journal:  EMBO Rep       Date:  2001-09       Impact factor: 8.807

3.  Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs".

Authors:  L R Matthews; P Vaglio; J Reboul; H Ge; B P Davis; J Garrels; S Vincent; M Vidal
Journal:  Genome Res       Date:  2001-12       Impact factor: 9.043

4.  A question of size: the eukaryotic proteome and the problems in defining it.

Authors:  Paul M Harrison; Anuj Kumar; Ning Lang; Michael Snyder; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

5.  Structural characterization of the human proteome.

Authors:  Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

6.  WormBase: a multi-species resource for nematode biology and genomics.

Authors:  Todd W Harris; Nansheng Chen; Fiona Cunningham; Marcela Tello-Ruiz; Igor Antoshechkin; Carol Bastiani; Tamberlyn Bieri; Darin Blasiar; Keith Bradnam; Juancarlos Chan; Chao-Kung Chen; Wen J Chen; Paul Davis; Eimear Kenny; Ranjana Kishore; Daniel Lawson; Raymond Lee; Hans-Michael Muller; Cecilia Nakamura; Philip Ozersky; Andrei Petcherski; Anthony Rogers; Aniko Sabo; Erich M Schwarz; Kimberly Van Auken; Qinghua Wang; Richard Durbin; John Spieth; Paul W Sternberg; Lincoln D Stein
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

7.  A first version of the Caenorhabditis elegans Promoterome.

Authors:  Denis Dupuy; Qian-Ru Li; Bart Deplancke; Mike Boxem; Tong Hao; Philippe Lamesch; Reynaldo Sequerra; Stephanie Bosak; Lynn Doucette-Stamm; Ian A Hope; David E Hill; Albertha J M Walhout; Marc Vidal
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

8.  Human ORFeome version 1.1: a platform for reverse proteomics.

Authors:  Jean-François Rual; Tomoko Hirozane-Kishikawa; Tong Hao; Nicolas Bertin; Siming Li; Amélie Dricot; Ning Li; Jennifer Rosenberg; Philippe Lamesch; Pierre-Olivier Vidalain; Tracey R Clingingsmith; James L Hartley; Dominic Esposito; David Cheo; Troy Moore; Blake Simmons; Reynaldo Sequerra; Stephanie Bosak; Lynn Doucette-Stamm; Christian Le Peuch; Jean Vandenhaute; Michael E Cusick; Joanna S Albala; David E Hill; Marc Vidal
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

9.  Toward improving Caenorhabditis elegans phenome mapping with an ORFeome-based RNAi library.

Authors:  Jean-François Rual; Julian Ceron; John Koreth; Tong Hao; Anne-Sophie Nicot; Tomoko Hirozane-Kishikawa; Jean Vandenhaute; Stuart H Orkin; David E Hill; Sander van den Heuvel; Marc Vidal
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

10.  C. elegans ORFeome version 3.1: increasing the coverage of ORFeome resources with improved gene predictions.

Authors:  Philippe Lamesch; Stuart Milstein; Tong Hao; Jennifer Rosenberg; Ning Li; Reynaldo Sequerra; Stephanie Bosak; Lynn Doucette-Stamm; Jean Vandenhaute; David E Hill; Marc Vidal
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.