Literature DB >> 12552088

Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes.

Roderic Guigo1, Emmanouil T Dermitzakis, Pankaj Agarwal, Chris P Ponting, Genis Parra, Alexandre Reymond, Josep F Abril, Evan Keibler, Robert Lyle, Catherine Ucla, Stylianos E Antonarakis, Michael R Brent.   

Abstract

A primary motivation for sequencing the mouse genome was to accelerate the discovery of mammalian genes by using sequence conservation between mouse and human to identify coding exons. Achieving this goal proved challenging because of the large proportion of the mouse and human genomes that is apparently conserved but apparently does not code for protein. We developed a two-stage procedure that exploits the mouse and human genome sequences to produce a set of genes with a much higher rate of experimental verification than previously reported prediction methods. RT-PCR amplification and direct sequencing applied to an initial sample of mouse predictions that do not overlap previously known genes verified the regions flanking one intron in 139 predictions, with verification rates reaching 76%. On average, the confirmed predictions show more restricted expression patterns than the mouse orthologs of known human genes, and two-thirds lack homologs in fish genomes, demonstrating the sensitivity of this dual-genome approach to hard-to-find genes. We verified 112 previously unknown homologs of known proteins, including two homeobox proteins relevant to developmental biology, an aquaporin, and a homolog of dystrophin. We estimate that transcription and splicing can be verified for >1,000 gene predictions identified by this method that do not overlap known genes. This is likely to constitute a significant fraction of the previously unknown, multiexon mammalian genes.

Entities:  

Mesh:

Year:  2003        PMID: 12552088      PMCID: PMC298740          DOI: 10.1073/pnas.0337561100

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  39 in total

1.  RefSeq and LocusLink: NCBI gene-centered resources.

Authors:  K D Pruitt; D R Maglott
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  PipMaker--a web server for aligning two genomic DNA sequences.

Authors:  S Schwartz; Z Zhang; K A Frazer; A Smit; C Riemer; J Bouck; R Gibbs; R Hardison; W Miller
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

3.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

4.  The conserved exon method for gene finding.

Authors:  V Bafna; D H Huson
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000

5.  Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence.

Authors:  H Roest Crollius; O Jaillon; A Bernot; C Dasilva; L Bouneau; C Fischer; C Fizames; P Wincker; P Brottier; F Quétier; W Saurin; J Weissenbach
Journal:  Nat Genet       Date:  2000-06       Impact factor: 38.330

6.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

7.  Using GeneWise in the Drosophila annotation experiment.

Authors:  E Birney; R Durbin
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

8.  Isolation and expression of novel human glutamate carboxypeptidases with N-acetylated alpha-linked acidic dipeptidase and dipeptidyl peptidase IV activity.

Authors:  M N Pangalos; J M Neefs; M Somers; P Verhasselt; M Bekkers; L van der Helm; E Fraiponts; D Ashton; R D Gordon
Journal:  J Biol Chem       Date:  1999-03-26       Impact factor: 5.157

9.  Human and mouse gene structure: comparative analysis and application to exon prediction.

Authors:  S Batzoglou; L Pachter; J P Mesirov; B Berger; E S Lander
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

10.  Regenerating motor neurons express Nna1, a novel ATP/GTP-binding protein related to zinc carboxypeptidases.

Authors:  A Harris; J I Morgan; M Pecot; A Soumare; A Osborne; H D Soares
Journal:  Mol Cell Neurosci       Date:  2000-11       Impact factor: 4.314

View more
  55 in total

1.  Gene structure conservation aids similarity based gene prediction.

Authors:  Irmtraud M Meyer; Richard Durbin
Journal:  Nucleic Acids Res       Date:  2004-02-04       Impact factor: 16.971

2.  Comparative gene prediction in human and mouse.

Authors:  Genís Parra; Pankaj Agarwal; Josep F Abril; Thomas Wiehe; James W Fickett; Roderic Guigó
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

3.  GlimmerM, Exonomy and Unveil: three ab initio eukaryotic genefinders.

Authors:  William H Majoros; Mihaela Pertea; Corina Antonescu; Steven L Salzberg
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

4.  ETOPE: Evolutionary test of predicted exons.

Authors:  Anton Nekrutenko; Wen-Yu Chung; Wen-Hsiung Li
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

5.  Accurate identification of novel human genes through simultaneous gene prediction in human, mouse, and rat.

Authors:  Colin Dewey; Jia Qian Wu; Simon Cawley; Marina Alexandersson; Richard Gibbs; Lior Pachter
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

6.  Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22.

Authors:  Dione Kampa; Jill Cheng; Philipp Kapranov; Mark Yamanaka; Shane Brubaker; Simon Cawley; Jorg Drenkow; Antonio Piccolboni; Stefan Bekiranov; Gregg Helt; Hari Tammana; Thomas R Gingeras
Journal:  Genome Res       Date:  2004-03       Impact factor: 9.043

7.  Numerous novel annotations of the human genome sequence supported by a 5'-end-enriched cDNA collection.

Authors:  Betina M Porcel; Olivier Delfour; Vanina Castelli; Veronique De Berardinis; Lucie Friedlander; Corinne Cruaud; Abel Ureta-Vidal; Claude Scarpelli; Patrick Wincker; Vincent Schächter; William Saurin; Gabor Gyapay; Marcel Salanoubat; Jean Weissenbach
Journal:  Genome Res       Date:  2004-02-12       Impact factor: 9.043

Review 8.  Chromosomal evolution in Rodentia.

Authors:  S A Romanenko; P L Perelman; V A Trifonov; A S Graphodatsky
Journal:  Heredity (Edinb)       Date:  2011-11-16       Impact factor: 3.821

9.  The other side of comparative genomics: genes with no orthologs between the cow and other mammalian species.

Authors:  Raffaele Mazza; Francesco Strozzi; Andrea Caprera; Paolo Ajmone-Marsan; John L Williams
Journal:  BMC Genomics       Date:  2009-12-14       Impact factor: 3.969

10.  Targeted discovery of novel human exons by comparative genomics.

Authors:  Adam Siepel; Mark Diekhans; Brona Brejová; Laura Langton; Michael Stevens; Charles L G Comstock; Colleen Davis; Brent Ewing; Shelly Oommen; Christopher Lau; Hung-Chun Yu; Jianfeng Li; Bruce A Roe; Phil Green; Daniela S Gerhard; Gary Temple; David Haussler; Michael R Brent
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.