Literature DB >> 12529305

Leveraging the mouse genome for gene prediction in human: from whole-genome shotgun reads to a global synteny map.

Paul Flicek1, Evan Keibler, Ping Hu, Ian Korf, Michael R Brent.   

Abstract

The availability of draft sequences for both the mouse and human genomes makes it possible, for the first time, to annotate whole mammalian genomes using comparative methods. TWINSCAN is a gene-prediction system that combines the methods of single-genome predictors like GENSCAN with information derived from genome comparison, thereby improving accuracy. Because TWINSCAN uses genomic sequence only, it is less biased toward highly and/or ubiquitously expressed genes than GENEWISE, GENOMESCAN, and other methods based on evidence derived from transcripts. We show that TWINSCAN improves gene prediction in human using intermediate products from various stages of the sequencing and analysis of the mouse genome, from low-redundancy, whole-genome shotgun reads to the draft assembly and the synteny map. TWINSCAN improves on the prior state of the art even when alignments from only 1X coverage of the mouse genome are available. Gene prediction accuracy improves steadily from 1X through 3X, more slowly from 3X to 4X, and relatively little thereafter. The assembly and the synteny map greatly speed the computations, however. Our human annotation using the mouse assembly is conservative, predicting only 25,622 genes, and appears to be one of the best de novo annotations of the human genome to date.

Entities:  

Mesh:

Year:  2003        PMID: 12529305      PMCID: PMC430948          DOI: 10.1101/gr.830003

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  29 in total

1.  PipMaker--a web server for aligning two genomic DNA sequences.

Authors:  S Schwartz; Z Zhang; K A Frazer; A Smit; C Riemer; J Bouck; R Gibbs; R Hardison; W Miller
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

2.  Shotgun sample sequence comparisons between mouse and human genomes.

Authors:  J B Bouck; M L Metzker; R A Gibbs
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

3.  GeneID in Drosophila.

Authors:  G Parra; E Blanco; R Guigó
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

4.  Using GeneWise in the Drosophila annotation experiment.

Authors:  E Birney; R Durbin
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

5.  Prediction of complete gene structures in human genomic DNA.

Authors:  C Burge; S Karlin
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

6.  Comparative analysis of 1196 orthologous mouse and human full-length mRNA and protein sequences.

Authors:  W Makałowski; J Zhang; M S Boguski
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

7.  Comparative sequence analysis of a gene-rich cluster at human chromosome 12p13 and its syntenic region in mouse chromosome 6.

Authors:  M A Ansari-Lari; J C Oeltjen; S Schwartz; Z Zhang; D M Muzny; J Lu; J H Gorrell; A C Chinault; J W Belmont; W Miller; R A Gibbs
Journal:  Genome Res       Date:  1998-01       Impact factor: 9.043

8.  Large-scale comparative sequence analysis of the human and murine Bruton's tyrosine kinase loci reveals conserved regulatory domains.

Authors:  J C Oeltjen; T M Malley; D M Muzny; W Miller; R A Gibbs; J W Belmont
Journal:  Genome Res       Date:  1997-04       Impact factor: 9.043

9.  Evolutionary parameters of the transcribed mammalian genome: an analysis of 2,820 orthologous rodent and human sequences.

Authors:  W Makalowski; M S Boguski
Journal:  Proc Natl Acad Sci U S A       Date:  1998-08-04       Impact factor: 11.205

10.  Comparative sequence of human and mouse BAC clones from the mnd2 region of chromosome 2p13.

Authors:  W Jang; A Hua; S V Spilson; W Miller; B A Roe; M H Meisler
Journal:  Genome Res       Date:  1999-01       Impact factor: 9.043

View more
  39 in total

1.  Identification and characterization of multi-species conserved sequences.

Authors:  Elliott H Margulies; Mathieu Blanchette; David Haussler; Eric D Green
Journal:  Genome Res       Date:  2003-12       Impact factor: 9.043

2.  Computational gene prediction using multiple sources of evidence.

Authors:  Jonathan E Allen; Mihaela Pertea; Steven L Salzberg
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

3.  GeneWise and Genomewise.

Authors:  Ewan Birney; Michele Clamp; Richard Durbin
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

4.  Iterative gene prediction and pseudogene removal improves genome annotation.

Authors:  Marijke J van Baren; Michael R Brent
Journal:  Genome Res       Date:  2006-05       Impact factor: 9.043

5.  Prediction of small, noncoding RNAs in bacteria using heterogeneous data.

Authors:  Brian Tjaden
Journal:  J Math Biol       Date:  2007-03-13       Impact factor: 2.259

6.  Comparative analysis of methylthioalkylmalate synthase (MAM) gene family and flanking DNA sequences in Brassica oleracea and Arabidopsis thaliana.

Authors:  Muqiang Gao; Genyi Li; Daniel Potter; W Richard McCombie; Carlos F Quiros
Journal:  Plant Cell Rep       Date:  2006-01-24       Impact factor: 4.570

Review 7.  The bioinformatics challenges in comparative analysis of cereal genomes-an overview.

Authors:  M Bellgard; Jia Ye; T Gojobori; R Appels
Journal:  Funct Integr Genomics       Date:  2004-02-10       Impact factor: 3.410

8.  Comparative analysis of a transposon-rich Brassica oleracea BAC clone with its corresponding sequence in A. thaliana.

Authors:  Muqiang Gao; Genyi Li; W Richard McCombie; Carlos F Quiros
Journal:  Theor Appl Genet       Date:  2005-10-18       Impact factor: 5.699

9.  Targeted discovery of novel human exons by comparative genomics.

Authors:  Adam Siepel; Mark Diekhans; Brona Brejová; Laura Langton; Michael Stevens; Charles L G Comstock; Colleen Davis; Brent Ewing; Shelly Oommen; Christopher Lau; Hung-Chun Yu; Jianfeng Li; Bruce A Roe; Phil Green; Daniela S Gerhard; Gary Temple; David Haussler; Michael R Brent
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

10.  Gene prediction and verification in a compact genome with numerous small introns.

Authors:  Aaron E Tenney; Randall H Brown; Charles Vaske; Jennifer K Lodge; Tamara L Doering; Michael R Brent
Journal:  Genome Res       Date:  2004-10-12       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.