Literature DB >> 17989246

Targeted discovery of novel human exons by comparative genomics.

Adam Siepel1, Mark Diekhans, Brona Brejová, Laura Langton, Michael Stevens, Charles L G Comstock, Colleen Davis, Brent Ewing, Shelly Oommen, Christopher Lau, Hung-Chun Yu, Jianfeng Li, Bruce A Roe, Phil Green, Daniela S Gerhard, Gary Temple, David Haussler, Michael R Brent.   

Abstract

A complete and accurate set of human protein-coding gene annotations is perhaps the single most important resource for genomic research after the human-genome sequence itself, yet the major gene catalogs remain incomplete and imperfect. Here we describe a genome-wide effort, carried out as part of the Mammalian Gene Collection (MGC) project, to identify human genes not yet in the gene catalogs. Our approach was to produce gene predictions by algorithms that rely on comparative sequence data but do not require direct cDNA evidence, then to test predicted novel genes by RT-PCR. We have identified 734 novel gene fragments (NGFs) containing 2188 exons with, at most, weak prior cDNA support. These NGFs correspond to an estimated 563 distinct genes, of which >160 are completely absent from the major gene catalogs, while hundreds of others represent significant extensions of known genes. The NGFs appear to be predominantly protein-coding genes rather than noncoding RNAs, unlike novel transcribed sequences identified by technologies such as tiling arrays and CAGE. They tend to be expressed at low levels and in a tissue-specific manner, and they are enriched for roles in motor activity, cell adhesion, connective tissue, and central nervous system development. Our results demonstrate that many important genes and gene fragments have been missed by traditional approaches to gene discovery but can be identified by their evolutionary signatures using comparative sequence data. However, they suggest that hundreds-not thousands-of protein-coding genes are completely missing from the current gene catalogs.

Entities:  

Mesh:

Year:  2007        PMID: 17989246      PMCID: PMC2099585          DOI: 10.1101/gr.7128207

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  61 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

Review 2.  MicroRNAs: genomics, biogenesis, mechanism, and function.

Authors:  David P Bartel
Journal:  Cell       Date:  2004-01-23       Impact factor: 41.582

3.  Aligning multiple genomic sequences with the threaded blockset aligner.

Authors:  Mathieu Blanchette; W James Kent; Cathy Riemer; Laura Elnitski; Arian F A Smit; Krishna M Roskin; Robert Baertsch; Kate Rosenbloom; Hiram Clawson; Eric D Green; David Haussler; Webb Miller
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

4.  'Touchdown' PCR to circumvent spurious priming during gene amplification.

Authors:  R H Don; P T Cox; B J Wainwright; K Baker; J S Mattick
Journal:  Nucleic Acids Res       Date:  1991-07-25       Impact factor: 16.971

5.  3,400 new expressed sequence tags identify diversity of transcripts in human brain.

Authors:  M D Adams; A R Kerlavage; C Fields; J C Venter
Journal:  Nat Genet       Date:  1993-07       Impact factor: 38.330

6.  Finishing the euchromatic sequence of the human genome.

Authors: 
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

7.  The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).

Authors:  Daniela S Gerhard; Lukas Wagner; Elise A Feingold; Carolyn M Shenmen; Lynette H Grouse; Greg Schuler; Steven L Klein; Susan Old; Rebekah Rasooly; Peter Good; Mark Guyer; Allison M Peck; Jeffery G Derge; David Lipman; Francis S Collins; Wonhee Jang; Steven Sherry; Mike Feolo; Leonie Misquitta; Eduardo Lee; Kirill Rotmistrovsky; Susan F Greenhut; Carl F Schaefer; Kenneth Buetow; Tom I Bonner; David Haussler; Jim Kent; Mark Kiekhaus; Terry Furey; Michael Brent; Christa Prange; Kirsten Schreiber; Nicole Shapiro; Narayan K Bhat; Ralph F Hopkins; Florence Hsie; Tom Driscoll; M Bento Soares; Tom L Casavant; Todd E Scheetz; Michael J Brown-stein; Ted B Usdin; Shiraki Toshiyuki; Piero Carninci; Yulan Piao; Dawood B Dudekula; Minoru S H Ko; Koichi Kawakami; Yutaka Suzuki; Sumio Sugano; C E Gruber; M R Smith; Blake Simmons; Troy Moore; Richard Waterman; Stephen L Johnson; Yijun Ruan; Chia Lin Wei; S Mathavan; Preethi H Gunaratne; Jiaqian Wu; Angela M Garcia; Stephen W Hulyk; Edwin Fuh; Ye Yuan; Anna Sneed; Carla Kowis; Anne Hodgson; Donna M Muzny; John McPherson; Richard A Gibbs; Jessica Fahey; Erin Helton; Mark Ketteman; Anuradha Madan; Stephanie Rodrigues; Amy Sanchez; Michelle Whiting; Anup Madari; Alice C Young; Keith D Wetherby; Steven J Granite; Peggy N Kwong; Charles P Brinkley; Russell L Pearson; Gerard G Bouffard; Robert W Blakesly; Eric D Green; Mark C Dickson; Alex C Rodriguez; Jane Grimwood; Jeremy Schmutz; Richard M Myers; Yaron S N Butterfield; Malachi Griffith; Obi L Griffith; Martin I Krzywinski; Nancy Liao; Ryan Morin; Ryan Morrin; Diana Palmquist; Anca S Petrescu; Ursula Skalska; Duane E Smailus; Jeff M Stott; Angelique Schnerch; Jacqueline E Schein; Steven J M Jones; Robert A Holt; Agnes Baross; Marco A Marra; Sandra Clifton; Kathryn A Makowski; Stephanie Bosak; Joel Malek
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

8.  Systematic recovery and analysis of full-ORF human cDNA clones.

Authors:  Agnes Baross; Yaron S N Butterfield; Shaun M Coughlin; Thomas Zeng; Malachi Griffith; Obi L Griffith; Anca S Petrescu; Duane E Smailus; Jaswinder Khattra; Helen L McDonald; Sheldon J McKay; Michelle Moksa; Robert A Holt; Marco A Marra
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

9.  Large-scale RT-PCR recovery of full-length cDNA clones.

Authors:  Jia Qian Wu; Angela M Garcia; Steven Hulyk; Anna Sneed; Carla Kowis; Ye Yuan; David Steffen; John D McPherson; Preethi H Gunaratne; Richard A Gibbs
Journal:  Biotechniques       Date:  2004-04       Impact factor: 1.993

10.  Identification of rat genes by TWINSCAN gene prediction, RT-PCR, and direct sequencing.

Authors:  Jia Qian Wu; David Shteynberg; Manimozhiyan Arumugam; Richard A Gibbs; Michael R Brent
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

View more
  27 in total

1.  The evolution of epitype.

Authors:  Richard B Meagher
Journal:  Plant Cell       Date:  2010-06-15       Impact factor: 11.277

2.  Comparative assessment of methods for aligning multiple genome sequences.

Authors:  Xiaoyu Chen; Martin Tompa
Journal:  Nat Biotechnol       Date:  2010-05-23       Impact factor: 54.908

3.  Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data.

Authors:  Roger Pique-Regi; Jacob F Degner; Athma A Pai; Daniel J Gaffney; Yoav Gilad; Jonathan K Pritchard
Journal:  Genome Res       Date:  2010-11-24       Impact factor: 9.043

4.  Spectral dictionaries: Integrating de novo peptide sequencing with database search of tandem mass spectra.

Authors:  Sangtae Kim; Nitin Gupta; Nuno Bandeira; Pavel A Pevzner
Journal:  Mol Cell Proteomics       Date:  2008-08-14       Impact factor: 5.911

5.  Sequencing and comparative analysis of a conserved syntenic segment in the Solanaceae.

Authors:  Ying Wang; Adam Diehl; Feinan Wu; Julia Vrebalov; James Giovannoni; Adam Siepel; Steven D Tanksley
Journal:  Genetics       Date:  2008-08-24       Impact factor: 4.562

6.  Darwinian alchemy: Human genes from noncoding DNA.

Authors:  Adam Siepel
Journal:  Genome Res       Date:  2009-10       Impact factor: 9.043

7.  Detection of nonneutral substitution rates on mammalian phylogenies.

Authors:  Katherine S Pollard; Melissa J Hubisz; Kate R Rosenbloom; Adam Siepel
Journal:  Genome Res       Date:  2009-10-26       Impact factor: 9.043

8.  mGene: accurate SVM-based gene finding with an application to nematode genomes.

Authors:  Gabriele Schweikert; Alexander Zien; Georg Zeller; Jonas Behr; Christoph Dieterich; Cheng Soon Ong; Petra Philips; Fabio De Bona; Lisa Hartmann; Anja Bohlen; Nina Krüger; Sören Sonnenburg; Gunnar Rätsch
Journal:  Genome Res       Date:  2009-06-29       Impact factor: 9.043

9.  PHAST and RPHAST: phylogenetic analysis with space/time models.

Authors:  Melissa J Hubisz; Katherine S Pollard; Adam Siepel
Journal:  Brief Bioinform       Date:  2010-12-21       Impact factor: 11.622

10.  A ranking-based scoring function for peptide-spectrum matches.

Authors:  Ari M Frank
Journal:  J Proteome Res       Date:  2009-05       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.