Literature DB >> 16339376

Genome annotation past, present, and future: how to define an ORF at each locus.

Michael R Brent1.   

Abstract

Driven by competition, automation, and technology, the genomics community has far exceeded its ambition to sequence the human genome by 2005. By analyzing mammalian genomes, we have shed light on the history of our DNA sequence, determined that alternatively spliced RNAs and retroposed pseudogenes are incredibly abundant, and glimpsed the apparently huge number of non-coding RNAs that play significant roles in gene regulation. Ultimately, genome science is likely to provide comprehensive catalogs of these elements. However, the methods we have been using for most of the last 10 years will not yield even one complete open reading frame (ORF) for every gene--the first plateau on the long climb toward a comprehensive catalog. These strategies--sequencing randomly selected cDNA clones, aligning protein sequences identified in other organisms, sequencing more genomes, and manual curation--will have to be supplemented by large-scale amplification and sequencing of specific predicted mRNAs. The steady improvements in gene prediction that have occurred over the last 10 years have increased the efficacy of this approach and decreased its cost. In this Perspective, I review the state of gene prediction roughly 10 years ago, summarize the progress that has been made since, argue that the primary ORF identification methods we have relied on so far are inadequate, and recommend a path toward completing the Catalog of Protein Coding Genes, Version 1.0.

Entities:  

Mesh:

Year:  2005        PMID: 16339376     DOI: 10.1101/gr.3866105

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  53 in total

Review 1.  A beginner's guide to eukaryotic genome annotation.

Authors:  Mark Yandell; Daniel Ence
Journal:  Nat Rev Genet       Date:  2012-04-18       Impact factor: 53.242

2.  Translation Initiation Site Profiling Reveals Widespread Synthesis of Non-AUG-Initiated Protein Isoforms in Yeast.

Authors:  Amy R Eisenberg; Andrea L Higdon; Ina Hollerer; Alexander P Fields; Irwin Jungreis; Paige D Diamond; Manolis Kellis; Marko Jovanovic; Gloria A Brar
Journal:  Cell Syst       Date:  2020-07-24       Impact factor: 10.304

3.  Retinoic acid regulation of eye and testis-specific transcripts within a complex locus.

Authors:  Pragnya Das; Timothy J Doyle; Donglin Liu; Jaspreet Kochar; Kwan Hee Kim; Melissa B Rogers
Journal:  Mech Dev       Date:  2006-10-28       Impact factor: 1.882

4.  Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.

Authors:  Alexander Stark; Michael F Lin; Pouya Kheradpour; Jakob S Pedersen; Leopold Parts; Joseph W Carlson; Madeline A Crosby; Matthew D Rasmussen; Sushmita Roy; Ameya N Deoras; J Graham Ruby; Julius Brennecke; Emily Hodges; Angie S Hinrichs; Anat Caspi; Benedict Paten; Seung-Won Park; Mira V Han; Morgan L Maeder; Benjamin J Polansky; Bryanne E Robson; Stein Aerts; Jacques van Helden; Bassem Hassan; Donald G Gilbert; Deborah A Eastman; Michael Rice; Michael Weir; Matthew W Hahn; Yongkyu Park; Colin N Dewey; Lior Pachter; W James Kent; David Haussler; Eric C Lai; David P Bartel; Gregory J Hannon; Thomas C Kaufman; Michael B Eisen; Andrew G Clark; Douglas Smith; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Nature       Date:  2007-11-08       Impact factor: 49.962

5.  GAPP: A Proteogenomic Software for Genome Annotation and Global Profiling of Post-translational Modifications in Prokaryotes.

Authors:  Jia Zhang; Ming-Kun Yang; Honghui Zeng; Feng Ge
Journal:  Mol Cell Proteomics       Date:  2016-09-14       Impact factor: 5.911

6.  Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes.

Authors:  Michael F Lin; Joseph W Carlson; Madeline A Crosby; Beverley B Matthews; Charles Yu; Soo Park; Kenneth H Wan; Andrew J Schroeder; L Sian Gramates; Susan E St Pierre; Margaret Roark; Kenneth L Wiley; Rob J Kulathinal; Peili Zhang; Kyl V Myrick; Jerry V Antone; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

7.  Experimental determination of translational starts using peptide mass mapping and tandem mass spectrometry within the proteome of Mycobacterium tuberculosis.

Authors:  Stuart C G Rison; Jens Mattow; Peter R Jungblut; Neil G Stoker
Journal:  Microbiology (Reading)       Date:  2007-02       Impact factor: 2.777

8.  GATExplorer: genomic and transcriptomic explorer; mapping expression probes to gene loci, transcripts, exons and ncRNAs.

Authors:  Alberto Risueño; Celia Fontanillo; Marcel E Dinger; Javier De Las Rivas
Journal:  BMC Bioinformatics       Date:  2010-04-29       Impact factor: 3.169

9.  Trends in genome dynamics among major orders of insects revealed through variations in protein families.

Authors:  Nadav Rappoport; Michal Linial
Journal:  BMC Genomics       Date:  2015-08-07       Impact factor: 3.969

10.  "Genes".

Authors:  Sonja J Prohaska; Peter F Stadler
Journal:  Theory Biosci       Date:  2008-03-05       Impact factor: 1.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.