Literature DB >> 11242120

Homology-based annotation yields 1,042 new candidate genes in the Drosophila melanogaster genome.

S Gopal1, M Schroeder, U Pieper, A Sczyrba, G Aytekin-Kurban, S Bekiranov, J E Fajardo, N Eswar, R Sanchez, A Sali, T Gaasterland.   

Abstract

The approach to annotating a genome critically affects the number and accuracy of genes identified in the genome sequence. Genome annotation based on stringent gene identification is prone to underestimate the complement of genes encoded in a genome. In contrast, over-prediction of putative genes followed by exhaustive computational sequence, motif and structural homology search will find rarely expressed, possibly unique, new genes at the risk of including non-functional genes. We developed a two-stage approach that combines the merits of stringent genome annotation with the benefits of over-prediction. First we identify plausible genes regardless of matches with EST, cDNA or protein sequences from the organism (stage 1). In the second stage, proteins predicted from the plausible genes are compared at the protein level with EST, cDNA and protein sequences, and protein structures from other organisms (stage 2). Remote but biologically meaningful protein sequence or structure homologies provide supporting evidence for genuine genes. The method, applied to the Drosophila melanogaster genome, validated 1,042 novel candidate genes after filtering 19,410 plausible genes, of which 12,124 matched the original 13,601 annotated genes. This annotation strategy is applicable to genomes of all organisms, including human.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11242120     DOI: 10.1038/85922

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


  16 in total

1.  MODBASE, a database of annotated comparative protein structure models.

Authors:  Ursula Pieper; Narayanan Eswar; Ashley C Stuart; Valentin A Ilyin; Andrej Sali
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

2.  Genome-wide analysis of core cell cycle genes in Arabidopsis.

Authors:  Klaas Vandepoele; Jeroen Raes; Lieven De Veylder; Pierre Rouzé; Stephane Rombauts; Dirk Inzé
Journal:  Plant Cell       Date:  2002-04       Impact factor: 11.277

3.  A question of size: the eukaryotic proteome and the problems in defining it.

Authors:  Paul M Harrison; Anuj Kumar; Ning Lang; Michael Snyder; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

4.  Genomic shotgun array: a procedure linking large-scale DNA sequencing with regional transcript mapping.

Authors:  Ling-Hui Li; Jian-Chiuan Li; Yung-Feng Lin; Chung-Yen Lin; Chung-Yung Chen; Shih-Feng Tsai
Journal:  Nucleic Acids Res       Date:  2004-02-11       Impact factor: 16.971

5.  Transcriptional Profile during Deoxycholate-Induced Sporulation in a Clostridium perfringens Isolate Causing Foodborne Illness.

Authors:  Mayo Yasugi; Daisuke Okuzaki; Ritsuko Kuwana; Hiromu Takamatsu; Masaya Fujita; Mahfuzur R Sarker; Masami Miyake
Journal:  Appl Environ Microbiol       Date:  2016-05-02       Impact factor: 4.792

6.  The utility of geometrical and chemical restraint information extracted from predicted ligand-binding sites in protein structure refinement.

Authors:  Michal Brylinski; Seung Yup Lee; Hongyi Zhou; Jeffrey Skolnick
Journal:  J Struct Biol       Date:  2010-09-17       Impact factor: 2.867

7.  Comparative characterization, expression pattern and function analysis of the 12-oxo-phytodienoic acid reductase gene family in rice.

Authors:  Wenyan Li; Feng Zhou; Bing Liu; Dongru Feng; Yanming He; Kangbiao Qi; Hongbin Wang; Jinfa Wang
Journal:  Plant Cell Rep       Date:  2011-01-20       Impact factor: 4.570

8.  Assessing the Drosophila melanogaster and Anopheles gambiae genome annotations using genome-wide sequence comparisons.

Authors:  Olivier Jaillon; Carole Dossat; Ralph Eckenberg; Karin Eiglmeier; Béatrice Segurens; Jean-Marc Aury; Charles W Roth; Claude Scarpelli; Paul T Brey; Jean Weissenbach; Patrick Wincker
Journal:  Genome Res       Date:  2003-07       Impact factor: 9.043

9.  Genomic analyses reveal a conserved glutathione homeostasis pathway in the invertebrate chordate Ciona intestinalis.

Authors:  Gerardo M Nava; David Y Lee; Javier H Ospina; Shi-Ying Cai; H Rex Gaskins
Journal:  Physiol Genomics       Date:  2009-05-26       Impact factor: 3.107

10.  Systematic discovery of new genes in the Saccharomyces cerevisiae genome.

Authors:  Marco M Kessler; Qiandong Zeng; Sarah Hogan; Robin Cook; Arturo J Morales; Guillaume Cottarel
Journal:  Genome Res       Date:  2003-02       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.