Literature DB >> 11726928

Computational identification of promoters and first exons in the human genome.

R V Davuluri1, I Grosse, M Q Zhang.   

Abstract

The identification of promoters and first exons has been one of the most difficult problems in gene-finding. We present a set of discriminant functions that can recognize structural and compositional features such as CpG islands, promoter regions and first splice-donor sites. We explain the implementation of the discriminant functions into a decision tree that constitutes a new program called FirstEF. By using different models to predict CpG-related and non-CpG-related first exons, we showed by cross-validation that the program could predict 86% of the first exons with 17% false positives. We also demonstrated the prediction accuracy of FirstEF at the genome level by applying it to the finished sequences of human chromosomes 21 and 22 as well as by comparing the predictions with the locations of the experimentally verified first exons. Finally, we present the analysis of the predicted first exons for all of the 24 chromosomes of the human genome.

Entities:  

Mesh:

Year:  2001        PMID: 11726928     DOI: 10.1038/ng780

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


  155 in total

1.  A question of size: the eukaryotic proteome and the problems in defining it.

Authors:  Paul M Harrison; Anuj Kumar; Ning Lang; Michael Snyder; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

Review 2.  In silico identification of metazoan transcriptional regulatory regions.

Authors:  Wyeth W Wasserman; William Krivan
Journal:  Naturwissenschaften       Date:  2003-03-27

Review 3.  Computational approaches to identify promoters and cis-regulatory elements in plant genomes.

Authors:  Stephane Rombauts; Kobe Florquin; Magali Lescot; Kathleen Marchal; Pierre Rouzé; Yves van de Peer
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

4.  Refined annotation of the Arabidopsis genome by complete expressed sequence tag mapping.

Authors:  Wei Zhu; Shannon D Schlueter; Volker Brendel
Journal:  Plant Physiol       Date:  2003-06       Impact factor: 8.340

Review 5.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

6.  Large scale study of protein domain distribution in the context of alternative splicing.

Authors:  Shuo Liu; Russ B Altman
Journal:  Nucleic Acids Res       Date:  2003-08-15       Impact factor: 16.971

Review 7.  Regulation of alternative RNA splicing by exon definition and exon sequences in viral and mammalian gene expression.

Authors:  Zhi-Ming Zheng
Journal:  J Biomed Sci       Date:  2004 May-Jun       Impact factor: 8.410

8.  The multiassembly problem: reconstructing multiple transcript isoforms from EST fragment mixtures.

Authors:  Yi Xing; Alissa Resch; Christopher Lee
Journal:  Genome Res       Date:  2004-02-12       Impact factor: 9.043

9.  Epigenome scans and cancer genome sequencing converge on WNK2, a kinase-independent suppressor of cell growth.

Authors:  Chibo Hong; K Scott Moorefield; Peter Jun; Kenneth D Aldape; Samir Kharbanda; Heidi S Phillips; Joseph F Costello
Journal:  Proc Natl Acad Sci U S A       Date:  2007-06-19       Impact factor: 11.205

10.  β3-chimaerin, a novel member of the chimaerin Rac-GAP family.

Authors:  Lautaro Zubeldia-Brenner; Alvaro Gutierrez-Uzquiza; Laura Barrio-Real; Hongbin Wang; Marcelo G Kazanietz; Federico Coluccio Leskow
Journal:  Mol Biol Rep       Date:  2014-01-16       Impact factor: 2.316

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.