Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Vertebrate gene predictions and the problem of large genes.

Literature DB >> 12951575

Vertebrate gene predictions and the problem of large genes.

Jun Wang¹, ShengTing Li, Yong Zhang, HongKun Zheng, Zhao Xu, Jia Ye, Jun Yu, Gane Ka-Shu Wong.

Abstract

To find unknown protein-coding genes, annotation pipelines use a combination of ab initio gene prediction and similarity to experimentally confirmed genes or proteins. Here, we show that although the ab initio predictions have an intrinsically high false-positive rate, they also have a consistently low false-negative rate. The incorporation of similarity information is meant to reduce the false-positive rate, but in doing so it increases the false-negative rate. The crucial variable is gene size (including introns)--genes of the most extreme sizes, especially very large genes, are most likely to be incorrectly predicted.

Mesh：

Year: 2003 PMID： 12951575 DOI： 10.1038/nrg1160

Source DB: PubMed Journal: Nat Rev Genet ISSN： 1471-0056 Impact factor: 53.242

Keyword Cloud
Cited

29 in total

1. Evaluation of five ab initio gene prediction programs for the discovery of maize genes.

Authors: Hong Yao; Ling Guo; Yan Fu; Lisa A Borsuk; Tsui-Jung Wen; David S Skibbe; Xiangqin Cui; Brian E Scheffler; Jun Cao; Scott J Emrich; Daniel A Ashlock; Patrick S Schnable
Journal: Plant Mol Biol Date: 2005-02 Impact factor: 4.076

Review 2. Long non-coding RNA and chromatin remodeling.

Authors: Pei Han; Ching-Pin Chang
Journal: RNA Biol Date: 2015-07-15 Impact factor: 4.652

Review 3. Emerging evidence for functional peptides encoded by short open reading frames.

Authors: Shea J Andrews; Joseph A Rothnagel
Journal: Nat Rev Genet Date: 2014-02-11 Impact factor: 53.242

4. Hundreds of putatively functional small open reading frames in Drosophila.

Authors: Emmanuel Ladoukakis; Vini Pereira; Emile G Magny; Adam Eyre-Walker; Juan Pablo Couso
Journal: Genome Biol Date: 2011-11-25 Impact factor: 13.583

5. Evolutionary expansion, gene structure, and expression of the rice wall-associated kinase gene family.

Authors: Shibo Zhang; Calvin Chen; Lei Li; Ling Meng; Jaswinder Singh; Ning Jiang; Xing-Wang Deng; Zheng-Hui He; Peggy G Lemaux
Journal: Plant Physiol Date: 2005-11 Impact factor: 8.340

Review 6. Small open reading frames: current prediction techniques and future prospect.

Authors: Haoyu Cheng; Wai Soon Chan; Zhixiu Li; Dan Wang; Song Liu; Yaoqi Zhou
Journal: Curr Protein Pept Sci Date: 2011-09 Impact factor: 3.272

7. New members of the neurexin superfamily: multiple rodent homologues of the human CASPR5 gene.

Authors: Walther Traut; Dieter Weichenhan; Heinz Himmelbauer; Heinz Winking
Journal: Mamm Genome Date: 2006-07-14 Impact factor: 2.957

8. A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection.

Authors: Kousuke Hanada; Xu Zhang; Justin O Borevitz; Wen-Hsiung Li; Shin-Han Shiu
Journal: Genome Res Date: 2007-03-29 Impact factor: 9.043

9. Signals of recent positive selection in a worldwide sample of human populations.

Authors: Joseph K Pickrell; Graham Coop; John Novembre; Sridhar Kudaravalli; Jun Z Li; Devin Absher; Balaji S Srinivasan; Gregory S Barsh; Richard M Myers; Marcus W Feldman; Jonathan K Pritchard
Journal: Genome Res Date: 2009-03-23 Impact factor: 9.043

10. Identification of human HK genes and gene expression regulation study in cancer from transcriptomics data analysis.

Authors: Meili Chen; Jingfa Xiao; Zhang Zhang; Jingxing Liu; Jiayan Wu; Jun Yu
Journal: PLoS One Date: 2013-01-31 Impact factor: 3.240