Literature DB >> 12951575

Vertebrate gene predictions and the problem of large genes.

Jun Wang1, ShengTing Li, Yong Zhang, HongKun Zheng, Zhao Xu, Jia Ye, Jun Yu, Gane Ka-Shu Wong.   

Abstract

To find unknown protein-coding genes, annotation pipelines use a combination of ab initio gene prediction and similarity to experimentally confirmed genes or proteins. Here, we show that although the ab initio predictions have an intrinsically high false-positive rate, they also have a consistently low false-negative rate. The incorporation of similarity information is meant to reduce the false-positive rate, but in doing so it increases the false-negative rate. The crucial variable is gene size (including introns)--genes of the most extreme sizes, especially very large genes, are most likely to be incorrectly predicted.

Mesh:

Year:  2003        PMID: 12951575     DOI: 10.1038/nrg1160

Source DB:  PubMed          Journal:  Nat Rev Genet        ISSN: 1471-0056            Impact factor:   53.242


  29 in total

1.  Evaluation of five ab initio gene prediction programs for the discovery of maize genes.

Authors:  Hong Yao; Ling Guo; Yan Fu; Lisa A Borsuk; Tsui-Jung Wen; David S Skibbe; Xiangqin Cui; Brian E Scheffler; Jun Cao; Scott J Emrich; Daniel A Ashlock; Patrick S Schnable
Journal:  Plant Mol Biol       Date:  2005-02       Impact factor: 4.076

Review 2.  Long non-coding RNA and chromatin remodeling.

Authors:  Pei Han; Ching-Pin Chang
Journal:  RNA Biol       Date:  2015-07-15       Impact factor: 4.652

Review 3.  Emerging evidence for functional peptides encoded by short open reading frames.

Authors:  Shea J Andrews; Joseph A Rothnagel
Journal:  Nat Rev Genet       Date:  2014-02-11       Impact factor: 53.242

4.  Hundreds of putatively functional small open reading frames in Drosophila.

Authors:  Emmanuel Ladoukakis; Vini Pereira; Emile G Magny; Adam Eyre-Walker; Juan Pablo Couso
Journal:  Genome Biol       Date:  2011-11-25       Impact factor: 13.583

5.  Evolutionary expansion, gene structure, and expression of the rice wall-associated kinase gene family.

Authors:  Shibo Zhang; Calvin Chen; Lei Li; Ling Meng; Jaswinder Singh; Ning Jiang; Xing-Wang Deng; Zheng-Hui He; Peggy G Lemaux
Journal:  Plant Physiol       Date:  2005-11       Impact factor: 8.340

Review 6.  Small open reading frames: current prediction techniques and future prospect.

Authors:  Haoyu Cheng; Wai Soon Chan; Zhixiu Li; Dan Wang; Song Liu; Yaoqi Zhou
Journal:  Curr Protein Pept Sci       Date:  2011-09       Impact factor: 3.272

7.  New members of the neurexin superfamily: multiple rodent homologues of the human CASPR5 gene.

Authors:  Walther Traut; Dieter Weichenhan; Heinz Himmelbauer; Heinz Winking
Journal:  Mamm Genome       Date:  2006-07-14       Impact factor: 2.957

8.  A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection.

Authors:  Kousuke Hanada; Xu Zhang; Justin O Borevitz; Wen-Hsiung Li; Shin-Han Shiu
Journal:  Genome Res       Date:  2007-03-29       Impact factor: 9.043

9.  Signals of recent positive selection in a worldwide sample of human populations.

Authors:  Joseph K Pickrell; Graham Coop; John Novembre; Sridhar Kudaravalli; Jun Z Li; Devin Absher; Balaji S Srinivasan; Gregory S Barsh; Richard M Myers; Marcus W Feldman; Jonathan K Pritchard
Journal:  Genome Res       Date:  2009-03-23       Impact factor: 9.043

10.  Identification of human HK genes and gene expression regulation study in cancer from transcriptomics data analysis.

Authors:  Meili Chen; Jingfa Xiao; Zhang Zhang; Jingxing Liu; Jiayan Wu; Jun Yu
Journal:  PLoS One       Date:  2013-01-31       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.