Literature DB >> 34928390

Foster thy young: enhanced prediction of orphan genes in assembled genomes.

Jing Li1,2,3, Urminder Singh1,2,4, Priyanka Bhandary1,2,4, Jacqueline Campbell5, Zebulun Arendsee1,2,4, Arun S Seetharam6, Eve Syrkin Wurtele1,2,3,4.   

Abstract

Proteins encoded by newly-emerged genes ('orphan genes') share no sequence similarity with proteins in any other species. They provide organisms with a reservoir of genetic elements to quickly respond to changing selection pressures. Here, we systematically assess the ability of five gene prediction pipelines to accurately predict genes in genomes according to phylostratal origin. BRAKER and MAKER are existing, popular ab initio tools that infer gene structures by machine learning. Direct Inference is an evidence-based pipeline we developed to predict gene structures from alignments of RNA-Seq data. The BIND pipeline integrates ab initio predictions of BRAKER and Direct inference; MIND combines Direct Inference and MAKER predictions. We use highly-curated Arabidopsis and yeast annotations as gold-standard benchmarks, and cross-validate in rice. Each pipeline under-predicts orphan genes (as few as 11 percent, under one prediction scenario). Increasing RNA-Seq diversity greatly improves prediction efficacy. The combined methods (BIND and MIND) yield best predictions overall, BIND identifying 68% of annotated orphan genes, 99% of ancient genes, and give the highest sensitivity score regardless dataset in Arabidopsis. We provide a light weight, flexible, reproducible, and well-documented solution to improve gene prediction.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2022        PMID: 34928390      PMCID: PMC9023268          DOI: 10.1093/nar/gkab1238

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   19.160


  91 in total

1.  Domain combinations in archaeal, eubacterial and eukaryotic proteomes.

Authors:  G Apic; J Gough; S A Teichmann
Journal:  J Mol Biol       Date:  2001-07-06       Impact factor: 5.469

2.  Using native and syntenically mapped cDNA alignments to improve de novo gene finding.

Authors:  Mario Stanke; Mark Diekhans; Robert Baertsch; David Haussler
Journal:  Bioinformatics       Date:  2008-01-24       Impact factor: 6.937

Review 3.  Venomics: integrative venom proteomics and beyond.

Authors:  Juan J Calvete
Journal:  Biochem J       Date:  2017-02-20       Impact factor: 3.857

Review 4.  Emerging and evolving concepts in gene essentiality.

Authors:  Giulia Rancati; Jason Moffat; Athanasios Typas; Norman Pavelka
Journal:  Nat Rev Genet       Date:  2017-10-16       Impact factor: 53.242

5.  The conversion of 3' UTRs into coding regions.

Authors:  Michael G Giacomelli; Adam S Hancock; Joanna Masel
Journal:  Mol Biol Evol       Date:  2006-11-10       Impact factor: 16.240

6.  Phylogenetic patterns of emergence of new genes support a model of frequent de novo evolution.

Authors:  Rafik Neme; Diethard Tautz
Journal:  BMC Genomics       Date:  2013-02-21       Impact factor: 3.969

7.  The sequence read archive.

Authors:  Rasko Leinonen; Hideaki Sugawara; Martin Shumway
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

8.  De novo gene birth.

Authors:  Stephen Branden Van Oss; Anne-Ruxandra Carvunis
Journal:  PLoS Genet       Date:  2019-05-23       Impact factor: 5.917

9.  A rice gene of de novo origin negatively regulates pathogen-induced defense response.

Authors:  Wenfei Xiao; Hongbo Liu; Yu Li; Xianghua Li; Caiguo Xu; Manyuan Long; Shiping Wang
Journal:  PLoS One       Date:  2009-02-25       Impact factor: 3.240

10.  Nematode orphan genes are adopted by conserved regulatory networks and find a home in ecology.

Authors:  Melanie G Mayer; Ralf J Sommer
Journal:  Worm       Date:  2015-08-24
View more
  4 in total

1.  Identification of novel PHD-finger genes in pepper by genomic re-annotation and comparative analyses.

Authors:  Ji-Yoon Guk; Min-Jeong Jang; Seungill Kim
Journal:  BMC Plant Biol       Date:  2022-04-20       Impact factor: 5.260

Review 2.  Research Advances and Prospects of Orphan Genes in Plants.

Authors:  Mingliang Jiang; Xiaonan Li; Xiangshu Dong; Ye Zu; Zongxiang Zhan; Zhongyun Piao; Hong Lang
Journal:  Front Plant Sci       Date:  2022-07-08       Impact factor: 6.627

Review 3.  Origins, evolution, and physiological implications of de novo genes in yeast.

Authors:  Saurin B Parikh; Carly Houghton; S Branden Van Oss; Aaron Wacholder; Anne-Ruxandra Carvunis
Journal:  Yeast       Date:  2022-08-24       Impact factor: 3.325

4.  De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes.

Authors:  Matthew B Hufford; Arun S Seetharam; Margaret R Woodhouse; Kapeel M Chougule; Shujun Ou; Jianing Liu; William A Ricci; Tingting Guo; Andrew Olson; Yinjie Qiu; Rafael Della Coletta; Silas Tittes; Asher I Hudson; Alexandre P Marand; Sharon Wei; Zhenyuan Lu; Bo Wang; Marcela K Tello-Ruiz; Rebecca D Piri; Na Wang; Dong Won Kim; Yibing Zeng; Christine H O'Connor; Xianran Li; Amanda M Gilbert; Erin Baggs; Ksenia V Krasileva; John L Portwood; Ethalinda K S Cannon; Carson M Andorf; Nancy Manchanda; Samantha J Snodgrass; David E Hufnagel; Qiuhan Jiang; Sarah Pedersen; Michael L Syring; David A Kudrna; Victor Llaca; Kevin Fengler; Robert J Schmitz; Jeffrey Ross-Ibarra; Jianming Yu; Jonathan I Gent; Candice N Hirsch; Doreen Ware; R Kelly Dawe
Journal:  Science       Date:  2021-08-06       Impact factor: 47.728

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.