Literature DB >> 35588743

Mixing genome annotation methods in a comparative analysis inflates the apparent number of lineage-specific genes.

Caroline M Weisman1, Andrew W Murray2, Sean R Eddy3.   

Abstract

Comparisons of genomes of different species are used to identify lineage-specific genes, those genes that appear unique to one species or clade. Lineage-specific genes are often thought to represent genetic novelty that underlies unique adaptations. Identification of these genes depends not only on genome sequences, but also on inferred gene annotations. Comparative analyses typically use available genomes that have been annotated using different methods, increasing the risk that orthologous DNA sequences may be erroneously annotated as a gene in one species but not another, appearing lineage specific as a result. To evaluate the impact of such "annotation heterogeneity," we identified four clades of species with sequenced genomes with more than one publicly available gene annotation, allowing us to compare the number of lineage-specific genes inferred when differing annotation methods are used to those resulting when annotation method is uniform across the clade. In these case studies, annotation heterogeneity increases the apparent number of lineage-specific genes by up to 15-fold, suggesting that annotation heterogeneity is a substantial source of potential artifact.
Copyright © 2022 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  genome annotation; lineage-specific genes; novel genes; orphan genes; taxonomically restricted genes

Mesh:

Year:  2022        PMID: 35588743      PMCID: PMC9346927          DOI: 10.1016/j.cub.2022.04.085

Source DB:  PubMed          Journal:  Curr Biol        ISSN: 0960-9822            Impact factor:   10.900


  25 in total

1.  Orphans as taxonomically restricted and ecologically important genes.

Authors:  G A Wilson; N Bertrand; Y Patel; J B Hughes; E J Feil; D Field
Journal:  Microbiology       Date:  2005-08       Impact factor: 2.777

Review 2.  Open questions in the study of de novo genes: what, how and why.

Authors:  Aoife McLysaght; Laurence D Hurst
Journal:  Nat Rev Genet       Date:  2016-07-25       Impact factor: 53.242

3.  WormBase: Annotating many nematode genomes.

Authors:  Kevin Howe; Paul Davis; Michael Paulini; Mary Ann Tuli; Gary Williams; Karen Yook; Richard Durbin; Paul Kersey; Paul W Sternberg
Journal:  Worm       Date:  2012-01-01

4.  Phylogenetic patterns of emergence of new genes support a model of frequent de novo evolution.

Authors:  Rafik Neme; Diethard Tautz
Journal:  BMC Genomics       Date:  2013-02-21       Impact factor: 3.969

5.  Proto-genes and de novo gene birth.

Authors:  Anne-Ruxandra Carvunis; Thomas Rolland; Ilan Wapinski; Michael A Calderwood; Muhammed A Yildirim; Nicolas Simonis; Benoit Charloteaux; César A Hidalgo; Justin Barbette; Balaji Santhanam; Gloria A Brar; Jonathan S Weissman; Aviv Regev; Nicolas Thierry-Mieg; Michael E Cusick; Marc Vidal
Journal:  Nature       Date:  2012-07-19       Impact factor: 49.962

6.  VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases.

Authors:  Gloria I Giraldo-Calderón; Scott J Emrich; Robert M MacCallum; Gareth Maslen; Emmanuel Dialynas; Pantelis Topalis; Nicholas Ho; Sandra Gesing; Gregory Madey; Frank H Collins; Daniel Lawson
Journal:  Nucleic Acids Res       Date:  2014-12-15       Impact factor: 16.971

7.  Young Genes are Highly Disordered as Predicted by the Preadaptation Hypothesis of De Novo Gene Birth.

Authors:  Benjamin A Wilson; Scott G Foy; Rafik Neme; Joanna Masel
Journal:  Nat Ecol Evol       Date:  2017-04-24       Impact factor: 15.460

8.  De novo gene birth.

Authors:  Stephen Branden Van Oss; Anne-Ruxandra Carvunis
Journal:  PLoS Genet       Date:  2019-05-23       Impact factor: 5.917

9.  Many, but not all, lineage-specific genes can be explained by homology detection failure.

Authors:  Caroline M Weisman; Andrew W Murray; Sean R Eddy
Journal:  PLoS Biol       Date:  2020-11-02       Impact factor: 8.029

Review 10.  New genes from non-coding sequence: the role of de novo protein-coding genes in eukaryotic evolutionary innovation.

Authors:  Aoife McLysaght; Daniele Guerzoni
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2015-09-26       Impact factor: 6.237

View more
  3 in total

Review 1.  The state of Medusozoa genomics: current evidence and future challenges.

Authors:  Mylena D Santander; Maximiliano M Maronna; Joseph F Ryan; Sónia C S Andrade
Journal:  Gigascience       Date:  2022-05-17       Impact factor: 7.658

2.  Heterologous expression of naturally evolved putative de novo proteins with chaperones.

Authors:  Lars A Eicholt; Margaux Aubel; Katrin Berk; Erich Bornberg-Bauer; Andreas Lange
Journal:  Protein Sci       Date:  2022-08       Impact factor: 6.993

3.  Giant Starship Elements Mobilize Accessory Genes in Fungal Genomes.

Authors:  Emile Gluck-Thaler; Timothy Ralston; Zachary Konkel; Cristhian Grabowski Ocampos; Veena Devi Ganeshan; Anne E Dorrance; Terry L Niblack; Corlett W Wood; Jason C Slot; Horacio D Lopez-Nicora; Aaron A Vogan
Journal:  Mol Biol Evol       Date:  2022-05-03       Impact factor: 8.800

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.