Literature DB >> 35689633

Species Tree Estimation and the Impact of Gene Loss Following Whole-Genome Duplication.

Haifeng Xiong1, Danying Wang1, Chen Shao1, Xuchen Yang1, Jialin Yang2, Tao Ma1, Charles C Davis3, Liang Liu2, Zhenxiang Xi1.   

Abstract

Whole-genome duplication (WGD) occurs broadly and repeatedly across the history of eukaryotes and is recognized as a prominent evolutionary force, especially in plants. Immediately following WGD, most genes are present in two copies as paralogs. Due to this redundancy, one copy of a paralog pair commonly undergoes pseudogenization and is eventually lost. When speciation occurs shortly after WGD; however, differential loss of paralogs may lead to spurious phylogenetic inference resulting from the inclusion of pseudoorthologs-paralogous genes mistakenly identified as orthologs because they are present in single copies within each sampled species. The influence and impact of including pseudoorthologs versus true orthologs as a result of gene extinction (or incomplete laboratory sampling) are only recently gaining empirical attention in the phylogenomics community. Moreover, few studies have yet to investigate this phenomenon in an explicit coalescent framework. Here, using mathematical models, numerous simulated data sets, and two newly assembled empirical data sets, we assess the effect of pseudoorthologs on species tree estimation under varying degrees of incomplete lineage sorting (ILS) and differential gene loss scenarios following WGD. When gene loss occurs along the terminal branches of the species tree, alignment-based (BPP) and gene-tree-based (ASTRAL, MP-EST, and STAR) coalescent methods are adversely affected as the degree of ILS increases. This can be greatly improved by sampling a sufficiently large number of genes. Under the same circumstances, however, concatenation methods consistently estimate incorrect species trees as the number of genes increases. Additionally, pseudoorthologs can greatly mislead species tree inference when gene loss occurs along the internal branches of the species tree. Here, both coalescent and concatenation methods yield inconsistent results. These results underscore the importance of understanding the influence of pseudoorthologs in the phylogenomics era. [Coalescent method; concatenation method; incomplete lineage sorting; pseudoorthologs; single-copy gene; whole-genome duplication.].
© The Author(s) 2022. Published by Oxford University Press on behalf of the Society of Systematic Biologists.

Entities:  

Mesh:

Year:  2022        PMID: 35689633      PMCID: PMC9558847          DOI: 10.1093/sysbio/syac040

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   9.160


  100 in total

1.  Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes.

Authors:  Brian C Thomas; Brent Pedersen; Michael Freeling
Journal:  Genome Res       Date:  2006-06-07       Impact factor: 9.043

2.  Phybase: an R package for species tree analysis.

Authors:  Liang Liu; Lili Yu
Journal:  Bioinformatics       Date:  2010-02-15       Impact factor: 6.937

Review 3.  Polyploidy: adaptation to the genomic environment.

Authors:  Jesse D Hollister
Journal:  New Phytol       Date:  2014-07-18       Impact factor: 10.151

4.  PartitionFinder 2: New Methods for Selecting Partitioned Models of Evolution for Molecular and Morphological Phylogenetic Analyses.

Authors:  Robert Lanfear; Paul B Frandsen; April M Wright; Tereza Senfeld; Brett Calcott
Journal:  Mol Biol Evol       Date:  2017-03-01       Impact factor: 16.240

5.  Independent sorting-out of thousands of duplicated gene pairs in two yeast species descended from a whole-genome duplication.

Authors:  Devin R Scannell; A Carolin Frank; Gavin C Conant; Kevin P Byrne; Megan Woolfit; Kenneth H Wolfe
Journal:  Proc Natl Acad Sci U S A       Date:  2007-05-09       Impact factor: 11.205

6.  Identifying single copy orthologs in Metazoa.

Authors:  Christopher J Creevey; Jean Muller; Tobias Doerks; Julie D Thompson; Detlev Arendt; Peer Bork
Journal:  PLoS Comput Biol       Date:  2011-12-01       Impact factor: 4.475

7.  Clade- and species-specific features of genome evolution in the Saccharomycetaceae.

Authors:  Kenneth H Wolfe; David Armisén; Estelle Proux-Wera; Seán S ÓhÉigeartaigh; Haleema Azam; Jonathan L Gordon; Kevin P Byrne
Journal:  FEMS Yeast Res       Date:  2015-06-10       Impact factor: 2.796

8.  One thousand plant transcriptomes and the phylogenomics of green plants.

Authors: 
Journal:  Nature       Date:  2019-10-23       Impact factor: 49.962

9.  The impact of paralogy on phylogenomic studies - a case study on annelid relationships.

Authors:  Torsten H Struck
Journal:  PLoS One       Date:  2013-05-07       Impact factor: 3.240

10.  FastMulRFS: fast and accurate species tree estimation under generic gene duplication and loss models.

Authors:  Erin K Molloy; Tandy Warnow
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.