Literature DB >> 17339634

Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites.

Maria Anisimova1, Ziheng Yang.   

Abstract

Detection of positive Darwinian selection has become ever more important with the rapid growth of genomic data sets. Recent branch-site models of codon substitution account for variation of selective pressure over branches on the tree and across sites in the sequence and provide a means to detect short episodes of molecular adaptation affecting just a few sites. In likelihood ratio tests based on such models, the branches to be tested for positive selection have to be specified a priori. In the absence of a biological hypothesis to designate so-called foreground branches, one may test many branches, but a correction for multiple testing becomes necessary. In this paper, we employ computer simulation to evaluate the performance of 6 multiple test correction procedures when the branch-site models are used to test every branch on the phylogeny for positive selection. Four of the methods control the familywise error rates (FWERs), whereas the other 2 control the false discovery rate (FDR). We found that all correction procedures achieved acceptable FWER except for extremely divergent sequences and serious model violations, when the test may become unreliable. The power of the test to detect positive selection is influenced by the strength of selection and the sequence divergence, with the highest power observed at intermediate divergences. The 4 correction procedures that control the FWER had similar power. We recommend Rom's procedure for its slightly higher power, but the simple Bonferroni correction is useable as well. The 2 correction procedures that control the FDR had slightly more power and also higher FWER. We demonstrate the multiple test procedures by analyzing gene sequences from the extracellular domain of the cluster of differentiation 2 (CD2) gene from 10 mammalian species. Both our simulation and real data analysis suggest that the multiple test procedures are useful when multiple branches have to be tested on the same data set.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17339634     DOI: 10.1093/molbev/msm042

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  110 in total

1.  Evolution of the Cinnamyl/Sinapyl Alcohol Dehydrogenase (CAD/SAD) gene family: the emergence of real lignin is associated with the origin of Bona Fide CAD.

Authors:  Dong-Mei Guo; Jin-Hua Ran; Xiao-Quan Wang
Journal:  J Mol Evol       Date:  2010-08-19       Impact factor: 2.395

2.  Large distribution and high sequence identity of a Copia-type retrotransposon in angiosperm families.

Authors:  Elaine Silva Dias; Clémence Hatt; Serge Hamon; Perla Hamon; Michel Rigoreau; Dominique Crouzillat; Claudia Marcia Aparecida Carareto; Alexandre de Kochko; Romain Guyot
Journal:  Plant Mol Biol       Date:  2015-08-06       Impact factor: 4.076

Review 3.  Models of coding sequence evolution.

Authors:  Wayne Delport; Konrad Scheffler; Cathal Seoighe
Journal:  Brief Bioinform       Date:  2008-10-29       Impact factor: 11.622

4.  Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes.

Authors:  Romain A Studer; Simon Penel; Laurent Duret; Marc Robinson-Rechavi
Journal:  Genome Res       Date:  2008-06-18       Impact factor: 9.043

5.  Patterns of molecular evolution of the germ line specification gene oskar suggest that a novel domain may contribute to functional divergence in Drosophila.

Authors:  Abha Ahuja; Cassandra G Extavour
Journal:  Dev Genes Evol       Date:  2014-01-10       Impact factor: 0.900

6.  A random effects branch-site model for detecting episodic diversifying selection.

Authors:  Sergei L Kosakovsky Pond; Ben Murrell; Mathieu Fourment; Simon D W Frost; Wayne Delport; Konrad Scheffler
Journal:  Mol Biol Evol       Date:  2011-06-13       Impact factor: 16.240

Review 7.  Statistics and truth in phylogenomics.

Authors:  Sudhir Kumar; Alan J Filipski; Fabia U Battistuzzi; Sergei L Kosakovsky Pond; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2011-08-26       Impact factor: 16.240

8.  Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.

Authors:  Derek Gatherer
Journal:  Bioinform Biol Insights       Date:  2009-11-24

9.  Molecular evolution and functional diversification of fatty acid desaturases after recurrent gene duplication in Drosophila.

Authors:  Shu Fang; Chau-Ti Ting; Cheng-Ruei Lee; Kuang-Hsi Chu; Chuan-Chan Wang; Shun-Chern Tsaur
Journal:  Mol Biol Evol       Date:  2009-03-23       Impact factor: 16.240

10.  FoxO gene family evolution in vertebrates.

Authors:  Minghui Wang; Xiangzhe Zhang; Hongbo Zhao; Qishan Wang; Yuchun Pan
Journal:  BMC Evol Biol       Date:  2009-09-07       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.