Literature DB >> 26315912

Hierarchical boosting: a machine-learning framework to detect and classify hard selective sweeps in human populations.

Marc Pybus1, Pierre Luisi2, Giovanni Marco Dall'Olio3, Manu Uzkudun1, Hafid Laayouni4, Jaume Bertranpetit1, Johannes Engelken1.   

Abstract

MOTIVATION: Detecting positive selection in genomic regions is a recurrent topic in natural population genetic studies. However, there is little consistency among the regions detected in several genome-wide scans using different tests and/or populations. Furthermore, few methods address the challenge of classifying selective events according to specific features such as age, intensity or state (completeness).
RESULTS: We have developed a machine-learning classification framework that exploits the combined ability of some selection tests to uncover different polymorphism features expected under the hard sweep model, while controlling for population-specific demography. As a result, we achieve high sensitivity toward hard selective sweeps while adding insights about their completeness (whether a selected variant is fixed or not) and age of onset. Our method also determines the relevance of the individual methods implemented so far to detect positive selection under specific selective scenarios. We calibrated and applied the method to three reference human populations from The 1000 Genome Project to generate a genome-wide classification map of hard selective sweeps. This study improves detection of selective sweep by overcoming the classical selection versus no-selection classification strategy, and offers an explanation to the lack of consistency observed among selection tests when applied to real data. Very few signals were observed in the African population studied, while our method presents higher sensitivity in this population demography.
AVAILABILITY AND IMPLEMENTATION: The genome-wide results for three human populations from The 1000 Genomes Project and an R-package implementing the 'Hierarchical Boosting' framework are available at http://hsb.upf.edu/.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2015        PMID: 26315912     DOI: 10.1093/bioinformatics/btv493

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  37 in total

1.  Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes.

Authors:  Matthieu Deschamps; Guillaume Laval; Maud Fagny; Yuval Itan; Laurent Abel; Jean-Laurent Casanova; Etienne Patin; Lluis Quintana-Murci
Journal:  Am J Hum Genet       Date:  2016-01-07       Impact factor: 11.025

2.  Detection and Classification of Hard and Soft Sweeps from Unphased Genotypes by Multilocus Genotype Identity.

Authors:  Alexandre M Harris; Nandita R Garud; Michael DeGiorgio
Journal:  Genetics       Date:  2018-10-12       Impact factor: 4.562

3.  Genomic analysis of Andamanese provides insights into ancient human migration into Asia and adaptation.

Authors:  Mayukh Mondal; Ferran Casals; Tina Xu; Giovanni M Dall'Olio; Marc Pybus; Mihai G Netea; David Comas; Hafid Laayouni; Qibin Li; Partha P Majumder; Jaume Bertranpetit
Journal:  Nat Genet       Date:  2016-07-25       Impact factor: 38.330

4.  Background Selection Does Not Mimic the Patterns of Genetic Diversity Produced by Selective Sweeps.

Authors:  Daniel R Schrider
Journal:  Genetics       Date:  2020-08-26       Impact factor: 4.562

5.  Detecting Recent Positive Selection with a Single Locus Test Bipartitioning the Coalescent Tree.

Authors:  Zongfeng Yang; Junrui Li; Thomas Wiehe; Haipeng Li
Journal:  Genetics       Date:  2017-12-07       Impact factor: 4.562

6.  Evaluating hierarchical machine learning approaches to classify biological databases.

Authors:  Pâmela M Rezende; Joicymara S Xavier; David B Ascher; Gabriel R Fernandes; Douglas E V Pires
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

7.  Sporadic occurrence of recent selective sweeps from standing variation in humans as revealed by an approximate Bayesian computation approach.

Authors:  Guillaume Laval; Etienne Patin; Pierre Boutillier; Lluis Quintana-Murci
Journal:  Genetics       Date:  2021-12-10       Impact factor: 4.402

8.  Variances and covariances of linear summary statistics of segregating sites.

Authors:  Yun-Xin Fu
Journal:  Theor Popul Biol       Date:  2022-04-04       Impact factor: 1.514

Review 9.  Human adaptation over the past 40,000 years.

Authors:  Iain Mathieson
Journal:  Curr Opin Genet Dev       Date:  2020-08-01       Impact factor: 5.578

10.  Genomic selection signatures in autism spectrum disorder identifies cognitive genomic tradeoff and its relevance in paradoxical phenotypes of deficits versus potentialities.

Authors:  Anil Prakash; Moinak Banerjee
Journal:  Sci Rep       Date:  2021-05-13       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.