Literature DB >> 27601374

pcadapt: an R package to perform genome scans for selection based on principal component analysis.

Keurcien Luu1, Eric Bazin2, Michael G B Blum1.   

Abstract

The R package pcadapt performs genome scans to detect genes under selection based on population genomic data. It assumes that candidate markers are outliers with respect to how they are related to population structure. Because population structure is ascertained with principal component analysis, the package is fast and works with large-scale data. It can handle missing data and pooled sequencing data. By contrast to population-based approaches, the package handle admixed individuals and does not require grouping individuals into populations. Since its first release, pcadapt has evolved in terms of both statistical approach and software implementation. We present results obtained with robust Mahalanobis distance, which is a new statistic for genome scans available in the 2.0 and later versions of the package. When hierarchical population structure occurs, Mahalanobis distance is more powerful than the communality statistic that was implemented in the first version of the package. Using simulated data, we compare pcadapt to other computer programs for genome scans (BayeScan, hapflk, OutFLANK, sNMF). We find that the proportion of false discoveries is around a nominal false discovery rate set at 10% with the exception of BayeScan that generates 40% of false discoveries. We also find that the power of BayeScan is severely impacted by the presence of admixed individuals whereas pcadapt is not impacted. Last, we find that pcadapt and hapflk are the most powerful in scenarios of population divergence and range expansion. Because pcadapt handles next-generation sequencing data, it is a valuable tool for data analysis in molecular ecology.
© 2016 John Wiley & Sons Ltd.

Entities:  

Keywords:  Mahalanobis distance; R package; outlier detection; population genetics; principal component analysis

Mesh:

Year:  2016        PMID: 27601374     DOI: 10.1111/1755-0998.12592

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  165 in total

1.  Rare genetic variation and balanced polymorphisms are important for survival in global change conditions.

Authors:  Reid S Brennan; April D Garrett; Kaitlin E Huber; Heidi Hargarten; Melissa H Pespeni
Journal:  Proc Biol Sci       Date:  2019-06-12       Impact factor: 5.349

2.  Introduction to Population Genomics Methods.

Authors:  Thibault Leroy; Quentin Rougemont
Journal:  Methods Mol Biol       Date:  2021

3.  Detecting Adaptive Differentiation in Structured Populations with Genomic Data and Common Gardens.

Authors:  Emily B Josephs; Jeremy J Berg; Jeffrey Ross-Ibarra; Graham Coop
Journal:  Genetics       Date:  2019-01-24       Impact factor: 4.562

4.  Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data.

Authors:  Jonas Meisner; Anders Albrechtsen
Journal:  Genetics       Date:  2018-08-21       Impact factor: 4.562

5.  Detecting Selection from Linked Sites Using an F-Model.

Authors:  Marco Galimberti; Christoph Leuenberger; Beat Wolf; Sándor Miklós Szilágyi; Matthieu Foll; Daniel Wegmann
Journal:  Genetics       Date:  2020-10-16       Impact factor: 4.562

6.  The Genetic Legacy of the Indian Ocean Slave Trade: Recent Admixture and Post-admixture Selection in the Makranis of Pakistan.

Authors:  Romuald Laso-Jadart; Christine Harmant; Hélène Quach; Nora Zidane; Chris Tyler-Smith; Qasim Mehdi; Qasim Ayub; Lluis Quintana-Murci; Etienne Patin
Journal:  Am J Hum Genet       Date:  2017-11-09       Impact factor: 11.025

7.  Small population size and low genomic diversity have no effect on fitness in experimental translocations of a wild fish.

Authors:  M C Yates; E Bowles; D J Fraser
Journal:  Proc Biol Sci       Date:  2019-11-27       Impact factor: 5.349

8.  Climate and Urbanization Drive Mosquito Preference for Humans.

Authors:  Noah H Rose; Massamba Sylla; Athanase Badolo; Joel Lutomiah; Diego Ayala; Ogechukwu B Aribodor; Nnenna Ibe; Jewelna Akorli; Sampson Otoo; John-Paul Mutebi; Alexis L Kriete; Eliza G Ewing; Rosemary Sang; Andrea Gloria-Soria; Jeffrey R Powell; Rachel E Baker; Bradley J White; Jacob E Crawford; Carolyn S McBride
Journal:  Curr Biol       Date:  2020-07-23       Impact factor: 10.834

9.  Strong trans-Pacific break and local conservation units in the Galapagos shark (Carcharhinus galapagensis) revealed by genome-wide cytonuclear markers.

Authors:  Diana A Pazmiño; Gregory E Maes; Madeline E Green; Colin A Simpfendorfer; E Mauricio Hoyos-Padilla; Clinton J A Duffy; Carl G Meyer; Sven E Kerwath; Pelayo Salinas-de-León; Lynne van Herwerden
Journal:  Heredity (Edinb)       Date:  2018-01-11       Impact factor: 3.821

10.  Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr.

Authors:  Florian Privé; Hugues Aschard; Andrey Ziyatdinov; Michael G B Blum
Journal:  Bioinformatics       Date:  2018-08-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.