Literature DB >> 30659755

Minor allele frequency thresholds strongly affect population structure inference with genomic data sets.

Ethan Linck1, C J Battey2.   

Abstract

A common method of minimizing errors in large DNA sequence data sets is to drop variable sites with a minor allele frequency (MAF) below some specified threshold. Although widespread, this procedure has the potential to alter downstream population genetic inferences and has received relatively little rigorous analysis. Here we use simulations and an empirical single nucleotide polymorphism data set to demonstrate the impacts of MAF thresholds on inference of population structure-often the first step in analysis of population genomic data. We find that model-based inference of population structure is confounded when singletons are included in the alignment, and that both model-based and multivariate analyses infer less distinct clusters when more stringent MAF cutoffs are applied. We propose that this behaviour is caused by the combination of a drop in the total size of the data matrix and by correlations between allele frequencies and mutational age. We recommend a set of best practices for applying MAF filters in studies seeking to describe population structure with genomic data.
© 2019 John Wiley & Sons Ltd.

Keywords:  zzm321990structurezzm321990; minor allele frequency; population genetic structure; principal components analysis

Mesh:

Year:  2019        PMID: 30659755     DOI: 10.1111/1755-0998.12995

Source DB:  PubMed          Journal:  Mol Ecol Resour        ISSN: 1755-098X            Impact factor:   7.090


  52 in total

Review 1.  Opportunities and challenges of macrogenetic studies.

Authors:  Deborah M Leigh; Charles B van Rees; Katie L Millette; Martin F Breed; Chloé Schmidt; Laura D Bertola; Brian K Hand; Margaret E Hunter; Evelyn L Jensen; Francine Kershaw; Libby Liggins; Gordon Luikart; Stéphanie Manel; Joachim Mergeay; Joshua M Miller; Gernot Segelbacher; Sean Hoban; Ivan Paz-Vinas
Journal:  Nat Rev Genet       Date:  2021-08-18       Impact factor: 53.242

2.  The population genomic structure of green turtles (Chelonia mydas) suggests a warm-water corridor for tropical marine fauna between the Atlantic and Indian oceans during the last interglacial.

Authors:  Jurjan P van der Zee; Marjolijn J A Christianen; Martine Bérubé; Mabel Nava; Kaj Schut; Frances Humber; Alonzo Alfaro-Núñez; Leontine E Becking; Per J Palsbøll
Journal:  Heredity (Edinb)       Date:  2021-10-11       Impact factor: 3.821

3.  Contrasting levels of hybridization across the two contact zones between two hedgehog species revealed by genome-wide SNP data.

Authors:  Pavel Hulva; Barbora Černá Bolfíková; Kristýna Eliášová; J Ignacio Lucas Lledó; José Horacio Grau; Miroslava Loudová; Anna A Bannikova; Katerina I Zolotareva; Vladimír Beneš
Journal:  Heredity (Edinb)       Date:  2022-10-13       Impact factor: 3.832

4.  In silico identification of the rare-coding pathogenic mutations and structural modeling of human NNAT gene associated with anorexia nervosa.

Authors:  Muhammad Bilal Azmi; Unaiza Naeem; Arisha Saleem; Areesha Jawed; Haroon Usman; Shamim Akhtar Qureshi; M Kamran Azim
Journal:  Eat Weight Disord       Date:  2022-06-02       Impact factor: 3.008

5.  Genomic analyses reveal three independent introductions of the invasive brown rat (Rattus norvegicus) to the Faroe Islands.

Authors:  Emily E Puckett; Eyðfinn Magnussen; Liudmila A Khlyap; Tanja M Strand; Åke Lundkvist; Jason Munshi-South
Journal:  Heredity (Edinb)       Date:  2019-08-09       Impact factor: 3.821

6.  Ecological basis and genetic architecture of crypsis polymorphism in the desert clicker grasshopper (Ligurotettix coquilletti).

Authors:  Timothy K O'Connor; Marissa C Sandoval; Jiarui Wang; Jacob C Hans; Risa Takenaka; Myron Child; Noah K Whiteman
Journal:  Evolution       Date:  2021-08-17       Impact factor: 3.694

7.  Genetic diversity and selection signatures in maize landraces compared across 50 years of in situ and ex situ conservation.

Authors:  Francis Denisse McLean-Rodríguez; Denise Elston Costich; Tania Carolina Camacho-Villa; Mario Enrico Pè; Matteo Dell'Acqua
Journal:  Heredity (Edinb)       Date:  2021-03-30       Impact factor: 3.821

8.  Genomic data support management of anadromous Arctic Char fisheries in Nunavik by highlighting neutral and putatively adaptive genetic variation.

Authors:  Xavier Dallaire; Éric Normandeau; Julien Mainguy; Jean-Éric Tremblay; Louis Bernatchez; Jean-Sébastien Moore
Journal:  Evol Appl       Date:  2021-05-27       Impact factor: 5.183

9.  Spatial population genetics in heavily managed species: Separating patterns of historical translocation from contemporary gene flow in white-tailed deer.

Authors:  Tyler K Chafin; Zachery D Zbinden; Marlis R Douglas; Bradley T Martin; Christopher R Middaugh; M Cory Gray; Jennifer R Ballard; Michael E Douglas
Journal:  Evol Appl       Date:  2021-05-04       Impact factor: 5.183

10.  A spectral theory for Wright's inbreeding coefficients and related quantities.

Authors:  Olivier François; Clément Gain
Journal:  PLoS Genet       Date:  2021-07-19       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.