Robin S Waples1, Eric C Anderson2. 1. NOAA Fisheries, Northwest Fisheries Science Center, 2725 Montlake Blvd. East, Seattle, WA, 98112, USA. 2. NOAA Fisheries, Southwest Fisheries Science Center, 110 McAllister Way, Santa Cruz, CA, 95060, USA.
Abstract
Interest has surged recently in removing siblings from population genetic data sets before conducting downstream analyses. However, even if the pedigree is inferred correctly, this has the potential to do more harm than good. We used computer simulations and empirical samples of coho salmon to evaluate strategies for adjusting samples to account for family structure. We compared performance in full samples and sibling-reduced samples of estimators of allele frequency (P^), population differentiation (F^ST) and effective population size (N^e). RESULTS: (i) unless simulated samples included large family groups together with a component of unrelated individuals, removing siblings generally reduced precision of P^ and F^ST; (ii) N^e based on the linkage disequilibrium method was largely unbiased using full random samples but became increasingly upwardly biased under aggressive purging of siblings. Under nonrandom sampling (some families over-represented), N^e using full samples was downwardly biased; removing just the right 'Goldilocks' fraction of siblings could produce an unbiased estimate, but this sweet spot varied widely among scenarios; (iii) weighting individuals based on the inferred pedigree (to produce a best linear unbiased estimator, BLUE) maximized precision of P^ when the inferred pedigree was correct but performed poorly when the pedigree was wrong; (iv) a variant of sibling removal that leaves intact small sibling groups appears to be more robust to errors in inferences about family structure. Our results illustrate the complex challenges posed by presence of family structure, suggest that no single optimal solution exists and argue for caution in adjusting population genetic data sets for the presence of putative siblings without fully understanding the consequences. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Interest has surged recently in removing siblings from population genetic data sets before conducting downstream analyses. However, even if the pedigree is inferred correctly, this has the potential to do more harm than good. We used computer simulations and empirical samples of coho salmon to evaluate strategies for adjusting samples to account for family structure. We compared performance in full samples and sibling-reduced samples of estimators of allele frequency (P^), population differentiation (F^ST) and effective population size (N^e). RESULTS: (i) unless simulated samples included large family groups together with a component of unrelated individuals, removing siblings generally reduced precision of P^ and F^ST; (ii) N^e based on the linkage disequilibrium method was largely unbiased using full random samples but became increasingly upwardly biased under aggressive purging of siblings. Under nonrandom sampling (some families over-represented), N^e using full samples was downwardly biased; removing just the right 'Goldilocks' fraction of siblings could produce an unbiased estimate, but this sweet spot varied widely among scenarios; (iii) weighting individuals based on the inferred pedigree (to produce a best linear unbiased estimator, BLUE) maximized precision of P^ when the inferred pedigree was correct but performed poorly when the pedigree was wrong; (iv) a variant of sibling removal that leaves intact small sibling groups appears to be more robust to errors in inferences about family structure. Our results illustrate the complex challenges posed by presence of family structure, suggest that no single optimal solution exists and argue for caution in adjusting population genetic data sets for the presence of putative siblings without fully understanding the consequences. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Entities:
Keywords:
allele frequency; effective population size; family structure; genetic differentiation; precision
Authors: Anna Brüniche-Olsen; R Jorge Urban; Vladimir V Vertyankin; Céline A J Godard-Codding; John W Bickham; J Andrew DeWoody Journal: Biol Lett Date: 2018-10-10 Impact factor: 3.703
Authors: Lúcio M Barbosa; Bruna C Barros; Moreno de Souza Rodrigues; Luciano K Silva; Mitermayer G Reis; Ronald E Blanton Journal: Int J Parasitol Date: 2018-10-24 Impact factor: 3.981
Authors: René D Clark; Matthew L Aardema; Peter Andolfatto; Paul H Barber; Akihisa Hattori; Jennifer A Hoey; Humberto R Montes; Malin L Pinsky Journal: Proc Biol Sci Date: 2021-06-09 Impact factor: 5.530
Authors: Jurjan P van der Zee; Marjolijn J A Christianen; Martine Bérubé; Mabel Nava; Sietske van der Wal; Jessica Berkel; Tadzio Bervoets; Melanie Meijer Zu Schlochtern; Leontine E Becking; Per J Palsbøll Journal: Mol Ecol Date: 2021-12-14 Impact factor: 6.622
Authors: Cristian B Canales-Aguirre; Lisa W Seeb; James E Seeb; María I Cádiz; Selim S Musleh; Ivan Arismendi; Gonzalo Gajardo; Ricardo Galleguillos; Daniel Gomez-Uchida Journal: Ecol Evol Date: 2017-11-28 Impact factor: 2.912
Authors: Emma V A Sylvester; Paul Bentzen; Ian R Bradbury; Marie Clément; Jon Pearce; John Horne; Robert G Beiko Journal: Evol Appl Date: 2017-09-14 Impact factor: 5.183
Authors: David J X Tan; Balaji Chattopadhyay; Kritika M Garg; Emilie Cros; Per G P Ericson; Martin Irestedt; Frank E Rheindt Journal: Sci Rep Date: 2018-08-24 Impact factor: 4.379
Authors: Brendan F Wringe; Nicholas W Jeffery; Ryan R E Stanley; Lorraine C Hamilton; Eric C Anderson; Ian A Fleming; Carole Grant; J Brian Dempson; Geoff Veinott; Steven J Duffy; Ian R Bradbury Journal: Commun Biol Date: 2018-08-09