Literature DB >> 20636893

High-grading bias: subtle problems with assessing power of selected subsets of loci for population assignment.

Robin S Waples1.   

Abstract

Recognition of the importance of cross-validation ('any technique or instance of assessing how the results of a statistical analysis will generalize to an independent dataset'; Wiktionary, en.wiktionary.org) is one reason that the U.S. Securities and Exchange Commission requires all investment products to carry some variation of the disclaimer, 'Past performance is no guarantee of future results.' Even a cursory examination of financial behaviour, however, demonstrates that this warning is regularly ignored, even by those who understand what an independent dataset is. In the natural sciences, an analogue to predicting future returns for an investment strategy is predicting power of a particular algorithm to perform with new data. Once again, the key to developing an unbiased assessment of future performance is through testing with independent data--that is, data that were in no way involved in developing the method in the first place. A 'gold-standard' approach to cross-validation is to divide the data into two parts, one used to develop the algorithm, the other used to test its performance. Because this approach substantially reduces the sample size that can be used in constructing the algorithm, researchers often try other variations of cross-validation to accomplish the same ends. As illustrated by Anderson in this issue of Molecular Ecology Resources, however, not all attempts at cross-validation produce the desired result. Anderson used simulated data to evaluate performance of several software programs designed to identify subsets of loci that can be effective for assigning individuals to population of origin based on multilocus genetic data. Such programs are likely to become increasingly popular as researchers seek ways to streamline routine analyses by focusing on small sets of loci that contain most of the desired signal. Anderson found that although some of the programs made an attempt at cross-validation, all failed to meet the 'gold standard' of using truly independent data and therefore produced overly optimistic assessments of power of the selected set of loci--a phenomenon known as 'high grading bias.'

Mesh:

Year:  2010        PMID: 20636893     DOI: 10.1111/j.1365-294X.2010.04675.x

Source DB:  PubMed          Journal:  Mol Ecol        ISSN: 0962-1083            Impact factor:   6.185


  7 in total

1.  Detection of outlier loci and their utility for fisheries management.

Authors:  Michael A Russello; Stephanie L Kirk; Karen K Frazer; Paul J Askey
Journal:  Evol Appl       Date:  2011-09-17       Impact factor: 5.183

2.  Single nucleotide polymorphisms to discriminate different classes of hybrid between wild Atlantic salmon and aquaculture escapees.

Authors:  Victoria L Pritchard; Jaakko Erkinaro; Matthew P Kent; Eero Niemelä; Panu Orell; Sigbjørn Lien; Craig R Primmer
Journal:  Evol Appl       Date:  2016-08-18       Impact factor: 5.183

3.  Accuracy of Assignment of Atlantic Salmon (Salmo salar L.) to Rivers and Regions in Scotland and Northeast England Based on Single Nucleotide Polymorphism (SNP) Markers.

Authors:  John Gilbey; Eef Cauwelier; Mark W Coulson; Lee Stradmeyer; James N Sampayo; Anja Armstrong; Eric Verspoor; Laura Corrigan; Jonathan Shelley; Stuart Middlemas
Journal:  PLoS One       Date:  2016-10-10       Impact factor: 3.240

4.  One species or four? Yes!...and, no. Or, arbitrary assignment of lineages to species obscures the diversification processes of Neotropical fishes.

Authors:  Stuart C Willis
Journal:  PLoS One       Date:  2017-02-24       Impact factor: 3.240

5.  Is the Red Wolf a Listable Unit Under the US Endangered Species Act?

Authors:  Robin S Waples; Roland Kays; Richard J Fredrickson; Krishna Pacifici; L Scott Mills
Journal:  J Hered       Date:  2018-06-27       Impact factor: 2.645

6.  Leveraging genomics to understand threats to migratory birds.

Authors:  Brenda Larison; Alec R Lindsay; Christen Bossu; Michael D Sorenson; Joseph D Kaplan; David C Evers; James Paruk; Jeffrey M DaCosta; Thomas B Smith; Kristen Ruegg
Journal:  Evol Appl       Date:  2021-04-10       Impact factor: 5.183

7.  A SNP assay for assessing diversity in immune genes in the honey bee (Apis mellifera L.).

Authors:  Dora Henriques; Ana R Lopes; Nor Chejanovsky; Anne Dalmon; Mariano Higes; Clara Jabal-Uriel; Yves Le Conte; Maritza Reyes-Carreño; Victoria Soroker; Raquel Martín-Hernández; M Alice Pinto
Journal:  Sci Rep       Date:  2021-07-28       Impact factor: 4.379

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.