Literature DB >> 34710714

Ancestry may confound genetic machine learning: Candidate-gene prediction of opioid use disorder as an example.

Alexander S Hatoum1, Frank R Wendt2, Marco Galimberti2, Renato Polimanti3, Benjamin Neale4, Henry R Kranzler5, Joel Gelernter6, Howard J Edenberg7, Arpana Agrawal8.   

Abstract

BACKGROUND: Machine learning (ML) models are beginning to proliferate in psychiatry, however machine learning models in psychiatric genetics have not always accounted for ancestry. Using an empirical example of a proposed genetic test for OUD, and exploring a similar test for tobacco dependence and a simulated binary phenotype, we show that genetic prediction using ML is vulnerable to ancestral confounding.
METHODS: We utilize five ML algorithms trained with 16 brain reward-derived "candidate" SNPs proposed for commercial use and examine their ability to predict OUD vs. ancestry in an out-of-sample test set (N = 1000, stratified into equal groups of n = 250 cases and controls each of European and African ancestry). We rerun analyses with 8 random sets of allele-frequency matched SNPs. We contrast findings with 11 genome-wide significant variants for tobacco smoking. To document generalizability, we generate and test a random phenotype.
RESULTS: None of the 5 ML algorithms predict OUD better than chance when ancestry was balanced but were confounded with ancestry in an out-of-sample test. In addition, the algorithms preferentially predicted admixed subpopulations. Random sets of variants matched to the candidate SNPs by allele frequency produced similar bias. Genome-wide significant tobacco smoking variants were also confounded by ancestry. Finally, random SNPs predicting a random simulated phenotype show that the bias attributable to ancestral confounding could impact any ML-based genetic prediction.
CONCLUSIONS: Researchers and clinicians are encouraged to be skeptical of claims of high prediction accuracy from ML-derived genetic algorithms for polygenic traits like addiction, particularly when using candidate variants.
Copyright © 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Algorithmic bias; Ancestry; Candidate genes; Machine learning; Opioid use disorder

Mesh:

Year:  2021        PMID: 34710714      PMCID: PMC9358969          DOI: 10.1016/j.drugalcdep.2021.109115

Source DB:  PubMed          Journal:  Drug Alcohol Depend        ISSN: 0376-8716            Impact factor:   4.852


  32 in total

1.  Genetic Variant in CHRNA5 and Response to Varenicline and Combination Nicotine Replacement in a Randomized Placebo-Controlled Trial.

Authors:  Li-Shiun Chen; Timothy B Baker; J Philip Miller; Michael Bray; Nina Smock; Jingling Chen; Faith Stoneking; Robert C Culverhouse; Nancy L Saccone; Christopher I Amos; Robert M Carney; Douglas E Jorenby; Laura J Bierut
Journal:  Clin Pharmacol Ther       Date:  2020-08-04       Impact factor: 6.875

2.  No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes.

Authors:  Emma C Johnson; Richard Border; Whitney E Melroy-Greif; Christiaan A de Leeuw; Marissa A Ehringer; Matthew C Keller
Journal:  Biol Psychiatry       Date:  2017-07-13       Impact factor: 13.382

Review 3.  Polygenic Risk Scores in Clinical Psychology: Bridging Genomic Risk to Individual Differences.

Authors:  Ryan Bogdan; David A A Baranger; Arpana Agrawal
Journal:  Annu Rev Clin Psychol       Date:  2018-05-07       Impact factor: 18.561

4.  Dissecting racial bias in an algorithm used to manage the health of populations.

Authors:  Ziad Obermeyer; Brian Powers; Christine Vogeli; Sendhil Mullainathan
Journal:  Science       Date:  2019-10-25       Impact factor: 47.728

Review 5.  A critical review of the first 10 years of candidate gene-by-environment interaction research in psychiatry.

Authors:  Laramie E Duncan; Matthew C Keller
Journal:  Am J Psychiatry       Date:  2011-09-02       Impact factor: 18.112

6.  The Fagerström Test for Nicotine Dependence: a revision of the Fagerström Tolerance Questionnaire.

Authors:  T F Heatherton; L T Kozlowski; R C Frecker; K O Fagerström
Journal:  Br J Addict       Date:  1991-09

Review 7.  Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): the TRIPOD Statement.

Authors:  G S Collins; J B Reitsma; D G Altman; K G M Moons
Journal:  Br J Surg       Date:  2015-02       Impact factor: 6.939

8.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

9.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

10.  The mutational constraint spectrum quantified from variation in 141,456 humans.

Authors:  Konrad J Karczewski; Laurent C Francioli; Grace Tiao; Beryl B Cummings; Jessica Alföldi; Qingbo Wang; Ryan L Collins; Kristen M Laricchia; Andrea Ganna; Daniel P Birnbaum; Laura D Gauthier; Harrison Brand; Matthew Solomonson; Nicholas A Watts; Daniel Rhodes; Moriel Singer-Berk; Eleina M England; Eleanor G Seaby; Jack A Kosmicki; Raymond K Walters; Katherine Tashman; Yossi Farjoun; Eric Banks; Timothy Poterba; Arcturus Wang; Cotton Seed; Nicola Whiffin; Jessica X Chong; Kaitlin E Samocha; Emma Pierce-Hoffman; Zachary Zappala; Anne H O'Donnell-Luria; Eric Vallabh Minikel; Ben Weisburd; Monkol Lek; James S Ware; Christopher Vittal; Irina M Armean; Louis Bergelson; Kristian Cibulskis; Kristen M Connolly; Miguel Covarrubias; Stacey Donnelly; Steven Ferriera; Stacey Gabriel; Jeff Gentry; Namrata Gupta; Thibault Jeandet; Diane Kaplan; Christopher Llanwarne; Ruchi Munshi; Sam Novod; Nikelle Petrillo; David Roazen; Valentin Ruano-Rubio; Andrea Saltzman; Molly Schleicher; Jose Soto; Kathleen Tibbetts; Charlotte Tolonen; Gordon Wade; Michael E Talkowski; Benjamin M Neale; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2020-05-27       Impact factor: 69.504

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.