Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Ancestry may confound genetic machine learning: Candidate-gene prediction of opioid use disorder as an example.

Literature DB >> 34710714

Ancestry may confound genetic machine learning: Candidate-gene prediction of opioid use disorder as an example.

Alexander S Hatoum¹, Frank R Wendt², Marco Galimberti², Renato Polimanti³, Benjamin Neale⁴, Henry R Kranzler⁵, Joel Gelernter⁶, Howard J Edenberg⁷, Arpana Agrawal⁸.

Abstract

BACKGROUND: Machine learning (ML) models are beginning to proliferate in psychiatry, however machine learning models in psychiatric genetics have not always accounted for ancestry. Using an empirical example of a proposed genetic test for OUD, and exploring a similar test for tobacco dependence and a simulated binary phenotype, we show that genetic prediction using ML is vulnerable to ancestral confounding.
METHODS: We utilize five ML algorithms trained with 16 brain reward-derived "candidate" SNPs proposed for commercial use and examine their ability to predict OUD vs. ancestry in an out-of-sample test set (N = 1000, stratified into equal groups of n = 250 cases and controls each of European and African ancestry). We rerun analyses with 8 random sets of allele-frequency matched SNPs. We contrast findings with 11 genome-wide significant variants for tobacco smoking. To document generalizability, we generate and test a random phenotype.
RESULTS: None of the 5 ML algorithms predict OUD better than chance when ancestry was balanced but were confounded with ancestry in an out-of-sample test. In addition, the algorithms preferentially predicted admixed subpopulations. Random sets of variants matched to the candidate SNPs by allele frequency produced similar bias. Genome-wide significant tobacco smoking variants were also confounded by ancestry. Finally, random SNPs predicting a random simulated phenotype show that the bias attributable to ancestral confounding could impact any ML-based genetic prediction.
CONCLUSIONS: Researchers and clinicians are encouraged to be skeptical of claims of high prediction accuracy from ML-derived genetic algorithms for polygenic traits like addiction, particularly when using candidate variants.

Entities: Chemical

Keywords: Algorithmic bias; Ancestry; Candidate genes; Machine learning; Opioid use disorder

Mesh：

Year: 2021 PMID： 34710714 PMCID： PMC9358969 DOI： 10.1016/j.drugalcdep.2021.109115

Source DB: PubMed Journal: Drug Alcohol Depend ISSN： 0376-8716 Impact factor: 4.852

Keyword Cloud
References

32 in total

1. Genetic Variant in CHRNA5 and Response to Varenicline and Combination Nicotine Replacement in a Randomized Placebo-Controlled Trial.

Authors: Li-Shiun Chen; Timothy B Baker; J Philip Miller; Michael Bray; Nina Smock; Jingling Chen; Faith Stoneking; Robert C Culverhouse; Nancy L Saccone; Christopher I Amos; Robert M Carney; Douglas E Jorenby; Laura J Bierut
Journal: Clin Pharmacol Ther Date: 2020-08-04 Impact factor: 6.875

2. No Evidence That Schizophrenia Candidate Genes Are More Associated With Schizophrenia Than Noncandidate Genes.

Authors: Emma C Johnson; Richard Border; Whitney E Melroy-Greif; Christiaan A de Leeuw; Marissa A Ehringer; Matthew C Keller
Journal: Biol Psychiatry Date: 2017-07-13 Impact factor: 13.382

Review 3. Polygenic Risk Scores in Clinical Psychology: Bridging Genomic Risk to Individual Differences.

Authors: Ryan Bogdan; David A A Baranger; Arpana Agrawal
Journal: Annu Rev Clin Psychol Date: 2018-05-07 Impact factor: 18.561

4. Dissecting racial bias in an algorithm used to manage the health of populations.

Authors: Ziad Obermeyer; Brian Powers; Christine Vogeli; Sendhil Mullainathan
Journal: Science Date: 2019-10-25 Impact factor: 47.728

Review 5. A critical review of the first 10 years of candidate gene-by-environment interaction research in psychiatry.

Authors: Laramie E Duncan; Matthew C Keller
Journal: Am J Psychiatry Date: 2011-09-02 Impact factor: 18.112

6. The Fagerström Test for Nicotine Dependence: a revision of the Fagerström Tolerance Questionnaire.

Authors: T F Heatherton; L T Kozlowski; R C Frecker; K O Fagerström
Journal: Br J Addict Date: 1991-09

Review 7. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): the TRIPOD Statement.

Authors: G S Collins; J B Reitsma; D G Altman; K G M Moons
Journal: Br J Surg Date: 2015-02 Impact factor: 6.939

8. Population structure and eigenanalysis.

Authors: Nick Patterson; Alkes L Price; David Reich
Journal: PLoS Genet Date: 2006-12 Impact factor: 5.917

9. A global reference for human genetic variation.

Authors: Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal: Nature Date: 2015-10-01 Impact factor: 49.962

10. The mutational constraint spectrum quantified from variation in 141,456 humans.

Authors: Konrad J Karczewski; Laurent C Francioli; Grace Tiao; Beryl B Cummings; Jessica Alföldi; Qingbo Wang; Ryan L Collins; Kristen M Laricchia; Andrea Ganna; Daniel P Birnbaum; Laura D Gauthier; Harrison Brand; Matthew Solomonson; Nicholas A Watts; Daniel Rhodes; Moriel Singer-Berk; Eleina M England; Eleanor G Seaby; Jack A Kosmicki; Raymond K Walters; Katherine Tashman; Yossi Farjoun; Eric Banks; Timothy Poterba; Arcturus Wang; Cotton Seed; Nicola Whiffin; Jessica X Chong; Kaitlin E Samocha; Emma Pierce-Hoffman; Zachary Zappala; Anne H O'Donnell-Luria; Eric Vallabh Minikel; Ben Weisburd; Monkol Lek; James S Ware; Christopher Vittal; Irina M Armean; Louis Bergelson; Kristian Cibulskis; Kristen M Connolly; Miguel Covarrubias; Stacey Donnelly; Steven Ferriera; Stacey Gabriel; Jeff Gentry; Namrata Gupta; Thibault Jeandet; Diane Kaplan; Christopher Llanwarne; Ruchi Munshi; Sam Novod; Nikelle Petrillo; David Roazen; Valentin Ruano-Rubio; Andrea Saltzman; Molly Schleicher; Jose Soto; Kathleen Tibbetts; Charlotte Tolonen; Gordon Wade; Michael E Talkowski; Benjamin M Neale; Mark J Daly; Daniel G MacArthur
Journal: Nature Date: 2020-05-27 Impact factor: 69.504