Literature DB >> 35649387

KLFDAPC: a supervised machine learning approach for spatial genetic structure analysis.

Xinghu Qin1, Charleston W K Chiang2, Oscar E Gaggiotti1.   

Abstract

Geographic patterns of human genetic variation provide important insights into human evolution and disease. A commonly used tool to detect and describe them is principal component analysis (PCA) or the supervised linear discriminant analysis of principal components (DAPC). However, genetic features produced from both approaches could fail to correctly characterize population structure for complex scenarios involving admixture. In this study, we introduce Kernel Local Fisher Discriminant Analysis of Principal Components (KLFDAPC), a supervised non-linear approach for inferring individual geographic genetic structure that could rectify the limitations of these approaches by preserving the multimodal space of samples. We tested the power of KLFDAPC to infer population structure and to predict individual geographic origin using neural networks. Simulation results showed that KLFDAPC has higher discriminatory power than PCA and DAPC. The application of our method to empirical European and East Asian genome-wide genetic datasets indicated that the first two reduced features of KLFDAPC correctly recapitulated the geography of individuals and significantly improved the accuracy of predicting individual geographic origin when compared to PCA and DAPC. Therefore, KLFDAPC can be useful for geographic ancestry inference, design of genome scans and correction for spatial stratification in GWAS that link genes to adaptation or disease susceptibility.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Keywords:  individual geographic origin; machine learning; population structure

Mesh:

Year:  2022        PMID: 35649387      PMCID: PMC9294434          DOI: 10.1093/bib/bbac202

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  52 in total

Review 1.  Viral mutation rates.

Authors:  Rafael Sanjuán; Miguel R Nebot; Nicola Chirico; Louis M Mansky; Robert Belshaw
Journal:  J Virol       Date:  2010-07-21       Impact factor: 5.103

2.  fastsimcoal: a continuous-time coalescent simulator of genomic diversity under arbitrarily complex evolutionary scenarios.

Authors:  Laurent Excoffier; Matthieu Foll
Journal:  Bioinformatics       Date:  2011-03-12       Impact factor: 6.937

3.  Comparisons of likelihood and machine learning methods of individual classification.

Authors:  B Guinand; A Topchy; K S Page; M K Burnham-Curtis; W F Punch; K T Scribner
Journal:  J Hered       Date:  2002 Jul-Aug       Impact factor: 2.645

4.  An application of Random Forests to a genome-wide association dataset: methodological considerations & new findings.

Authors:  Benjamin A Goldstein; Alan E Hubbard; Adele Cutler; Lisa F Barcellos
Journal:  BMC Genet       Date:  2010-06-14       Impact factor: 2.797

5.  LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms.

Authors:  Daniel Money; Kyle Gardner; Zoë Migicovsky; Heidi Schwaninger; Gan-Yuan Zhong; Sean Myles
Journal:  G3 (Bethesda)       Date:  2015-09-15       Impact factor: 3.154

6.  Exome sequencing of Finnish isolates enhances rare-variant association power.

Authors:  Adam E Locke; Karyn Meltz Steinberg; Charleston W K Chiang; Susan K Service; Aki S Havulinna; Laurel Stell; Matti Pirinen; Haley J Abel; Colby C Chiang; Robert S Fulton; Anne U Jackson; Chul Joo Kang; Krishna L Kanchi; Daniel C Koboldt; David E Larson; Joanne Nelson; Thomas J Nicholas; Arto Pietilä; Vasily Ramensky; Debashree Ray; Laura J Scott; Heather M Stringham; Jagadish Vangipurapu; Ryan Welch; Pranav Yajnik; Xianyong Yin; Johan G Eriksson; Mika Ala-Korpela; Marjo-Riitta Järvelin; Minna Männikkö; Hannele Laivuori; Susan K Dutcher; Nathan O Stitziel; Richard K Wilson; Ira M Hall; Chiara Sabatti; Aarno Palotie; Veikko Salomaa; Markku Laakso; Samuli Ripatti; Michael Boehnke; Nelson B Freimer
Journal:  Nature       Date:  2019-07-31       Impact factor: 49.962

7.  Geography predicts neutral genetic diversity of human populations.

Authors:  Franck Prugnolle; Andrea Manica; François Balloux
Journal:  Curr Biol       Date:  2005-03-08       Impact factor: 10.834

Review 8.  Chapter 11: Genome-wide association studies.

Authors:  William S Bush; Jason H Moore
Journal:  PLoS Comput Biol       Date:  2012-12-27       Impact factor: 4.475

9.  Discriminant analysis of principal components and pedigree assessment of genetic diversity and population structure in a tetraploid potato panel using SNPs.

Authors:  Sofía I Deperi; Martín E Tagliotti; M Cecilia Bedogni; Norma C Manrique-Carpintero; Joseph Coombs; Ruofang Zhang; David Douches; Marcelo A Huarte
Journal:  PLoS One       Date:  2018-03-16       Impact factor: 3.240

10.  Genomic insights into the formation of human populations in East Asia.

Authors:  Chuan-Chao Wang; Hui-Yuan Yeh; Alexander N Popov; Hu-Qin Zhang; Hirofumi Matsumura; Kendra Sirak; Olivia Cheronet; Alexey Kovalev; Nadin Rohland; Alexander M Kim; Swapan Mallick; Rebecca Bernardos; Dashtseveg Tumen; Jing Zhao; Yi-Chang Liu; Jiun-Yu Liu; Matthew Mah; Ke Wang; Zhao Zhang; Nicole Adamski; Nasreen Broomandkhoshbacht; Kimberly Callan; Francesca Candilio; Kellie Sara Duffett Carlson; Brendan J Culleton; Laurie Eccles; Suzanne Freilich; Denise Keating; Ann Marie Lawson; Kirsten Mandl; Megan Michel; Jonas Oppenheimer; Kadir Toykan Özdoğan; Kristin Stewardson; Shaoqing Wen; Shi Yan; Fatma Zalzala; Richard Chuang; Ching-Jung Huang; Hana Looh; Chung-Ching Shiung; Yuri G Nikitin; Andrei V Tabarev; Alexey A Tishkin; Song Lin; Zhou-Yong Sun; Xiao-Ming Wu; Tie-Lin Yang; Xi Hu; Liang Chen; Hua Du; Jamsranjav Bayarsaikhan; Enkhbayar Mijiddorj; Diimaajav Erdenebaatar; Tumur-Ochir Iderkhangai; Erdene Myagmar; Hideaki Kanzawa-Kiriyama; Masato Nishino; Ken-Ichi Shinoda; Olga A Shubina; Jianxin Guo; Wangwei Cai; Qiongying Deng; Longli Kang; Dawei Li; Dongna Li; Rong Lin; Rukesh Shrestha; Ling-Xiang Wang; Lanhai Wei; Guangmao Xie; Hongbing Yao; Manfei Zhang; Guanglin He; Xiaomin Yang; Rong Hu; Martine Robbeets; Stephan Schiffels; Douglas J Kennett; Li Jin; Hui Li; Johannes Krause; Ron Pinhasi; David Reich
Journal:  Nature       Date:  2021-02-22       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.