Literature DB >> 33126243

Revisiting genome-wide association studies from statistical modelling to machine learning.

Shanwen Sun1, Benzhi Dong2, Quan Zou1.   

Abstract

Over the last decade, genome-wide association studies (GWAS) have discovered thousands of genetic variants underlying complex human diseases and agriculturally important traits. These findings have been utilized to dissect the biological basis of diseases, to develop new drugs, to advance precision medicine and to boost breeding. However, the potential of GWAS is still underexploited due to methodological limitations. Many challenges have emerged, including detecting epistasis and single-nucleotide polymorphisms (SNPs) with small effects and distinguishing causal variants from other SNPs associated through linkage disequilibrium. These issues have motivated advancements in GWAS analyses in two contrasting cultures-statistical modelling and machine learning. In this review, we systematically present the basic concepts and the benefits and limitations in both methods. We further discuss recent efforts to mitigate their weaknesses. Additionally, we summarize the state-of-the-art tools for detecting the missed signals, ultrarare mutations and gene-gene interactions and for prioritizing SNPs. Our work can offer both theoretical and practical guidelines for performing GWAS analyses and for developing further new robust methods to fully exploit the potential of GWAS.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  Bayesian; GWAS; SNPs; epistasis; fine-map; machine learning; regression

Year:  2021        PMID: 33126243     DOI: 10.1093/bib/bbaa263

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  7 in total

Review 1.  Genome-Wide Association Study Statistical Models: A Review.

Authors:  Mohsen Yoosefzadeh-Najafabadi; Milad Eskandari; François Belzile; Davoud Torkamaneh
Journal:  Methods Mol Biol       Date:  2022

2.  Prediction of Plant Resistance Proteins Based on Pairwise Energy Content and Stacking Framework.

Authors:  Yifan Chen; Zejun Li; Zhiyong Li
Journal:  Front Plant Sci       Date:  2022-05-31       Impact factor: 6.627

Review 3.  Research Progress of Gliomas in Machine Learning.

Authors:  Yameng Wu; Yu Guo; Jun Ma; Yu Sa; Qifeng Li; Ning Zhang
Journal:  Cells       Date:  2021-11-15       Impact factor: 6.600

Review 4.  Optimising Cardiometabolic Risk Factors in Pregnancy: A Review of Risk Prediction Models Targeting Gestational Diabetes and Hypertensive Disorders.

Authors:  Eleanor P Thong; Drishti P Ghelani; Pamada Manoleehakul; Anika Yesmin; Kaylee Slater; Rachael Taylor; Clare Collins; Melinda Hutchesson; Siew S Lim; Helena J Teede; Cheryce L Harrison; Lisa Moran; Joanne Enticott
Journal:  J Cardiovasc Dev Dis       Date:  2022-02-10

5.  Functional coding haplotypes and machine-learning feature elimination identifies predictors of Methotrexate Response in Rheumatoid Arthritis patients.

Authors:  Ashley J W Lim; Lee Jin Lim; Brandon N S Ooi; Ee Tzun Koh; Justina Wei Lynn Tan; Samuel S Chong; Chiea Chuen Khor; Lisa Tucker-Kellogg; Khai Pang Leong; Caroline G Lee
Journal:  EBioMedicine       Date:  2022-01-10       Impact factor: 8.143

6.  Genetic incompatibility of the reproductive partners: an evolutionary perspective on infertility.

Authors:  Jukka Kekäläinen
Journal:  Hum Reprod       Date:  2021-11-18       Impact factor: 6.918

Review 7.  Labels in a haystack: Approaches beyond supervised learning in biomedical applications.

Authors:  Artur Yakimovich; Anaël Beaugnon; Yi Huang; Elif Ozkirimli
Journal:  Patterns (N Y)       Date:  2021-12-10
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.