Literature DB >> 18271057

Application of machine learning algorithms to predict coronary artery calcification with a sibship-based design.

Yan V Sun1, Lawrence F Bielak, Patricia A Peyser, Stephen T Turner, Patrick F Sheedy, Eric Boerwinkle, Sharon L R Kardia.   

Abstract

As part of the Genetic Epidemiology Network of Arteriopathy study, hypertensive non-Hispanic White sibships were screened using 471 single nucleotide polymorphisms (SNPs) to identify genes influencing coronary artery calcification (CAC) measured by computed tomography. Individuals with detectable CAC and CAC quantity > or =70th age- and sex-specific percentile were classified as having a high CAC burden and compared to individuals with CAC quantity <70th percentile. Two sibs from each sibship were randomly chosen and divided into two data sets, each with 360 unrelated individuals. Within each data set, we applied two machine learning algorithms, Random Forests and RuleFit, to identify the best predictors of having high CAC burden among 17 risk factors and 471 SNPs. Using five-fold cross-validation, both methods had approximately 70% sensitivity and approximately 60% specificity. Prediction accuracies were significantly different from random predictions (P-value<0.001) based on 1,000 permutation tests. Predictability of using 287 tagSNPs was as good as using all 471 SNPs. For Random Forests, among the top 50 predictors, the same eight tagSNPs and 15 risk factors were found in both data sets while eight tagSNPs and 12 risk factors were found in both data sets for RuleFit. Replicable effects of two tagSNPs (in genes GPR35 and NOS3) and 12 risk factors (age, body mass index, sex, serum glucose, high-density lipoprotein cholesterol, systolic blood pressure, cholesterol, homocysteine, triglycerides, fibrinogen, Lp(a) and low-density lipoprotein particle size) were identified by both methods. This study illustrates how machine learning methods can be used in sibships to identify important, replicable predictors of subclinical coronary atherosclerosis.

Entities:  

Mesh:

Year:  2008        PMID: 18271057      PMCID: PMC2828904          DOI: 10.1002/gepi.20309

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  44 in total

1.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.

Authors:  Christopher S Carlson; Michael A Eberle; Mark J Rieder; Qian Yi; Leonid Kruglyak; Deborah A Nickerson
Journal:  Am J Hum Genet       Date:  2003-12-15       Impact factor: 11.025

2.  Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays.

Authors:  Hajime Matsuzaki; Shoulian Dong; Halina Loi; Xiaojun Di; Guoying Liu; Earl Hubbell; Jane Law; Tam Berntsen; Monica Chadha; Henry Hui; Geoffrey Yang; Giulia C Kennedy; Teresa A Webster; Simon Cawley; P Sean Walsh; Keith W Jones; Stephen P A Fodor; Rui Mei
Journal:  Nat Methods       Date:  2004-11       Impact factor: 28.547

3.  Imputing missing genotypic data of single-nucleotide polymorphisms using neural networks.

Authors:  Yan V Sun; Sharon Lr Kardia
Journal:  Eur J Hum Genet       Date:  2008-01-16       Impact factor: 4.246

4.  Quantification of coronary artery calcium using ultrafast computed tomography.

Authors:  A S Agatston; W R Janowitz; F J Hildner; N R Zusmer; M Viamonte; R Detrano
Journal:  J Am Coll Cardiol       Date:  1990-03-15       Impact factor: 24.094

5.  Coronary artery calcification measured at electron-beam CT: agreement in dual scan runs and change over time.

Authors:  L F Bielak; P F Sheedy; P A Peyser
Journal:  Radiology       Date:  2001-01       Impact factor: 11.105

6.  Summary of the second report of the National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults (Adult Treatment Panel II)

Authors: 
Journal:  JAMA       Date:  1993-06-16       Impact factor: 56.272

7.  Association of fibrinogen with quantity of coronary artery calcification measured by electron beam computed tomography.

Authors:  L F Bielak; G G Klee; P F Sheedy; S T Turner; R S Schwartz; P A Peyser
Journal:  Arterioscler Thromb Vasc Biol       Date:  2000-09       Impact factor: 8.311

8.  A smoking-dependent risk of coronary artery disease associated with a polymorphism of the endothelial nitric oxide synthase gene.

Authors:  X L Wang; A S Sim; R F Badenhop; R M McCredie; D E Wilcken
Journal:  Nat Med       Date:  1996-01       Impact factor: 53.440

9.  Low-density lipoprotein particle size and coronary atherosclerosis in subjects belonging to hypertensive sibships.

Authors:  Iftikhar J Kullo; Kent R Bailey; Joseph P McConnell; Patricia A Peyser; Lawrence F Bielak; Sharon L R Kardia; Patrick F Sheedy; Eric Boerwinkle; Stephen T Turner
Journal:  Am J Hypertens       Date:  2004-09       Impact factor: 2.689

10.  Classification of rheumatoid arthritis status with candidate gene and genome-wide single-nucleotide polymorphisms using random forests.

Authors:  Yan V Sun; Zhaohui Cai; Kaushal Desai; Rachael Lawrance; Richard Leff; Ansar Jawaid; Sharon Lr Kardia; Huiying Yang
Journal:  BMC Proc       Date:  2007-12-18
View more
  27 in total

1.  SAR Studies of N-[2-(1H-Tetrazol-5-yl)phenyl]benzamide Derivatives as Potent G Protein-Coupled Receptor-35 Agonists.

Authors:  Lai Wei; Tao Hou; Chang Lu; Jixia Wang; Xiuli Zhang; Ye Fang; Yaopeng Zhao; Jiatao Feng; Jiaqi Li; Lala Qu; Hai-Long Piao; Xinmiao Liang
Journal:  ACS Med Chem Lett       Date:  2018-04-09       Impact factor: 4.345

2.  Structured learning algorithm for detection of nonobstructive and obstructive coronary plaque lesions from computed tomography angiography.

Authors:  Dongwoo Kang; Damini Dey; Piotr J Slomka; Reza Arsanjani; Ryo Nakazato; Hyunsuk Ko; Daniel S Berman; Debiao Li; C-C Jay Kuo
Journal:  J Med Imaging (Bellingham)       Date:  2015-03-06

3.  High-throughput identification and characterization of novel, species-selective GPR35 agonists.

Authors:  Zaynab Neetoo-Isseljee; Amanda E MacKenzie; Craig Southern; Jeff Jerman; Edward G McIver; Nicholas Harries; Debra L Taylor; Graeme Milligan
Journal:  J Pharmacol Exp Ther       Date:  2012-12-21       Impact factor: 4.030

Review 4.  Random forests for genomic data analysis.

Authors:  Xi Chen; Hemant Ishwaran
Journal:  Genomics       Date:  2012-04-21       Impact factor: 5.736

5.  Machine Learning Approach for Cardiovascular Risk and Coronary Artery Calcification Score.

Authors:  C R Aditya; Naveen Chakravarthy Sattaru; Kumaraguruparan Gopal; R Rahul; G Chandra Shekara; Omaima Nasif; Sulaiman Ali Alharbi; S S Raghavan; S Arockia Jayadhas
Journal:  Biomed Res Int       Date:  2022-06-23       Impact factor: 3.246

6.  Genome-wide prediction of discrete traits using Bayesian regressions and machine learning.

Authors:  Oscar González-Recio; Selma Forni
Journal:  Genet Sel Evol       Date:  2011-02-17       Impact factor: 4.297

7.  GPR35 as a Novel Therapeutic Target.

Authors:  A E Mackenzie; J E Lappin; D L Taylor; S A Nicklin; G Milligan
Journal:  Front Endocrinol (Lausanne)       Date:  2011-11-09       Impact factor: 5.555

8.  Label-free phenotypic profiling identified D-luciferin as a GPR35 agonist.

Authors:  Haibei Hu; Huayun Deng; Ye Fang
Journal:  PLoS One       Date:  2012-04-12       Impact factor: 3.240

9.  Discovery of 2-(4-methylfuran-2(5H)-ylidene)malononitrile and thieno[3,2-b]thiophene-2-carboxylic acid derivatives as G protein-coupled receptor 35 (GPR35) agonists.

Authors:  Huayun Deng; Haibei Hu; Mingqian He; Jieyu Hu; Weijun Niu; Ann M Ferrie; Ye Fang
Journal:  J Med Chem       Date:  2011-10-04       Impact factor: 7.446

10.  Disruption of GPR35 Exacerbates Dextran Sulfate Sodium-Induced Colitis in Mice.

Authors:  Shukkur M Farooq; Yuning Hou; Hainan Li; Megan O'Meara; Yihan Wang; Chunying Li; Jie-Mei Wang
Journal:  Dig Dis Sci       Date:  2018-07-24       Impact factor: 3.199

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.