Literature DB >> 34552349

Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells.

Jie Liu1,2, Xiaodong Wang1,2, Junhua Lin1, Shaohua Li1, Guoxiong Deng1,2, Jinru Wei1,2.   

Abstract

OBJECTIVE: Coronary artery disease (CAD) is a serious global health concern. Current diagnostic methods for CAD involve risk to the patient and are costly, so better diagnostic tools are needed. We defined four classifiers based on gene expression profiles in peripheral blood mononuclear cells and determined their potential for CAD detection.
METHODS: We downloaded a CAD-related data set (GSE113079) from the Gene Expression Omnibus (GEO) database. We identified differentially expressed genes (DEGs) in peripheral blood mononuclear cells between CAD samples and healthy controls. DEGs were analyzed for functional enrichment. To create a robust CAD classifier, DEGs were identified by feature selection using the principal component analysis. Then, least absolute shrinkage and selection operator (LASSO) logistic regression, random forest, and support vector machine (SVM) models were created. Gene set variation analysis (GSVA) score and gene set enrichment analysis (GSEA) were also conducted. The performance of the models was evaluated in terms of the area under receiver operating characteristic curves (AUC).
RESULTS: In the training set, we found 135 up-regulated genes and 104 down-regulated genes in CAD patients compared with controls. The DEGs were involved in some pathways associated with CAD, such as pathways involving calcium and interleukin-17 signaling. Twenty genes were identified as optimal features and used to generate the logistic classifier based on LASSO. The AUC for the classifier was 1.00 in the training set and 0.997 in the test set. Using the 20 DEGs, SVM and random forest classifiers were also generated and showed high diagnostic efficacy, with respective AUCs of 0.997 and 1.00 against the training set. A GSVA score was also established using the top 20 significant DEGs, which showed an AUC of 0.971 in the training set and 0.989 in the test set. Furthermore, GSEA showed autophagy and the proteasome to be major pathways involving the DEGs.
CONCLUSION: We identified a set of genes specific for CAD whose expression can be measured non-invasively. Using these genes, we defined four diagnostic classifiers using multiple methods.
© 2021 Liu et al.

Entities:  

Keywords:  classifier; coronary artery disease; diagnosis; gene expression

Year:  2021        PMID: 34552349      PMCID: PMC8450378          DOI: 10.2147/IJGM.S329005

Source DB:  PubMed          Journal:  Int J Gen Med        ISSN: 1178-7074


  46 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  clusterProfiler: an R package for comparing biological themes among gene clusters.

Authors:  Guangchuang Yu; Li-Gen Wang; Yanyan Han; Qing-Yu He
Journal:  OMICS       Date:  2012-03-28

Review 3.  Coronary heart disease as a case study in prevention: potential role of incentives.

Authors:  Philip A Ades; Diann E Gaalema
Journal:  Prev Med       Date:  2012-01-28       Impact factor: 4.018

Review 4.  What is a support vector machine?

Authors:  William S Noble
Journal:  Nat Biotechnol       Date:  2006-12       Impact factor: 54.908

5.  Simulation-selection-extrapolation: Estimation in high-dimensional errors-in-variables models.

Authors:  Linh Nghiem; Cornelis Potgieter
Journal:  Biometrics       Date:  2019-08-28       Impact factor: 2.571

6.  Characterization of LncRNA expression profile and identification of novel LncRNA biomarkers to diagnose coronary artery disease.

Authors:  Lin Li; Laiyuan Wang; Hongfan Li; Xikun Han; Shufeng Chen; Bin Yang; Zunsong Hu; Huijuan Zhu; Can Cai; Jichun Chen; Xiangdong Li; Jianfeng Huang; Dongfeng Gu
Journal:  Atherosclerosis       Date:  2018-06-18       Impact factor: 5.162

7.  Coronary heart disease diagnosis by artificial neural networks including aortic pulse wave velocity index and clinical parameters.

Authors:  Alexandre Vallée; Alexandre Cinaud; Vincent Blachier; Hélène Lelong; Michel E Safar; Jacques Blacher
Journal:  J Hypertens       Date:  2019-08       Impact factor: 4.844

8.  Ensemble of heterogeneous classifiers for diagnosis and prediction of coronary artery disease with reduced feature subset.

Authors:  Durgadevi Velusamy; Karthikeyan Ramasamy
Journal:  Comput Methods Programs Biomed       Date:  2020-09-30       Impact factor: 5.428

9.  An SVM-based method for assessment of transcription factor-DNA complex models.

Authors:  Rosario I Corona; Sanjana Sudarshan; Srinivas Aluru; Jun-Tao Guo
Journal:  BMC Bioinformatics       Date:  2018-12-21       Impact factor: 3.169

10.  GSVA: gene set variation analysis for microarray and RNA-seq data.

Authors:  Sonja Hänzelmann; Robert Castelo; Justin Guinney
Journal:  BMC Bioinformatics       Date:  2013-01-16       Impact factor: 3.169

View more
  2 in total

1.  Construction of genetic classification model for coronary atherosclerosis heart disease using three machine learning methods.

Authors:  Wenjuan Peng; Yuan Sun; Ling Zhang
Journal:  BMC Cardiovasc Disord       Date:  2022-02-12       Impact factor: 2.298

2.  RUNX1 and CCL3 in Diabetes Mellitus-Related Coronary Artery Disease: A Bioinformatics Analysis.

Authors:  Yi Zhong; Guoyong Du; Jie Liu; Shaohua Li; Junhua Lin; Guoxiong Deng; Jinru Wei; Jun Huang
Journal:  Int J Gen Med       Date:  2022-01-28
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.