| Literature DB >> 34130601 |
Yinhui Yao1, Jingyi Zhao2, Xiaohui Zhou3, Junhui Hu1, Ying Wang1.
Abstract
In this study, we evaluated the diagnostic value of key genes in myocardial infarction (MI) based on data from the Gene Expression Omnibus (GEO) database. We used data from GSE66360 to identify a set of significant differentially expressed genes (DEGs) between MI and healthy controls. Logistic regression, least absolute shrinkage and selection operator (LASSO) regression, support vector machine recursive feature elimination (SVM-RFE), and SignalP 3.0 server were used to identify the potential role of genes in predicting diagnosis in patients with MI. Principal component analysis (PCA), receiver operating characteristic (ROC) curve analyses, area under the curve (AUC) analyses, and C-index were used to estimate the diagnostic value of genes in patients with MI. The association was validated using six other independent data sets. Subsequently, bioinformatics analysis was conducted based on the aforementioned potential genes. A meta-analysis was performed to evaluate the diagnostic value of the genes in MI. Forty-four DEGs were selected from the GSE66360 dataset. A three-gene signature consisting of CCL20, IL1R2, and ITLN1 could effectively distinguish patients with MI. The three-gene signature was validated in seven independent cohorts. Functional enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed to reveal the involvement of the three-gene signature in inflammation-related biological processes and pathways. Moreover, diagnostic meta-analysis results of the three-gene signature showed that the pooled sensitivity, specificity, and AUC for MI were 0.80, 0.90, and 0.93, respectively. These results suggest that the three-gene signature is a novel candidate biomarker for distinguishing MI from healthy controls.Entities:
Keywords: Gene signature; biomarker; diagnosis; meta-analysis; myocardial infarction
Mesh:
Substances:
Year: 2021 PMID: 34130601 PMCID: PMC8806758 DOI: 10.1080/21655979.2021.1938498
Source DB: PubMed Journal: Bioengineered ISSN: 2165-5979 Impact factor: 3.269
Information on the included microarray data sets
| GEO accession | Country | Platform | Cases/controls | Source of tissue |
|---|---|---|---|---|
| GSE141512 | Russia | GPL17586 | 6/6 | Whole blood |
| GSE24519 | Italy | GPL2895 | 34/4 | Whole blood |
| GSE34198 | Czech Republic | GPL6102 | 49/48 | Whole blood |
| GSE48060 | USA | GPL570 | 31/21 | Whole blood |
| GSE60993 | South Korea | GPL6884 | 17/7 | Whole blood |
| GSE66360 | USA | GPL570 | 49/50 | CD146+ circulating endothelial cells |
| GSE109048 | Italy | GPL17586 | 19/19 | Platelets |
Figure 1.Flow chart of microarray data set selection
Figure 2.Differentially expressed genes (DEGs) between myocardial infarction (MI) and healthy control tissues
Figure 3.Univariate logistic regression of differentially expressed genes (DEGs) between myocardial infarction (MI) and healthy control tissues
Figure 4.Identification of the three-gene signature for patients with myocardial infarction (MI) in the GSE66360 data set. (a, b) Eight differentially expressed genes (DEGs) were identified by least absolute shrinkage and selection operator (LASSO) regression. (c, d) Line plot of 5-fold cross-validation of the support vector machine recursive feature elimination (SVM-RFE) algorithm for feature selection. (e) Venn diagram of LASSO and SVM-RFE. (f) Three DEGs were identified by using the SignalP 3.0 server. (g)The three-gene signature was identified by multivariable logistic regression
Figure 5.Principal component analysis of the three-gene signature for patients with myocardial infarction (MI) in the GSE66360 data set
Figure 6.The relative expression of three hub genes validated by six Gene Expression Omnibus (GEO) terms
Sensitivity, specificity, AUC, and C-index of the classification performance of the three-gene signature in six data sets
| GEO accession | TP | FP | FN | TN | Sensitivity (95% CI) | Specificity (95% CI) | AUC (95% CI) | C-index (95% CI) | |
|---|---|---|---|---|---|---|---|---|---|
| GSE141512 | 3 | 1 | 3 | 5 | 0.500 (0.139–0.860) | 0.833 (0.364–0.991) | 0.639 (0.311–0.967) | 0.639 (0.00788–1.00) | |
| GSE24519 | 32 | 0 | 2 | 4 | 0.941 (0.789–0.989) | 1.00 (0.395–1.000) | 0.978 (0.934–1.00) | 0.978 (0.899–1.00) | |
| GSE34198 | 27 | 10 | 22 | 38 | 0.551 (0.403–0.691) | 0.79 (0.645–0.890) | 0.652 (0.542–0.762) | 0.652 (0.446–0.857) | |
| GSE48060 | 24 | 4 | 7 | 17 | 0.774 (0.584–0.897) | 0.809 (0.574–0.937) | 0.78 (0.641–0.920) | 0.78 (0.501–1.00) | |
| GSE60993 | 15 | 1 | 2 | 6 | 0.824 (0.558–0.953) | 1.00 (0.561–1.00) | 0.882 (0.744–1.000) | 0.882 (0.613–1.00) | |
| GSE66360 | 45 | 1 | 4 | 49 | 0.918 (0.795–0.973) | 0.980 (0.879–0.998) | 0.975 (0.948–1.000) | 0.975 (0.922–1.00) | |
| GSE109048 | 14 | 2 | 5 | 17 | 0.736 (0.485–0.898) | 0.894 (0.654–0.981) | 0.867 (0.749–0.985) | 0.867 (0.635–1.00) | |
GO functional annotation of the three-gene signature
| Category | ID | GO term | Gene | |
|---|---|---|---|---|
| BP | GO:0071347 | cellular response to interleukin-1 | 0.00014 | |
| BP | GO:0070555 | response to interleukin-1 | 0.00020 | |
| BP | GO:0050711 | negative regulation of interleukin-1 secretion | 0.0025 | |
| BP | GO:1900016 | negative regulation of cytokine production involved in inflammatory response | 0.0031 | |
| BP | GO:0035584 | calcium-mediated signaling using intracellular calcium source | 0.0035 | |
| BP | GO:0032692 | negative regulation of interleukin-1 production | 0.0045 | |
| BP | GO:2000406 | positive regulation of T cell migration | 0.0048 | |
| BP | GO:0070207 | protein homotrimerization | 0.0052 | |
| BP | GO:0046326 | positive regulation of glucose import | 0.0053 | |
| BP | GO:2000403 | positive regulation of lymphocyte migration | 0.0058 | |
| BP | GO:0010955 | negative regulation of protein processing | 0.0061 | |
| BP | GO:1903318 | negative regulation of protein maturation | 0.0061 | |
| BP | GO:0010828 | positive regulation of glucose transmembrane transport | 0.0064 | |
| BP | GO:1900015 | regulation of cytokine production involved in inflammatory response | 0.0064 | |
| BP | GO:2000404 | regulation of T cell migration | 0.0064 | |
| BP | GO:0002534 | cytokine production involved in inflammatory response | 0.0069 | |
| BP | GO:0050704 | regulation of interleukin-1 secretion | 0.0079 | |
| BP | GO:0070206 | protein trimerization | 0.0087 | |
| BP | GO:0046324 | regulation of glucose import | 0.0088 | |
| BP | GO:0070498 | interleukin-1-mediated signaling pathway | 0.0088 | |
| BP | GO:0001960 | negative regulation of cytokine-mediated signaling pathway | 0.0090 | |
| BP | GO:0050701 | interleukin-1 secretion | 0.0090 | |
| BP | GO:0072678 | T cell migration | 0.0095 | |
| BP | GO:2000401 | regulation of lymphocyte migration | 0.0095 | |
| BP | GO:0060761 | negative regulation of response to cytokine stimulus | 0.0097 | |
| BP | GO:0002532 | production of molecular mediator involved in inflammatory response | 0.010 | |
| BP | GO:0046323 | glucose import | 0.010 | |
| BP | GO:0048247 | lymphocyte chemotaxis | 0.010 | |
| BP | GO:0002548 | monocyte chemotaxis | 0.010 | |
| BP | GO:0050710 | negative regulation of cytokine secretion | 0.010 | |
| BP | GO:0010827 | regulation of glucose transmembrane transport | 0.011 | |
| BP | GO:0032652 | regulation of interleukin-1 production | 0.012 | |
| BP | GO:0071674 | mononuclear cell migration | 0.013 | |
| BP | GO:0032612 | interleukin-1 production | 0.014 | |
| BP | GO:0070098 | chemokine-mediated signaling pathway | 0.014 | |
| BP | GO:1990868 | response to chemokine | 0.015 | |
| BP | GO:1990869 | cellular response to chemokine | 0.015 | |
| BP | GO:0030593 | neutrophil chemotaxis | 0.016 | |
| BP | GO:1904659 | glucose transmembrane transport | 0.016 | |
| BP | GO:0072676 | lymphocyte migration | 0.016 | |
| BP | GO:0008645 | hexose transmembrane transport | 0.017 | |
| BP | GO:0015749 | monosaccharide transmembrane transport | 0.017 | |
| BP | GO:0034219 | carbohydrate transmembrane transport | 0.018 | |
| BP | GO:1990266 | neutrophil migration | 0.018 | |
| BP | GO:0019730 | antimicrobial humoral response | 0.019 | |
| BP | GO:0071621 | granulocyte chemotaxis | 0.019 | |
| BP | GO:0002687 | positive regulation of leukocyte migration | 0.019 | |
| BP | GO:0050709 | negative regulation of protein secretion | 0.020 | |
| BP | GO:0097530 | granulocyte migration | 0.021 | |
| BP | GO:0002792 | negative regulation of peptide secretion | 0.021 | |
| BP | GO:0050728 | negative regulation of inflammatory response | 0.022 | |
| BP | GO:0008643 | carbohydrate transport | 0.022 | |
| BP | GO:0001959 | regulation of cytokine-mediated signaling pathway | 0.024 | |
| BP | GO:0060759 | regulation of response to cytokine stimulus | 0.026 | |
| BP | GO:0070613 | regulation of protein processing | 0.027 | |
| BP | GO:1903317 | regulation of protein maturation | 0.027 | |
| BP | GO:0071346 | cellular response to interferon-gamma | 0.028 | |
| BP | GO:0002685 | regulation of leukocyte migration | 0.028 | |
| BP | GO:0051224 | negative regulation of protein transport | 0.029 | |
| BP | GO:0050707 | regulation of cytokine secretion | 0.029 | |
| BP | GO:1904950 | negative regulation of establishment of protein localization | 0.029 | |
| BP | GO:0034764 | positive regulation of transmembrane transport | 0.030 | |
| BP | GO:1903531 | negative regulation of secretion by cell | 0.031 | |
| BP | GO:0034341 | response to interferon-gamma | 0.031 | |
| BP | GO:0031348 | negative regulation of defense response | 0.031 | |
| BP | GO:0097529 | myeloid leukocyte migration | 0.031 | |
| BP | GO:0050663 | cytokine secretion | 0.033 | |
| BP | GO:0030595 | leukocyte chemotaxis | 0.034 | |
| BP | GO:0019722 | calcium-mediated signaling | 0.035 | |
| BP | GO:0051048 | negative regulation of secretion | 0.035 | |
| BP | GO:0071356 | cellular response to tumor necrosis factor | 0.037 | |
| MF | GO:0070492 | oligosaccharide binding | 0.0025 | |
| MF | GO:0048020 | CCR chemokine receptor binding | 0.0072 | |
| MF | GO:0008009 | chemokine activity | 0.0083 | |
| MF | GO:0042379 | chemokine receptor binding | 0.011 | |
| MF | GO:0004896 | cytokine receptor activity | 0.015 | |
| MF | GO:0019955 | cytokine binding | 0.021 | |
| MF | GO:0019838 | growth factor binding | 0.023 |
KEGG pathway analysis of the three-gene signature
| ID | KEGG term | Gene | |
|---|---|---|---|
| hsa04060 | Cytokine-cytokine receptor interaction | 0.0010 | |
| hsa05323 | Rheumatoid arthritis | 0.023 | |
| hsa04657 | IL-17 signaling pathway | 0.023 | |
| hsa05215 | Prostate cancer | 0.024 | |
| hsa04640 | Hematopoietic cell lineage | 0.024 | |
| hsa04061 | Viral protein interaction with cytokine and cytokine receptor | 0.025 | |
| hsa05146 | Amoebiasis | 0.025 | |
| hsa04668 | TNF signaling pathway | 0.028 | |
| hsa05418 | Fluid shear stress and atherosclerosis | 0.034 | |
| hsa04062 | Chemokine signaling pathway | 0.046 | |
| hsa05202 | Transcriptional misregulation in cancer | 0.047 |
Figure 7.Meta-analysis of the three-gene signature for predicting diagnosis in patients with myocardial infarction (MI). (a) Forest plots of the pooled sensitivity and specificity of the three-gene signature in the diagnosis of MI. (b) Summary receiver operating characteristic (SROC) curve of the three-gene signature. (c) Fagan’s nomogram was used to evaluate the clinical utility of the three-gene signature for the diagnosis of MI. (d) Likelihood ratio scattergram
Figure 8.The source of heterogeneity was analyzed from the perspectives of publication bias and a bivariate box plot. (a) Deeks’ funnel plot asymmetry test for identifying publication bias. (b) Bivariate boxplot
Figure 9.Univariable meta-regression and subgroup analysis in the meta-analysis