| Literature DB >> 27573188 |
Luran Liu1, Yan Liu1, Chang Liu1, Zhuobo Zhang1, Yaojun Du1, Hao Zhao1.
Abstract
The present study aimed to identify potential biomarkers for atherosclerosis via analysis of gene expression profiles. The microarray dataset no. GSE20129 was downloaded from the Gene Expression Omnibus database. A total of 118 samples from the peripheral blood of female patients was used, including 47 atherosclerotic and 71 non‑atherosclerotic patients. The differentially expressed genes (DEGs) in the atherosclerosis samples were identified using the Limma package. Gene ontology term and Kyoto Encyclopedia of Genes and Genomes pathway analyses for DEGs were performed using the Database for Annotation, Visualization and Integrated Discovery tool. The recursive feature elimination (RFE) algorithm was applied for feature selection via iterative classification, and support vector machine classifier was used for the validation of prediction accuracy. A total of 430 DEGs in the atherosclerosis samples were identified, including 149 up‑ and 281 downregulated genes. Subsequently, the RFE algorithm was used to identify 11 biomarkers, whose receiver operating characteristic curves had an area under curve of 0.92, indicating that the identified 11 biomarkers were representative. The present study indicated that APH1B, JAM3, FBLN2, CSAD and PSTPIP2 may have important roles in the progression of atherosclerosis in females and may be potential biomarkers for early diagnosis and prognosis as well as treatment targets for this disease.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27573188 PMCID: PMC5042771 DOI: 10.3892/mmr.2016.5650
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
Figure 1Heat map of hierarchical clustering analysis. The abscissa represents clustering of specimens with atherosclerosis samples in dark blue and non-atherosclerosis samples in light blue, and the mixture of the two sample groups in orange. The ordinate represents differentially expressed genes and the clustering of genes. Red indicates upregulated genes while green signifies downregulated genes.
Over-represented GO terms enriched by differentially expressed genes.
| Term | Count | P-value |
|---|---|---|
| Upregulated genes | ||
| GO:0022613 - Ribonucleoprotein complex biogenesis | 6 | 0.010748 |
| GO:0042254 - Ribosome biogenesis | 5 | 0.012705 |
| GO:0042102 - Positive regulation of T-cell proliferation | 3 | 0.033574 |
| GO:0007010 - Cytoskeletal organization | 8 | 0.042602 |
| GO:0006941 - Striated muscle contraction | 3 | 0.045367 |
| Downregulated genes | ||
| GO:0032496 - Response to lipopolysaccharides | 7 | 9.36×10−4 |
| GO:0006955 - Immune response | 22 | 0.001271 |
| GO:0002237 - Response to molecules of bacterial origin | 7 | 0.001665 |
| GO:0009611 - Response to wounding | 18 | 0.002148 |
| GO:0002684 - Positive regulation of immune system processes | 11 | 0.002768 |
| GO:0001892 - Embryonic placenta development | 4 | 0.005054 |
| GO:0045936 - Negative regulation of phosphate metabolic processes | 5 | 0.005334 |
| GO:0010563 - Negative regulation of phosphorus metabolic processes | 5 | 0.005334 |
| GO:0050778 - Positive regulation of immune response | 8 | 0.005621 |
| GO:0042127 - Regulation of cell proliferation | 22 | 0.005981 |
| GO:0006954 - Inflammatory response | 12 | 0.008674 |
GO, gene ontology
Significantly enriched Kyoto Encyclopedia of Genes and Genomes pathways among differentially expressed genes.
| Term | Count | P-value |
|---|---|---|
| Upregulated | ||
| hsa05416: Viral myocarditis | 3 | 0.02366 |
| hsa00230: Purine metabolism | 4 | 0.03880 |
| Downregulated | ||
| hsa04670: Leukocyte transendothelial migration | 7 | 0.01940 |
| hsa04330: Notch signaling pathway | 4 | 0.04248 |
hsa, Homo sapiens.
Biomarkers screened using the recursive feature elimination algorithm.
| Biomarker | P-value | logFC |
|---|---|---|
| APH1B | 5.81×10−4 | −0.089736 |
| HIATL1 | 1.44×10−3 | −0.111004 |
| JAM3 | 2.24×10−3 | 0.164411 |
| HOXB13 | 2.25×10−3 | −0.031921 |
| FBLN2 | 2.35×10−3 | 0.049115 |
| RNF148 | 2.82×10−3 | −0.030481 |
| RALB | 2.84×10−3 | −0.093036 |
| CPEB3 | 3.07×10−3 | −0.059267 |
| ABHD14B | 6.30×10−3 | 0.069626 |
| CSAD | 7.75×10−3 | −0.041617 |
| PSTPIP2 | 9.45×10−3 | −0.096693 |
FC, fold change.
Figure 2ROC of support vector machine classifier constructed by 11 biomarkers. Five curves represent the results of every time (refinement cycle) in five-fold cross-validation of all samples. The black dotted line represents the mean ROC, and the light gray dotted line represents the random ROC. ROC, receiver operating characteristic curve; area, area under curve.
Figure 3ROC of support vector machine classifier constructed by all differentially expressed genes. Five curves represent the results of every time (refinement cycle) in five-fold cross-validation of all samples. The black dotted line represents the mean ROC, and the light gray dotted line represents the random ROC. ROC, receiver operating characteristic curve; area, area under curve.