| Literature DB >> 34688276 |
Feng Zhu1,2,3, Lili Zuo4, Rui Hu5, Jin Wang6, Zhihua Yang1,7, Xin Qi3, Limin Feng8.
Abstract
BACKGROUND: Atherosclerosis is the leading cause of cardiovascular disease with a high mortality worldwide. Understanding the atherosclerosis pathogenesis and identification of efficient diagnostic signatures remain major problems of modern medicine. This study aims to screen the potential diagnostic genes for atherosclerosis.Entities:
Keywords: Atherosclerosis; GO analysis; KEGG analysis; Logistic regression diagnostic mode; PPI network
Mesh:
Substances:
Year: 2021 PMID: 34688276 PMCID: PMC8540101 DOI: 10.1186/s12872-021-02323-9
Source DB: PubMed Journal: BMC Cardiovasc Disord ISSN: 1471-2261 Impact factor: 2.298
Fig. 1The WGCNA analysis. A Cluster analysis of 119 samples. B Selection of the soft threshold in the WGCNA analysis. The red line represents the correlation coefficient, and the first point above the red line corresponds to the soft threshold. C Clustering dendrogram of gene modules. Different gene modules are represented by distinct colors, and the genes that could not be grouped into other modules are placed in the grey module. D The correlation analysis between gene modules and phenotypes. The darker color indicates greater correlation between the gene module and the corresponding phenotype
Fig. 2The GO and KEGG pathway enrichment analysis. A The top 20 GO terms with the greatest number of enriched genes. The horizontal axis denotes the gene number, and the vertical axis denotes the GO term. B The top 20 KEGG pathways with the greatest number of enriched genes. The horizontal axis denotes the gene number, and the vertical axis denotes the KEGG pathway
Fig. 3Construction of the PPI network. A The PPI network, in which the dot denotes the node. More lines connected to the dot indicates higher degree of this node and significant importance of the corresponding gene in the network. B The top 50 high-degree genes screened from the PPI network by using MCC algorithm. The darker the color is, the higher the degree is
Fig. 4Establishment and evaluation of the logistic regression model. A The logistic regression model. The red dashed line indicates the COOK distance. In general, points with the COOK distance larger than 0.5 (influential points) may influence the model accuracy. B The ROC curve. The horizontal axis represents false positive rate (FPR), and the vertical axis represents true positive rate (TPR). AUC value could assess the performance of the model, and the high AUC value ranging from 0 to 1 indicates good performance of the model