| Literature DB >> 31754224 |
Kang-Wen Xiao1, Jia-Li Li2, Zi-Hang Zeng2, Zhi-Bo Liu1, Zhi-Qiang Hou1, Xin Yan1, Lin Cai3.
Abstract
Osteoporosis is one of the most common metabolic bone disease among pre- and postmenopausal women. As the precursors of osteoclast cells, circulating monocytes play important role in bone destruction and remodeling. The aim of study is to identify potential key genes and pathways correlated with the pathogenesis of osteoporosis. Then we construct novel estimation model closely linked to the bone mineral density (BMD) with key genes. Weighted gene co-expression network analysis (WGCNA) were conducted by collecting gene data set with 80 samples from gene expression omnibus (GEO) database. Besides, hub genes were identified by series of bioinformatics and machine learning algorithms containing protein-protein interaction (PPI) network, receiver operating characteristic curve and Pearson correlation. The direction of correlation coefficient were performed to screen for gene signatures with high BMD and low BMD. A novel BMD score system was put forward based on gene set variation analysis and logistic regression, which was validated by independent data sets. We identified six modules correlated with BMD. Finally 100 genes were identified as the high bone mineral density signatures while 130 genes were identified as low BMD signatures. Besides, we identified the significant pathway in monocytes: ribonucleoprotein complex biogenesis. What's more, our score validated it successfully.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31754224 PMCID: PMC6872746 DOI: 10.1038/s41598-019-53843-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The basic information of patients with osteoporosis.
| BMD | GSE56815 | GSE2208 | ||
|---|---|---|---|---|
| high BMD | low BMD | high BMD | low BMD | |
| postmenopausal | 20 | 20 | 5 | 5 |
| premenopausal | 20 | 20 | 5 | 4 |
Figure 1(A) Flowchart of this study; (B) Differential gene expression analysis for identifying BMD related gene signatures in training data set.
Figure 2Process of WGCNA. (A) dynamic tree cut based on 1- TOM; (B) heat map of the correlation between the module eigenvalue and the BMD phenotype and menopausal phenotype; (C) clustering heat map of module eigenvalue; (D) Analysis of GS across modules.
Figure 3PPI network of modules. (A) sub network of midnight blue; (B) sub network of salmon; (C–E): sub network of brown.
Figure 4ROC analysis of genes and novel BMD score.
Figure 5GSEA analysis and box plot. (A) box plot of training data; (B) box plot of validation data; (C) GSEA analysis of 230 BMD related genes; (D) box plot of ribonucleoprotein complex biogenesis activity and BMD in training data; (E) box plot of ribonucleoprotein complex biogenesis activity and BMD in validation data.
Figure 6Gene clustering heat map analysis, ROC analysis of genes and elastic net regression network and box plot. (A) Gene clustering heat map analysis; (B,C) ROC curve of training data set (GSE56815) and validation data set (GSE2208) with elastic net regression model; (D,E) Box plot for training data set (GSE56815) and validation data set (GSE2208).