| Literature DB >> 31829182 |
Mengfei Guo1, Yanan Yu1, Tiancai Wen2,3, Xiaoping Zhang4, Baoyan Liu5, Jin Zhang6, Runshun Zhang7, Yanning Zhang8, Xuezhong Zhou9.
Abstract
BACKGROUND: Disease comorbidity is popular and has significant indications for disease progress and management. We aim to detect the general disease comorbidity patterns in Chinese populations using a large-scale clinical data set.Entities:
Keywords: Complex network; Disease comorbidity; Network medicine
Mesh:
Year: 2019 PMID: 31829182 PMCID: PMC6907122 DOI: 10.1186/s12920-019-0629-x
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1The framework to predict disease occurrence using the comorbid trajectories of patients
Fig. 2Basic properties of the network. a Distribution of degree. b Weight distribution of edges. c Distribution of CC1. d Distribution of BC. e Distribution of CC2. f The top 10 diseases with the highest degree, CC2 and BC, respectively
Fig. 3The relationship between topological properties and the network structure. a Degree and CC1; b CC2 and CC1; c Degree and CC2; d BC and CC2; e Degree and BC; f CC1 and BC; g Modules in the network; h One specific disease comorbidity module in the network
PCC between the disease comorbidity and shared molecular mechanisms
| Shared genes | Shared pathways | |
|---|---|---|
| RR | 0.05312( | 0.008511 ( |
| Φ-correlation | 0.23688( | 0.037891 ( |
Fig. 4The shared molecular mechanisms of disease comorbidity. a The relationship between shared genes and intensity of disease comorbidity b. The relationship between shared pathways and intensity of disease comorbidity c. Disease comorbidity of Alzheimer’s Disease and Arteriosclerotic Heart Disease
PCC between disease similarity and molecular mechanisms
| Jaccard | Cosine | |
|---|---|---|
| Shared genes | 0.1166 ( | 0.1312 ( |
| Shared pathways | 0.0705 ( | 0.0826 ( |
Positive and negative sample distribution in the data set
| Data set | Positive | Negative | Total |
|---|---|---|---|
| Hypertension | 10,000 | 10,000 | 20,000 |
| Psychiatric diseases | 3500 | 3500 | 7000 |
Settings and parameters for classification methods
| Methods | Setting |
|---|---|
| LR | using L2 regularization norm regularization intensity = 1 |
| SVM | using the linear kernel function penalty parameter of the error term = 10 |
| RF | Decision tree = 180 Bootstrap Sample oob_score = true Feature = Gini coefficient |
| NN | Using multilayer feedforward neural network learning rate = 0.001 maximum number of iterations = 200 two hidden layers randomly optimizing the size of mini batches |
The classification results of the four models on hypertension and psychiatric diseases
| Model | Hypertension | Psychiatric diseases | ||||
|---|---|---|---|---|---|---|
| Precision | Recall | F1-score | Precision | Recall | F1-score | |
| LR | 0.6837 | 0.6498 | 0.6900 | 0.6681 | ||
| SVM | 0.6038 | 0.7199 | 0.6567 | 0.6334 | 0.7041 | 0.6668 |
| RF | 0.6034 | 0.6386 | ||||
| NN | 0.5919 | 0.6166 | 0.6038 | 0.6306 | 0.6534 | 0.6415 |
The highest values of the related measures are showed in bold values
Important diseases for hypertension and psychiatric diseases in classification method0073
| LR | SVM | RF | |||||||
|---|---|---|---|---|---|---|---|---|---|
| ICD10 | Disease | Regression coefficient | ICD10 | Disease | Feature weights | ICD10 | Disease | Importance | |
| Hypertension | H35.0 | Background retinopathy and retinal vascular changes | 1.5174 | I35.2 | Aortic (valve) stenosis with insufficiency | 1.6055 | Z51.1 | Chemotherapy session for neoplasm | 0.0326 |
| A15.6 | Tuberculous pleurisy, confirmed bacteriologically and histologically | 1.4360 | A15.6 | Tuberculous pleurisy, confirmed bacteriologically and histologically | 1.5705 | I25.1 | Atherosclerotic heart disease | 0.0274 | |
| I11.0 | Hypertensive heart disease with (congestive) heart failure | 1.3145 | E53.9 | Vitamin B deficiency, unspecified | 1.4400 | B18.1 | Chronic viral hepatitis B without delta-agent | 0.0188 | |
| H52.3 | Anisometropia and aniseikonia | 1.2809 | E15.X | Nondiabetic hypoglycaemic coma | 1.4358 | I63.9 | Cerebral infarction, unspecified | 0.0184 | |
| R10.1 | Pain localized to upper abdomen | 1.2530 | M89.9 | Disorder of bone, unspecified | 1.3565 | I50.9 | Heart failure, unspecified | 0.0184 | |
| Psychiatric diseases | R00.2 | Palpitations | 1.6927 | I63.1 | Polydipsia | 1.5527 | Z51.1 | Chemotherapy session for neoplasm | 0.0323 |
| R62.8 | Other lack of expected normal physiological development | 1.4442 | I26.0 | Pulmonary embolism with mention of acute cor pulmonale | 1.5192 | C34.9 | Bronchus or lung, unspecified | 0.0250 | |
| R79.8 | Other specified abnormal findings of blood chemistry | 1.3983 | K75.8 | Other specified inflammatory liver diseases | 1.5137 | I63.9 | Cerebral infarction, unspecified | 0.0236 | |
| I63.1 | Cerebral infarction due to embolism of precerebral arteries | 1.3883 | K70.9 | Alcoholic liver disease, unspecified | 1.4400 | G30.9 | Alzheimer disease, unspecified | 0.0210 | |
| E11.0 | Type 2 diabetes mellitus | 1.3871 | R79.8 | Other specified abnormal findings of blood chemistry | 1.3510 | C78.7 | Secondary malignant neoplasm of liver and intrahepatic bile duct | 0.0189 | |