| Literature DB >> 34848731 |
Zixin Shu1, Jingjing Wang1, Hailong Sun1, Ning Xu2, Chenxia Lu3, Runshun Zhang4, Xiaodong Li3, Baoyan Liu5, Xuezhong Zhou6.
Abstract
Symptom phenotypes have continuously been an important clinical entity for clinical diagnosis and management. However, non-specificity of symptom phenotypes for clinical diagnosis is one of the major challenges that need be addressed to advance symptom science and precision health. Network medicine has delivered a successful approach for understanding the underlying mechanisms of complex disease phenotypes, which will also be a useful tool for symptom science. Here, we extracted symptom co-occurrences from clinical textbooks to construct phenotype network of symptoms with clinical co-occurrence and incorporated high-quality symptom-gene associations and protein-protein interactions to explore the molecular network patterns of symptom phenotypes. Furthermore, we adopted established network diversity measure in network medicine to quantify both the phenotypic diversity (i.e., non-specificity) and molecular diversity of symptom phenotypes. The results showed that the clinical diversity of symptom phenotypes could partially be explained by their underlying molecular network diversity (PCC = 0.49, P-value = 2.14E-08). For example, non-specific symptoms, such as chill, vomiting, and amnesia, have both high phenotypic and molecular network diversities. Moreover, we further validated and confirmed the approach of symptom clusters to reduce the non-specificity of symptom phenotypes. Network diversity proposes a useful approach to evaluate the non-specificity of symptom phenotypes and would help elucidate the underlying molecular network mechanisms of symptom phenotypes and thus promotes the advance of symptom science for precision health.Entities:
Mesh:
Year: 2021 PMID: 34848731 PMCID: PMC8632989 DOI: 10.1038/s41540-021-00206-5
Source DB: PubMed Journal: NPJ Syst Biol Appl ISSN: 2056-7189
Fig. 1Quantifying the phenotypic and molecular network diversity of symptom phenotypes.
a Curation of symptom-symptom relationships. The associations between symptoms are based on their co-occurrence in a symptom cluster of a textbook named differential diagnosis of traditional Chinese medicine symptom. b Constructing symptom clinical association network. The nodes represent symptoms and size reflects the phenotypic diversity in network. c Extracting high-quality symptom-gene associations. d Integrating both symptom-gene associations and protein–protein interaction (PPI) database to obtain molecular network diversity of symptom phenotypes. e The main steps of symptom network diversity analysis. We measured symptom diversity from both phenotypic and molecular network contexts.
Fig. 2The basic statistics of high-quality symptom-gene associations.
a The distribution of symptom-related genes. b The distribution of gene-related symptoms. c The distribution of related system categories of symptoms. We compared the class information of symptoms with gene information to the ontology. d Mapping distribution of symptoms with genetic information to SCN. We compared the different system categories of symptoms with genes information grouped by mapping to SCN. The full name of the system: NSS Nervous System Symptom, HNS Head and Neck Symptom, AS Abdominal Symptom, SITS Skin and Integumentary Tissue Symptom, NPS Neurological and Physiological Symptom, DSS Digestive System Symptom, RSCS Respiratory System and Chest Symptom, MSS Musculoskeletal System Symptom, HISS Hemic and Immune System Symptom, GS General Symptom, USS Urinary System Symptom, NMDS Nutrition, Metabolism, and Development Symptom, CSS Cardiovascular System Symptom, RSS Reproductive System Symptom.
Fig. 3Construction of symptom clinical association network(SCN).
The nodes indicate the symptoms and interconnecting edges in SCN represent the clinical co-occurrence. Node size and color reflected the diversity of symptom phenotypes in SCN (a high diversity is represented by large size node and deep orange color node). Here, filtering the node and related edges of symptom phenotypic diversity value <60 in the network and remaining 144 nodes and 6894 edges are visualized.
Quantifying the diversity of symptom phenotypes in SCN (including the top 50 symptoms sorted by the phenotypic diversity in SCN).
| Symptom | PDa | PEb | Symptom | PD | PE |
|---|---|---|---|---|---|
| Dysphoria | 100.33 | 623 | Cough | 85.76 | 321 |
| Emotional lability | 99.34 | 632 | Blurred vision | 85.73 | 344 |
| Yellowish complexion | 91.26 | 327 | Impaired vision | 85.65 | 345 |
| Rash | 91.15 | 367 | Coughing of phlegm | 85.37 | 264 |
| Bitter taste | 90.98 | 330 | Red eyes | 85.26 | 251 |
| Palpitation | 90.07 | 366 | Chill | 85.06 | 502 |
| Short urine | 89.81 | 319 | Hypochondriac pain | 84.71 | 225 |
| Dry throat | 89.75 | 388 | Constipation | 84.65 | 539 |
| Hypologia | 89.68 | 253 | Diarrhea | 84.24 | 230 |
| Night sweats | 89.44 | 282 | Consciousness disorder | 84.09 | 351 |
| Chest distress | 89.24 | 381 | Cold hands | 83.83 | 254 |
| Tachypnea | 89.22 | 381 | Cold feet | 83.63 | 250 |
| Whitish complexion | 89.17 | 412 | Cold limbs | 83.43 | 349 |
| Reddish complexion | 88.65 | 451 | Oliguria | 83.25 | 220 |
| Nausea | 87.75 | 301 | Chest pain | 83.12 | 197 |
| Vomiting | 87.54 | 311 | Convulsion | 83.01 | 284 |
| Do not like to drink | 87.38 | 243 | Abdomen distention | 82.97 | 457 |
| Emaciation | 87.36 | 363 | Coma | 82.85 | 261 |
| Cacochroea | 87.31 | 252 | Clear urine | 82.84 | 213 |
| Soreness of loins | 87.25 | 360 | Fullness in the stomach | 82.80 | 244 |
| Loose stools | 86.94 | 432 | Yellow urine | 82.53 | 507 |
| Spontaneous sweating | 86.69 | 229 | Insomnia | 82.37 | 514 |
| Headache | 86.29 | 394 | Lower extremity weakness | 82.33 | 230 |
| Skin patches | 86.27 | 249 | Dark complexion | 81.62 | 200 |
| Tinnitus | 85.89 | 343 | Cold body | 81.51 | 195 |
aPD means the symptom phenotypic diversity in SCN; bPE means the symptom phenotype degree in SCN.
Fig. 4Symptom network diversity analysis.
a The MGD and MGE distribution of symptoms in SCN. b Correlations of the symptom diversity between phenotypic and molecular networks. c Compared the MGD and MGE distribution of symptoms and diseases. On each box, the central mark indicates the median, the bottom and top edges of the box indicate the 25th and 75th percentiles, respectively. The whiskers extend to the most extreme data represent the minimum and maximum value. d Compared the MGD and MGE distribution of symptoms and symptom pairs.
Quantifying the molecular network diversity of symptom phenotype in SCN (including the top 50 symptoms sorted by the molecular network diversity in SCN).
| Symptom | MGDa | MGEb | Symptom | MGD | MGE |
|---|---|---|---|---|---|
| Convulsion | 491.39 | 1186 | Brash | 249.61 | 410 |
| Vomiting | 491.39 | 777 | Thin hair | 249.61 | 410 |
| Nausea | 491.39 | 777 | Voice hoarseness | 249.61 | 410 |
| Chest pain | 491.39 | 777 | Nail thinness | 249.61 | 410 |
| Headache | 491.39 | 777 | Rigidity | 244.27 | 515 |
| Chill | 422.73 | 1400 | Fatigue | 244.27 | 515 |
| Dyscalculia | 380.77 | 1186 | Fever | 241.09 | 502 |
| Obesity | 367.89 | 639 | Loose stools | 241.09 | 502 |
| Deafness | 367.89 | 558 | Cough | 241.09 | 527 |
| Body weakness | 367.89 | 527 | Skin pruritus | 233.94 | 362 |
| Decreased hearing | 367.89 | 558 | Difficulty in movement | 229.93 | 437 |
| Tremor | 367.89 | 527 | Consciousness disorder | 229.93 | 639 |
| Emaciation | 323.43 | 515 | Failure to thrive | 229.93 | 437 |
| Edema | 323.43 | 527 | Disorder joint | 229.93 | 437 |
| Speech scanning | 299.95 | 639 | Jaundice | 229.93 | 437 |
| Amnesia | 299.95 | 639 | Limb pain | 229.93 | 437 |
| Depression | 299.95 | 522 | Joint pain | 229.93 | 437 |
| Rash | 286.07 | 558 | Insomnia | 229.93 | 437 |
| Body pain | 285.63 | 444 | Low back pain | 229.93 | 437 |
| Abdominal pain | 285.63 | 515 | Joint swollen | 229.93 | 437 |
| Constipation | 282.94 | 558 | Aphtha | 229.93 | 437 |
| Dyspnea | 282.94 | 558 | Poor appetite | 213.66 | 350 |
| Skin patches | 282.94 | 558 | Anorexia | 213.66 | 350 |
| Delay language | 282.94 | 558 | Emotional lability | 213.66 | 382 |
| Tachypnea | 282.94 | 558 | Blindness | 208.68 | 350 |
aMGD means the maximum node diversity of the symptom-related genes in PPI network; bMGE means the maximum node degree of the symptom-related genes in PPI network.
Fig. 5Correlations of the symptom network diversity and related drug-targets diversity.
a Correlations between the symptom diversity (phenotypic and molecular networks) and the number of related drugs. b Correlations between the symptom diversity (phenotypic and molecular networks) and the number of related drug-targets.
The basic molecular features of insomnia symptom cluster (sorted by the co-occurrences).
| Symptoms | Co-occurrences | Overlap pathways | Overlap genes |
|---|---|---|---|
| Emotional lability | 160 (32.72) | 26 (25.74) | NDST1, SLC18A2, TSHR, PRNP, DCTN1 |
| Dysphoria | 156 (32.64) | 2 (16.67) | LEP, PRNP |
| Dizzy | 135 (33.25) | 19 (29.23) | TNXB |
| Fever | 121 (16.78) | 41 (29.08) | IL6, HLA-DRB1, PRNP, PRL, CRP |
| Thirst | 99 (18.20) | 1 (25.00) | LEP |
| Fatigue | 79 (14.42) | 49 (26.78) | HESX1, LHX3, TNXB, SLC18A2, TSHR, DNMT1, HLA-DRB1, PRNP, DCTN1, PROP1, TSHB, POU1F1 |
| Blurred vision | 69 (38.76) | 19 (23.75) | HESX1, CLIP2, LIMK1, BAZ1B, PRNP, GTF2IRD1, GTF2I, RFC2, TBL2, ELN |
| Night sweats | 53 (42.06) | 38 (31.40) | SLC18A2, HLA-DRB1, DDC, PRNP, HMBS |
| Poor appetite | 51 (11.26) | 31 (36.90) | HMBS |
| Constipation | 47 (13.35) | 34 (24.46) | DDC, PRNP, RAI1, NR4A2, THRA, TSHB, POU1F1, FLII, HESX1, TSHR, THRB, HMBS, CLIP2, SNCAIP, LHX3, TRHR, TNXB, LIMK1, BAZ1B, GTF2IRD1, PROP1, RFC2, GTF2I, TBL2, ELN, CPOX |
| Amnesia | 44 (66.67) | 44 (32.35) | NPS, HCRT, IL6, HLA-DRB1, DNMT1, PRNP, HLA-DQB1, MOG, ZNF365 |
| Tachypnea | 33 (15.57) | 20 (20.83) | HLA-DRB1, DCTN1 |
| Emaciation | 33 (23.91) | 44 (29.73) | TSHR, HLA-DRB1, SLC9A6, PRNP, HLA-DQB1, SNCA, DCTN1, LEP |
| Loose stools | 23 (8.68) | 33 (31.43) | TSHR, NAGLU, DDC, HMBS, SGSH, CPOX, GNS |
| Headache | 23 (9.54) | 21 (32.31) | IL6 |
| Rash | 21 (10.66) | 46 (26.29) | TNXB, IL6, HLA-DRB1, CRP, SIN3A |
| Body pain | 19 (6.57) | 36 (25.35) | CLIP2, TNXB, IL6, LIMK1, HLA-DRB1, BAZ1B, ELN, GTF2IRD1, GTF2I, HMBS, RFC2, TBL2, CPOX |
| Cough | 16 (7.80) | 28 (49.12) | HLA-DRB1 |
| Consciousness disorder | 16 (11.27) | 28 (24.59) | IL6, TSHB |
aThe co-occurrences are presented as n/N (%), where n is the co-occurrence frequency of the symptom and insomnia in a textbook named differential diagnosis of traditional Chinese medicine symptom; N is the total occurrence frequency of symptom in this book. bThe overlap pathways are presented as n/N (%), where n is the number of overlapped enriched KEGG pathways between the symptom and insomnia; N is the total enriched KEGG pathways of the symptom.
Fig. 6The overlapped pathways of insomnia symptom clusters.
The enriched KEGG pathways is evaluated by P-value with <0.05.
Fig. 7Construction the PPI network of insomnia-fever-rash cluster.
We extracted a PPI subnetwork of insomnia-fever-rash symptom clusters which consisted of 363 nodes and 1860 edges. The nodes indicate the related genes of these symptoms in PPI network and edges represent the interactions of these genes in PPI network. Node size reflected the degree of symptom in the network (a high degree is represented by large node). Node colors represent genes associated with different symptoms.
The GO BP of overlapping genes enriched of insomnia-fever-rash cluster.
| ID | GO_BP | |
|---|---|---|
| 1 | Immune response | 2.70E-12 |
| 2 | Adaptive immune response | 7.90E-07 |
| 3 | B-cell receptor signaling pathway | 5.80E-06 |
| 4 | B-cell differentiation | 1.30E-05 |
| 5 | T-cell receptor signaling pathway | 1.70E-05 |
| 6 | Interferon-gamma-mediated signaling pathway | 1.70E-05 |
| 7 | Positive regulation of gene expression | 2.10E-05 |
| 8 | Positive regulation of nitric oxide biosynthetic process | 1.10E-04 |
| 9 | Platelet activation | 1.20E-04 |
| 10 | Growth hormone receptor signaling pathway | 1.30E-04 |
| 11 | Negative regulation of lipid storage | 1.30E-04 |
| 12 | Intrinsic apoptotic signaling pathway in response to DNA damage | 1.50E-04 |
| 13 | Cytokine-mediated signaling pathway | 1.90E-04 |
| 14 | Positive regulation of transcription from RNA polymerase II promoter | 2.20E-04 |
| 15 | Antigen processing and presentation | 2.40E-04 |
| 16 | Humoral immune response | 2.70E-04 |
| 17 | Negative regulation of apoptotic process | 4.40E-04 |
| 18 | JAK-STAT cascade involved in growth hormone signaling pathway | 4.90E-04 |
| 19 | T-cell costimulation | 6.70E-04 |
| 20 | Blood coagulation | 6.90E-04 |
| 21 | Defense response to protozoan | 7.90E-04 |
| 22 | Inflammatory response | 1.40E-03 |
| 23 | Positive regulation of sequence-specific DNA binding transcription factor activity | 1.60E-03 |
| 24 | Cellular response to lipopolysaccharide | 2.00E-03 |
| 25 | Tumor necrosis factor-mediated signaling pathway | 2.20E-03 |
| 26 | Positive regulation of NF-kappaB transcription factor activity | 3.10E-03 |
| 27 | Positive regulation of tyrosine phosphorylation of Stat3 protein | 3.20E-03 |
| 28 | Extrinsic apoptotic signaling pathway via death domain receptors | 3.20E-03 |
| 29 | Acute-phase response | 3.30E-03 |
| 30 | Positive regulation of B-cell proliferation | 3.30E-03 |
| 31 | Negative regulation of gene expression | 3.40E-03 |
| 32 | Extrinsic apoptotic signaling pathway | 3.80E-03 |
| 33 | Defense response to bacterium | 4.00E-03 |
| 34 | Viral process | 4.10E-03 |
| 35 | Positive regulation of vitamin D biosynthetic process | 4.40E-03 |
| 36 | Positive regulation of growth factor dependent skeletal muscle satellite cell proliferation | 4.40E-03 |
| 37 | Positive regulation of interferon-gamma production | 4.60E-03 |
| 38 | Positive regulation of tumor necrosis factor production | 4.80E-03 |
| 39 | Positive regulation of transcription, DNA-templated | 5.20E-03 |
| 40 | Neutrophil apoptotic process | 6.60E-03 |
| 41 | Positive regulation of calcidiol 1-monooxygenase activity | 6.60E-03 |
| 42 | Positive regulation of ERK1 and ERK2 cascade | 6.70E-03 |
| 43 | Positive regulation of T cell proliferation | 7.70E-03 |
| 44 | I-kappaB kinase/NF-kappaB signaling | 7.70E-03 |
| 45 | Regulation of cell proliferation | 7.80E-03 |
To measure the function of overlapping genes in PPI network of insomnia-fever-rash cluster, we obtained the specific gene ontology function categories terms in biological process (GO_BP) of 38 overlapping genes (including the overlapping genes for two symptoms) for the cluster (P-value < 0.01).