| Literature DB >> 32657473 |
Xiaoyang Ji1,2,3, Wenting Tan4, Chunming Zhang2,3,5,6, Yubo Zhai5,7, Yiching Hsueh2,3, Zhonghai Zhang5, Chunli Zhang3, Yanqiu Lu8, Bo Duan5,6, Guangming Tan5,6, Renhua Na1, Guohong Deng4, Gang Niu2,3,6.
Abstract
Faced with the current large-scale public health emergency, collecting, sorting, and analyzing biomedical information related to the "SARS-CoV-2" should be done as quickly as possible to gain a global perspective, which is a basic requirement for strengthening epidemic control capacity. However, for human researchers studying viruses and hosts, the vast amount of information available cannot be processed effectively and in a timely manner, particularly if our scientific understanding is also limited, which further lowers the information processing efficiency. We present TWIRLS (Topic-wise inference engine of massive biomedical literatures), a method that can deal with various scientific problems, such as liver cancer, acute myeloid leukemia, and so forth, which can automatically acquire, organize, and classify information. Additionally, this information can be combined with independent functional data sources to build an inference system via a machine-based approach, which can provide relevant knowledge to help human researchers quickly establish subject cognition and to make more effective decisions. Using TWIRLS, we automatically analyzed more than three million words in more than 14,000 literature articles in only 4 hr. We found that an important regulatory factor angiotensin-converting enzyme 2 (ACE2) may be involved in host pathological changes on binding to the coronavirus after infection. On triggering functional changes in ACE2/AT2R, the cytokine homeostasis regulation axis becomes imbalanced via the Renin-Angiotensin System and IP-10, leading to a cytokine storm. Through a preliminary analysis of blood indices of COVID-19 patients with a history of hypertension, we found that non-ARB (Angiotensin II receptor blockers) users had more symptoms of severe illness than ARB users. This suggests ARBs could potentially be used to treat acute lung injury caused by coronavirus infection.Entities:
Keywords: coronavirus; cytokine storm; literature mining; renin-angiotensin system; topic inference
Year: 2020 PMID: 32657473 PMCID: PMC7404951 DOI: 10.1002/ddr.21717
Source DB: PubMed Journal: Drug Dev Res ISSN: 0272-4391 Impact factor: 5.004
FIGURE 1Flow chart of the knowledge‐driven literature mining method, the basic steps of the literature mining includes: (a) identify genes with accurate relevance to the subject, (b) identify entities with accurate relevance to the subject, (c) entities were classified by calculating the association strength between genes and entities, (d) alignment with KEGG database to establish an association matrix between pathways and entity‐categories
FIGURE 2(a, b) The entity cloud (CSSE cloud) associated with ACE2 and DPP4 in the coronavirus knowledge graph. (c–e) The entity clouds of the three IFITMs family proteins (IFITM1‐3) in the coronavirus knowledge graph. (f) The gene cloud associated with coronavirus‐C3 entity category
Coronavirus‐entity category labels and genes associated with each category. MISC indicates the label cannot be summarized
| Category | HR label |
|---|---|
| C0 | MISC |
| C1 | Canine coronavirus |
| C2 | Porcine epidemic diarrhea (PED) |
| C3 | Neurotropic coronavirus correlated with immune‐mediated demyelination |
| C4 | Coronavirus that infects humans |
| C5 | Coronavirus spike protein |
| C6 | Protease enhances SARS‐CoV infection |
| C7 | Monoclonal antibody to the coronavirus nucleocapsid protein |
| C8 | SARS‐CoV genome |
| C9 | Avian infectious bronchitis coronavirus |
| C10 | Coronavirus and interferon |
| C11 | Feline infectious peritonitis (FIP) |
| C12 | Vectors of novel coronaviruses |
| C13 | Mouse hepatitis virus |
| C14 | Interaction between coronaviruses and receptors |
| C15 | Coronavirus‐related vaccines |
| C16 | Identification of MHC class I restricted T‐cell epitopes |
| C17 | Transmissible gastroenteritis coronavirus |
| C18 | SARS coronavirus inhibitors and diagnostic methods |
| C19 | Coronavirus fusion with host cells and virus replication |
| C20 | Gene therapy‐inhibition of coronavirus by antisense RNA, sense RNA and protein |
| C21 | Imaging |
| C22 | Cytotoxic T‐lymphocyte escape |
| C23 | SARS coronavirus compound inhibitors |
| C24 | Coronavirus studies using biophysical methods |
| C25 | Detection of viral pathogenicity and distribution (RT‐PCR, immunohistochemistry and in situ hybridization) |
| C26 | Coronavirus immunization |
| C27 | Gastroenteritis virus and coronavirus |
| C28 | Effects of coronavirus infection on the body |
| C29 | Coronavirus detection evidence and methods |
| C30 | Human respiratory coronavirus NL63 |
| C31 | The antibodies against SARS‐CoV |
FIGURE 3Cluster analysis of time information. Entity categories were grouped based on the distribution of date information corresponding to the topic entities contained in each category. In the heatmap, the rows represent the time information on a yearly basis and the columns represent the topic‐entity categories. To generate the heatmap/cluster dendrogram, Euclidean measure for distance matrix and complete agglomeration method for clustering was applied
The most relevant and least relevant signaling pathways of each coronavirus‐entity category
| Class | Likely pathway | Z score | Unlikely pathway | Z score |
|---|---|---|---|---|
| C0 | PKCθ signaling in T lymphocytes | 1.5782 | Toll‐like receptor signaling | −1.7195 |
| C1 | AMPK signaling | 5.1816 | TGF‐β signaling | −4.0841 |
| C2 | Extrinsic prothrombin activation pathway | 4.3314 | Melanocyte development and pigmentation signaling | −3.8316 |
| C3 | Renin‐angiotensin signaling | 5.6100 | AMPK signaling | −4.4763 |
| C4 | April mediated signaling | 3.2382 | Neuregulin signaling | −2.4121 |
| C5 | Melanocyte development and pigmentation signaling | 3.7887 | Extrinsic prothrombin activation pathway | −3.9823 |
| C6 | Leukocyte extravasation signaling | 2.3615 | Pancreatic adenocarcinoma Siganling | −2.3694 |
| C7 | Role of BRCA1 in DNA damage response | 4.6871 | NF‐κB signaling | −4.8545 |
| C8 | PKCθ signaling in T lymphocytes | 1.9902 | VDR_RXR activation | −2.6143 |
| C9 | Toll‐like receptor signaling | 4.2000 | Role of BRCA1 in DNA damage response | −2.5017 |
| C10 | April mediated signaling | 4.3113 | TGF‐β signaling | −3.5853 |
| C11 | Acute phase response signaling | 2.5066 | Renin‐angiotensin signaling | −3.2492 |
| C12 | ATM signaling | 2.7020 | PKCθ signaling in T lymphocytes | −1.9846 |
| C13 | Retinoic acid mediated apoptosis signaling | 2.6069 | Interferon signaling | −1.8953 |
| C14 | ATM signaling | 2.9244 | PKCθ signaling in T lymphocytes | −1.9667 |
| C15 | AMPK signaling | 1.9142 | Leukocyte extravasation signaling | −2.4233 |
| C16 | Role of BRCA1 in DNA damage response | 3.1033 | Colorectal cancer metastasis signaling | −3.1033 |
| C17 | Production of nitric oxide and reactive oxygen species in macrophages | 3.1398 | mTOR signaling | −2.8893 |
| C18 | Extrinsic prothrombin activation pathway | 1.8215 | Intrinsic prothrombin activation pathway | −1.5680 |
| C19 | Role of NFAT in regulation of the immune response | 2.0089 | CD40 signaling | −2.2696 |
| C20 | April mediated signaling | 2.3687 | Aryl hydrocarbon receptor signaling | −1.5471 |
| C21 | NRF2‐mediated oxidative stress response | 1.4679 | CD40 signaling | −2.4042 |
| C22 | Toll‐like receptor signaling | 2.2825 | Melanocyte development and pigmentation signaling | −1.6007 |
| C23 | ATM signaling | 2.5518 | mTOR signaling | −1.8228 |
| C24 | Melanocyte development and pigmentation signaling | 2.1700 | Aryl hydrocarbon receptor signaling | −2.1700 |
| C25 | Extrinsic prothrombin activation pathway | 1.9825 | IL‐22 signaling | −1.7638 |
| C26 | mTOR signaling | 2.7449 | Aryl hydrocarbon receptor signaling | −1.7790 |
| C27 | AMPK signaling | 2.4033 | Intrinsic prothrombin activation pathway | −1.7579 |
| C28 | NRF2‐mediated oxidative stress response | 2.0846 | PKCθ signaling in T lymphocytes | −1.3415 |
| C29 | TGF‐β signaling | 2.4379 | Ephrin A signaling | −2.0030 |
| C30 | Role of NFAT in regulation of the immune response | 2.1383 | Leukocyte extravasation signaling | −1.8524 |
| C31 | Role of NFAT in regulation of the immune response | 1.7979 | Neuregulin signaling | −1.4467 |
Recommended signaling pathway most relevant to entity category C3
| Pathway | Likely class | Z score | Unlikely class | Z score |
|---|---|---|---|---|
| Renin‐angiotensin signaling | C3 | 5.6100 | C5 | −3.5918 |
| VDR_RXR activation | C3 | 4.7060 | C5 | −3.2514 |
| Aryl hydrocarbon receptor signaling | C3 | 4.3746 | C5 | −3.7887 |
| Chemokine signaling | C3 | 3.9999 | C7 | −2.4881 |
| IL‐8 signaling | C3 | 3.5211 | C2 | −2.6692 |
| Neuregulin signaling | C3 | 3.2914 | C1 | −3.4134 |
FIGURE 4Gene interaction network centered on 119 CSHGs. The yellow nodes represent 119 CSHGs, the blue nodes represent genes that interact with CSHG in the string database (combination score > 800), and the red squares mark the most relevant entity category of CSHG
The signaling pathways enriched by 119 CSHGs
| KEGG_PATHWAY | Count | % |
| Bonferroni | Benjamini | FDR |
|---|---|---|---|---|---|---|
|
| ||||||
| ptr05162:Measles | 18 | 11% | 1.05E‐13 | 1.38E‐11 | 1.38E‐11 | 1.22E‐10 |
| ptr05164:Influenza A | 19 | 11% | 7.05E‐13 | 9.30E‐11 | 4.65E‐11 | 8.23E‐10 |
| ptr05160:Hepatitis C | 15 | 9% | 2.30E‐10 | 3.04E‐08 | 7.60E‐09 | 2.69E‐07 |
| ptr05161:Hepatitis B | 11 | 7% | 7.37E‐06 | 9.72E‐04 | 1.22E‐04 | 8.61E‐03 |
| ptr05169:Epstein–Barr virus infection | 6 | 4% | 9.87E‐03 | 7.30E‐01 | 9.58E‐02 | 1.09E+01 |
| ptr05168:Herpes simplex infection | 14 | 8% | 2.73E‐07 | 3.60E‐05 | 6.00E‐06 | 3.18E‐04 |
|
| ||||||
| ptr04620:Toll‐like receptor signaling pathway | 15 | 9% | 6.78E‐12 | 8.95E‐10 | 2.98E‐10 | 7.92E‐09 |
| ptr04062:Chemokine signaling pathway | 16 | 10% | 9.91E‐10 | 1.31E‐07 | 2.62E‐08 | 1.16E‐06 |
| ptr04060:Cytokine‐cytokine receptor interaction | 13 | 8% | 2.80E‐06 | 3.69E‐04 | 5.28E‐05 | 3.27E‐03 |
| ptr04622:RIG‐I‐like receptor signaling pathway | 7 | 4% | 1.41E‐04 | 1.84E‐02 | 2.06E‐03 | 1.64E‐01 |
| ptr04623:Cytosolic DNA‐sensing pathway | 6 | 4% | 7.45E‐04 | 9.37E‐02 | 9.79E‐03 | 8.66E‐01 |
| ptr04630:Jak–STAT signaling pathway | 7 | 4% | 6.11E‐03 | 5.54E‐01 | 7.09E‐02 | 6.90E+00 |
| ptr04668:TNF signaling pathway | 6 | 4% | 7.61E‐03 | 6.35E‐01 | 8.06E‐02 | 8.53E+00 |
FIGURE 5The gene interaction networks centered around DPP4, ACE2, and IFITM1, respectively. The yellow nodes represent the ACE2, DPP4 and IFITM1 genes, purple nodes represent genes that have 1° of interaction with the core genes, green circled purple nodes represent the genes connecting CSHG and C3 category‐related genes, and pink nodes represent genes with 2° of interaction with the core gene. The red diamonds show the most relevant entity category symbol for CSHG
FIGURE 6The specific distribution of COVID‐19 patient clinical data. (a) Age and gender distribution. (b) Distribution of mild, severe and critical illness in patients with or without basic medical history. (c) The distribution of illness and anti‐hypertension drug in patients with hypertension history. (d) Cluster graph of clinical characteristics of mild illness, severe illness, critical illness, ARB or non‐ ARB users with hypertension, and patients with other medical history
The z score values of clinical characteristics
| Clinical characteristics | ARB | Non‐ARB | With other medical history | Severe illness | Critical illness | Mild illness | |
|---|---|---|---|---|---|---|---|
|
Routine blood tests | White blood cells(WBC)count | 1.35 | 1.06 | 0.90 | 1.93 | −0.58 | −0.32 |
| Neutrophil count(Neu) | 1.42 | 1.47 | 1.23 | 2.77 | 0.03 | −0.58 | |
| Lymphocyte count(lymph) | −0.26 | −1.37 | −0.93 | −2.78 | −1.75 | 0.82 | |
| Monocyte count(mono) | 1.20 | −0.21 | −0.03 | 1.02 | −1.63 | 0.02 | |
| Neu% | 1.18 | 2.51 | 1.43 | 3.49 | 1.52 | −0.94 | |
| Lymph% | −1.25 | −2.58 | −1.44 | −3.66 | −1.25 | 0.94 | |
| Mono% | −0.27 | −1.57 | −0.76 | −0.61 | −0.85 | 0.24 | |
| Red blood cell (RBC) count | 0.76 | −2.24 | −1.05 | −1.77 | −0.48 | 0.43 | |
| Hemoglobin(Hb) | 0.55 | −0.09 | 0.99 | 1.38 | −0.04 | −0.26 | |
| Platelet(PLT) | 0.41 | 0.97 | 0.72 | 0.97 | −1.91 | 0.07 | |
|
Liver function | Alanine aminotransferase (ALT) | −0.41 | −0.73 | −0.07 | 1.61 | 0.52 | −0.40 |
| Aspartate aminotransferase (AST) | −0.68 | 0.39 | 0.58 | 0.22 | 2.99 | −0.47 | |
| Total bilirubin (TBIL) | 1.01 | 0.92 | 0.11 | 0.34 | 0.20 | −0.11 | |
| Direct bilirubin (DBIL) | −0.04 | −0.51 | −0.13 | −0.32 | 2.96 | −0.37 | |
| Indirect bilirubin (IBIL) | 1.26 | 0.67 | 0.03 | 0.49 | 0.00 | −0.11 | |
| Gamma‐glutamyl transferase (GGT) | 0.18 | 0.34 | 0.64 | 3.83 | 1.39 | −0.97 | |
| Alkaline phosphatase (ALP) | 0.11 | −0.01 | −0.41 | −0.30 | −0.50 | 0.13 | |
| Lactic dehydrogenase (LDH) | −0.54 | 0.32 | 0.74 | 2.70 | 3.55 | −1.07 | |
| Total protein (TP) | 3.10 | −0.73 | −1.24 | −1.11 | −1.35 | 0.42 | |
| Albumin (Alb) | 1.97 | −1.72 | −2.22 | −3.65 | −1.52 | 0.99 | |
| Globulin (Glb) | 2.77 | 0.54 | 0.28 | 2.10 | −0.47 | −0.38 | |
| ALB/GLB (A/G) | −1.20 | −1.64 | −1.54 | −3.71 | −0.49 | 0.86 | |
|
Renal function | Na+ | 0.31 | 0.81 | 0.44 | 0.63 | −3.49 | 0.37 |
| K+ | 1.31 | −1.97 | −0.47 | −3.13 | −1.36 | 0.83 | |
| Fasting blood glucose (FBG) | 0.03 | 1.94 | 0.05 | 0.41 | −0.34 | −0.05 | |
| Uric acid(UA) | 3.07 | −1.99 | −0.74 | −0.84 | −1.14 | 0.34 | |
| Urea nitrogen(UN) | 0.40 | −0.55 | −0.28 | −0.21 | −0.33 | 0.10 | |
| Creatinine(Cr) | 2.33 | −1.57 | −0.76 | −0.51 | −0.47 | 0.17 | |
| Glomerular filtration rate(CDK‐EPI) | −2.83 | −1.89 | −0.43 | −0.48 | −1.34 | 0.29 | |
| Cardiac enzymes | Creatine kinase(CK) | −0.44 | 1.14 | 0.56 | −0.26 | 2.79 | −0.32 |
| Creatine kinase‐MB (CK‐MB) | −0.67 | 0.74 | 1.76 | 0.37 | 0.71 | −0.18 | |
FIGURE 7Disequilibrium of RAS‐cytokine signaling homeostasis causing cytokine storms triggered by ACE2‐ mediated coronaviral infection