| Literature DB >> 34004362 |
Jayanta Kumar Das1, Swarup Roy2, Pietro Hiram Guzzi3.
Abstract
The development of therapeutic targets for COVID-19 relies on understanding the molecular mechanism of pathogenesis. Identifying genes or proteins involved in the infection mechanism is the key to shedding light on the complex molecular mechanisms. The combined effort of many laboratories distributed throughout the world has produced protein and genetic interactions. We integrated available results and obtained a host protein-protein interaction network composed of 1432 human proteins. Next, we performed network centrality analysis to identify critical proteins in the derived network. Finally, we performed a functional enrichment analysis of central proteins. We observed that the identified proteins are primarily associated with several crucial pathways, including cellular process, signaling transduction, neurodegenerative diseases. We focused on the proteins that are involved in human respiratory tract diseases. We highlighted many potential therapeutic targets, including RBX1, HSPA5, ITCH, RAB7A, RAB5A, RAB8A, PSMC5, CAPZB, CANX, IGF2R, and HSPA1A, which are central and also associated with multiple diseases.Entities:
Keywords: COVID-19; Centrality; Disease; Pathways; Protein-protein interaction; SARS-CoV-2
Mesh:
Substances:
Year: 2021 PMID: 34004362 PMCID: PMC8123524 DOI: 10.1016/j.meegid.2021.104921
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Fig. 1The complete work-flow design of the current study.
Fig. 2The quantitative information of host-viral interactions. (a) The abundance (percentage) of collected interacting human host protein for different SARS-CoV-2 viral proteins; (b) A host-viral interaction network pattern, where few host proteins interactions can be seen associated with multiple viral proteins although most of interactions are virus protein specific unique (more details can be seen from Supplementary-A).
Fig. 3The gain (connected) component of PPI network obtained from whole PPI network. The network is consisting of 1111 unique nodes (proteins) and 7043 interactions edges (more details can be seen from Supplementary-A).
Fig. 4The degree distribution of all 1111 nodes (proteins) in the gain component of PPI network. The X-axis indicates degree distribution, whereas Y-axis shows relative frequency distributions.
Correlation analysis among all centrality parameters computed for 1111 proteins. The correlation score (>0.5) among the three centrality measures (Dc,Bc,Cc) are similar (highlighted in bold).
| 1 | |||||
| 0.209 | −0.168 | 1 | |||
| −0.32 | −0.283 | 0.451 | 1 | ||
| 0.557 | 0.1101 | 0.346 | −0.139 | 1 | |
| 0.213 | −0.329 | 0.684 |
Fig. 5The top 7 enriched pathways from each category of KEGG pathways. In each category, pathways are shown ordered by − log 10(p) value.
Fig. 6The top 10 enriched terms in each category of gene ontology (BP-Biological process, MF-Molecular function, CC-Cellular component). In each category, terms are shown ordered by log10(combined score) value.
Fig. 7Comparison of three groups of disease categories (Cardiovascular, Respiratory, Immune system) using venn-diagram. (a) based on number of proteins count in each category; (b) based on number of disease associated proteins (curated from database) among the observed proteins in each category.
The shortlisted 64 host proteins with their degree in host-host PPI , disease count (out of 204), disease type count (out of 3), and pathway count (out of 31). Some of the proteins that are known to be targeted by other viruses were also highlighted.
| Gene/Protein | Degree in PPI network | Pathway count | Disease category | Disease count | Known target virus |
|---|---|---|---|---|---|
| 10 | 2 | Immune | 1 | ||
| 26 | 1 | Cardiovascular | 1 | ||
| 17 | 1 | Respiratory | 2 | ||
| 20 | 2 | Respiratory, Cardiovascular, Immune | 10 | ||
| 29 | 6 | Cardiovascular, Immune | 8 | ||
| 10 | 1 | Cardiovascular | 11 | ||
| 19 | 5 | Cardiovascular | 1 | Human SARS coronavirus, Bovine papillomavirus type 1, Human papillomavirus type 16 | |
| 17 | 5 | Respiratory | 2 | Human adenovirus 5 | |
| 25 | 4 | Cardiovascular, Immune | 4 | Hepatitis C virus genotype 1b (isolate Con1) | |
| 31 | 2 | Cardiovascular | 1 | ||
| 34 | 1 | Cardiovascular | 1 | ||
| 22 | 2 | Respiratory, Cardiovascular, Immune | 17 | Poliovirus type 1 (strain Sabin) | |
| 19 | 1 | Immune | 1 | ||
| 9 | 3 | Cardiovascular | 1 | ||
| 11 | 5 | Cardiovascular | 4 | ||
| 10 | 6 | Respiratory, Cardiovascular | 2 | ||
| 13 | 1 | Cardiovascular, Immune | 2 | ||
| 9 | 2 | Cardiovascular | 5 | ||
| 9 | 1 | Cardiovascular | 10 | ||
| 23 | 2 | Cardiovascular | 1 | Hepatitis C virus genotype 1b (isolate Con1), Epstein-Barr virus (strain GD1) | |
| 16 | 1 | Cardiovascular | 5 | ||
| 14 | 5 | Cardiovascular | 1 | ||
| 16 | 1 | Immune | 1 | ||
| 12 | 5 | Respiratory | 2 | Human herpesvirus 1 (strain 17), Human papillomavirus type 16, Human papillomavirus type 31 | |
| 15 | 8 | Immune | 5 | Epstein-Barr virus (strain GD1), Human papillomavirus type 16 | |
| 14 | 8 | Immune | 7 | ||
| 11 | 1 | Immune | 4 | ||
| 30 | 5 | Cardiovascular | 1 | Epstein-Barr virus (strain GD1) | |
| 46 | 1 | Respiratory, Cardiovascular | 3 | Epstein-Barr virus (strain GD1) | |
| 10 | 3 | Respiratory, Cardiovascular, Immune | 10 | Sendai virus (strain Fushimi) | |
| 31 | 2 | Respiratory | 1 | ||
| 46 | 1 | Immune | 1 | Epstein-Barr virus (strain B95-8) | |
| 10 | 3 | Immune | 1 | ||
| 29 | 5 | Cardiovascular | 10 | Hepatitis C virus genotype 1b (isolate Con1) | |
| 22 | 2 | Cardiovascular | 6 | ||
| 14 | 1 | Cardiovascular | 4 | ||
| 20 | 2 | Cardiovascular | 4 | ||
| 18 | 4 | Respiratory | 1 | ||
| 20 | 3 | Cardiovascular | 3 | ||
| 16 | 2 | Cardiovascular | 3 | ||
| 23 | 3 | Cardiovascular | 4 | Hepatitis C virus genotype 1b (isolate Con1) | |
| 11 | 2 | Cardiovascular | 1 | ||
| 15 | 3 | Cardiovascular | 2 | ||
| 14 | 1 | Respiratory, Cardiovascular, Immune | 17 | Human herpesvirus 1 (strain 17) | |
| 25 | 3 | Immune | 1 | Human herpesvirus 1 (strain 17) | |
| 14 | 3 | Cardiovascular | 1 | ||
| 24 | 1 | Respiratory, Cardiovascular, Immune | 59 | ||
| 14 | 4 | Cardiovascular | 2 | ||
| 15 | 1 | Respiratory, Immune | 3 | Human herpesvirus 1 (strain 17) | |
| 34 | 1 | Immune | 7 | Human adenovirus 5, Human adenovirus 12, Simian virus 40 | |
| 24 | 1 | Immune | 2 | ||
| 22 | 1 | Cardiovascular | 6 | ||
| 40 | 3 | Cardiovascular | 1 | ||
| 41 | 2 | Cardiovascular | 1 | ||
| 40 | 3 | Immune | 1 | ||
| 58 | 4 | Immune | 5 | ||
| 16 | 4 | Cardiovascular | 8 | ||
| 12 | 2 | Cardiovascular | 7 | ||
| 14 | 1 | Cardiovascular | 3 | ||
| 11 | 1 | Respiratory, Immune | 4 | ||
| 29 | 7 | Respiratory, Cardiovascular, Immune | 29 | Hepatitis C virus genotype 1b (isolate Con1) | |
| 17 | 9 | Respiratory, Cardiovascular | 6 | ||
| 16 | 8 | Respiratory, Cardiovascular | 13 | ||
| 25 | 2 | Cardiovascular | 2 |
Fig. 8The interaction network represents the most influential host protein and viral protein. The network is consisting of sixty-four (64) host proteins interacting with twenty-five (25) SARS-CoV-2 viral proteins. The yellow colour node represents the viral proteins in the network, whereas the green one represents the host proteins.