| Literature DB >> 35764916 |
Shengrong Xia1,2, Yingchun Xia1,2, Chulei Xiang1,2, Hui Wang1, Chao Wang1,2,3, Jin He1,2,3, Guolong Shi1, Lichuan Gu4,5,6.
Abstract
BACKGROUND: Target drugs play an important role in the clinical treatment of virus diseases. Virus-encoded proteins are widely used as targets for target drugs. However, they cannot cope with the drug resistance caused by a mutated virus and ignore the importance of host proteins for virus replication. Some methods use interactions between viruses and their host proteins to predict potential virus-target host proteins, which are less susceptible to mutated viruses. However, these methods only consider the network topology between the virus and the host proteins, ignoring the influences of protein complexes. Therefore, we introduce protein complexes that are less susceptible to drug resistance of mutated viruses, which helps recognize the unknown virus-target host proteins and reduce the cost of disease treatment.Entities:
Keywords: Host protein; Protein complexes recognition; Protein–protein interaction network; Virus–target protein
Mesh:
Year: 2022 PMID: 35764916 PMCID: PMC9238269 DOI: 10.1186/s12859-022-04792-x
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1Process of CCA-SE
Fig. 2An illumination of virus–host PPI Network
Effects in different scores
| Score | Seed | Precision | Recall | F-score |
|---|---|---|---|---|
| 0.1398 | 1035 | 0.9317 | 0.6399 | 0.7587 |
| 0.1498 | 982 | 0.9350 | 0.6352 | 0.7565 |
| 0.1598 | 939 | 0.9390 | 0.6381 | 0.7598 |
| 0.1698 | 868 | 0.9414 | 0.6406 | 0.7624 |
| 0.1798 | 821 | 0.9383 | 0.6334 | 0.7563 |
| 0.1898 | 781 | 0.9272 | 0.6119 | 0.7373 |
| 0.1998 | 726 | 0.9399 | 0.6078 | 0.7382 |
Partial protein complex data
| ID | Complex_size | Host_protein | Complex |
|---|---|---|---|
| 1 | 4 | P67870; P19784; | P09429; P19784; P67870; P68400; |
| 2 | 5 | Q15370; P62877; Q15369; Q13617; | P40337; Q15370; P62877; Q15369; Q13617; |
| Q15369; Q13617; | Q15369; Q13617; | ||
| 3 | 5 | Q15370; Q15369; | Q15369; Q15370; Q93034; |
| Q9UBF6; Q9Y576; | |||
| 4 | 7 | Q92769; | O00422; O75446; Q09028; Q13547; |
| Q16576; Q92769; Q96ST3; | |||
| 5 | 3 | P13861; | P13861; P24588; Q08209; |
| 6 | 5 | Q9UHD2; | P18124; P08238; Q04206; Q9UHD2; |
| Q00653; | |||
| 7 | 3 | Q86VM9; | Q86VM9; Q9BXB5; Q15287; |
| 8 | 5 | P11940; O75534; | P11940; O75534; Q16549; Q14103; |
| O60506; | |||
| 9 | 16 | O43633; Q70EL1; | Q9BY43; O95630; Q9UN37; Q70EL1; |
| Q96FZ7; Q96CF2; Q9NZZ3; Q9H444; | |||
| Q70EL1; Q9HD42; O43633; O75351; | |||
| Q9UN37; Q7LBR1; Q9H444; Q9BY43; | |||
| 10 | 9 | O60885; Q7L2J0; | O94992; O60563; O60583; O60563; |
| P06400; O94992; O60885; P50750; Q7L2J0; |
Classification table of predict results
| Type of host protein | Result (whether host protein) | Type of evaluation |
|---|---|---|
| Host protein | Positive | TP |
| Host protein | Negative | FN |
| Not host protein | Positive | FP |
| Not host protein | Negative | TN |
Fig. 3The influence of parameter k
Fig. 4Comparison of AUC values in multiple control experiments
Experimental comparison of whether the complex is integrated or not
| Comparision algorithm | Number of recognition | TPR (%) | FPR (%) |
|---|---|---|---|
| Unintegrated data | 17 | 68.67 | 38.08 |
| Integrated data | 22 | 72 | 40.31 |
Some of the potential virus–target host proteins
| Uniprot ID | Protein name | MF | CC | BP |
|---|---|---|---|---|
| O15264 | Mitogen-activated protein | GO:0005515; | GO:0005829; | GO:0071347; |
| Kinase 13 (MAPK13) | GO:0005524; | GO:0005737; | GO:0018105 | |
| GO:0004674; | GO:0005634; | GO:0070301; | ||
| GO:0004707; | GO:0007049; | |||
| GO:0050729; | ||||
| GO:1903936; | ||||
| GO:0006970; | ||||
| GO:0051403; | ||||
| Q92598 | Heat shock protein family H | GO:0005515; | GO:0005654; | GO:1900034; |
| (Hsp110) member 1 (HSPH1) | GO:0005524; | GO:0005829; | GO:0051135; | |
| GO:0000774; | GO:0005737; | GO:0045345; | ||
| GO:0043014; | GO:0005634; | GO:0051085; | ||
| GO:0070062; | GO:0006986; | |||
| GO:0032991; | GO:0050790; | |||
| GO:0071682; | GO:0006898; | |||
| GO:0005874; | ||||
| GO:0005576; | ||||
| Q13177 | p21 (RAC1) activated kinase | GO:0005515; | GO:0005829; | GO:0071407; |
| 2(PAK2) | GO:0042802; | GO:0005737; | GO:0050770; | |
| GO:0019901; | GO:0005634; | GO:0051497; | ||
| GO:0005524; | GO:0098978; | GO:0018105; | ||
| GO:0045296; | GO:0005911; | GO:0031295; | ||
| GO:0004674; | GO:0014069; | GO:0040008; | ||
| GO:0030296; | GO:0048471; | GO:0043066; | ||
| GO:0031267; | GO:0005886; | GO:0006469; | ||
| GO:0004672; | GO:0150105; | |||
| GO:0034333; | ||||
| GO:0006468; | ||||
| GO:0046777; | ||||
| GO:0050852; | ||||
| GO:0031098; | ||||
| Q15311 | ralA binding protein | GO:0005515; | GO:0005829; | GO:0043547; |
| 1(RALBP1) | GO:0042910; | GO:0016020; | GO:0007264; | |
| GO:0005096; | GO:1990961; | |||
| GO:0042626; | GO:0051056; | |||
| GO:0022857; | GO:0043087; | |||
| GO:0006935; | ||||
| GO:0006897; | ||||
| GO:0055085; | ||||
| P78545 | E74 like ETS transcription | GO:0005515; | GO:0005654; | GO:0045892; |
| Factor 3(ELF3) | GO:0000978; | GO:0005829; | GO:0045944; | |
| GO:0001228; | GO:0005634; | GO:0006366; | ||
| GO:1990837; | GO:0005794; | GO:0045747; | ||
| GO:0003700; | GO:0006357; | |||
| GO:0000981; | GO:0030855; | |||
| GO:0060056; | ||||
| GO:0006954; | ||||
| GO:0001824; | ||||
| GO:0030198; | ||||
| GO:0030154; |
Results of GO enrichment analysis
| GO ID | GO terminology | |
|---|---|---|
| GO:0016032 | Viral process | 1.23e−4 |
| GO:0003677 | DNA binding | 1.40e−4 |
| GO:0004842 | Ubiquitin-protein transferase activity | 1.32e−4 |
| GO:0005515 | Protein binding | 1.33e−4 |
| GO:0031625 | Ubiquitin protein ligase binding | 1.09e−05 |
Fig. 5Enrichment function analysis of KEGG signaling pathway
Fig. 6Renal cell carcinoma pathway