| Literature DB >> 29940841 |
Carlos A García-Pérez1, Xianwu Guo1, Juan García Navarro2, Diego Alonso Gómez Aguilar1, Edgar E Lara-Ramírez3.
Abstract
BACKGROUND: The influenza A virus (IAV) is a constant threat for humans worldwide. The understanding of motif-domain protein participation is essential to combat the pathogen.Entities:
Keywords: Data-mining; Influenza; Motif-domain interactions; Protein-protein interactions
Mesh:
Substances:
Year: 2018 PMID: 29940841 PMCID: PMC6019528 DOI: 10.1186/s12859-018-2237-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The total number of sequence information retrieved for IAV
| Protein | H1N1 | H5N1 | H3N2 | H1N2 | H2N2 | Total |
|---|---|---|---|---|---|---|
| HA | 6329 | 215 | 6831 | 33 | 80 | 13,488 |
| NA | 4196 | 184 | 5514 | 29 | 92 | 10,015 |
| M1 | 621 | 37 | 426 | 6 | 26 | 1116 |
| M2 | 593 | 56 | 701 | 9 | 23 | 1382 |
| NS1 | 1735 | 130 | 2160 | 13 | 61 | 4099 |
| NS2 | 550 | 71 | 459 | 8 | 26 | 1114 |
| NP | 1172 | 97 | 1286 | 8 | 69 | 2632 |
| PA | 2561 | 140 | 2664 | 12 | 67 | 5444 |
| PB1 | 2260 | 123 | 2367 | 11 | 63 | 4824 |
| PB1-F2 | 230 | 58 | 1360 | 6 | 36 | 1690 |
| PB2 | 2556 | 155 | 2653 | 8 | 66 | 5438 |
| Total | 22,803 | 1266 | 26,421 | 143 | 609 | 51,242 |
Fig. 1Flow diagram of the data mining methodology employed
Comparison of the Motif counts in the five IAV strains per proteins
| strain | H1N1 | H5N1 | H3N2 | H1N2 | H2N2 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Occurrence | > 70% | < 70% | > 70% | < 70% | > 70% | < 70% | > 70% | < 70% | > 70% | < 70% | Total |
| HA | 5 | 46 | 8 | 15 | 5 | 37 | 10 | 5 | 3 | 9 | 143 |
| NA | 3 | 41 | 3 | 15 | 8 | 28 | 8 | 7 | 5 | 4 | 122 |
| M1 | 2 | 13 | 2 | 1 | 2 | 6 | 2 | 0 | 2 | 0 | 30 |
| M2 | 1 | 11 | 2 | 0 | 1 | 12 | 1 | 0 | 2 | 0 | 30 |
| NS1 | 9 | 29 | 8 | 9 | 6 | 25 | 7 | 2 | 7 | 2 | 104 |
| NS2 | 1 | 11 | 2 | 5 | 1 | 8 | 1 | 0 | 1 | 1 | 31 |
| NP | 7 | 23 | 9 | 3 | 7 | 24 | 7 | 5 | 8 | 1 | 94 |
| PA | 12 | 33 | 15 | 13 | 14 | 25 | 14 | 1 | 15 | 4 | 146 |
| PA-X | 3 | 14 | 4 | 3 | 3 | 19 | 2 | 1 | 2 | 3 | 54 |
| PB1 | 14 | 22 | 15 | 10 | 14 | 20 | 14 | 2 | 15 | 1 | 127 |
| PB1-F2 | 0 | 7 | 2 | 4 | 2 | 12 | 1 | 3 | 1 | 2 | 34 |
| PB2 | 9 | 18 | 9 | 3 | 9 | 20 | 9 | 1 | 9 | 5 | 92 |
| Total | 66 | 268 | 79 | 81 | 72 | 236 | 76 | 27 | 70 | 32 | 1007 |
Fig. 2Circular layout network representation for MDIs mapped across the five strains and proteins. a) Strain-MDIs network b) Protein-MDI network. The red diamond represents strains a) and proteins b) respectively, the black ellipse represents the MDIs. The edges highlighted are the motifs mentioned in the text. Both figures can be reproduced in Cytoscape with the first four column information presented in the Additional file 2: Table S3
Fig. 3The structural localization for the Pkinase_Tyr_LIG_1–1 MDIs on the HA and the amino acid motifs for the H1N1 strain. a) The Haemagglutinin (HA, PDBID: 3Al4), with the Pkinase_Tyr_LIG_1–1 MDI marked in red. b) The RegExp, c, d) Seqlogo showing the frequently amino acid motifs found in the H1NI strain
Fig. 4Biological processes related to the MDIs and GO clusters of the IAV-Human mined Network. a) REVIGO summary for the MDIs related GO terms b, c, d, e, f) Identified GO clusters in the IAV-human Network related to the MDIs with the MCODE app for cytoscape