| Literature DB >> 31719605 |
Miguel Angel Fuertes1, Silvia López-Arguello2, Carlos Alonso2.
Abstract
Picornaviridae family includes several viruses of great economic and medical importance. Among all members of the family we focused our attention on the human rhinovirus, the most important etiologic agent of the common cold and on the foot-and-mouth disease virus that cause of an economically important disease in cattle. Despite the low sequence similarity of the polyprotein coding open reading frames of these highly divergent picornaviruses, they have in common structural and functional similarities including a similar genomic organization, a capsid structure composed of 60 copies of four different proteins, or 3D-structures showing similar general topology, among others. We hypothesized that such similarities could be reflected in emergent common compositional structures interspersed in their genomes which were not observed heretofore. Using a methodology categorizing nucleotide triplets by their gross-composition we have found two human rhinoviruses sharing compositional structures interspersed along their genomic RNA with three foot-and-mouth disease viruses. The shared compositional structures are in one case composed by nucleotide triplets containing all nearest-neighbours of A and G and in other case containing all nearest-neighbours of A, and C. The structures are under strong evolutionary constraints for variability, allowing the access to novel viral genomic motifs with likely biological relevance. The conserved fragments would be useful to predict critical mutation points sites important from the evolutionary point of view.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31719605 PMCID: PMC6851159 DOI: 10.1038/s41598-019-53013-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
List of all tCPs and their associated NT-triplets (adapted from[10,36]).
| tCPs | NT-triplets per tCP |
|---|---|
|
| |
| <A> | AAA |
| <T> | TTT |
| <G> | GGG |
| <C> | CCC |
|
| |
| <AC> | AAC, CAA, ACA, CCA, ACC, CAC |
| <AT> | AAT, TAA, ATA, TTA, ATT, TAT |
| <AG> | AAG, GAA, AGA, GGA, AGG, GAG |
| <CG> | CCG, GCC, CGC, GGC, CGG, GCG |
| <GT> | GGT, TGG, GTG, TTG, GTT, TGT |
| <CT> | CCT, TCC, CTC, TTC, CTT, TCT |
| <AGC> | AGC, GCA, CAG, ACG, CGA, GAC |
| <AGT> | AGT, GTA, TAG, ATG, TGA, GAT |
| <ACT> | ACT, CTA, TAC, ATC, TCA, CAT |
| <TCG> | TCG, CGT, GTC, TGC, GCT, CTG |
Figure 1Cumulative tCP-usage frequency graphs with FMDV-polyprotein length. (a) Cumulative
tCPs shared and percentages of identity and gaps between the serotypes of HRV and FMDV polyproteins encoding ORFs. Comparison with their random (RND) generated polyproteins encoding ORFs.
| Organisms | NT-identity (%) | NT-gaps (%) | tCP-identity (%) | tCP-gaps (%) | tCP shared |
|---|---|---|---|---|---|
| HRV14 and HRV14 | 100 | 0 | 100 | 0 | 14 |
| HRV14 and HRV-C | 57 | 24 | 33 | 31 | 4 |
| HRV14 and HRV-A | 56 | 26 | 33 | 32 | 1 |
| HRV14 and HRV2 | 57 | 24 | 33 | 31 | 1 |
| HRV-C and HRV-A | 57 | 23 | 33 | 32 | 3 |
| HRV-C and HRV2 | 56 | 25 | 35 | 33 | 1 |
| HRV-A and HRV2 | 75 | 4 | 33 | 32 | 2 |
| FMDV-O and FMDV-A | 84 | 3 | 66 | 8 | 6 |
| FMDV-O and FMDV-C | 85 | 3 | 70 | 5 | 6 |
| FMDV-O and FMDV-SAT1 | 75 | 8 | 53 | 15 | 3 |
| FMDV-SAT1and FMDV-C | 75 | 8 | 51 | 19 | 5 |
| FMDV-SAT1and FMDV-A | 75 | 7 | 53 | 16 | 4 |
| FMDV-C and FMDV-A | 84 | 3 | 35 | 9 | 5 |
| FMDV-C and HRV14 | 43 | 44 | 25 | 44 | 1 |
| FMDV-A and HRV14 | 43 | 43 | 25 | 44 | 1 |
| FMDV-A and HRV-C | 43 | 44 | 26 | 44 | 0 |
| FMDV-A and HRV-A | 42 | 45 | 25 | 42 | 0 |
| FMDV-A and HRV2 | 43 | 44 | 26 | 44 | 0 |
| FMDV-C and HRV-C | 43 | 44 | 25 | 45 | 1 |
| FMDV-C and HRV-A | 42 | 45 | 24 | 44 | 0 |
| FMDV-C and HRV2 | 43 | 43 | 25 | 45 | 0 |
| FMDV-O and HRV14 | 42 | 44 | 25 | 44 | 0 |
| FMDV-O and HRV-C | 42 | 46 | 26 | 44 | 0 |
| FMDV-O and HRV-A | 42 | 44 | 24 | 47 | 0 |
| FMDV-O and HRV2 | 43 | 43 | 26 | 44 | 0 |
| FMDV-A and RND FMDV-A | 43 | 45 | 23 | 40 | 0 |
| FMDV-C and RND FMDV-C | 42 | 46 | 26 | 47 | 0 |
| FMDV-SAT1 and RND FMDV-SAT1 | 41 | 44 | 24 | 40 | 0 |
| FMDV-O and RND FMDV-O | 42 | 47 | 25 | 46 | 0 |
| HRV14 and RND HRV14 | 43 | 45 | 25 | 36 | 0 |
| HRV-C and RND HRV-C | 44 | 45 | 24 | 37 | 0 |
| HRV-A and RND HRV-A | 43 | 45 | 24 | 36 | 0 |
| HRV2 and RND HRV2 | 42 | 43 | 23 | 37 | 0 |
Figure 2Comparison of the 14 tCP-profiles obtained from the alignments of the tCP-sequences of FMDV-C (red) and HRV14 (blue) polyproteins.
Conserved tCPs and correlation coefficients (r) between serotypes of FMDV and HRV.
| Serotype | FMDV-A | FMDV-C | r** | FMDV-SAT1 | r | FMDV-O | r | HRV14 | r | HRV-C | r | HRV-A | r | HRV2 | r |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FMDV-A | All tCPs* | <A> | 0.85 | <AC> | 0.94 | <A> | 0.87 | <AG> | 0.85 | — | — | — | — | — | — |
| <AC> | 0.94 | <AG> | 0.98 | <AC> | 0.88 |
|
| ||||||||
| <AG> | 0.98 | <AGT> | 0.91 | <AG> | 0.99 | ||||||||||
| <AGT> | 0.97 | <TCG> | 0.86 | <CT> | 0.86 | ||||||||||
| <ACT> | 0.86 | <AGT> | 0.95 | ||||||||||||
| FMDV-C | All tCPs | <G> | 0.89 | <C> | 0.91 | <AG> | 0.86 | — | — |
|
| — | — | ||
| <AC> | 0.95 | <AC> | 0.95 | ||||||||||||
| <AG> | 0.99 | <AG> | 0.99 | ||||||||||||
| <CT> | 0.86 | <CT> | 0.87 | ||||||||||||
| <AGT> | 0.91 | <AGT> | 0.93 | ||||||||||||
| <TCG> | 0.88 | ||||||||||||||
| FMDV-SAT1 | All tCPs | <AC> | 0.85 |
|
|
|
| <AC> | 0.85 |
|
| ||||
| <AG> | 0.98 | ||||||||||||||
| <AGT> | 0.91 | ||||||||||||||
| FMDV-O | All tCPs | — | — | — | — | — | — | — | — | ||||||
| HRV14 | All tCPs | <A> | 0.86 | <TCG> | 0.87 | <AG> | 0.89 | ||||||||
| <AG> | 0.98 | ||||||||||||||
| <AC> | 0.91 | ||||||||||||||
| <AT> | 0.86 | ||||||||||||||
| HRV-C | All tCPs | <G> | 0.87 | <AG> | 0.85 | ||||||||||
| <AC> | 0.90 | ||||||||||||||
| <AT> | 0.91 | ||||||||||||||
| HRV-A | All tCPs | <AT> | 0.85 | ||||||||||||
| <AG> | 0.87 | ||||||||||||||
| HRV2 | All tCPs |
*All tCPs indicate that when compared two identical tCP (or NT) sequences all tCPs (or all NTs) are conserved. Underlined; correlation coefficients lower but near the cut-off.
Figure 3Comparison of the 14 tCP-profiles obtained from the alignments of the tCP-sequences of FMDV-C (blue) and FMDV-O (red) polyproteins.
Figure 4Comparison of the 14 tCP-profiles obtained from the alignments of the tCP-sequences of HRV14 (red) and HRV-C (blue) polyproteins.
Figure 5Bar diagram showing the percent of NT-mismatches in genomic regions in which FMDV-A and HRV14 (white bars) share the tCP
Figure 6Differences between tCP-observed (tCPo), and estimated (tCPe) events for the conserved tCPs in the alignment of the polyprotein coding ORFs of FMDV and HRV. An schematic of the RNA genomic organization of the polyprotein-encoding ORFs of FMDV and HRV are superimposed to the tCP-profiles according to the L-P1-P2-P3 structural scheme[24]. (a) The L, P1, P2 and P3 genomic organization of FMDV-A, FMDV-C and HRV14 are in black, grey, orange and blue colours. L represents in FMDV-A and FMDV-C, the leader proteinase; HRV14 lacks this region in the polyprotein coding ORF. P1 contains the sequences coding for viral structural proteins: VP4, VP2, VP3 and VP1. P2 contains the sequences coding for non-structural genes 2 A, 2B, and 2 C. P3 contains the sequences coding for non-structural genes 3 A, 3B, 3 C, and 3D. The tCP-profiles are represented as follows: FMDV-A (red), FMDV-C (orange) and HRV14 (in black, the alignment FMDV-C vs HRV14 and in grey, the alignment FMDV-A vs HRV14). Arrows underline significant differences between tCPo and tCPe events between FMDV-A, FMDV-C and HRV14 polyprotein-encoding ORFs. (b) The genomic organization of FMDV-SAT1 and HRV-A are as indicated before. The tCP-profiles are represented as follows: FMDV-SAT1 (red) and HRV-A (black).