| Literature DB >> 30862832 |
Abstract
Human adenovirus (HAdV) group C are the common etiologic in infants with severe acute respiratory infections (SARI). In the study, we report that a novel recombinant HAdV-C group strain (SH2016) was isolated from an infant with SARI in Shanghai in Feb. 4, 2016. The whole-genome sequence of SH2016 strain was generated and compared to other HAdV genomes publicly available. The strain SH2016 genome contains 35,946 nucleotides and coded 40 putative proteins, which was divided into 11 regions. RDP and phylogenetic analyses of the complete genome showed that the SH2016 strain was arranged into a novel subtype and might be recombined with HAdV-1 and HAdV-2. Our finding indicated that the frequent recombination among the HAdV-C group played an important role in driving force for polymorphism of human HAdV-C group prevalent in Shanghai, China. Further epidemiological surveillance of HAdV-C group is necessary to explore whether the novel HAdV-C group will maintain long-term stability. And the pathogenicity and clinical characteristics of the novel HAdV-C group member should be done more.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30862832 PMCID: PMC6414723 DOI: 10.1038/s41598-018-37756-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Genome annotation of SH2016.
| Gene | Product | Location | No. of amino residues |
|---|---|---|---|
| E1A | cds | 560–1,546 | NT |
| E1A 29 kDa | 560–1,112, 1,230–1,546 | 289 | |
| E1A 26 kDa | 560–974, 1,230–1,546 | 243 | |
| E1A 6 kDa | 560–637, 1,230–1,546 | 55 | |
| E1B | cds | 1,717–3,515 | NT |
| E1B 19 kDa | 1,717–2,250 | 177 | |
| E1B 55 kDa | 2,022–3,515 | 497 | |
| pIX | pIX 14 kDa | 3,613–4,035 | 142 |
| Iva2 C | Iva2 51 kDa | 4,094–5,430, 5,709–5,721 | 449 |
| E2B C | E2B pol | 5,200–8,787, 14,117–14,125 | 1198 |
| L1 | L1 13.6 kDa | 7,981–8,430 | 149 |
| pTP C | E2B pTP | 8,586–10,592, 14,117–14,125 | 671 |
| L1 | L1 52 kDa | 11,053–12,300 | 415 |
| pIIIa | 12,321–14,078 | 585 | |
| L2 | Penton | 14,162–15,886 | 574 |
| pVII | 15,893–16,489 | 198 | |
| pV | 16,559–17,665 | 368 | |
| pX | 17,693–17,935 | 80 | |
| L3 | pVI | 18,019–18,771 | 250 |
| Hexon | 18,858–21,751 | 964 | |
| Protease 23 kDa | 21,785–22,399 | 204 | |
| E2A C | DBP | 22,497–24,086 | 529 |
| L4 | cds | 24,115–27,905 | NT |
| Hexon-assembly protein 33 kDa | 24,115–26,538 | 807 | |
| Spicing factor 33 kDa | 26,249–26,564, 26,767–27,134 | 227 | |
| Encapsidation protein 22 kDa | 26,249–26,833 | 194 | |
| pVIII | 27,222–27,905 | 227 | |
| E3 | cds | 27,906–30,842 | |
| Immune modulating protein 12.5 kDa | 27,906–28,229 | 107 | |
| CR1-alpha | 28,642–28,827 | 61 | |
| Immune modulating protein 19 kDa | 28,824–29,303 | 159 | |
| CR1-beta | 29,480–29,785 | 101 | |
| RID-alpha | 29,793–30,068 | 91 | |
| RID-beta | 30,071–30,643 | 130 | |
| Control protein 14.7 kDa | 30,456–30,842 | 128 | |
| Putative protein U C | 30,865–31,032 | 56 | |
| L5 | Fiber | 31,043–32,791 | 582 |
| E4 | cds | 32,920–35,537 | |
| Control protein orf 6/7 C | 32,920–33,198, 33,910–34,083 | 150 | |
| Control protein orf 5 (34 kDa) C | 33,199–34083 | 294 | |
| Control protein orf 3/4 C | 34,004–34,088, 34,614–34,714 | 61 | |
| Control protein orf 4 C | 34,004–34,348 | 114 | |
| Control protein orf 3 C | 34,364–34,714 | 116 | |
| Control protein orf 2 C | 34,711–35,103 | 130 | |
| Control protein orf 1 C | 35,151–35,537 | 128 |
The complementary strand functions are marked as ‘c’, for example Iva2c.
SH2016 genome non-coding motifs annotations.
| Motif (5′ to 3′) | Function | Nucleotide locatity |
|---|---|---|
| CATCAT…CGTAG | ITR | 1–103 |
| AATAATATACC | DNApol-pTP-binding site | 8–18 |
| TATGATAATGA | NFIII-binding site | 39–49 |
| GGGGGTGG | Sp1-recognition | 50–56 |
| TGACGT | Transcription factor ATF-binding site | 64–69 |
| GGGCGTGG | Transcription factor Sp1-recognition site | 76–83 |
| CGGGGCGG | Transcription factor Sp1-recognition site | 87–92 |
| TGACGT | Transcription factor ATF-binding site | 96–101 |
| TAAATA | TATA box for E1A | 468–473 |
| AATAAA | polyA signal for E1A | 1,612–1,617 |
| TATATA | TATA box for E1B | 1,675–1,680 |
| ACTGAA | polyA signal for E1B | 3,519–3,525 |
| TATATAA | TATA box for pIX gene | 3,557–,3563 |
| AATAAA | polyA signal for pIX gene | 4,042–4,047 |
| AATACA | polyA signal for E2B | 4,081–4,086 C |
| TGATTGGTTT | Inverted CAAT box for MLP | 5,970–5,979 |
| GCCACGTGAC | Upstream element for MLP | 5,990–5,999 |
| GAAGGGGGGG | MAZ/Sp1-binding site for MLP | 6,011–6,020 |
| TATAAAA | TATA box for MLP | 6,021–6,027 |
| GGGGGTGGGGG | MAZ/SP1-binding site for MLP | 6,028–6,038 |
| TCACTCT | Initiator element for MLP | 6,050–6,056 |
| TTGTCAGTTTCCA | DE1 for MLP | 6,137–6,149 |
| AACGAGGAGGATTTGA | DE2a & DE2b for MLP | 6,152–6,167 |
| AATAAA | polyA signal for L1 | 14,103–14,109 |
| ATTAAA | polyA signal for VII gene | 16,495–16,500 |
| AATAAA | polyA signal for V gene | 17,967–17,972 |
| AATAAA | polyA signal for L3 | 22,425–22,430 |
| TATAAC | TATA box for E3 | 27,587–27,592 |
| AACACA | polyA signal for L4 | 27,938–27,943 |
| AATAAA | polyA signal for E3 | 30,844–30,849 |
| AAAAAA | polyA signal for L5 | 32,837–32,842 |
| AATATA | polyA signal for E4 | 32,947–32,952 C |
| TATATATA | TATA box for E4 | 35,643–35,650 C |
| ATAATATACC | DNApol-pTP-binding site | 35,929–35,938 C |
| CATCAT…CGTAG | Inverted terminal repeat | 35,844–35,946 C |
ITR: Inverted terminal repeat, DNApol-pTP-binding site: the pre-terminal protein-DNA polymerase complex binding site, NFIII: nuclear factor III, MLP: major-late promoter, DE1: Downstream sequence element 1, DE 2a & DE2b: Downstream sequence element 2a & 2b, MAZ: Myc-associated Zinc Finger Protein.
DNA non-coding sequence motifs are recognized for the novel Shanghai HAdV-1 strain (human/CHN/SH/2016/1[P1H1F1]). The nucleotide signatures and their putative functions are indicated. The complementary strand functions are marked as ‘c’, for example 4081–4086c.
Figure 1Neighbor-joining phylogenetic trees based on the open reading frame sequences of the whole genome (A), hexon gene (B), penton gene (C) and fiber gene (D) of SH2016 strain in this study and those of HAdV-C whole genome reference strains from GenBank. Strain SH2016 highlighted with a solid square was characterized in this study. The trees were constructed using the neighbor-joining method of MEGA 6.06 with 1000 bootstrap trials performed to assign confidence to the grouping.
The nucleotides sequence identities between SH2016 and HAdV-C reprensentative strains.
| Region | % nucleotide identities of Novel-HAdV-1 | ||||
|---|---|---|---|---|---|
| HAdV-1 | HAdV-2 | HAdV-5 | HAdV-6 | HAdV-57 | |
| 5′-Terminal (no ITR) | 98.44% | 98.45% | 98.00% | 99.12% | 99.12% |
| E1A | 98.56% | 98.77% | 99.28% | 98.87% | 98.77% |
| E1B | 99.21% | 99.27% | 98.58% | 99.16% | 99.44% |
| pIX | 99.53% | 99.05% | 99.29% | 99.05% | 99.53% |
| Iva2 | 98.88% | 98.58% | 98.65% | 98.58% | 98.81% |
| E2B | 98.11% | 98.34% | 98.51% | 98.42% | 98.23% |
| L1 13.6 kDa | 99.33% | 99.10% | 99.11% | 99.55% | 99.33% |
| pTP | 99.10% | 99.15% | 99.75% | 99.30% | 99.10% |
| L1 52 kDa | 99.11% | 99.60% | 98.46% | 99.44% | 99.11% |
| pIIIa | 99.31% | 99.43% | 98.85% | 99.03% | 99.14% |
| Penton | 99.82% | 99.18% | 98.41% | 99.59% | 99.82% |
| pVII | 99.16% | 98.30% | 98.65% | 99.33% | 99.16% |
| pV | 99.73% | 98.54% | 97.89% | 99.00% | 99.36% |
| pX | 99.59% | 99.59% | 99.59% | 99.17% | 99.17% |
| pVI | 99.87% | 97.70% | 98.79% | 96.59% | 96.45% |
| Hexon | 99.68% | 84.48% | 82.34% | 83.16% | 87.88% |
| Protease 23 kDa | 99.18% | 98.02% | 97.36% | 99.51% | 97.69% |
| DBP | 99.49% | 96.85% | 96.38% | 96.72% | 96.52% |
| L4 | 99.60% | 98.53% | 97.25% | 98.56% | 97.71% |
| E3 | 82.79% | 98.39% | 79.54% | 98.20% | 98.31% |
| Putative protein U | 85.50% | 96.33% | 83.10% | 98.19% | 98.19% |
| Fiber | 98.79% | 66.08% | 69.10% | 72.90% | 72.69% |
| E4 | 98.69% | 98.80% | 98.41% | 98.92% | 98.45% |
| 3′-Terminal (no ITR) | 98.30% | 97.96% | 98.65% | 97.95% | 97.96% |
| Complete genome | 97.93% | 96.44% | 94.66% | 96.62% | 96.78% |
Figure 2Genetic recombinant analyses of the complete genome of the novel strain SH2016. (A) Genomic map of strain SH2016. The l-strand of the genome is represented by a straight line. Rightward (top) and leftward (bottom) ORFs are represented by grey arrows. (B) Recombination events predicted in strain SH2016. Strain SH2016 genome is shown as a thick black line. The likely backbone is shown as a cyan line. Genetic components predicted by RDP4 to be involved in a recombination event are shown as purple line. Likely breakpoint positions are shown below the genome. (C) Similarity analyses of SH2016. SH2016 was used as the query sequence to compare with other 2 representative strains of HAdV-C. The default setting of SimPlot software was used as followed: Window size 200 bp, step size 20 bp, replicates 1000 times and tree model neighbor-joining.
The number of algorithms of the RDP4 package that were predicting the recombination event and recombinant score are shown.
| Recombinant strain | Parent major/minor | Recombinant region in alignment | Model (average p-value) | Recombinant score | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| RDP | GENECONV | Bootscan | MaxChi | Chimaera | Siscan | 3Seq | ||||
| SH2016 | JX173083/KX384959 | 28185–31576 | 2.347 × 10−187 | 8.807 × 10−190 | 8.032 × 10−172 | 1.448 × 10−49 | 4.506 × 10−51 | 9.637 × 10−59 | 2.179 × 10−12 | 0.606 |
Figure 3Neighbor-joining phylogenetic trees based on the left region (A) of recombinant point (5′-end), the right region (B) of recombinant point (3′-end) and the recombinant frame (C) sequences of the recombinant regionof SH2016 strain in this study and those of HAdV-C whole genome reference strains from GenBank. Strain SH2016 highlighted with a solid square was characterized in this study. The trees were constructed using the neighbor-joining method of MEGA 6.06 with 1000 bootstrap trials performed to assign confidence to the grouping.