| Literature DB >> 29133946 |
Naiying Mao1, Zhen Zhu1, Pierre Rivailler1, Meng Chen2, Qin Fan1, Fang Huang3, Wenbo Xu4.
Abstract
Human mastadenovirus species C (HAdV-C) are the most common etiologic agents of respiratory disease in young children and are frequently detected worldwide including China. Two recombinant HAdV-C strains (BJ04 and BJ09) were isolated from infants with acute respiratory infection (ARI) in Beijing in 2012-2013. The whole genome sequences (WGS) of BJ04 and BJ09 were generated and compared to other 35 HAdV-C WGSs publicly available. Phylogenetic analyses showed that the BJ04 strain might be the result of three homologous recombination events involving the parental strains JX173086 (HAdV-1), NC_001405 (HAdV-2) and LC068718 (HAdV-6), whereas BJ09 viral genome might be made of genetic elements from JX173083 (HAdV-1), KF268199 (HAdV-5), and KR699642 (strain CBJ113). Despite intratypic recombination, amino acid analysis showed that the gene repertoire of BJ04 and BJ09 were similar to type 2 viruses. Finally, this analysis revealed that at least three lineages of HAdV-C have been identified in China, represented by BJ04 related to NC_001405, BJ09 related to CBJ113, and KF951595 (strain DD28) related to virus isolated in Japan. This study showed that the frequent recombination played an important driving force for complexity of the HAdV-C epidemic in Beijing, thereby demonstrating the necessity for epidemiological and virological surveillance for HAdV-C in China.Entities:
Mesh:
Year: 2017 PMID: 29133946 PMCID: PMC5684218 DOI: 10.1038/s41598-017-15336-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Highest identity to BJ04 and BJ09 among the 35 GB sequences. 1Identity percentage were computed from p-distances displayed in Supplementary Figure 2. 2Backbone sequences are shaded in dark grey. Additional recombinant partners are shaded in light grey.
Figure 1Neighbor joining phylogenetic trees on 20 HAdV-C WGS. The genomic region used to generate the trees is indicated for each tree. Sequences of BJ04 and BJ09 viruses are in bold font. BJ04 sequences are indicated with a black square whereas BJ09 sequences are indicated with a black triangle. The Chinese sequences KR699642 (CBJ113) and KF951595 (DD28) are underlined. Each prototype sequences are indicated by a black dot. Only bootstrap values greater than 70% are indicated. Types designation for the hexon and fiber genes are indicated by brackets. Sequences are indicated by their GB ID followed by the collection year and the country of collection. The type is also indicated for the 5 prototype sequences. Finally, the name of Chinese strain CBJ113 and DD28 is also indicated.
Figure 2Schematic representation of recombination events within BJ04 and BJ09 genomes. (A) Genomic map of HAdV-C. The l-strand of the genome is represented by a straight line. Rightward (top) and leftward (bottom) ORFs are represented by grey arrows based on NC_001405 annotation. (B) Recombination events predicted in BJ04. BJ04 genome is shown as a thick black line. The likely backbone and the other genetic components were identified based on Table 1. The likely backbone is shown as a black block. Genetic components predicted by RDP4 to be involved in a recombination event are shown as dark grey blocks whereas genetic components identified only in the bootscan analysis are shown as light grey blocks. Likely breakpoint positions are shown below the genome. The lowest (best) p-values for the prediction of recombination as well as breakpoint are indicated. The number of algorithms of the RDP4 package that were predicting the recombination event is also shown. PHF designations based on our study are identified by an asterisk. (C) Recombination events predicted in BJ09. Representation layout similar to (B).
Comparison of 37 BJ04 amino acid (AA) sequences with homologous sequences encoded by the 5 prototype viruses.
| BJ04 amino acid analysisa | Numberb,c | Percentage (%)b |
|---|---|---|
| Overall number of amino acids (37 ORFs) | 11540 | / |
| BJ04 residues conserved in all prototype sequences | 10696 | 92.7 |
| BJ04 residues specific to prototype type 2 sequence | 147 | 17.4 |
| Type 2 residues found in another prototype sequence | 643 | 76.2 |
| BJ04 residues found in other prototype sequence (not type 2) | 29 | 3.4 |
| BJ04 residues not found in any prototype sequences | 25 | 3 |
Note: aGB annotations were harmonized based on NC_001405 type 2 annotation.
bThe number and percentage of BJ04 amino acid residues that are not conserved in all prototype sequences are indicated. As BJ04 is highly related to type 2 prototype, the comparison was done in reference to type 2 sequences.
cSupplementary Table 5 was used to generate the numbers.
List of 25 BJ04 amino acid (AA) residues that were not found in any prototype sequences.
| ORF Name | Length(AA) | AA change | Genomic positiona | Other 30 GB sequencesb | PROVEAN analysisc |
|---|---|---|---|---|---|
| Control protein E1A | 289 | A29T | 643 | 2 | NDd |
| Control protein E1B 55 K | 495 | T51A | 2157 | 5 | ND |
| DNA polymerase | 1198 | V89I | 8519 | 0 | Neutral |
| N254H | 8024 | 0 | Neutral | ||
| R274C | 7964 | 0 | Deleterious | ||
| Terminal protein precursor pTP | 671 | A64T | 10399 | 0 | Neutral |
| A283T | 9742 | 0 | Neutral | ||
| D432N | 9313 | 1 | ND | ||
| A527T | 9028 | 1 | ND | ||
| Encapsidation protein 52 K | 415 | P74S | 11259 | 0 | Neutral |
| Capsid protein precursor pIIIa | 585 | L474V | 13727 | 0 | Neutral |
| Penton base | 571 | A368T | 15266 | 3 | ND |
| Single-stranded DNA-binding protein | 529 | R67C | 23881 | 0 | Neutral |
| Hexon assembly protein 100 K | 805 | A122T | 24471 | 0 | Neutral |
| Encapsidation protein 22 K | 195 | S75N | 26462 | 0 | Neutral |
| Protein 33 K | 228 | S75N | 26462 | 0 | Neutral |
| Control protein E3 12.5 K | 107 | T50I | 28047 | 0 | Neutral |
| Membrane glycoprotein E3 gp19K | 159 | K62R | 28995 | 0 | Neutral |
| Fiber | 582 | S178T | 31561 | 5 | ND |
| Control protein E4orf6/7 | 150 | P66S | 33171 | 10 | ND |
| Q69R | 33161 | 9 | ND | ||
| Control protein E4 34 K | 294 | Y61F | 33896 | 5 | ND |
| L90I | 33810 | 0 | Neutral | ||
| Control protein E4orf4 | 114 | H38R | 34230 | 0 | Neutral |
| Control protein E4orf3 | 116 | V70I | 34499 | 0 | Neutral |
Note: aGenomic positions as well as amino changes for each ORF were indicated.
bThe number of sequences within the remaining 30 GB sequences featuring the same amino change is shown.
cThe PROVEAN prediction on the effect of amino changes only found in BJ04 is indicated.
dNot done (ND).
Comparison of 37 BJ09 amino acid (AA) sequences with homologous sequences encoded by the 5 prototype viruses as well as the Chinese CBJ113 strain.
| BJ09 amino acid analysisa | Numberb,c | Percentage (%)b |
|---|---|---|
| Overall number of amino acids (37 ORFs) | 11502 | / |
| BJ09 residues conserved in all prototype sequences and CBJ113 | 10675 | 92.3 |
| BJ09 residues specific to CBJ113 | 6 | 0.7 |
| CBJ113 residues found in any prototype sequences | 775 | 93.9 |
| BJ09 residues found in any prototype sequences (not in CBJ113) | 30 | 3.6 |
| BJ09 residues not found in any prototype sequences nor CBJ113 | 15 | 1.8 |
| BJ09 residues specific to type 1 | 6 | / |
| BJ09 residues specific to type 2 | 143 | / |
| BJ09 residues specific to type 5 | 10 | / |
| BJ09 residues specific to type 6 | 0 | / |
| BJ09 residues specific to type 57 | 1 | / |
Note: aGB annotations were harmonized based on NC_001405 type 2 annotation.
bThe number and percentage of BJ09 amino acid residues that are not conserved in all prototype sequences and in CBJ113 are indicated. As BJ09 is highly related to CBJ113, the comparison was done in reference to CBJ113 sequences. The number of BJ09 residues that are specific to any of the 5 types is shown.
cSupplementary Table 6 was used to generate the numbers.
List of 15 BJ09 amino acid (AA) residues that were not found in any prototype sequences nor CBJ113 sequence.
| ORF Name | Length(AA) | AA change | Genomic positiona | Other 29 GB sequencesb | PROVEAN analysisc |
|---|---|---|---|---|---|
| Control protein E1A | 289 | H94Y | 838 | 0 | Neutral |
| Control protein E1B 55 K | 495 | A116S | 2346 | 1 | NDd |
| R130K | 2389 | 1 | ND | ||
| DNA polymerase | 1198 | S613N | 6946 | 0 | Neutral |
| P1005H | 5770 | 0 | Neutral | ||
| Terminal protein precursor pTP | 671 | P397_E398insEEEEE | 9397 | 0 | Neutral |
| Penton base | 571 | P23S | 14217 | 0 | Neutral |
| Core protein V | 369 | A250T | 17286 | 0 | Neutral |
| Protein 33 K | 228 | S59L | 26414 | 0 | Neutral |
| Encapsidation protein 22 K | 195 | S59L | 26414 | 0 | Neutral |
| V134I | 26638 | 0 | Neutral |
Note: aGenomic positions as well as amino changes for each ORF were indicated.
bThe number of sequences within the remaining 29 GB sequences featuring the same amino change is shown.
cThe PROVEAN prediction on the effect of amino changes only found in BJ09 is indicated.
dNot done (ND).