| Literature DB >> 30778154 |
Pierre Rivailler1, Naiying Mao1, Zhen Zhu, Wenbo Xu2.
Abstract
This study aims at analyzing all publicly available HAdV-C whole genome sequences (WGSs) and describes the genetic relationships between these genomes as well as identifies potential hotspots for recombination throughout the viral genome. In addition to the 4 prototypical genomic sequences, this analysis identified 20 HAdV-C WGSs which should be relevant for future recombination analysis of HAdV-C. This report confirmed the recombinogenic property of HAdV-C genomes and identified two main regions for breakpoints, within the hexon gene and around the fiber genomic region. No obvious recombination was detected between HAdV-Cs and non-human mastadenoviruses or non-C HAdVs. Finally, it highlighted the need for a surveillance of HAdVs in order to detect novel recombinant types that might represent health risks and develop possible prevention measures. Genetic analyses of recombination between recently collected HAdV-Cs and the assessment of their potential virulence are necessary steps towards the establishment of a surveillance of HAdVs in the future.Entities:
Year: 2019 PMID: 30778154 PMCID: PMC6379361 DOI: 10.1038/s41598-019-38719-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
List of the 25 HAdV-C WGSs analyzed for recombination among the 43 WGSs downloaded from GenBank.
| GenBank-IDa | Complete name or annotation | Country of collection | Year of collection | Informative in RDP4b | Recombination analysisc | Prototype | Reference |
|---|---|---|---|---|---|---|---|
| AF534906 | USA | 1953 | Yes | Yes | Type 1 |
[ | |
| NC_001405 | USA | 1953 | Yes | Yes | Type 2 |
[ | |
| AC_000008 | USA | 1953 | Yes | Yes | Type 5 |
[ | |
| FJ349096 | USA | 1953 | Yes | Yes | Type 6 |
[ | |
| LC068713 | strain: 870550 | Japan | 1987 | Yes | Yes | Iida | |
| KF268310 | human/USA/Pitts_00109/1992/2[P2H2F2] | USA | 1992 | Yes | Yes | Madupu | |
| LC068714 | strain: 930113 | Japan | 1993 | Yes | Yes | Iida | |
| LC068715 | strain: 940162 | Japan | 1994 | Yes | No | Iida | |
| JX173078 | human/ARG/A15812/2000/1[P1H1F1] | Argentina | 2000 | Yes | Yes | Madupu | |
| HQ003817 | human/AZE/16700/2001/57[P1H57F6] | Azerbaijan | 2001 | Yes | Yes | Type 57 |
[ |
| JX173081 | human/EGY/E53/2001/2[P2H2F2] | Egypt | 2001 | Yes | Yes | Madupu | |
| JX173080 | human/EGY/E13/2001/1[P1H1F1] | Egypt | 2001 | Yes | Yes | Madupu | |
| JX173079 | human/ARG/A15932/2002/2[P2H2F2] | Argentina | 2002 | Yes | Yes | Madupu | |
| KX384959 | strain T215/Ft Jackson South Carolina USA/2002 | USA | 2002 | Yes | Yes |
[ | |
| JX173084 | human/USA/VT5544/2003/2[P2H2F2] | USA | 2003 | Yes | Yes | Madupu | |
| LC068716 | strain: 1030787 | Japan | 2003 | Yes | Yes | Iida | |
| JX173082 | human/USA/VT384/2003/1[P1H1F1] | USA | 2003 | Yes | Yes | Madupu | |
| JX173083 | human/USA/VT2672/2003/1[P1H1F1] | USA | 2003 | Yes | Yes | Madupu | |
| JX173085 | human/USA/VT2612/2003/1[P1H1F1] | USA | 2003 | Yes | No | Madupu | |
| KF268130 | human/USA/UFL_Adv2/2004/2[P2H2F2] | USA | 2004 | Yes | No | Madupu | |
| LC068717 | strain: 1040264 | Japan | 2004 | Yes | Yes | Iida | |
| LC068718 | strain: 1040502 | Japan | 2004 | Yes | Yes | Iida | |
| JX173086 | human/USA/VT13862/2004/1[P1H1F1] | USA | 2004 | Yes | No | Madupu | |
| JX173077 | human/ARG/A8649/2005/2[P2H2F2] | Argentina | 2005 | Yes | No | Madupu | |
| LC068720 | strain: 1050158 | Japan | 2005 | Yes | No | Iida | |
| KF268129 | human/USA/UFL_Adv6/2005/6[P6H6F6] | USA | 2005 | Yes | Yes | Madupu | |
| JX423389 | human/USA/ak31_AdV6/2007/6[P6H6F6] | USA | 2007 | Yes | Yes | Madupu | |
| KF268199 | human/USA/UFL_Adv5/2008/5[P2/H5/F5] | USA | 2008 | Yes | Yes | Madupu | |
| KR699642 | strain CBJ113 | China | 2009 | Yes | Yes |
[ | |
| MF315028 | human/CHN/BJ04/2012/[P1/H2/F2] | China | 2012 | Yes | Yes |
[ | |
| KF951595 | strain DD28 | China | 2013 | Yes | No | An | |
| MF315029 | human/CHN/BJ09/2013/[P1/H2/F2] | China | 2013 | Yes | Yes |
[ | |
| AY601635 | strain NHRC Ad5FS 7151 | USA | Yes | Nod | Tibbetts | ||
| AC_000007 | same as NC_001405 | USA | 1953 | No |
[ | ||
| J01917 | same as NC_001405 | USA | 1953 | No |
[ | ||
| HQ413315 | Tonsil 99 | USA | 1953 | No |
[ | ||
| AC_000017 | same as AF534906 | USA | 1953 | No |
[ | ||
| M73260 | same as AC_000008 | USA | 1953 | No |
[ | ||
| LC068712 | strain: 870550 | Japan | 1987 | No | Iida | ||
| KF268127 | human/USA/CL_42/1988/5[P5H5F5] | USA | 1988 | No | Madupu | ||
| KF429754 | human/USA/Pitts_00149/1990/5[P5H5F5], strain F268310 | USA | 1990 | No | Madupu | ||
| LC068719 | strain: 1050156 | Japan | 2005 | No | Iida | ||
| AY339865 | No |
aThe genomes were sorted based on the columns “informative in RDP4”, “prototype” and “year of collection”.
bOnly WGSs identified as informative in RDP4 were analyzed.
cThe genomes closely related to another genome of the dataset based on phylogenetics and genetic distance and not involved in any major recombination event were not selected for recombination analysis.
dThis genome was not selected because the collection year was unknown.
Figure 1Phylogenetic network built with 25 HAdV-C WGSs. The clusters corresponding to type 1, 2, 5 and 6 are shown with different colors, type 1 in pastel pink, type 2 in blue, type 5 in pastel yellow and type 6 in pastel green. The sequences corresponding to the prototype viruses of type 1, 2, 5, 6 and 57 are shown in bold face and a dot.
Figure 2Neighbor joining phylogenetic trees of 25 HAdV-Cs. WGS as well contiguous regions were analyzed. The genomic region used to build the tree is shown for each tree. Type designations for the hexon and fiber genes are indicated by brackets. The sequences identified as outliers based on the p-distances from Supplementary Table S1 are identified with an asterisk. Bootstrap values greater than 70% are shown.
Figure 3Graphic representation of the genetic relationships between 32 HAdV-Cs. Each HAdV-C genome is represented in scale, as a colored box with 9 portions corresponding to the 9 genomic regions used in this analysis. The start and the end of the genome as well as the regions corresponding to the penton (P), hexon (H) and fiber (F) genes are indicated. The genomes of the prototype of type 1, 2, 5 and 6 are shown on the top. Type 1, 2, 5 and 6 are represented in pastel pink, blue, pastel yellow and pastel green, respectively. The other colors are indicating sequences with unknown origin. Hypothetical genomes are shown as dotted line boxes. Hypothetical genomic regions not involved in any recombination event are shown in light grey. The relationship between genomes was established based on phylogenetic trees (Fig. 2 and Supplementary Fig. S2) and p-distances (Supplementary Table S1). The relationship between genomes is indicated as follows: the related genomes that are not involved in major recombination events are linked with a thick black line (for example, KF268310 and KF268130). A recombinant virus is linked to its major parental genome (or backbone) by two parallel straight lines. Finally, the genomic sequences involved in recombination are linked by a parallelogram shaded in grey. The names of the genomes sharing the same genotype as a prototype virus are shown in the same color as the prototype name. The number of the RDP event confirming the genetic relationship is shown in a black box. The collection year is indicated on the left side of the figure.
Figure 4Breakpoint distribution across HAdV-C genome. The distribution plot was generated by RDP4 using 40 recombination events. A genomic map is shown above, featuring the genome as a straight line, the rightward ORFs above this line and the leftward ORFs below this line. ORFs are represented as thick arrows whereas introns are represented as a broken line. The genome NC_001405 was used for the annotation and numbering.
List of the 40 first RDP events.
| Event #a,b | Beginsc | Endsc | Recombinant Sequence(s)d,e | Minor Parental Sequence(s)e | Major Parental Sequence(s)e | RDPf | GENECONVf | Bootscanf | Maxchif | Chimaeraf | SiSscanf | 3Seqf | Mapg |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 28060 | 31054 | JX173080.1-2001 | 21 sequences | 6 sequences | 7.34E-186 | 1.25E-189 | 1.03E-183 | 1.60E-49 | 3.40E-51 | 5.81E-53 | 1.65E-12 | no |
| 2 | 27327 | 31044 | KF268129.1-2005 | 10 unknown sequences | 10 sequences | 9.04E-117 | 1.03E-117 | 1.05E-126 | 2.12E-46 | 2.02E-47 | 1.52E-55 | 1.10E-12 | no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 | 27327 | 31052 | 11 sequences | 10 sequences | 6 sequences | 5.87E-98 | 4.50E-115 | 3.58E-112 | 3.26E-37 | 2.44E-28 | 1.45E-56 | 5.51E-13 | no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 | 4435 | 19128 | ^JX173081.1-2001 | 3 unknown sequences | JX173079.1-2002; KX384959.1-2002 | 6.57E-67 | 1.74E-47 | 6.97E-36 | 1.68E-24 | 5.61E-10 | 3.36E-20 |
| no |
| 9 | 5504 | 18363 | ^KF268129.1-2005 | 6 unknown sequences | 11 sequences | 5.65E-63 | 1.25E-48 | 3.76E-26 | 1.13E-25 |
| 3.05E-12 |
| no |
| 10 | 22362 | 26094 | ^KF268129.1-2005 | 2 unknown sequences | FJ349096.1-1953 | NS | 4.47E-61 |
| 3.30E-21 | 7.40E-21 | 1.29E-18 | 1.10E-12 | no |
| 11 | 20327 | 21795 | 11 sequences | 3 sequences | 7 sequences | 1.23E-43 | 8.38E-55 | 5.41E-50 | 1.64E-16 | 2.37E-14 | 1.17E-27 | 5.51E-13 | no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 | 23695 | 28059* | JX173081.1-2001 | 7 sequences | MF315028-2012 | NS | 2.89E-10 |
| 1.21E-17 | 1.26E-14 |
| 1.65E-12 | no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 | 32764 | 34473 | 8 sequences | 3 sequences | JX173080.1-2001 | 2.40E-39 | 9.91E-30 | 1.87E-39 |
|
| 2.27E-10 | 1.65E-12 | no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 | 171* | 4504* | KF268129.1-2005 | 3 unknown sequences | 8 sequences | 3.06E-32 | 3.21E-28 | 8.74E-19 | 1.13E-11 | 2.32E-10 |
| 1.10E-12 | no |
| 22 | 6874 | 13947 | 5 sequences | JX173079.1-2002 | JX173082.1-2003 | 7.73E-31 | 4.06E-17 | 3.92E-20 |
|
|
| 2.20E-12 | no |
| 23 | 28137 | 28931 | ^AC_000008.1-1953; KF268199.1-2008 | 1 unknown sequence | 6 sequences | 2.27E-24 | 1.09E-29 | 3.72E-16 |
| 8.39E-13 | 9.30E-23 |
| no |
| 24 | 3163 | 6835* | ^JX173079.1-2002 | AF534906.1-1953 | 4 unknown sequences | 2.25E-28 | 4.61E-17 | 1.94E-28 |
|
| 8.19E-23 | 3.85E-12 | no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 27 | 950* | 10020* | ^HQ003817.1-2001 | 5 unknown sequences | 7 sequences |
|
|
|
|
| 2.40E-13 | 5.03E-37 | no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 | 34781* | 35890* | 5 sequences | 7 unknown sequences | 9 sequences | 1.97E-20 | 2.31E-24 | 5.52E-22 |
|
|
|
| no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 | 18364* | 19197 | ^AC_000008.1-1953 | 10 sequences | 1 unknown sequence | 5.47E-21 | 9.04E-13 | 4.26E-20 |
|
| NS | 1.44E-10 | no |
| 33 | 28155 | 28899 | ^KF268129.1-2005 | 5 sequences | 6 unknown sequences | 1.99E-19 | NS |
|
| 3.50E-11 | 6.15E-18 |
| no |
| 34 | 18419* | 19197 | ^KF268129.1-2005 | 1 unknown sequence | 10 sequences | 6.65E-19 |
| 4.97E-19 |
|
| NS |
| no |
| 35 | 33322 | 34308* | 8 sequences | 1 unknown sequence | 7 sequences | 2.02E-18 | 1.59E-17 | 9.24E-19 |
|
|
|
| no |
| 36 | 34970* | 35937* | ^KF268129.1-2005 | 5 unknown sequences | 5 sequences | 1.59E-14 |
| 9.58E-18 |
|
| NS | NS | no |
| 37 | 20154 | 20845 | ^HQ003817.1-2001 | 5 sequences | 7 sequences | 1.43E-10 | 5.59E-11 |
|
|
| 6.64E-17 | 5.51E-13 | no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 39 | 5397 | 6297 | JX173083.1-2003 | 13 sequences | 3 sequences | NS | 1.52E-12 | 1.00E-15 | NS | NS | NS |
| no |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
aRDP events are listed in Supplementary Table S13.
bRDP events that are identified in recombination map in Fig. 3 are shown in bold.
cUndetermined breakpoints are shown with “*”.
dThe potentially misidentified recombinants are indicated with ^.
eIn cases where more than 2 sequences are listed as potential candidate for a recombination event, the number of listed sequences is indicated.
fp-values > 1E-10 are italicized.
gRDP events shown in bold are featured in the recombination map in Fig. 3.