| Literature DB >> 24055951 |
Zhiqiang Mi1, Azeem Mehmood Butt2, Xiaoping An1, Tao Jiang1, Wei Liu1, Chengfeng Qin1, Wu-Chun Cao3, Yigang Tong4.
Abstract
Human adenovirus type 14 (HAdV-B14) was first reported in 1955 from the Netherlands and since then had been associated with outbreaks of febrile respiratory illness (FRI). In China, sporadic HAdV-B14 infections were first identified in 2010, in Guangzhou and Beijing. In 2012, an outbreak of FRI occurred in Beijing and the etiological agent was determined to be HAdV-B14. We present a complete HAdV-B14 genome sequence isolated from this recent FRI outbreak. Virus in 30 throat swab samples was detected using polymerase chain reaction assays, and confirmed by sequencing of the fiber, hexon and penton genes. Comparative genomics and phylogenetic analysis showed that the newly isolated HAdV-B14 (HAdV-B14 CHN) shared highest sequence homology with a 2006 isolate from the United States and clustered closely with other HAdV-B14 strains. It is expected that data from the present study will help in devising better protocols for virus surveillance, and in developing preventative measures.Entities:
Keywords: Bioinformatics; Genome analysis; Human adenovirus; Respiratory disease
Mesh:
Year: 2013 PMID: 24055951 PMCID: PMC7126778 DOI: 10.1016/j.ygeno.2013.09.001
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736
Fig. 1Genome organization of HAdV-B14 CHN strain.
The genome is represented by two black horizontal lines marked at 5000 bp intervals. Protein encoding regions are shown as arrows indicating transcriptional orientation. Forward arrows (above the horizontal black line) show coding regions in the 5′ to 3′ direction and arrows pointing to the left (below the horizontal black line) show the coding regions encoded on the complementary strand. Genes are colored in blue and coding sequences are colored in light yellow.
Genome organization and coding sequences annotation from the HAdV-B14 CHN strain.
| Region | Nucleotide locality | Gene product | Product length | TATA box | ATG site | Stop site | Poly (A) signal |
|---|---|---|---|---|---|---|---|
| ITR | 1–137 | ||||||
| E1A | 587–1166, 1251–1459 | 29.1 kDa | 262 | 494–501 | 587 | 1457 | 1518–1523 |
| E1A | 587–1073, 1251–1459 | 25.7 kDa | 231 | 494–501 | 587 | 1457 | 1518–1523 |
| E1A | 587–658, 1251–1355 | 6.5 kDa | 58 | 494–501 | 587 | 1353–1355 | 1518–1523 |
| E1B | 1629–2171 | 20 kDa | 180 | 1575–1581 | 1629 | 2169–2171 | 3471–3476 |
| E1B | 1934–3418 | 54.9 kDa | 494 | 1575–1581 | 1934 | 3416–3418 | 3471–3476 |
| IVa2 | 3985c–5031c | IVa2 protein | 348 | ND | 5607c | 3985c–3987c | 3963c–3968c |
| E2A | 21833c–23389c | DNA binding protein | 518 | ND | 23387c | 21833c | 21791c–21796c |
| E2B | 5088c–8660c, 13643c–13651c | DNA polymerase | 1122 | 13693c–13698c | 13649c | 5088c | ND |
| E2B | 8459–10420, 13643–13651 | pTP | 656 | 13693c–13698c | ND | ND | ND |
| IX | 3498–3917 | 14.2 kDa | 139 | 3416–3421 | 3501 | 3918 | 3942–3947 |
| E4 | 31801c–32052c | ORF6/7 | 141 | 34452c–34459c | 32940c–32942c | 31795c–31797c | 31778c–31783c |
| E4 | 32043c–32942c | ORF6; 34 kDa | 299 | 34452c–34459c | 32940c–32942c | 32043c–32045c | 31778c–31783c |
| E4 | 32845c–33213c | ORF4; 13 kDa | 122 | 34452c–34459c | 33211c–33213c | 32845c–32847c | 31778c–31783c |
| E4 | 33222c–33575c | ORF3; 13.5 kDa | 117 | 34452c–34459c | 33573c–33575c | 33222c–33224c | 31778c–31783c |
| E4 | 33572c–33961c | ORF2; 14.3 kDa | 129 | 34452c–34459c | 33959c–33961c | 33572c–33574c | 31778c–31783c |
| E4 | 34004c–34381c | ORF1; 14.2 kDa | 125 | 34452c–34459c | 34379c–34381c | 34004c–34006c | 31778c–31783c |
| L1 | 10668–11828 | 43.9 kDa | 386 | ND | 10668–10670 | 11826–11828 | 13624–13629 |
| L1 | 11854–13617 | pIIIa; 65.6 kDa | 587 | ND | 11854–11856 | 13615–13617 | 13624–13629 |
| L2 | 13699–15375 | Penton | 558 | ND | 13699–13701 | 15373–15375 | 17335–17340 |
| L2 | 15380–15958 | pVII | 192 | ND | 15380–15382 | 15956–15958 | 17335–17340 |
| L2 | 16001–17056 | pV | 351 | ND | 16001–16003 | 17054–17056 | 17335–17340 |
| L2 | 17085–17315 | pX | 76 | ND | 17085–17087 | 17313–17315 | 17335–17340 |
| L3 | 17396–18136 | pVI | 246 | ND | 17396–17398 | 18134–18136 | 21779–21784 |
| L3 | 18252–21089 | Hexon | 945 | ND | 18252–18254 | 21087–21089 | 21779–21784 |
| L3 | 21126–21755 | 23 kDa | 209 | ND | 21126–21128 | 21753–21755 | 21779–21784 |
| L4 | 23420–25858 | 100 kDa | 812 | ND | 23420–23422 | 25856–25858 | 27333–27338 |
| L4 | 25590–26165 | 22 kDa | 191 | ND | 25590–25592 | 26163–26165 | 27333–27338 |
| L4 | 26047–26439 | 33 kDa | 130 | ND | 25590–25592 | 26437–26439 | 27333–27338 |
| L4 | 26489–27172 | pVIII | 227 | ND | 26489–26491 | 27170–27172 | 27333–27338 |
| E3 | 27172–27489 | 11.7 kDa | 105 | 26854–26859 | 27172–27174 | 27487–27489 | 30596–30601 |
| E3 | 27443–27838 | 14.6 kDa | 131 | 26854–26859 | 27443–27445 | 27836–27838 | 30596–30601 |
| E3 | 27823–28323 | 18.4 kDa | 166 | 26854–26859 | 27823–27825 | 28321–28323 | 30596–30601 |
| E3 | 28343–28888 | 20.1 kDa | 181 | 26854–26859 | 28343–28345 | 28886–28888 | 30596–30601 |
| E3 | 28906–29457 | 20.8 kDa | 183 | 26854–26859 | 28906–28908 | 29455–29457 | 30596–30601 |
| E3 | 29501–29776 | 10.1 kDa | 91 | 26854–26859 | 29501–29503 | 29774–29776 | 30596–30601 |
| E3 | 29781–30185 | 14.9 kDa | 134 | 26854–26859 | 29781–29783 | 30183–30185 | 30596–30601 |
| E3 | 30178–30585 | 15 kDa | 135 | 26854–26859 | 30178–30180 | 30583–30585 | 30596–30601 |
| U | 30609c–30733c | U protein | 54 | ND | 30771c | 30609c | ND |
| ND | 102c–281c | Hypothetical protein | 141 | ND | 102c | 281c | ND |
| L5 | 30788–31759 | Fiber | 323 | ND | 30788–30790 | 31757–31759 | 31762–31767 |
| ITR | 34624c–34760c |
Coding sequences with spliced regions are indicated by double entries separated by “,” in the nucleotide locality column. Coding sequences transcribed from the complementary strand are designated by “c”. ND; not determined. p: protein.
GenBank accession numbers and details of adenoviruses genomes used in present study.
| S no | Viruses | Serotype | Accession IDs | Host | Collection date (year) | Country |
|---|---|---|---|---|---|---|
| 1 | HAdV-B11 (Slobitski) | B11 | Human | 1956 | USA | |
| 2 | HAdV-B14p1 (303600) | B14 | Human | 2006 | USA | |
| 3 | HAdV-B14p1 (BJ430) | B14 | Human | 2011 | China | |
| 4 | HAdV-B14 (CHN)# | B14 | Human | 2012 | China | |
| 5 | HAdV-B14p (de Wit) | B14 | Human | 1955 | Netherlands | |
| 6 | HAdV-B14 (GZ01) | B14 | Human | 2010 | China | |
| 7 | HAdV-B16 (ch. 79) | B16 | Human | – | USA | |
| 8 | HAdV-B21 (AV-1645) | B21 | Human | – | USA | |
| 9 | HAdV-B3 (GB) | B3 | Human | 2004 | USA | |
| 10 | HAdV-B3 (Guangzhou01) | B3 | Human | 2005 | China | |
| 11 | HAdV-B3 (Guangzhou02) | B3 | Human | 2004 | China | |
| 12 | HAdV-B3 (NHRC 1276) | B3 | Human | 1997 | USA | |
| 13 | HAdV-B34 (Compton) | B34 | Human | 1972 | USA | |
| 14 | HAdV-B35 (Holden) | B35 | Human | 1973 | USA | |
| 15 | HAdV-B50 (Wan) | B50 | Human | – | USA | |
| 16 | HAdV-B7 (Gomen) | B7 | Human | 1954 | USA | |
| 17 | HAdV-B7 (NHRC 1315) | B7 | Human | 1997 | USA | |
| 18 | HAdV-B7 (0901 HZ) | B7 | Human | 2009 | China | |
| 19 | HAdV-B55 (QS-DLL) | B55 | Human | 2006 | China | |
| 20 | SAdV-21 | B21 | AC_000010 | Chimpanzee | – | – |
| 21 | HAdV-D9 | D9 | Human | – | – | |
| 22 | HAdV-E4 | E4 | Human | – | – | |
| 23 | HAdV-G52 | G52 | Human | – | – | |
| 24 | HAdV-C1 | C1 | Human | – | – | |
| 25 | HAdV-F40 | F40 | Human | – | – | |
| 26 | HAdV-A12 | A12 | AC_000005 | Human | – | – |
Dashes (–) indicate data not available. HAdV-B14 isolate sequenced in the present study is indicated by (#).
Fig. 2Phylogenetic analysis.
Maximum-likelihood phylogenetic trees were estimated using full-length (A) genomic sequences, (B) E1A gene sequences, (C) hexon gene sequences, and (D) fiber gene sequences of AdVs. The GenBank accession IDs and details of the full-length genomic sequences and E1A, hexon, and fiber genes sequences used in phylogenetic analysis are given in Table 1 and Table S1 respectively. Reference sequences representing different AdV genomes together with the newly sequenced HAdV-B14 CHN (bold format) were included in each tree construction dataset. Isolates were named using the following format: AdV type (sampling country/isolate name/isolation year). The percentage of trees in which the associated taxa clustered together is shown next to the branches. The reliability of the tree was assessed by bootstrap analysis with 1000 replications.