| Literature DB >> 16169033 |
Dominique Sirena1, Zsolt Ruzsics, Walter Schaffner, Urs F Greber, Silvio Hemmi.
Abstract
Human adenovirus (Ad) serotype 3 causes respiratory infections. It is considered highly virulent, accounting for about 13% of all Ad isolates. We report here the complete Ad3 DNA sequence of 35,343 base pairs (GenBank accession DQ086466). Ad3 shares 96.43% nucleotide identity with Ad7, another virulent subspecies B1 serotype, and 82.56 and 62.75% identity with the less virulent species B2 Ad11 and species C Ad5, respectively. The genomic organization of Ad3 is similar to the other human Ads comprising five early transcription units, E1A, E1B, E2, E3, and E4, two delayed early units IX and IVa2, and the major late unit, in total 39 putative and 7 hypothetical open reading frames. A recombinant E1-deleted Ad3 was generated on a bacterial artificial chromosome. This prototypic virus efficiently transduced CD46-positive rodent and human cells. Our results will help in clarifying the biology and pathology of adenoviruses and enhance therapeutic applications of viral vectors in clinical settings.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16169033 PMCID: PMC7172737 DOI: 10.1016/j.virol.2005.08.024
Source DB: PubMed Journal: Virology ISSN: 0042-6822 Impact factor: 3.616
Previously published human adenovirus 3 sequences used for primer design
| Sequence origin | GenBank Accession No. | Reference | % Homology compared to the discussed sequence (number of differences) | Region (bp) |
|---|---|---|---|---|
| Ad3 complete sequence | This publication | 1–35,342 | ||
| Ad3 left end fragment containing ITR, E1A | ( | 99.24 (12/1572) | 1–1569 | |
| Ad3 ITR, left end | ( | 99.4 (1/158) | 3–160 | |
| Ad3 ITR, left end | ( | 95.8 (32/770) | 12–762 | |
| Ad3 E1A 9S protein, E1A 13S protein, and E1A 12S protein genes, complete cds | ( | 100 | 576–1455 | |
| Ad3 E1A protein gene, partial cds | Lin et al., 2003, unpublished | 99.3 (3/430) | 704–1134 | |
| Ad3 polypeptide IX gene, complete cds | ( | 100 | 3413–3965 | |
| Ad3 DNA polymerase gene, partial cds | Chmielewicz et al., 2004, unpublished | 100 | 5398–5646 | |
| Ad3 virus-associated RNA, pre-terminal protein and 52/55-kDa protein genes, partial cds | Ma et al., 1996, unpublished | 98.3 (10/582) | 10,305–10,890 | |
| Ad3 virus-associated RNA I and RNA II genes | ( | 99.7 (1/450) | 10,399–10,849 | |
| Ad3 gene for pIIIa, pVII and penton base protein | ( | 99.8 (3/1986) (penton 100% identical) | 13,686–15,668 (13,905–15,540) | |
| Ad3 hexon gene | ( | 99.6 (10/2835) (hexon 3 aa differences) | 18,417–21,251 | |
| Ad3 hexon gene, partial cds (nonfunctional) | Lin et al., 2003, unpublished | 97.7 (18/778) | 18,524–19,295 | |
| Ad3 hexon gene, partial cds | Ju et al., 2004, unpublished | 98.7 (5/397) | 19,010–19,406 | |
| Ad3 L3–23-kDa gene for chymotrypsin-like endoprotease | ( | 99.8 (2/1273) | 20,917–22,190 | |
| Ad3 E3 region | ( | 99.9 (3/4379) | 26,993–31,372 | |
| Ad3 fiber polypeptide gene | ( | 100 | 31,118–32,447 | |
| Ad3 fiber protein gene, partial cds | Lin et al., 2003, unpublished | 98.7 (9/673) | 31,406–32,079 | |
| Ad3 ITR, right end | ( | 99.4 (1/158) | 35,183–35,340 |
Cds, coding sequence; ITR, inverted terminal repeat; aa, amino acid residue.
Base composition and GC contents of selected human adenovirus genomes
| Ad species | Ad serotype | GenBank Accession No. | Mol% | % GC content | |||
|---|---|---|---|---|---|---|---|
| A | C | G | T | ||||
| A | Ad12 | 27.34 | 23.48 | 23.04 | 26.14 | 46.52 | |
| B1 | Ad3 | 25.29 | 25.72 | 25.33 | 23.66 | 51.05 | |
| B1 | Ad7 | 25.37 | 25.78 | 25.25 | 23.61 | 51.03 | |
| B2 | Ad11p | 26.04 | 24.42 | 24.46 | 25.08 | 48.88 | |
| B2 | Ad35p | 26.04 | 24.40 | 24.49 | 25.08 | 48.88 | |
| C | Ad5 | 23.29 | 28.03 | 27.16 | 21.51 | 55.20 | |
| D | Ad17 | 22.76 | 28.14 | 28.44 | 20.66 | 56.58 | |
| E | Ad4 | 21.95 | 28.93 | 28.72 | 20.39 | 57.65 | |
| F | Ad40 | 24.87 | 26.21 | 25.01 | 23.91 | 51.22 | |
Although all three sequences show identical base composition, AY598970 differs by four and AF532578 by eleven nucleotides positions from AY163756.
Percent genome homology among selected human adenovirus genomes
| Type | Species | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| A | B1 | B1 | B2 | B2 | C | D | E | F | |
| Ad12 ( | Ad3 | Ad7 ( | Ad11p ( | Ad35p ( | Ad5 ( | Ad17 ( | Ad4 ( | Ad40 ( | |
| Ad12 | 100 | 60.14 | 60.35 | 60.17 | 59.99 | 59.78 | 58.94 | 58.77 | 61.52 |
| Ad3 | 100 | 96.43 | 82.56 | 82.45 | 62.75 | 65.74 | 73.18 | 59.44 | |
| Ad7 | 100 | 83.65 | 82.84 | 62.94 | 65.55 | 72.99 | 59.36 | ||
| Ad11p | 100 | 98.14 | 61.54 | 63.62 | 70.17 | 59.16 | |||
| Ad35p | 100 | 61.59 | 63.69 | 70.22 | 59.15 | ||||
| Ad5 | 100 | 62.71 | 64.23 | 58.94 | |||||
| Ad17 | 100 | 68.86 | 59.69 | ||||||
| Ad4 | 100 | 60.17 | |||||||
| Ad40 | 100 | ||||||||
Fig. 1Genome organization of human Ad3. The map of the Ad3 genome (35,343 bp) is divided into 100 map units (mu). The genome consists of five early transcription units, E1A, E1B, E2, E3, and E4, two delayed early units IX and IVa2, and one late unit, the major late unit. 39 potential protein-coding regions and the direction of transcription are shown as solid arrows, relative to their position and orientation. Most late gene transcripts contain the tripartite leader consisting of three spliced leader sequences (L1–L3) at their 5′ ends. Splice events contributing to the transcript generation are indicated by diagonal thin lines. In addition, the Ad3 genome encodes VA RNA I and II transcription units and seven hypothetical (hyp) ORFs with unknown biological functions.
Predicted noncoding motifs of the adenovirus 3 genome
| Motif | Description | Position |
|---|---|---|
| CTATCT..TGACGT | ITR | 1–136 |
| ATAATATACC | DNApol-pTP binding site | 9–18 (ITR) |
| TGGAATGGTGCCAA | NFI binding motif | 26–39 (ITR) |
| CATGTAAATGA | NFIII binding motif | 40–50 (ITR) |
| TATTTA | TATA box for E1A | 480–485 |
| AATAAA | PolyA signal for E1A | 1494–1499 |
| TATATA | TATA box for E1B | 1549–1554 |
| TAAAGT | TATA box for pIX | 3384–3389 |
| AATAAA | PolyA signal pIX | 3909–3914 |
| AATAAA | PolyA signal for E2B | 3947–3952c |
| TGATTGGCTT | Inverted CAAT box for MLP | 5821–5830 |
| GGCCACGTGACC | Upstream element for MLP | 5840–5851 |
| GCCGGGGGGG | MAZ binding motif for MLP | 5862–5871 |
| TATAAAAG | TATA box for MLP | 5872–5879 |
| GGGGGCGGGCC | MAZ/SP1 binding motif for MLP | 5879–5889 |
| TCACTGT | Initiator element for MLP | 5901–5907 |
| TTGTCAGTTTC | DE1 for MLP | 5988–5998 |
| AACGAGGAGGATTTGA | DE2a and DE2b for MLP | 6003–6018 |
| AATAAA | PolyA signal for L1 | 13,830–13,835 |
| AATAAA | PolyA signal for L2 | 17,496–17,501 |
| AATAAA | PolyA signal for L3 | 21,938–21,943 |
| AATAAA | PolyA signal for E2A | 21,950–21,955c |
| TTAA | E2 TATA-box-like element | 26,639–26,642c |
| TATAA | TATA box for E3 | 27,085–27,089 |
| AATAAA | PolyA signal for L4 | 27,712–27,717 |
| AATAAA | PolyA signal for E3A | 29,001–29,006 |
| AATAAA | PolyA signal for E3B | 31,181–31,186 |
| AATAAA | PolyA signal for L5 | 32,335–32,340 |
| AATAAA | PolyA signal for L6 | 34,866–34,871 |
| AATAAA | PolyA signal for E4 | 32,352–32,357c |
| TATATATT | TATA box for E4 | 35,034–35,040c |
| ATAATATACC | DNApol-pTP binding site | 35,305–35,318c |
| CTATCT..TGACGT | ITR | 35,208–35,343c |
The nucleotide positions of putative motifs are noted in the 5′ to 3′ orientation.
cds, coding sequence; ITR, inverted terminal repeat; MLP, major late promoter; c, complementary.
Splice site predicted using ERPIN RNA structure prediction (see Materials and methods) cds, coding sequence; ITR, inverted terminal repeat; MLP, major late promoter; c, complementary.
Forty-six predicted translation products and VA RNA genes of the Ad3 genome
| Feature | MW (kDa) | Ad5 (others) equivalent | ATG | STOP |
|---|---|---|---|---|
| E1 region | ||||
| E1A 13s protein | 28.4 | E1A 13s mRNA 32-kDa | 576 j1250 | 1155 1455 |
| E1A 12s protein | 24.6 | E1A 12s mRNA 26-kDa | 576 j1250 | 1062 1455 |
| E1A 9s protein | 6.7 | E1A 9s mRNA 6-kDa | 576 j1250 | 647 1351 |
| E1B 21-kDa | 20.5 | E1B 19-kDa | 1603 | 2139 |
| E1B 55-kDa | 54.7 | E1B 55-kDa | 1908 | 3386 |
| Intermediate transcription regions IX and IVa2 | ||||
| Protein IX | 14.1 | IX | 3480 | 3896 |
| Protein IVa2 | 50.6 | IVa2 | 5572c j5281c | 5560c 3948c |
| E2 region | ||||
| E2B DNA pol | 128.6 | DNA pol | 8419c | 5051c |
| E2B pTP | 73.8 | Terminal protein | 10,344c | 8422c |
| E2A DBP | 58.3 | DBP | 23,557c | 22,004c |
| E3 region | ||||
| E3 12.1-kDa | 12.1 | E3 12.5-kDa | 27,403 | 27,723 |
| E3 16-kDa | 16.0 | 27,677 | 28,117 | |
| E3 gp19-kDa | 19.0 | gp19-kDa | 28,102 | 28,620 |
| E3 20.1-kDa | 20.0 | 28,650 | 29,189 | |
| E3 20.5-kDa | 20.5 | 29,202 | 29,771 | |
| E3 9-kDa | 9.0 | 29,786 | 30,019 | |
| E3 10.2-kDa | 10.3 | 30,061 | 30,336 | |
| E3 15.2-kDa | 15.2 | 30,341 | 30,745 | |
| E3 15.3-kDa | 15.2 | 30,738 | 31,148 | |
| E4 region | ||||
| E4 ORF1 | 14.1 | E4 ORF1 | 34,953c | 34,576c |
| E4 ORF2 | 16.0 | E4 ORF B | 34,579c | 34,145c |
| E4 ORF3 | 13.6 | E4 ORF3 | 34,148c | 33,795c |
| E4 ORF4 | 14.3 | E4 ORF4 | 33,786c | 33,418c |
| E4 ORF6 34-kDa | 34.7 | E4 34-kDa | 33,515c | 32,616c |
| E4 ORF6/7 | 16.0 | E4 ORF6/7 | 33,515c j32,619c | 33,342c 32,368c |
| VA RNA region | ||||
| VA RNA I | – | VA I | 10,422 | 10,591 |
| VA RNA II | – | VA II | 10,666 | 10,837 |
| L region | ||||
| L1 52/55-kDa | 43.8 | 52/55-kDa protein | 10,869 | 12,026 |
| L1 pIIIa | 65.7 | Pro-IIIa | 12,051 | 13,817 |
| L2 III (penton base) | 61.8 | Penton protein | 13,905 | 15,539 |
| L2 pVII | 21.2 | Pro-VII | 15,551 | 16,129 |
| L2 pV | 40.1 | Pro-V | 16,172 | 17,221 |
| L2 pX | 8.3 | Pro-X | 17,250 | 17,477 |
| L3 pVI | 27.1 | Pro-VI | 17,553 | 18,305 |
| L3 II (hexon) | 106.2 | Hexon | 18,418 | 21,252 |
| L3 23-kDa (protease) | 23.8 | 23-kDa (protease) | 21,289 | 21,918 |
| L4 100-kDa | 92.3 | L4 100-kDa protein | 23,588 | 26,074 |
| L4 22-kDa | 22.5 | 22-kDa | 25,776 | 26,375 |
| L4 33-kDa | 30.9 | 33-kDa | 25,776 j26,294 | 26,244 26,649 |
| L4 pVIII | 24.9 | Pro-VIII | 26,720 | 27,403 |
| L5 IV (fiber) | 34.8 | L5 IV fiber | 31,368 | 32,327 |
| L6 (agnoprotein, hyp) | 18.8 | – | 33,641 | 34,150 |
| Miscellaneous proteins | ||||
| 20.3-kDa (hyp) | 20.3 | (Ad7/Ad4) 20.6-kDa/19.4-kDa (hyp) | 5123 | 5692 |
| 11.5-kDa (hyp) | 11.5 | (Ad7/Ad11/Ad35/Ad4) 11.5-kDa (hyp) | 6144 | 6464 |
| 21.9-kDa (hyp) | 21.9 | (Ad7/Ad4/Ad11) (hyp) | 7829 | 8425 |
| 14.6-kDa (hyp) | 14.6 | (Ad7/Ad4) 14.5-kDa (hyp) | 8548 | 8949 |
| 19-kDa (hyp) | 19k | (Ad7) E2B 19-kDa (hyp) | 7389c | 6868c |
| 11.5-kDa (hyp) | 11.5 | (Ad7) E2B 11.3-kDa (hyp) | 9857c | 9543c |
C, complementary strand; j, join; hyp, hypothetical.
Sizes determined from our sequences may deviate from previously determined values.
Comparison of primary amino-acid sequences of predicted Ad3 proteins with corresponding proteins of other human adenoviruses
| Species | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A | B | C | D | E | F | |||||||||||
| B1 | B2 | |||||||||||||||
| Serotypes | Ad12 | Ad7 | Ad16 | Ad21 | Ad11 | Ad14 | Ad35 | Ad2 | Ad5 | Ad8 | Ad9 | Ad17 | Ad36 | Ad4 | Ad40 | Ad41 |
| E1 region | ||||||||||||||||
| E1A 12s protein | 39 | 98 | 76 | 76 | 36 | 41 | 53 | 56 | 33 | 35 | ||||||
| E1A 13s protein | 42 | 98 | 98 | 96 | 78 | 88 | 77 | 36 | 36 | 58 | 43 | 44 | 58 | 57 | 40 | 42 |
| E1A 9s protein | 98 | 72 | 72 | 49 | ||||||||||||
| E1B 21-kDa | 41 | 98 | 88 | 87 | 50 | 48 | 52 | 53 | 61 | 44 | ||||||
| E1B 55-kDa | 47 | 98 | 85 | 85 | 53 | 53 | 57 | 62 | 71 | 47 | ||||||
| Intermediate transcription regions | ||||||||||||||||
| Protein IX | 58 | 100 | 88 | 89 | 51 | 51 | 61 | 78 | 57 | 57 | ||||||
| Protein IVa2 | 78 | 99 | 93 | 93 | 82 | 82 | 85 | 85 | 91 | 80 | ||||||
| E2 region | ||||||||||||||||
| E2B DNA pol | 73 | 98 | 91 | 91 | 76 | 76 | 81 | 80 | 88 | 72 | ||||||
| E2B pTP | 77 | 99 | 94 | 94 | 80 | 80 | 80 | 79 | 89 | 73 | ||||||
| E2A DBP | 50 | 96 | 83 | 83 | 54 | 54 | 60 | 59 | 74 | 51 | 51 | |||||
| E3 region | ||||||||||||||||
| E3 12.1-kDa | 67 | 99 | 100 | 99 | 87 | 88 | 88 | 55 | 55 | 66 | 66 | 66 | 79 | |||
| E3 16-kDa | 97 | 98 | 94 | 62 | 61 | 62 | 30 | 27 | 35 | 28 | 33 | 54 | ||||
| E3 gp19-kDa | 23 | 98 | 98 | 98 | 80 | 70 | 80 | 32 | 36 | 35 | 35 | 34 | 63 | |||
| E3 20.1-kDa | 24 | 97 | 97 | 97 | 75 | 73 | 75 | 32 | 32 | 32 | 31 | |||||
| E3 20.5-kDa | 96 | 97 | 98 | 62 | 61 | 62 | 33 | 27 | 33 | 33 | ||||||
| E3 9-kDa | 66 | 81 | 98 | 40 | 42 | 42 | 48 | |||||||||
| E3 10.2-kDa | 52 | 100 | 98 | 100 | 93 | 95 | 93 | 51 | 47 | 65 | 64 | 75 | 42 | 43 | ||
| E3 15.2-kDa | 29 | 98 | 95 | 96 | 64 | 64 | 63 | 37 | 31 | 43 | 44 | 49 | 48 | 32 | 32 | |
| E3 15.3-kDa | 56 | 87 | 87 | 88 | 86 | 86 | 86 | 49 | 53 | 74 | 77 | 76 | 79 | 51 | 51 | |
| E4 region | ||||||||||||||||
| E4 ORF1 | 50 | 97 | 96 | 95 | 46 | 46 | 45 | 68 | ||||||||
| E4 ORF2 | 30 | 97 | 93 | 92 | 32 | 32 | 51 | 39 | 58 | 28 | ||||||
| E4 ORF3 | 61 | 100 | 98 | 98 | 49 | 49 | 72 | 72 | 86 | 57 | ||||||
| E4 ORF4 | 37 | 95 | 92 | 92 | 45 | 45 | 43 | 53 | 33 | |||||||
| E4 ORF6 34-kDa | 52 | 97 | 98 | 97 | 59 | 59 | 69 | 67 | 71 | 71 | ||||||
| E4 ORF6/7 | 42 | 97 | 97 | 97 | 47 | 47 | 66 | 34 | ||||||||
| L region | ||||||||||||||||
| L1 52/55-kDa | 73 | 100 | 94 | 95 | 70 | 69 | 79 | 79 | 84 | 77 | ||||||
| L1 IIIa | 73 | 99 | 93 | 93 | 75 | 75 | 79 | 79 | 88 | 72 | ||||||
| L2 III (penton base) | 73 | 99 | 85 | 85 | 70 | 70 | 76 | 76 | 84 | 72 | 73 | |||||
| L2 pVII | 74 | 98 | 91 | 91 | 69 | 69 | 81 | 87 | 69 | |||||||
| L2 pV | 59 | 99 | 84 | 84 | 61 | 60 | 65 | 64 | 79 | 57 | ||||||
| L2 pX | 62 | 100/99 | 92 | 92 | 69 | 69 | 77 | 88 | 67 | |||||||
| L3 pVI | 64 | 99 | 86 | 86 | 63 | 63 | 70 | 70 | 79 | 61 | ||||||
| L3 II (hexon) | 78 | 95 | 86 | 85 | 86 | 85 | 77 | 77 | 84 | 83 | 84 | 79 | 78 | |||
| L3 23-kDa (protease) | 81 | 98 | 89 | 89 | 80 | 80 | 83 | 79 | 89 | 81 | 83 | |||||
| L4 100-kDa | 64 | 97 | 85 | 85 | 64 | 69 | 62 | 76 | 65 | 63 | ||||||
| L4 22-kDa | 43 | 98 | 78 | 77 | 50 | 51 | 53 | 63 | 73 | 45 | ||||||
| L4 33-kDa | 98 | 69 | 69 | 46 | 47 | 62 | 40 | 38 | ||||||||
| L4 pVIII | 76 | 98 | 99 | 99 | 94 | 93 | 94 | 79 | 79 | 82 | 83 | 82 | 90 | 79 | 81 | |
| L5 IV (fiber) | 33 | 57 | 62 | 58 | 58 | 57 | 58 | 29 | 29 | 28 | 28 | 31 | 31 | 31 | 27 | |
| L6 (agnoprotein) | 97 | 94 | 95 | |||||||||||||
Fig. 2Generation of a BAC of E1-deleted Ad3 encoding the eGFP reporter gene. (A) The plasmid pBSAd3LRzeo contains a zeocin selection cassette flanked by a 468-bp fragment of the Ad3 left end sequence and a 528-bp fragment of the Ad3 right end sequence. (B) Transfer of the NotIB–HindIII fragment containing the Ad3LRzeo cassette to the single copy pKSB2 plasmid. (C) Transformation of DH10B bacteria expressing the phage lambda recombinases allows homologous recombination between the SalI-linearized pKSB2Ad3LRzeo and Ad3 genomic DNA, resulting in pKSBAd3wt. (D) Subsequent homologous recombination between pKSBAd3wt and a CMV-eGFP/zeo cassette flanked by short homologous sequences of 40 bp results in pKSB2 Ad3CMV-eGFP, which contains the eGFP expression cassette in reverse orientation. (E) Release of the viral genome by the flanking unique MluI endonuclease sites followed by transfection of helper 911-Ad3E1B cells yielding Ad3CMV-eGFP. (For details of the procedure, see Materials and methods.)
Fig. 3Ad3-mediated e-GFP expression in permissive human cells and CD46-transfected rodent cells. The indicated cells were incubated with eGFP expressing Ad3, Ad5, or Ad5/F3 at different MOIs. eGFP expression was analyzed 2 days post-infection by flow cytometry. Results are expressed as means of fluorescence intensity (MFI).