| Literature DB >> 19939241 |
Soeren Hofmayer1, Ijad Madisch, Sebastian Darr, Fabienne Rehren, Albert Heim.
Abstract
BACKGROUND:Entities:
Mesh:
Substances:
Year: 2009 PMID: 19939241 PMCID: PMC2794291 DOI: 10.1186/1471-2164-10-557
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Map of the genome organization and transcription units of HAdV-A31. Early and late transcription units are represented in different colors, intermediate gene products in white. The block arrows represent the predicted protein, titled either by protein name or predicted molecular size. Orientation of the arrows indicates the direction of transcription.
HadV-A31 annotation of coding regions
| Region | Common name | Product | Location | Length in aa |
|---|---|---|---|---|
| E1A | n.n. | 29.4K | 479-1045 & 1124-1348 | 266 |
| E1B | 19 K small t-antigen | 18.3 K | 1488-1955 | 156 |
| 55 K large t-antigen | 53.1 K | 1793-3208 | 472 | |
| Intermediate | IX | 14.9 K | 3290-3721 | 144 |
| IVa2 | 50.9 K | c3757-5093 & c5372-5384 | 449 | |
| E2B | DNA Polymerase | 134.6 K | c4866-8042 & c13193-13201 | 1184 |
| pTP | 73.1 K | c8222-9982 & c13193-13201 | 633 | |
| L1 | 52/55 K | 41.8 K | 10307-11413 | 368 |
| pIIIa | 64.4 K | 11435-13171 | 578 | |
| L2 | III (Penton Protein) | 57.1 K | 13246-14763 | 505 |
| V | 39.8 K | 15373-16416 | 347 | |
| pVII | 20.4 K | 14777-15340 | 187 | |
| pX | 7.9 K | 16440-16658 | 72 | |
| L3 | pVI | 28.5 K | 16734-17516 | 260 |
| Hexon | 103.7 | 17575-20343 | 922 | |
| Protease | 23.3 K | 20370-20981 | 203 | |
| E2A | DNA binding Protein | 55.3 K | c21065-22522 | 485 |
| L4 | 100 K | 86.8 K | 22551-24878 | 775 |
| 22 K | 20.2 K | 24616-25146 | 176 | |
| 33 K | 23 K | 24616-24913 & 25092-25393 | 199 | |
| pVIII | 25.2 K | 25452-26153 | 233 | |
| E3 | 12.5 K | 12.1 K | 26153-26470 | 105 |
| CR 1 alpha | 28.6 K | 26424-27212 | 250 | |
| CR 1 beta | 29.4 K | 27215-27967 | 262 | |
| RID alpha | 10.5 K | 27999-28274 | 91 | |
| RID beta | 12.5 K | 28271-28600 | 109 | |
| 14.7 K | 14.6 K | 28593-28979 | 128 | |
| L5 | Fiber | 58.9 K | 29157-30827 | 556 |
| E4 | ORF 1 | 13.9 K | c33046-33429 | 127 |
| ORF 2 | 14.9 K | c32619-33014 | 131 | |
| ORF 3 | 13.4 K | c32272-32622 | 116 | |
| ORF 4 | 13.7 K | c31904-32266 | 120 | |
| ORF 5 | 33.9 K | c31102-31971 | 289 | |
| ORF 6/7 | 13.9 K | c30861-31082 & c31834-31971 | 119 |
Figure 2Phylogenetic analysis of all available complete genomic HAdV sequences representing all human adenovirus species (A to G), including the newly generated HAdV-A31 sequence. The tree was generated with MEGA 3.1 using neighbor-joining method, bootstrap values (%) were generated with 1,000 pseudoreplicates. For nucleotide accession numbers see Methods section.
Figure 3Global pairwise sequence alignment of the HAdV-A31 genome with representative types of each HAdV species. The x axis shows the genome position, the y axis shows the sequence conservation in percent. Arrows on top display the transcription units and the direction of their transcription.
Figure 4Schematic view of the predicted E3 CR1 beta (A) and the E3 RID beta (B) proteins of HAdV-A31, -A12 and -F40 or -C5, respectively. (A): a V-Set domain (red box) was only predicted within the N-terminal region of the E3 CR1 beta protein of HAdV-A31. (B): N-terminal phosphorylation sites (small red boxes) were predicted both for the HAdV-A31 and -C5 RID beta proteins, but not for HAdV-A12. Domain predictions were carried out using web based Pfam, ProSite and BLASTp.
Figure 5Multiple alignment of the protein IX amino acid sequences of HAdV-A31, -A12, -C5, -F40 and -G52. The RGD motif found in the protein IX of HAdV-A31 is highlighted.
Sequence comparison of clinical isolates to the HAdV-A31 prototype sequence
| Clinical isolate | V04-03789 | 0105019310 | 2006001610 | 95/8866 | 95/6956 | 96/783 | 95/6315 |
|---|---|---|---|---|---|---|---|
| Origin | Regensburg, Germany | Hannover, Germany | Hannover, Germany | Nancy, France | Nancy, France | Nancy, France | Nancy, France |
| Year | 2004 | 2001 | 2006 | 1995 | 1995 | 1996 | 1995 |
| E1A | S56C, V222N (5) | ||||||
| E3 12.1 | E34Q (1) | A5T (3) | |||||
| E3 CR1 α | Q71E, N159S (9) | ||||||
| E3 CR1 β | Q164E, T242I (2) | T242I (7) | Q164E, T242I (3) | D130N, Q164E (2) | Q164E (3) | Q164E, T242I (4) | Q164E, T242I (2) |
| E3 RID β | M13V, E107G (2) | E107G (1) | E107G (1) | E107G (1) | E107G (1) | E107G (1) | |
| E3 14.7 | T106N (3) | T74A, T106N (4) | T106N (4) | T106N (5) | T106N (5) | T106N (3) | |
| PB | F259Y (7) | F259Y(8) | |||||
| FK | V40L, R62Q (4) | R62Q (4) | R62Q (3) | R62Q (3) | R62Q (3) | V40L, T53A, R62Q (5) | |
| E4 ORF1 | P119T (1) | V59L (2) | |||||
| E4 ORF2 | L84P (1) | ||||||
| E4 ORF5 | R51K (6) | ||||||
| E4 ORF6/7 | K105N, I115S (3) |
The coding regions which showed amino acid substitutions in at least one isolate are listed. For each isolate, the sequence positions of amino acid substitution are noted. Additionally, the number of mutations in the nucleotide sequence is indicated in brackets.