| Literature DB >> 18649324 |
Amanda J Bradley1, Ida J Kovács, Derek Gatherer, Derrick J Dargan, Khaled R Alkharsah, Paul K S Chan, William F Carman, Martin Dedicoat, Vincent C Emery, Colin C Geddes, Giuseppe Gerna, Bassam Ben-Ismaeil, Steve Kaye, Alistair McGregor, Paul A Moss, Rozalia Pusztai, William D Rawlinson, Gillian M Scott, Gavin W G Wilkinson, Thomas F Schulz, Andrew J Davison.
Abstract
Most human cytomegalovirus (HCMV) genes are highly conserved in sequence among strains, but some exhibit a substantial degree of variation. Two of these genes are UL146, which encodes a CXC chemokine, and UL139, which is predicted to encode a membrane glycoprotein. The sequences of these genes were determined from a collection of 184 HCMV samples obtained from Africa, Australia, Asia, Europe, and North America. UL146 is hypervariable throughout, whereas variation in UL139 is concentrated in a sequence encoding a potentially highly glycosylated region. The UL146 sequences fell into 14 genotypes, as did all previously reported sequences. The UL139 sequences grouped into 8 genotypes, and all previously reported sequences fell into a subset of these. There were minor differences among continents in genotypic frequencies for UL146 and UL139, but no clear geographical separation, and identical nucleotide sequences were represented among communities distant from each other. The frequent detection of multiple genotypes indicated that mixed infections are common. For both genes, the degree of divergence was sufficient to preclude reliable sequence alignments between genotypes in the most variable regions, and the mode of evolution involved in generating the genotypes could not be discerned. Within genotypes, constraint appears to have been the predominant mode, and positive selection was detected marginally at best. No evidence was found for linkage disequilibrium. The emerging scenario is that the HCMV genotypes developed in early human populations (or even earlier), becoming established via founder or bottleneck effects, and have spread, recombined and mixed worldwide in more recent times.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18649324 PMCID: PMC2658010 DOI: 10.1002/jmv.21241
Source DB: PubMed Journal: J Med Virol ISSN: 0146-6615 Impact factor: 2.327
Primers Used for PCR and Sequencing
| Gene | Primer | Sequence(5′–3′) | Genome location |
|---|---|---|---|
| UL146 | AB4 | TAGACACTACGTCGTAAATG | 180494–180513 |
| UL146 | A162 | TGTAGAATTAGTCTAGATTCCTGA | 181524–181501 |
| UL146 | UL146–4A | GCTTGCGCGTTAGGATTGAGACAC | 180571–180594 |
| UL146 | UL146–3A | ATACCGGATATTACGAATT | 181341–181323 |
| UL139 | AB1 | GTCATTGTGAAAGTGACGTCTCAG | 186389–186412 |
| UL139 | AB2 | ATCTACTGTAAACCCTCTGCTCTG | 187148–187125 |
| UL139 | UL140–11A | GCGGCATTGGTGTACGCGTG | 186553–186572 |
| UL139 | UL140–3A | GTGGAAATTTTTACGTCATT | 187077–187058 |
With reference to RefSeq accession NC_006273.2 (HCMV strain Merlin).
UL146 Diversity
| Alignment lenth | Diversity | ||||||
|---|---|---|---|---|---|---|---|
| Genotype | Samples | Frequences(%) | DNA | Protein | DNA | Protein | dN/dS |
| G1 | 34 | 9.71 | 345 | 115 | 0.011 | 0.026 | 1.19 |
| G2 | 25 | 7.14 | 360 | 120 | 0.002 | 0.005 | 1.48 |
| G3 | 10 | 2.86 | 375 | 125 | 0.010 | 0.016 | 0.50 |
| G4 | 8 | 2.29 | 369 | 123 | 0.004 | 0.006 | 0.26 |
| G5 | 16 | 4.57 | 348 | 116 | 0.007 | 0.012 | 0.71 |
| G6 | 2 | 0.57 | 351 | 117 | 0.029 | 0.051 | ND |
| G7 | 57 | 16.3 | 354 | 118 | 0.011 | 0.015 | 1.29 |
| G8 | 22 | 6.29 | 342 | 114 | 0.006 | 0.008 | 0.38 |
| G9 | 49 | 14 | 351 | 117 | 0.017 | 0.032 | 0.94 |
| G10 | 12 | 3.43 | 291 | 97 | 0.003 | 0.005 | 0.30 |
| G11 | 19 | 5.43 | 339 | 113 | 0.007 | 0.015 | 0.50 |
| G12 | 43 | 12.3 | 354 | 118 | 0.016 | 0.018 | 0.45 |
| G13 | 47 | 13.4 | 357 | 119 | 0.007 | 0.015 | 1.58 |
| G14 | 6 | 1.71 | 354 | 118 | 0 | 0 | ND |
| All | 350 | 100 | 225 | 75 | 0.642 | 0.521 | 0.27 |
Gaps removed
Jukes–Cantor Pi from DnaSP 4.10.9
Protein diversity p from MEGA4.0
dN/dS(omega) from PAML 3.15 under the single–rate model.ND, not determined
Five percent significance for positive selection
Calculated from a comparison of a single of a member of each genotype
Figure 1Phylogenetic analysis of UL139.A:Alignment (CLUSTAL W) of amino acid sequences representing the eight genotypes. Predicted signal peptide and transmembrane sequences are highlighted in gray. Completely conserved residues are indicated in the consensus row (con). Below this is the CCMV sequence, which is included to illustrate conservation of the SETTTGTSSNSS motif (underlined). The CCMV sequence [Davison et al., 2003] provided is theC–terminal portion (final 12 residues not shown) of a larger protein, the N–terminal portion of which lacks a counterpart in HCMV but is related to a protein (encoded by gene rh174) in rhesus cytomegalovirus. B: Unrooted neighborjoining tree for the HCMV amino acid sequences shown in (A), computed using Mega4.0 (Poisson correction method with gaps removed). Bootstrap values (out of 100) are shown, and values below 70 indicate regions of unresolved branching order. The scale bar indicates the number of amino acid substitutions per site.
UL139 Diversity
| Alignment lenth | Diversity | ||||||
|---|---|---|---|---|---|---|---|
| Genotype | Samples | Frequency(%) | DNA | Protein | DNA | Protein | dN/dS |
| G1 | 48 | 16 | 255 | 85 | 0.059 | 0.095 | 0.82 |
| G2 | 82 | 27.33 | 240 | 80 | 0.009 | 0.009 | 0.76 |
| G3 | 29 | 9.66 | 339 | 113 | 0.014 | 0.013 | 0.34 |
| G4 | 68 | 22.66 | 201 | 67 | 0.023 | 0.015 | 0.76 |
| G5 | 28 | 9.33 | 255 | 124 | 0.018 | 0.024 | 0.65 |
| G6 | 24 | 8 | 312 | 104 | 0.010 | 0.013 | 1.08 |
| G7 | 14 | 4.66 | 237 | 79 | 0.006 | 0.015 | 2.38 |
| G8 | 7 | 2.33 | 228 | 140 | 0.007 | 0.007 | 0.19 |
| All | 300 | 100 | 153 | 51 | 0.285 | 0.275 | 0.48 |
Gaps removed
Jukes–Cantor Pi from DnaSP 4.10.9
Protein diversity p from MEGA4.0.
dN/dS(omega) from PAML 3.15 under the single–rate model
One persent significance for positive selection
Calculated from a comparision of a single member of each genotype
Geographical Distribution of UL146 Genotypes
| Genotype | Africa | Asia | Europe | Australia |
|---|---|---|---|---|
| G1 | 4 | 2 | 7 | 0 |
| G2 | 2 | 1 | 7 | 2 |
| G3 | 4 | 0 | 2 | 0 |
| G4 | 1 | 0 | 4 | 2 |
| G5 | 3 | 0 | 2 | 1 |
| G6 | 0 | 1 | 0 | 0 |
| G7 | 5 | 6 | 21 | 1 |
| G8 | 2 | 0 | 5 | 0 |
| G9 | 6 | 2 | 11 | 3 |
| G10 | 0 | 0 | 8 | 0 |
| G11 | 0 | 0 | 3 | 0 |
| G12 | 5 | 1 | 16 | 0 |
| G13 | 11 | 0 | 17 | 0 |
| G14 | 2 | 0 | 1 | 0 |
| Total=177 | 45 | 13 | 104 | 15 |
Geographical Distribution of UL139 Genotypes
| Genotype | Africa | Asia | Europe | Australia |
|---|---|---|---|---|
| G1 | 8 | 2 | 16 | 2 |
| G2 | 9 | 1 | 23 | 5 |
| G3 | 1 | 4 | 7 | 0 |
| G4 | 8 | 5 | 23 | 5 |
| G5 | 10 | 3 | 14 | 0 |
| G6 | 0 | 1 | 11 | 1 |
| G7 | 3 | 0 | 5 | 5 |
| G8 | 2 | 2 | 1 | 1 |
| Totals=178 | 41 | 18 | 100 | 19 |
Analysis of Linkage Disequilibrium
| UL139 genotype | ||||||||
|---|---|---|---|---|---|---|---|---|
| UL146 genotype | G1 | G2 | G3 | G4 | G5 | G6 | G7 | G8 |
| G1 | 1 | 0 | 0 | 3 | 1 | 0 | 0 | 0 |
| G2 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| G3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| G4 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 |
| G5 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 |
| G6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| G7 | 1 | 4 | 1 | 3 | 2 | 1 | 0 | 0 |
| G8 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 |
| G9 | 0 | 1 | 0 | 0 | 0 | 2 | 1 | 0 |
| G10 | 0 | 3 | 0 | 1 | 0 | 0 | 0 | 0 |
| G11 | 0 | 2 | 0 | 1 | 0 | 0 | 0 | 0 |
| G12 | 1 | 1 | 1 | 1 | 2 | 1 | 0 | 0 |
| G13 | 2 | 2 | 0 | 5 | 1 | 0 | 0 | 1 |
| G14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Totals=60 | 8 | 16 | 3 | 17 | 7 | 4 | 3 | 2 |