| Literature DB >> 29301233 |
Joshua G Tweedy1, Eric Escriva2,3, Maya Topf4, Ursula A Gompels5.
Abstract
Tissue-culture adaptation of viruses can modulate infection. Laboratory passage and bacterial artificial chromosome (BAC)mid cloning of human cytomegalovirus, HCMV, resulted in genomic deletions and rearrangements altering genes encoding the virus entry complex, which affected cellular tropism, virulence, and vaccine development. Here, we analyse these effects on the reference genome for related betaherpesviruses, Roseolovirus, human herpesvirus 6A (HHV-6A) strain U1102. This virus is also naturally "cloned" by germline subtelomeric chromosomal-integration in approximately 1% of human populations, and accurate references are key to understanding pathological relationships between exogenous and endogenous virus. Using whole genome next-generation deep-sequencing Illumina-based methods, we compared the original isolate to tissue-culture passaged and the BACmid-cloned virus. This re-defined the reference genome showing 32 corrections and 5 polymorphisms. Furthermore, minor variant analyses of passaged and BACmid virus identified emerging populations of a further 32 single nucleotide polymorphisms (SNPs) in 10 loci, half non-synonymous indicating cell-culture selection. Analyses of the BAC-virus genome showed deletion of the BAC cassette via loxP recombination removing green fluorescent protein (GFP)-based selection. As shown for HCMV culture effects, select HHV-6A SNPs mapped to genes encoding mediators of virus cellular entry, including virus envelope glycoprotein genes gB and the gH/gL complex. Comparative models suggest stabilisation of the post-fusion conformation. These SNPs are essential to consider in vaccine-design, antimicrobial-resistance, and pathogenesis.Entities:
Keywords: HHV-6; HHV-6A; Roseolovirus; betaherpesvirus; cell fusion; deep sequencing; gB; gH/gL; glycoprotein structure model; human herpesvirus; reference genome; virus entry complex
Mesh:
Substances:
Year: 2017 PMID: 29301233 PMCID: PMC5795429 DOI: 10.3390/v10010016
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Long range PCR genome amplicon primers.
| Amplicon | Forward Primer (5′-3′) | Reverse Primer (5′-3′) | Size (bp) |
|---|---|---|---|
| DR1 | ACAACCGCATCTTCTTCAG | ATGAACGGAGATCTGGGAG | 1750 |
| DR1DR6 | CCTCATCTGTTATTCCCTTCC | ACTCACCGCAGTCTTGAAC | 2872 |
| DR6 | TCCAAACTGTACGGCTATG | TGGCGTCTAATCCAGGTAC | 1858 |
| DR_UL | CATGAGAGGACACTGGACGTACTC | GACATCTGTGGACCATGCA | 2339 |
| 0203 | ACCATGTGCGGAGAAAGTTG | ACGAGAATGAATATGCCGATG | 2174 |
| 0306 | ACTCCTATCCGTCTAAACTGCT | ATTTCTCACGCCGGTATTC | 4417 |
| 0611 | ATGCCAAATCTAGCTGCTG | GAATCGGAGGAAGCTAACGT | 4757 |
| 1114 | TCTTCAGGTGTCTCAACCTTC | TGATGATCTGCTGTTGGAAC | 5737 |
| 1419 | TTGAGATGCAGAGCCTCTC | ACGTGGTGGATAATTCTTCAG | 5370 |
| 1923 | CTACATCATCGGTTGCCATA | TACAGAAGTTTCGGCAACTG | 5216 |
| 2328 | TAGTCGCTGGATGTCGATAC | GAGGAATTGTGGACAACGCT | 4721 |
| 2830 | CATCCAACTCCTTGTCCGT | TCGTTCGACGTGTCTGAC | 5753 |
| 3031 | GACACTGTCCAACGCATCT | CTCTTGGGCATAAGTCCAGTA | 6552 |
| 3134 | ACAGAGCGACACCCAATATTC | AGCTGTGTCGATGTCTTGCTA | 2787 |
| 3439 | TGCATTCTCTCACAGCATG | TGTTTCCTTATGCCACCAATC | 6309 |
| 3941 | TGAGGCATGAGACTCAGAAG | AGCGGCATCCATAATGCTTA | 4796 |
| 4142 | AGCATCGTTTGAAGACACC | GAGTTGTTGTTCGCTGGTCT | 2810 |
| 4142INT | TTGTTTCAGGCCAGCATC | ATACCGAAAACTCAGGCAAC | 2983 |
| 4245 | AGACCGCATTCGTATCGCT | CACGATCAAGTTCGAATCGTA | 5092 |
| 4549 | GAGTGCTCGGATAAGTTCATT | ACGATACAATCCGGAATGC | 5922 |
| 4953 | TGGAACTCGGAATTGTTTCT | GTTTGAGACTGTATCTGCGCA | 4282 |
| 5357 | TATGGTGAAGAACCGTCG | TCCGCAGATAACAGAATGG | 4707 |
| 5758 | CAGTGCCATCTGTTATGAATAG | AGTCGCAGTAAGGTCCACGT | 6792 |
| 5864 | TGCTGAAATTACCTCAGTGAG | CCATTCAGTATAATGCATCC | 4229 |
| 6469 | CTGTATCAATATCAAGGCGG | ACCACTGAATACGAACGCTC | 5111 |
| 6974 | GCGATTGTTCTGCGATAGAG | CTTATATCCGAGCCTTGCAGT | 6472 |
| 7476 | GAGAACATGCTAGACAATTGG | CAAGAACTGCGACTCAATCC | 2668 |
| 7679 | GTTGCTAGTTGTATGACTTGG | CAATATCCACCGTTAGAGAAC | 7681 |
| 7984 | AGATGTATGCTGAAGAACGTG | GGATCGTCAACCGTTAGTG | 5109 |
| 8486 | CACAGTGTGTTCGCCGGAAG | ACAGACAATGCACATCCTCTG | 4773 |
| 8690 | AGGTTGATACGGCAACGAG | GCAATCATTAGCATACAGATG | 3326 |
| 8690INT | TGAGGAATCACGTGTTTG | ATCGAATCTATCCATGAAGATG | 1746 |
| 90R3 | CATCATTGTTATCGCTTTCAC | GCAACCGCAGTTCCTGTT | 4912 |
| R395 | GCGGTACCCACTGATCTT | AGTCTACCAGGCATTCCGT | 6250 |
| 95100 | GAGGAGGGTCTGTCTAGATGT | TCGGAGATATCATAATCTGCGT | 4031 |
| 100DR | TTATAGTTGCTCCCGAAAGC | AAGAAGATGCGGTTGTCTTG | 2358 |
Comparison of Illumina resequence human betaherpesvirus 6 (HHV-6A) U1102 to Reference sequence NC_001664.
| ORF or Locus | Refseq bp | Refseq:SNP | Non-Synonymous SNP Amino Acid Change, Codons and (Reverse Complement Codons) | This Study or as Cited |
|---|---|---|---|---|
| DR-L DR6 | 6663 | G:GG | ||
| “ | 6665 | G:GG | Frameshift extension | [ |
| DR-L repeat | 7623 | G:C | “ | “ |
| Repeats left end U | 8262 | AGGAG:A | ||
| “ | 8362 | C:CGCG | ||
| “ | 8518 | T:C | ||
| “ | 8559 | ACAACA:CAC | ||
| “ | 8591 | AT:A | ||
| “ | 8602 | GG:G | ||
| U10 | 18884 | C:Indel1 | Frameshift extension | [ |
| U40 | 63871 | G:T | Thr:His (ACG:CAG) | |
| “ | 63872 | T:G | “ | |
| U42 | 70446 | A:ATC | HisCys:GlnLeuAsp (CACTGT:CAACTGCATG) | Jones & Teo 1993 in Genbank L20954 |
| “ | 70449 | G:GT | “ | Jones, M. in Genbank X92436 |
| U57 mcp | 92149 | C:G | Lys:Asn ( AAG:AAC) | |
| U58 tbp | 94068 | A:G | Met:Val ATG:GTG | |
| U83 chemokine | 123740 | C:G | AspGlu:GluGln GACGAG:GAGCAG | [ |
| “ | 123741 | G:C | “ | “ |
| U86 IE2 | (128132) | (Indel G) | Frameshift U86/U87 | In RefSeq; [ |
| R1 repeats | 128325 | C:T | ||
| “ | 128443 | C:T | ||
| “ | 128914 | G:A | ||
| “ | 129917 | A:ATCC | ||
| R2 repeats | 132312 | GG:G | ||
| R3 repeats | 137831 | C:G | ||
| “ | 137832 | G:C | ||
| “ | 137906 | C:G | ||
| “ | 137907 | G:C | ||
| U100 | 148154 | C:G | ||
| “ | 148155 | G:C | ||
| U100 gQ1 | 149553 | C:G | Arg:Ala (CGT: GCT) | |
| “ | 149554 | G:C | “ | |
| DR-R DR6 | 157898 | G:GG | Frameshift extension | [ |
| “ | 157900 | G:GG | “ | |
| DR-R repeat | 158857 | G:C |
ORF = open reading frame; SNP = single nucleotide polymorphisms; Repetitive regions are shaded; Indel1: CACACCTATCGCGACCCTGTTTTATGTAGAAAGAATGACAAGCCAGAACAATGA; (“) indicates same as above.
Minor variant polymorphism populations in HHV-6A U1102, passaged, and BACmid virus.
| ORF or Loci | RefSeq | RefSeq | Amino Acid Change, Codons and (Reverse Complement Codons) | % New Reference | % BAC Virus | % Passaged Virus | Comment, Citations, and Other Strains with SNPs |
|---|---|---|---|---|---|---|---|
| DR | 8016 | A:C | - | 87% | - | End T2 | |
| “ | 8077 | C:CGA | 96% | - | - | HHV-6A AJ KP257,584.1, GS KC465,951.1; CiHHV6A 2284 KT895,199.1, 1501 KT355,575.1; [ | |
| “ | 8080 | G:GAC | - | 86% | - | CiHHV6A 2284 KT895,199.1, 5055, 5814 [ | |
| Uleft repeat | 8157 | G:GC | - | - | 78% | ||
| “ | 8558 | A:AC | 88% | AAC:A 99% | 90% | AC repeat | |
| “ | 8561 | AAC:A | - | 100% | 98% | HHV-6A AJ [ | |
| “ | 8601 | CG:C | 97% | 87% | 86% | ||
| U2 | 8876 | G:T | Ala314:Glu(GCG:GAG) | - | - | 53% | |
| U19 UTR | 28,371 | C:CA | - | 52% | 72% | HHV-6A AJ | |
| U31 tegument | 45,667 | A:T | Asn173:Ile AAC:ATC | - | 73% | 96% | |
| U33 capsid | 52,148 | A:G | Tyr330:His(TAT:CAT) | - | 54% | 69% | |
| U39 gB* | 61,162 | T:C | Thr193:Ala(ACG:GCG) | - | 58% | 99% | |
| U41 dbp | 64,707 | C:T | Ala972:Thr (GCT:ACT) | - | - | 90% | HHV-6A GS KJ123,690.1 |
| ORI repeat | 67,674 | T:C | - | - | 54% | ||
| “ | 67,860 | T:C | - | - | 34% | HHV-6A GS KJ123,690.1 & KC465,951.1 | |
| U50 capsid | 81,583 | G:A | Ala258:Thr(GCA:ACA) | - | 59% | 78% | HHV-6A GS KC465,951.1 |
| “ | 82,158 | G:C | - | 67% | - | ||
| U73 obp | 109,214 | G:A | Ser297:Asn(AGT:AAT) | - | - | 99% | |
| U83 chemokine | 123,504 | G:GTT | f/s N-terminal extension | <1% | - | - | cDNA clone [ |
| U86 IE2 | 126,126 | T:C | Asn395:Asp(AAT:GAT) | - | - | 99% | |
| “ | 127,092 | C:CTGA | Ser976:SerSer | - | 45% | - | 2 Ser insertion CiHHV6A 5055; 5814; [ |
| R2 repeat | 131,092 | C:T | - | - | 56% | IE2 intron | |
| “ | 131,743 | G:C | - | - | 52% | “ | |
| “ | 131,745 | A:C | - | - | 62% | “ | |
| R2 region | 132,303 | TG:T | - | 64% | 56% | “ TGG:T HHV-6A AJ [ | |
| R3 repeat | 140,872 | G:A | - | 96% | - | End 30× repeats from 137,775; latency IE1/2 intron [ | |
| U95 | 142,969 | T:C | Trp10:Arg TGG:CGG | - | - | 54% | |
| “ | 142,979 | T:A | Ile13:Asn ATT:AAT | - | - | 53% | |
| “ | 143,591 | C:A | Ser217:Tyr TCC:TAC | - | - | 76% | |
| U100 gQ2 | 146,729 | C:T | C-terminal truncation; Trp186:stop (TGG:TGA) | <1% | - | - | [ |
| “ | 146,862 | G:A | Ser42:Leu (TCG:TTG) | - | 60% | 99% | Not in HHV-6A HHV-6A U1102 from [ |
| U100 gQ1 | 149,251 | Indel2:A | intron | - | 44% | 83% | HHV-6A AJ; CiHHV6A 2284, 5055, 5814 [ |
Indel2: AGGGGGGGG. Repetitive regions are shaded, DR repeats listed once here. * Adjacent to fusion loop 2 of Domain I (DI) homologue of human cytomegalovirus (HCMV) gB C246 [40], see text. (“) indicates same as above. (–) indicates no change in comparison to the reference genome sequence.
Figure 1Passaged bacterial artificial chromosome (BAC)-virus deletes the green fluorescent protein (GFP) cassette. In the BAC virus, the GFP cassette disrupts the U53/U53a transcriptional control elements, and after tissue culture passage it is deleted. Shaded and underlined are palindrome1, palindrome 2, and LoxP sites; note the insertion is in an end repaired EcoRI site. Underlined are the original and new poly A regulatory sequences disrupted by the BAC cassette.
Figure 2Human herpesvirus gB fusion loop 2, FL2, region sequence comparisons show unique SNP substitution in passaged HHV-6A virus. Comparison of the FL2 (in bold) and associated sequences of HHV-6A gB (residues 184–191 in FL2 in bold) from passaged virus (and cultured BAC virus) compared to gB from 51 clinical samples of HHV-6A/B [45], CiHHV-6A/B [14,24], and 50 clinical samples of HHV-7 gB [46] demonstrating the Thr193Ala mutation is only found in passaged virus. This is compared to reference sequences from alpha, beta, and gamma herpesvirus gB where the structure has been determined, as shown for HCMV gB structure, identifying the beta sheet 12 in domain I [40]. Alignments were made using Clustal as described [40] with gB from HCMV (shown for strains Merlin, Towne, and TB40E and 74 clinical strains; strain AD169 and six clinical strains have Val252Leu), EBV (strain B958), and HSV-1 (strain KOS) (UniProtKB P06473, Q777B0, P06437, respectively). The FL2 in HCMV, EBV, and HSV-1 gB are in bold, and predicted FL2 for HHV-6A/B and HHV-7 are similarly indicated. FL2 in HHV-6A/B gB is amino acids 184–191 and in HCMV, strain Merlin, 237–244. This is adjacent to a conserved beta sheet shown in the structures of EBV, HSV, and cytomegalovirus (CMV) gB as underlined, and that predicted for HHV-6A/B and HHV-7 indicated (See Methods). The Thr193Ala passage SNP is shaded in the FL2 adjacent region, disrupting the polar patch and increasing hydrophobicity. This Thr is conserved in HHV-6A/B, HHV-7, EBV, and HSV.
Figure 3Structural models of HHV-6A gB show conservation with HCMV gB structure identifying the passaged virus has a mutation in the fusion associated domain I affecting the trimer interface. (a) Model of the HCMV reference strain Merlin (ectodomain in residues 91–642 from a total of 907), alignment with corresponding region of HHV-6A gB is shown in the Figures S1–3. (b) Model of HHV-6A gB ectodomain, in the corresponding residues 38 to 579 (total 839 residues) using multiple templates of pdb (RCSB protein database) folds from gB HCMV strain AD169 (PDB ID: 5cxf) and strain Towne (PDB IDs: 4osn, 5c6t), gB EBV strain B95-8 (PDB ID: 3fvc), and gB HSV-1 strain KOS (PDB IDs: 2gum and 3nwa). Using the structure of most closely related HCMV gB, as a model, shows Domains I (blue), II (green), III (yellow), and IV (orange) in the post-fusion conformation. Domain I (blue) shows conserved beta sheets connected by the two fusion loops, FL1 and FL2, at the base of the model (purple). (c) Longitudinal view of the predicted trimer of HHV-6A gB showing the Thr193 position in red within three subunits, indicated in purple, blue, and orange. (d) Domain I (residues 79–291) is enlarged as indicated by the grey T bar, showing the position of Thr193, in red, in the beta sheet at the trimer interfaces with the fusion loops FL1 (residues 100–105) and FL2 (residues 184–191) exposed at the base of the structure.