| Literature DB >> 30955492 |
Ray Bridges1, Samantha Correia1, Fanny Wegner2, Cristina Venturini2, Anne Palser3, Robert E White1, Paul Kellam1,3, Judith Breuer2, Paul J Farrell1.
Abstract
Many regions of the Epstein-Barr virus (EBV) genome, repeated and unique sequences, contribute to the geographical variation observed between strains. Here we use a large alignment of curated EBV genome sequences to identify major sites of variation in the genome of type 1 EBV strains; the CAO deletion in latent membrane protein 1 (LMP1) is the most frequent major indel present in the unique regions of EBV strains from various parts of the world. Principal component analysis was used to identify patterns of sequence variation and nucleotide positions in the sequences that can distinguish EBV from some different geographical regions. Viral genome sequence variation also affects interpretation of genetic content; known genes, origins of replication and gene expression control regions explain most of the viral genome but there are still a few sections of unknown function. One of these EBV genome regions contains a large inverted repeat sequence (invR) within the IR-1 major internal repeat array. We deleted this invR sequence and showed that this abolished the ability of the virus to transform human B cells into lymphoblastoid cell lines. This article is part of the theme issue 'Silent cancer agents: multi-disciplinary modelling of human DNA oncoviruses'.Entities:
Keywords: Epstein–Barr virus; cell transformation; repeats; strain variation
Mesh:
Substances:
Year: 2019 PMID: 30955492 PMCID: PMC6501908 DOI: 10.1098/rstb.2018.0299
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Figure 1.Principal component analysis of type 1 EBV sequence diversity coloured by geographical origin (colours as shown in part a) using 217 type 1 EBV strains (a) PC-1 versus PC-3, (b) PC-2 versus PC-3. PC-1 versus PC-2 with the same dataset is shown in fig. 1B of [6]. (c) Eigenvector coefficients plotted with a smoothing window of 20 nt along the MSA genome (plotted as sum of coefficients on the ordinate axis). Labelled peaks are listed in electronic supplementary material, figure S2. PNG, Papua New Guinea.
Figure 2.CAO deletion incidence in EBV strains from different geographical regions. The filled part of the boxes indicates the number of CAO strains within the total, also expressed as a percentage for each strain. PNG, Papua New Guinea.
Figure 3.PCA nucleotide contribution plots of geographical groups identify three major patterns consistent with three major clades in the phylogenetic tree of type 1 sequences. Nucleotide positions from the marked areas, which were chosen to characterize differences between the geographical groups, are listed in electronic supplementary material, table S3.
Figure 4.(a) Diagram of two IR-1 repeat units, numbered according to the NC007605 reference EBV sequence, showing the positions of the deletions of the inverted repeat invR. The locations of the BWRF1 open reading frame and the known Wp promoter region are marked, together with the W0, W1 and W2 exons of RNA expressed from Wp. The relationship between IR-1 and the repeated BamHI W restriction fragment is also indicated. (b) Transformation assay for human B lymphocytes using wild-type, revertant or invR deletion mutant virus strains. C33, W3-4, W1-11 and A29 are different producer lines for the wild-type virus and D6-62, D2-7 and D2-16 are producer lines for the invR-deleted virus. 1-3 and 21-7 are revertant virus strains. Virus titre is the quantity of virus (in green Raji units—GRU) added to 106 B cells and the outcome is scored as cell proliferation observed under the microscope 2 days after infection or lymphoblastoid cell line (LCL) outgrowth two weeks after infection. (c) Upper panel: EBNA-LP western blot showing endogenous protein in Raji cells and the extra EBNA-LP (arrowed) from superinfecting virus 2 days after infection with wild-type (WT) or invR (D2) virus. Uninfected Raji cells (U) do not have that band. Positions of size markers 46 kD and 58 kD are indicated. Lower panel: Same as upper panel but assayed 2 h after addition of virus, showing that the extra band is absent so was not owing to protein present in the virus preparation but a consequence of viral gene expression in the super-infected cells.