| Literature DB >> 31332191 |
W K Jacky Lam1,2,3,4, Peiyong Jiang1,2,3, K C Allen Chan1,2,3, Wenlei Peng1,2, Huimin Shang1,2, Macy M S Heung1,2, Suk Hang Cheng1,2, Haiqiang Zhang1,2, O Y Olivia Tse1,2, Radha Raghupathy3,5, Brigette B Y Ma3,5, Edwin P Hui3,5, Anthony T C Chan3,5, John K S Woo4, Rossa W K Chiu1,2,3, Y M Dennis Lo6,7,8.
Abstract
Epstein-Barr virus (EBV) is associated with a number of diseases, including malignancies. Currently, it is not known whether patients with different EBV-associated diseases have different methylation profiles of circulating EBV DNA. Through whole-genome methylation analysis of plasma samples from patients with nasopharyngeal carcinoma (NPC), EBV-associated lymphoma and infectious mononucleosis, we demonstrate that EBV DNA methylation profiles exhibit a disease-associated pattern. This observation implies a significant potential for the development of methylation analysis of plasma EBV DNA for NPC diagnostics. We further analyse the plasma EBV DNA methylome of NPC and non-NPC subjects from a prospective screening cohort. Plasma EBV DNA fragments demonstrate differential methylation patterns between NPC and non-NPC subjects. Combining such differential methylation patterns with the fractional concentration (count) and size of plasma EBV DNA, population screening of NPC is performed with an improved positive predictive value of 35.1%, compared to a count- and size-based only protocol.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31332191 PMCID: PMC6646310 DOI: 10.1038/s41467-019-11226-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Distinctive plasma EBV DNA methylation profiles among different EBV-associated diseases. Unsupervised hierarchical clustering analysis of plasma EBV DNA methylome for the 29 samples from patients with different EBV-associated diseases. Each vertical bar represents one plasma sample. Each horizontal bar represents the selected 500-bp regions in the viral genome which demonstrated the most variable methylation densities (coefficient of variation > 30%) across all 29 samples. The corresponding methylation density of each region for all samples were represented in different colours. Samples of different patients with the same EBV-associated diseases were clustered together
Subject characteristics in the exploratory and validation sample sets
| Non-NPC subjects with transiently positive plasma EBV DNA in the exploratory set | Non-NPC subjects with persistently positive plasma EBV DNA in the exploratory set | NPC patients from the screening cohort in the exploratory set | Non-NPC subjects with transiently positive plasma EBV DNA in the validation set | Non-NPC subjects with persistently positive plasma EBV DNA in the validation set | NPC patients from the screening cohort in the validation set | NPC patients from an external cohort in the validation set | |
|---|---|---|---|---|---|---|---|
| Number | 20 | 20 | 10 | 90 | 30 | 23 | 14 |
|
| |||||||
| M | 20 | 20 | 10 | 90 | 30 | 23 | 13 |
| F | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Median age, year (IQR) | 53.5 (46–57) | 48 (43–58) | 52.5 (49–55) | 54 (49–58) | 54 (49–59) | 52 (44.5–55) | 57 (45.5–62) |
|
| |||||||
| I | NA | NA | 5 | NA | NA | 10 | 2 |
| II | NA | NA | 1 | NA | NA | 7 | 0 |
| III | NA | NA | 4 | NA | NA | 4 | 5 |
| IV | NA | NA | 0 | NA | NA | 2 | 7 |
Fig. 2Mining of NPC-associated DMRs in the EBV genome. a BamHI restriction map of the EBV genome is shown. b Methylation densities of CpG loci across the EBV DNA genome from the pooled sequencing data of the 15 cases of NPC and the pooled data of the 5 cases of infectious mononucleosis used for mining of DMRs. Each dot (grey and coloured) shows the methylation density at a corresponding CpG site. A DMR was constructed by two or more differentially methylated CpG sites (>80% in the pooled data of NPC and <60% in the pooled data of infectious mononucleosis) within 200 bp. The coloured dots highlight those CpG sites which fulfilled our criteria. The red dots are those CpG sites with the methylation densities greater than 80% in the pooled data of NPC and the blues dots are those with the methylation densities less than 60% in the pooled data of infectious mononucleosis. Viral DNA fragments mapped to the BamHI-W repeat region presented ambiguities in alignment to the exact member of the repeat family. Hence, this region was not used in the final approach
Fig. 3Methylation-, count- and size-based analyses of plasma EBV DNA in the exploratory sample set. a The EBV DNA methylation scores of the NPC patients and non-NPC subjects with transiently positive and persistently positive results are shown. A cutoff value in the EBV DNA methylation score was defined at 3 standard deviations below the mean of the methylation scores of these 10 NPC patients in the exploratory dataset. The cutoff value of 73.7 is denoted by the red dotted line. b The proportion of EBV DNA reads among the total number of sequenced plasma DNA reads (both human and viral) of the NPC patients and non-NPC subjects with transiently positive and persistently positive results are shown. A cutoff value in the proportion of plasma EBV DNA reads was defined at 3 standard deviations below the mean of the logarithmic values of portion of EBV DNA reads of the 10 NPC patients in the exploratory dataset. The cutoff value of 2.7 × 10−5 is denoted by the red dotted line. c The EBV DNA size ratios of the NPC patients and non-NPC subjects with transiently positive and persistently positive results are shown. A cutoff value was defined at 3 standard deviations above the mean values of the EBV size ratios of all the 10 NPC patients in the exploratory dataset. The cutoff value of 5.0 is denoted by the red dotted line. Source data are provided as a Source Data file
Sensitivities and specificities of the isolated and combined analysis at the defined cutoffs in both exploratory and validation cohorts. The corresponding 95% confidence intervals are bracketed
| Sensitivity (%) | Specificity (%) | ||
|---|---|---|---|
| Exploratory cohort | Methylation based | 100% | 77.5% (62.5–87.7%) |
| Count based | 100% | 70% (54.5–81.9%) | |
| Size based | 100% | 35% (22.1–50.5%) | |
| Combined methylation, count and size based | 100% | 87.5% (73.9–94.5%) | |
| Validation cohort | Methylation based | 100% | 83.3% (75.7–88.9%) |
| Count based | 100% | 75% (66.6–81.9%) | |
| Size based | 100% | 35.8% (27.8–44.7%) | |
| Combined methylation, count and size based | 100% | 94.2% (88.5–97.1%) |
Fig. 4Methylation-, count- and size-based analyses of plasma EBV DNA in the validation sample set. The cutoffs in the corresponding analyses defined in the exploratory dataset are shown. a The EBV DNA methylation scores of the NPC patients (from both the screening and external cohorts) and non-NPC subjects with transiently positive and persistently positive results are shown. The same cutoff value of 73.7 defined in the exploratory dataset is denoted by the red dotted line. b The proportion of EBV DNA reads among the total number of sequenced plasma DNA reads (both human and viral) of the NPC patients (from both the screening and external cohorts) and non-NPC subjects with transiently positive and persistently positive results are shown. The same cutoff value of 2.7 × 10−5 defined in the exploratory dataset is denoted by the red dotted line. c The EBV DNA size ratios of the NPC patients (from both the screening and external cohorts) and non-NPC subjects with transiently positive and persistently positive results are shown. The same cutoff value of 5.0 defined in the exploratory dataset is denoted by the red dotted line. Source data are provided as a Source Data file
Modelled performance of the methylation-, count-, size- and combined analyses in the screening cohort by target-capture bisulfite sequencing
| Sensitivity (%) | Specificity (%) | PPV (%) | |
|---|---|---|---|
| Methylation based | 97.1 | 99.1 | 15.9 |
| Count based | 97.1 | 98.7 | 11.2 |
| Size based | 97.1 | 96.6 | 4.7 |
| Combined count and size based | 97.1 | 99.1 | 16.6 |
| Combined methylation, count and size based | 97.1 | 99.7 | 35.1 |