| Literature DB >> 35105887 |
James E Barrett1,2,3, Allison Jones3, Iona Evans3, Daniel Reisel3, Chiara Herzog1,2, Kantaraja Chindera3, Mark Kristiansen4, Olivia C Leavy5,6, Ranjit Manchanda7,8,9, Line Bjørge10,11, Michal Zikan12,13, David Cibula13, Martin Widschwendter14,15,16,17.
Abstract
The vast majority of epithelial ovarian cancer arises from tissues that are embryologically derived from the Müllerian Duct. Here, we demonstrate that a DNA methylation signature in easy-to-access Müllerian Duct-derived cervical cells from women with and without ovarian cancer (i.e. referred to as the Women's risk IDentification for Ovarian Cancer index or WID-OC-index) is capable of identifying women with an ovarian cancer in the absence of tumour DNA with an AUC of 0.76 and women with an endometrial cancer with an AUC of 0.81. This and the observation that the cervical cell WID-OC-index mimics the epigenetic program of those cells at risk of becoming cancerous in BRCA1/2 germline mutation carriers (i.e. mammary epithelium, fallopian tube fimbriae, prostate) further suggest that the epigenetic misprogramming of cervical cells is an indicator for cancer predisposition. This concept has the potential to advance the field of risk-stratified cancer screening and prevention.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35105887 PMCID: PMC8807742 DOI: 10.1038/s41467-021-26615-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Sample heterogeneity, differential methylation, and development of discriminatory index.
a Distribution of p-values obtained by comparing cases and controls at each CpG site and after controlling for immune cell proportion and age. b The distribution of different cell types in the discovery dataset inferred using the EpiDISH algorithm. p-values were computed using a two-tailed Mann–Whitney test. For indicated significant differences, exact p-values = 0.00014 (epithelial), <0.0001 (neutrophil), <0.0001 (fibroblast), <0.0001 (eosinophil). The centre line of each box corresponds to the median. The lower and upper hinges correspond to the first and third quartiles. The upper whisker extends from the hinge to the largest value no further than 1.5 * IQR from the hinge (where IQR is the inter-quartile range, or distance between the first and third quartiles). The lower whisker extends from the hinge to the smallest value at most 1.5 * IQR of the hinge. Data beyond the end of the whiskers are plotted individually. c An example of a CpG with epithelial specific differential methylation. d Area under the curve (AUC) values in the internal validation set as a function of the number of CpGs used to train the classifier. e ROC curves of the WID-OC-index in the internal validation set for samples with an immune cell (IC) proportion ≤0.5 and >0.5. f Distribution of the WID-OC-index with respect to immune cell proportion in the internal validation set. g Distribution of the estimated variance in epithelial and immune cells across all CpGs used in the WID-OC- index. h Odds ratios corresponding to the four genomic regions when comparing the CpGs used in the WID-OC-index to the overall EPIC array. Error bars correspond to 95% confidence intervals computed using the median-unbiased estimation method. For indicated significant differences, all exact p values <0.0001. i AUC values in the internal validation set after training classifiers on different subsets of the CpGs used in the WID-OC-index. The top n CpGs were either retained or removed. CpGs were also split into separate bins of size 500. Source data are provided as a Source Data file.
Fig. 2External validation.
a The WID-OC-index versus immune cell proportion in an independent external validation set. b ROC curve from the external validation set. c The WID-OC-index versus immune cell proportion in a separate cohort of endometrial cancer samples and the same control samples from the internal validation set. d ROC curve from the endometrial cancer dataset. e The WID- OC-index versus immune cell proportion in a separate cohort of breast cancer samples and the same control samples from the internal validation set. f ROC curve from the breast cancer dataset. g The WID-OC-index versus immune cell proportion in an independent cohort of BRCA1 mutation carriers. h ROC curve from the BRCA1 dataset. Source data are provided as a Source Data file.
Odds ratios corresponding to quartiles defined using the internal validation ovarian cancer dataset.
| Quartile | Control ( | Internal validation ovarian cancer ( | Unadjusted OR (95% CI) | Adjusted OR (95% CI) |
|---|---|---|---|---|
| 75 | 3 | 1.00 (reference) | 1.00 (reference) | |
| 74 | 10 | 3.25 (0.93,15.68) | 2.38 (0.56, 12.99) | |
| 74 | 13 | 4.20 (1.27,19.79) | 3.65 (0.87, 19.42) | |
| 74 | 57 | 18.20 (6.33,79.95) | 10.26 (2.89, 49.1) |
Adjustment was based on a logistic regression model with age, menopausal status, age at menarche, number of first degree relatives with ovarian cancer, and BMI included as covariates for the ovarian cancer datasets. For endometrial cancers and the BRCA1 dataset age and menopause were included as covariates. For the endometrial cancers adjusted estimates are unavailable as the logistic regression model failed to converge. In addition, it was assumed that there was 1 cancer case in the first quartile in order to estimate ORs for the remaining quartiles.
Fig. 3Association with epidemiological, clinical, and technical factors.
a The WID-OC-index versus age in control samples from the internal and external validation datasets. b ROC curves for women above and below 50 years of age. c The WID-OC-index versus a 28 SNP ovarian cancer polygenic risk score (PRS) in the internal validation dataset. d ROC curve corresponding to the PRS score. e The distribution of the WID-OC-index across different histological subtypes (endometrioid borderline, mucinous-clear cell cancer, carcinosarcoma, and serous cancer with no information on grade have been classified as “other cancers”). p-values were computed using a two-tailed Mann–Whitney test. For indicated significant difference, exact p-value = 0.013. f The distribution of the WID-OC-index across different cancer stages. p-values were computed using a two-tailed Mann–Whitney test. For indicated significant difference, exact p value = 0.0023. For box plots in e and f, the centre line of each box corresponds to the median. The lower and upper hinges correspond to the first and third quartiles. The upper whisker extends from the hinge to the largest value no further than 1.5 * IQR from the hinge (where IQR is the inter-quartile range). The lower whisker extends to the smallest value at most 1.5 * IQR of the hinge. Data beyond the end of the whiskers are plotted individually. Source data are provided as a Source Data file.
Fig. 4Inferred proportion of tumour DNA and functional assessment of the WID-OC-index.
The estimated proportion of tumour DNA in each cervical smear sample as estimated using the EpiDISH algorithm for controls and a ovarian cancers and b endometrial cancers. c Distribution of the WID-OC-index with respect to tumour DNA fraction in controls and endometrial cancers. d ROC curve for samples with tumour DNA < 1% in the endometrial cancer set. e Results from real-time PCR to detect ZNF154, a pan-cancer marker primarily discovered in ovarian cancer. p-values were computed using a two-tailed Mann–Whitney test. For indicated significant differences, all exact p values < 0.001. The centre line of each box corresponds to the median. The lower and upper hinges correspond to the first and third quartiles. The upper whisker extends from the hinge to the largest value no further than 1.5 * IQR from the hinge (where IQR is the inter-quartile range). The lower whisker extends to the smallest value at most 1.5 * IQR of the hinge. Data beyond the end of the whiskers are plotted individually. f The WID-OC-index evaluated in eight different cell lines. g, h shows a subset of ENCODE tissue samples. The germline mutation proportion refers to the proportion of cancers in each tissue type that have a BRCA1 and BRCA2 mutation. p-values were computed using a two-tailed correlation test. Source data are provided as a Source Data file.