| Literature DB >> 33858496 |
Glenn Rademakers1, Maartje Massen1, Alexander Koch1, Muriel X Draht1, Nikkie Buekers1, Kim A D Wouters1, Nathalie Vaes1, Tim De Meyer2, Beatriz Carvalho3, Gerrit A Meijer3, James G Herman4, Kim M Smits1, Manon van Engeland1, Veerle Melotte5,6.
Abstract
PURPOSE: Colonoscopy and the fecal immunochemical test (FIT) are currently the most widely used screening modalities for colorectal cancer (CRC), however, both with their own limitations. Here we aim to identify and validate stool-based DNA methylation markers for the early detection of CRC and investigate the biological pathways prone to DNA methylation.Entities:
Keywords: Colorectal cancer; DNA methylation; Diagnostic markers; In silico discovery; Nervous system
Mesh:
Substances:
Year: 2021 PMID: 33858496 PMCID: PMC8048074 DOI: 10.1186/s13148-021-01067-9
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1Pipeline to select candidate methylation markers using The Cancer Genome Atlas (TCGA) database. Marker discovery is based on a selection procedure using methylation data (right section) and gene expression data (left section). The DNA methylation analysis resulted in a list of Infinium 450 k probes that were: (1) located in promoter CpG islands, (2) unmethylated in normal colon tissue, and (3) hypermethylated in tumor samples over all four stages cancer development. This list was compared with a list of genes downregulated in tumor compared to normal samples, and we checked the methylation status of the remaining probes in normal samples from 14 different cancer types, resulting in a list of 221 genes. Finally, we designed and tested primers for the probes with the highest sensitivity and specificity based on the TCGA data, resulting in the top five potential early detection markers for CRC
Clinicopathological features of the stool samples obtained from a hospital-based series
| Patient demographics | Normal ( | Carcinoma ( |
|---|---|---|
| Sex | ||
| Male | 24 (48.0) | 28 (66.7) |
| Female | 26 (52.0) | 14 (33.3) |
| Age (years) | ||
| Median (± StDev) | 55.5 (± 3.7) | 70.5 (± 10.5) |
| Histological type | ||
| Adenocarcinoma | – | 31 (73.8) |
| Signet ring cell carcinoma | – | 1 (2.4) |
| Mucinous adenocarcinoma | – | 2 (4.8) |
| High-grade neuroendocrine carcinoma | – | 1 (2.4) |
| Unknown | 7 (16.6%) | |
| Differentiation grade | ||
| Poor | – | 6 (14.3) |
| Moderate/well | – | 25 (59.5) |
| Unknown | – | 11 (26.2) |
| T-stage | ||
| Stage 1 | – | 0 (0.0) |
| Stage 2 | – | 7 (16.7) |
| Stage 3 | – | 24 (57.1) |
| Stage 4 | – | 10 (23.8) |
| Unknown | – | 1 (2.4) |
| Location | ||
| Proximal | – | 14 (33.3) |
| Distal | – | 25 (59.5) |
| Unknown/other | – | 3 (7.2) |
Tissue retrieved retrospectively from the tissue archive of the department of Pathology of the Maastricht University Medical Center
P < 0.000 compared to normal, one-way ANOVA was used
Fig. 2Early detection methylation marker validation using fecal DNA from healthy controls and CRC patients. a Receiver operating characteristic (ROC) curve for marker validation of GDNF, HAND2, SLC35F3, SNAP91 and SORCS1 on fecal DNA to determine optimal sensitivity/specificity. The jagged lines indicate the different ROC curves for each independent marker. The dashed line represents the line of no discrimination between good and bad classification. b Methylation frequency (%) of single markers in fecal DNA of healthy controls (white bar) and CRC patients (black bar). The exact number of methylated samples is indicated in the table below for both groups (methylated samples/total number samples). For the healthy control group, the specificity is given; for carcinomas the sensitivity was determined. Pearson’s chi-square test was used to calculate P values. c ROC curve for the best performing marker panel (GDNF/SNAP91) based on the cutoffs for GDNF and SNAP91. The jagged (green) line indicates the ROC curve for this specific panel. The gray line represents the line of no discrimination between good and bad classification. d Methylation frequency of the GDNF/SNAP91 marker panel in fecal DNA of healthy controls (white bar) and carcinomas (dark gray bar). Fisher’s exact test was used to calculate P values. e ROC curve for the best performing marker panel (SLC35F3/SNAP91) without cutoff. f Methylation frequency of the SLC35F3/SNAP91 marker panel in fecal DNA of healthy controls (white bar) and carcinomas (dark gray bar). Fisher’s exact test was used to calculate P values
Promoter methylation markers in colorectal carcinomas compared with clinicopathological features
| M | U | M | U | M | U | M | U | M | U | |
|---|---|---|---|---|---|---|---|---|---|---|
| Tumors | ||||||||||
| Number | 17 (40.5) | 25 (59.5) | 11 (26.2) | 31 (73.8) | 15 (35.7) | 27 (64.3) | 19 (45.2) | 23 (54.8) | 17 (40.5) | 25 (59.5) |
| Sex | ||||||||||
| Male | 11 (64.7) | 17 (68.0) | 8 (72.7) | 20 (64.5) | 10 (66.7) | 18 (66.7) | 11 (57.9) | 17 (73.9) | 10 (58.9) | 18 (72.0) |
| Female | 6 (35.3) | 8 (32.0) | 3 (27.3) | 11 (35.5) | 5 (33.3) | 9 (33.3) | 8 (42.1) | 6 (26.1) | 7 (41.1) | 7 (28.0) |
| | 0.824 | 0.620 | 1.000 | 0.273 | 0.374 | |||||
| Cancer stage | ||||||||||
| Stage I | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Stage II | 5 (31.3) | 2 (8.0) | 2 (20.0) | 5 (16.1) | 3 (21.4) | 4 (14.8) | 6 (33.3) | 1 (4.3) | 4 (25.0) | 3 (12.0) |
| Stage III | 8 (50.0) | 16 (64.0) | 5 (50.0) | 19 (61.3) | 7 (50.0) | 17 (63.0) | 8 (44.5) | 16 (69.6) | 8 (50.0) | 16 (64.0) |
| Stage IV | 3 (18.7) | 7 (28.0) | 3 (30.0) | 7 (22.6) | 4 (28.6) | 6 (22.2) | 4 (22.2) | 6 (26.1) | 4 (25.0) | 6 (24.0) |
| P value | 0.153 | 0.818 | 0.721 | 0.523 | ||||||
| Tumor location | ||||||||||
| Proximal | 0 (0.0) | 14 (60.9) | 1 (9.1) | 13 (46.4) | 0 (0.0) | 14 (56.0) | 0 (0.0) | 14 (66.7) | 0 (0.0) | 14 (60.9) |
| Distal | 16 (100.0) | 9 (39.1) | 10 (90.9) | 15 (53.6) | 14 (100.0) | 11 (44.0) | 18 (100.0) | 7 (33.3) | 16 (100.0) | 9 (39.1) |
| | 0.000 | 0.029 | 0.000 | 0.000 | 0.000 | |||||
| Age (years) | ||||||||||
| Mean age (± SD) | 71 (± 7.6) | 70 (± 12.2) | 73 (± 7.7) | 70 (± 11.2) | 70 (± 9.6) | 71 (± 11.1) | 70 (± 9.0) | 71 (± 11.7) | 70 (± 9.0) | 71 (± 11.5) |
| P value | 0.927 | 0.333 | 0.812 | 0.663 | 0.885 | |||||
Pearson's chi-square (sex, stage and tumor location) and independent samples t tests (age) were used to calculate P values
Fig. 3NDRG4 methylation and FIT performance in fecal DNA and combined with the previously established marker panel. a ROC curve for NDRG4. The jagged (green) lines indicate the ROC curve. The gray line represents the line of no discrimination between good and bad classification. b Methylation frequency (%) of NDRG4 methylation markers in fecal DNA of healthy controls (white bar) and carcinomas (dark gray bar). Pearson’s chi-square test was used to calculate P value. c The performance of FIT within the study population containing normal (n = 50) and carcinoma (n = 43) samples. d ROC curve for FIT alone and for FIT with GDNF/SNAP91/NDRG4. The green lines indicate the ROC curve for FIT alone while the purple line indicates the ROC for FIT/NDRG4/SNAP91. The gray line represents the line of no discrimination between good and bad classification. e, f The best performing marker panels GDNF/SNAP91/NDRG4 (e) and SNAP91/NDRG4 (f) in combination with FIT. For all figures, the 95% CI is shown with the error bars. Pearson’s chi-square test was used to calculate P values
Fig. 4Gene ontology enrichment and pathway analysis of the full set (n = 221) of identified methylation markers. a Comparisons of the frequency of neuro-related gene ontologies in the complete set of gene ontologies (yellow bar) versus the enriched sets obtained via analysis by GOrilla (green) and clusterProfiler (dark blue). The three major subsets of gene ontologies have been accounted for: cellular component (CC), biological process (BP) and molecular function (MF). b Pathway analysis was performed using the ToppGene tool based on 221 genes with potential as early detection biomarkers. ToppGene links, gene lists with pathways described in three major pathway databases (KEGG pathway, Reactome and PantherDB). Nervous system-related genes are highlighted in green, nervous system-related pathways in orange and genes and pathways with a neuronal background are linked with black lines, other with gray lines