| Literature DB >> 34151267 |
Jiajun Cai1, Chang Zeng2, Wei Hua2, Zengxin Qi2, Yanqun Song3, Xingyu Lu3, Dongdong Li3, Zhou Zhang2, Xiaolong Cui4, Xin Zhang1, Zixiao Yang1, Jinsen Zhang1, Kai Quan1, Wei Zhu1, Jiabin Cai5, Chuan He4, Shi-Yuan Cheng6, Wei Zhang2, Ying Mao7.
Abstract
BACKGROUND: Gliomas, especially the high-grade glioblastomas (GBM), are highly aggressive tumors in the central nervous system (CNS) with dismal clinical outcomes. Effective biomarkers, which are not currently available, may improve clinical outcomes through early detection. We sought to develop a noninvasive diagnostic approach for gliomas based on 5-hydroxymethylcytosines (5hmC) in circulating cell-free DNA (cfDNA).Entities:
Keywords: 5-hydroxymethylcytosine; biomarker; cell-free DNA; diagnosis; glioma
Year: 2021 PMID: 34151267 PMCID: PMC8209591 DOI: 10.1093/noajnl/vdab049
Source DB: PubMed Journal: Neurooncol Adv ISSN: 2632-2498
Demographics and Clinical Characteristics of the Study Participants
| WHO II/WHO III Glioma | WHO IV Glioma (GBM) | Healthy Subjects | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Training ( | Validation ( | Training ( | Validation ( | Training ( | Validation ( | ||||||||||
| No./Value | % | No./Value | % |
| No./Value | % | No./Value | % |
| No./Value | % | No./Value | % |
| |
| Age – Years | 42.5 ± 11.4 | 41.1 ± 9.6 | .67a | 51.8 ± 12.7 | 49.5 ± 12.5 | .50b | 47.4 ± 13.0 | 46.9 ± 13.0 | .83a | ||||||
| Sex (Male) | 23 | 69.7 | 11 | 78.6 | .79b | 26 | 65 | 14 | 58.3 | .79b | 50 | 66.7 | 24 | 66.7 | 1.00b |
|
| .79b | .82b | |||||||||||||
| Mutant | 23 | 69.7 | 11 | 78.6 | 3 | 7.5 | 3 | 12.5 | |||||||
| Wild-type | 10 | 30.3 | 3 | 21.4 | 37 | 92.5 | 21 | 87.5 | |||||||
|
| 1.00b | .64b | |||||||||||||
| Mutant | 9 | 27.3 | 3 | 33.3 | 12 | 30 | 11 | 45.8 | |||||||
| Wild-type | 13 | 39.4 | 6 | 66.7 | 11 | 27.5 | 6 | 25 | |||||||
| 1p/19q co-deletion NGS | .97b | ||||||||||||||
| Yes | 10 | 30.3 | 3 | 33.3 | / | / | / | / | |||||||
| No | 14 | 42.4 | 6 | 66.7 | 23 | 57.5 | 17 | 70.8 | |||||||
|
| .80b | .49b | |||||||||||||
| Mutant | 9 | 27.3 | 5 | 35.7 | 15 | 37.5 | 10 | 41.7 | |||||||
| Wild-type | 22 | 66.7 | 8 | 57.1 | 19 | 47.5 | 7 | 29.2 | |||||||
|
| .51b | .96b | |||||||||||||
| Mutant | 21 | 63.6 | 8 | 57.1 | 33 | 82.5 | 20 | 83.3 | |||||||
| Wild-type | 8 | 24.2 | 6 | 42.9 | 5 | 12.5 | 2 | 8.3 | |||||||
| Hemisphere | .44b | 1.00b | |||||||||||||
| Left | 17 | 51.5 | 6 | 42.9 | 12 | 30 | 7 | 29.2 | |||||||
| Right | 11 | 33.3 | 8 | 57.1 | 25 | 62.5 | 14 | 58.3 | |||||||
| Location | |||||||||||||||
| Frontal | 14 | 42.4 | 10 | 71.4 | 15 | 37.5 | 11 | 45.8 | |||||||
| Temporal | 11 | 33.3 | 5 | 35.7 | 14 | 35 | 6 | 25 | |||||||
| Parietal | 3 | 9.1 | 2 | 14.3 | 2 | 5 | 3 | 12.5 | |||||||
| Occipital | 1 | 3 | 5 | 12.5 | 2 | 8.3 | |||||||||
| Thalamus | 1 | 3 | 2 | 5 | 1 | 4.2 | |||||||||
| Ultimate Treatment | |||||||||||||||
| Radiation therapy | 32 | 97 | 13 | 92.9 | 36 | 87.5 | 20 | 83.3 | |||||||
| Chemotherapy | 32 | 97 | 12 | 85.7 | 34 | 85 | 19 | 79.2 | |||||||
| Event | .91b | .55b | |||||||||||||
| Event | 4 | 12.1 | 1 | 7.1 | 21 | 52.5 | 6 | 25 | |||||||
| Event Time - Months | 11.0 ± 6.8 | 17.0 ± 0.0 | 10.0 ± 5.1 | 7.7 ± 2.9 | |||||||||||
| Event Free | 15 | 45.5 | 8 | 57.1 | 11 | 27.5 | 6 | 25 | |||||||
| Event Time - Months | 15.3 ± 3.0 | 17.9 ± 4.3 | 14.6 ± 2.3 | 19.2 ± 6.4 | |||||||||||
ATRX, ATRX chromatin remodeler; IDH, isocitrate dehydrogenase; TERT, telomerase reverse transcriptase; TP53, tumor protein p53; WHO, World Health Organization.
p: P-values obtained from a the 2-tailed t-test and b the Pearson’s chi-squared test for 2-sample proportions.
IHC: the results were from immunohistochemical assay.
NGS: the results were from next-generation sequencing.
No./Value: the continuous variables will be displayed as (mean ± sd); the categorical variables will be displayed as the number of patients in each category.
%: percentage out of the total number.
/: data not available.
Figure 1.Study design. Study subjects with WHO II-IV gliomas and age- and gender-matched healthy individuals are randomly divided into the training set (2/3) and the internal validation set (1/3) with a balanced distribution of age, gender, and IDH1 mutation status, followed by modeling. The models are evaluated in an independent validation set. NGS, next-generation sequencing; GBM, WHO IV glioma; lncRNA, long noncoding RNA; GLM, generalized linear model.
Figure 2.Genomic distributions of 5hmC in cfDNA. The 5hmC-Seal data from the 111 glioma patients (WHO II-IV) and 111 healthy individuals are used to characterize genomic distributions. (A) The 5hmC profiles are distinctly distributed across various genomic feature types. The read counts are normalized to per million counts. (B) Genomic distributions of differentially hydroxymethylated features (FDR < 0.05 and fold change > 20%) between glioma patients and healthy individuals. Up: up-modified; Down: down-modified. (C) Hierarchical clustering of samples based on differentially hydroxymethylated features (FDR < 0.05 and fold-change > 20%) between glioma patients and healthy individuals. (D) Comparison across tissues shows tissue specificity regarding the proportions of differentially hydroxymethylated H3K4me1 and H3K27ac sites (FDR < 0.05). (E) Comparison of the ranks of differentially hydroxymethylated genes (FDR < 0.01) from cfDNA in normal tissues (5hmC and mRNA) shows enrichment in the brain. TSS: transcription start site; TES: transcription end site; D: splice donor site; A: splice acceptor site; FDR: false discovery rate.
Figure 3.Performance of the integrated diagnostic models for gliomas. (A) The heatmaps show the final features selected in the training set (T) and the internal validating set (V). The AUCs in the training set and internal validation set show the performance of the integrated diagnostic models of (B) Glioma versus HEA: Alu- and lncRNA-based wd-scores; (C) GBM versus HEA: Alu-, gene body-, and lncRNA-based wd-scores; (D) WHO II-III versus HEA: LINE-1-, H3K27ac-, and lncRNA-based wd-scores; (E) IDH1 wild-type versus HEA: Alu-, gene body-, and lncRNA-based wd-scores for distinguishing; (F) IDH1 mutant versus HEA: H3K27ac-, gene body-, and lncRNA-based wd-scores; (G) IDH1 mutant versus wild type: Alu-, LINE-1-, and lncRNA-based wd-scores; (H) GBM versus WHO II-III: Alu- and gene body-based wd-scores; and (I) GBM versus II-III: Alu-, gene body-based wd-scores and IDH1 mutation status. (J) Performance of the final diagnostic model for gliomas (Panel B) in an independent set of 27 patients. (K) The boxplots show the distributions of wd-score derived from 4 gene bodies in all samples: HEA (n = 111), WHO II-III (n = 47), GBM (n = 64), patients with wild-type IDH1 (n = 71), and patients with mutant IDH1 (n = 40). AUC, area under the curve. HEA: healthy controls; T: training set; V: internal validation set; IDH1-wt: IDH1 wild-type; IDH1-mut: IDH1 mutant.
Elastic Net Selected Final Features
| Feature | Category | BaseMean | log2(fold-change) |
| FDR | Chr | Start | End | Annotation_2KB | Annotation |
|---|---|---|---|---|---|---|---|---|---|---|
| RGS4 | Gene body | 10.76 | 0.32 | .000 | 0.000 | chr1 | 163038565 | 163046592 | ||
| NPHS2 | Gene body | 18.17 | 0.39 | .000 | 0.000 | chr1 | 179519674 | 179545087 | ||
| PRR15 | Gene body | 10.41 | 0.28 | .000 | 0.000 | chr7 | 29603427 | 29606911 | ||
| NR2F2 | Gene body | 17.26 | 0.39 | .000 | 0.000 | chr15 | 96869167 | 96883492 | ||
| AC092415.1 | lncRNA | 18.58 | 0.45 | .000 | 0.000 | chr3 | 28038701 | 28070095 | ||
| RP11-659P15.1 | lncRNA | 14.73 | 0.34 | .000 | 0.000 | chr11 | 57811582 | 57827610 | OR9Q1 | |
| RP11-643G5.6 | lncRNA | 27.77 | 0.47 | .000 | 0.000 | chr11 | 89279805 | 89322779 | ||
| RP11-2L4.1 | lncRNA | 18.84 | 0.48 | .000 | 0.000 | chr16 | 82587654 | 82608815 | ||
| RP11-173L6.1 | lncRNA | 14.79 | 0.29 | .000 | 0.000 | chr18 | 73556880 | 73580207 | ||
| AC003973.4 | lncRNA | 10.34 | 0.64 | .000 | 0.000 | chr19 | 22200761 | 22219117 | ||
| CTC-435M10.6 | lncRNA | 20.00 | 0.32 | .000 | 0.000 | chr19 | 41931264 | 41932142 | CTC-435M10.3 B3GNT8 | BCKDHA B3GNT8 |
| RP11-87M18.2 | lncRNA | 40.09 | 0.40 | .000 | 0.000 | chrX | 36383741 | 36458375 | ||
| SINE/Alu_AluSx1_197020 | Alu | 9.48 | 0.26 | .001 | 0.019 | chr1 | 95143645 | 95143908 | ||
| SINE/Alu_AluSz_2273844 | Alu | 11.35 | 0.25 | .000 | 0.007 | chr2 | 26331749 | 26332078 | RAB10 | |
| SINE/Alu_AluSc_4223943 | Alu | 9.59 | 0.48 | .000 | 0.000 | chr7 | 32707218 | 32707529 | ||
| LINE/L1_L1P2_603915 | L1 | 9.90 | 0.33 | .000 | 0.002 | chr10 | 97535364 | 97538037 | ENTPD1 | |
| LINE/L1_L1MB8_1301438 | L1 | 17.94 | 0.25 | .000 | 0.002 | chr13 | 101006542 | 101009464 | PCCA | |
| LINE/L1_L1PB1_2691404 | L1 | 17.04 | 0.39 | .000 | 0.000 | chr20 | 29649078 | 29651586 | ||
| LINE/L1_L1PA4_2945947 | L1 | 11.59 | 0.29 | .000 | 0.002 | chr3 | 23337809 | 23343301 | UBE2E2 | |
| LINE/L1_L1PA5_3107752 | L1 | 10.74 | 0.19 | .003 | 0.026 | chr3 | 119282028 | 119288164 | ||
| LINE/L1_L1MC4_3200492 | L1 | 9.88 | 0.27 | .001 | 0.012 | chr3 | 171915141 | 171916338 | FNDC3B | |
| LINE/L1_L1PA2_3230068 | L1 | 11.46 | 0.27 | .000 | 0.007 | chr3 | 187608009 | 187614002 | ||
| LINE/L1_L1PB_3652941 | L1 | 9.47 | 0.26 | .000 | 0.005 | chr5 | 53551600 | 53553176 | ARL15 | |
| LINE/L1_L1MB3_3745436 | L1 | 9.53 | 0.23 | .001 | 0.011 | chr5 | 108682005 | 108682581 | PJA2 | |
| Rank_2809 | H3K27ac | 17.33 | 0.26 | .000 | 0.000 | chr2 | 131625860 | 131632306 | ARHGEF4 | |
| Rank_9305 | H3K27ac | 14.76 | 0.27 | .000 | 0.000 | chr8 | 145923444 | 145928208 | ||
| Rank_15572 | H3K27ac | 19.36 | 0.31 | .000 | 0.000 | chr15 | 100024591 | 100028923 | ||
| Rank_22708 | H3K27ac | 12.77 | 0.20 | .001 | 0.007 | chr4 | 15377249 | 15378595 | C1QTNF7 | |
| Rank_29494 | H3K27ac | 13.80 | 0.22 | .000 | 0.002 | chr10 | 24495172 | 24499400 | KIAA1217 | |
| Rank_31171 | H3K27ac | 13.05 | 0.42 | .000 | 0.000 | chr15 | 60290939 | 60300474 | ||
| Rank_31379 | H3K27ac | 10.98 | 0.24 | .001 | 0.004 | chr2 | 69930813 | 69932615 | ANXA4 | |
| Rank_31544 | H3K27ac | 10.00 | 0.31 | .000 | 0.000 | chr7 | 29603478 | 29606668 | PRR15 | PRR15 |
| Rank_36687 | H3K27ac | 10.22 | 0.45 | .000 | 0.000 | chr16 | 10021857 | 10030768 | GRIN2A | |
| Rank_37578 | H3K27ac | 15.46 | 0.26 | .000 | 0.000 | chr12 | 92838256 | 92840403 | ||
| Rank_39601 | H3K27ac | 12.92 | 0.23 | .000 | 0.001 | chr1 | 203303037 | 203305772 | ||
| Rank_41373 | H3K27ac | 10.69 | 0.34 | .000 | 0.000 | chr5 | 172504968 | 172508599 | CREBRF | |
| Rank_41739 | H3K27ac | 16.92 | 0.30 | .000 | 0.000 | chr10 | 13862041 | 13865998 | FRMD4A | |
| Rank_42607 | H3K27ac | 10.29 | 0.21 | .001 | 0.007 | chr12 | 66537458 | 66540359 | RP11-745O10.4 TMBIM4 | |
| Rank_47609 | H3K27ac | 11.08 | 0.24 | .000 | 0.003 | chr8 | 103134371 | 103136917 | NCALD | NCALD |
| Rank_49686 | H3K27ac | 12.59 | 0.30 | .000 | 0.000 | chr5 | 90421443 | 90424504 | ADGRV1 | |
| Rank_50546 | H3K27ac | 12.70 | 0.26 | .000 | 0.000 | chr10 | 97455745 | 97458888 | ||
| T2Rank_53694 | H3K27ac | 26.55 | 0.30 | .000 | 0.000 | chr12 | 83114658 | 83128925 | TMTC2 |
BaseMean: mean of normalized counts for all samples.
log2(fold-change): the logarithmic fold change(ie, log2(glioma/healthy); log2(fold-change) > 0, higher 5hmC modification levels in patients with gliomas. log2(fold-change) < 0, higher 5hmC modification levels in healthy individuals).
P: Wald Statistic P-value.
FDR: FDR-adjusted P-value.
Chr: chromosome location.
Start: start genomic coordinate.
End: end genomic coordinate.
Annotation: residing gene of the feature.
Annotation 2K: neighboring genes (2KB).
Figure 4.Functional exploration. (A) Venn diagram of residing or neighboring/host genes associated with differentially hydroxymethylated genomic features. (B) The GO enrichment analysis of residing or neighboring/host genes associated with differentially hydroxymethylated genomic features. (C) The KEGG pathway analysis of gene bodies that are differentially modified between gliomas and healthy individuals. (D) The KEGG pathway analysis of lncRNA elements-derived host genes. (E) The KEGG pathway analysis of Alu-derived host genes. (F) The KEGG pathway analysis of H3K27ac loci-derived host genes. GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.