| Literature DB >> 35769265 |
Ilhame Diboun1,2, Sachin Wani3, Stuart H Ralston3, Omar M E Albagha1,3.
Abstract
Background: Paget's disease of bone (PDB) is characterized by focal areas of dysregulated bone turnover resulting in increased bone loss and abnormal bone formation with variable severity. PDB has a complex etiology and both genetics and environmental factors have been implicated. A recent study has identified many differentially methylated loci in PDB compared to healthy subjects. However, associations between DNA methylation profiles and disease severity of PDB have not been investigated.Entities:
Keywords: DNA methylation; bone; epigenetics; genetics; paget’s disease of bone
Year: 2022 PMID: 35769265 PMCID: PMC9235511 DOI: 10.3389/fcell.2022.903612
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
Characteristic features of study subjects.
| Characteristic | PDB* patients |
|---|---|
| Number | 232 |
| Gender (Male:Female) | 124:108 |
| Age (years) | 72.3 ± 8.1 |
| Age at Diagnosis (years) | 62.1 ± 10.9 |
| PDB severity Score (n)** | |
| All subjects | 6.6 ± 2.8 (232) |
| Males | 6.9 ± 3.1 (124) |
| Females | 6.3 ± 2.6 (108) |
| Age 30–39 years | 8.0 (1) |
| Age 40–49 years | 8.3 ± 2.1 (3) |
| Age 50–59 years | 7.4 ± 2.0 (13) |
| Age 60–69 years | 6.7 ± 2.3 (55) |
| Age 70–79 years | 6.4 ± 3.0 (116) |
| Age 80–89 years | 6.7 ± 3.3 (44) |
| | 33 |
Paget’s disease of bone. Data are presented as Mean ± SD.
PDB, severity scores were not signficantly different between male and female subjects (p = 0.13) or between the listed age groups (p = 0.49).
FIGURE 1Site-level association analysis with PDB severity. (A). Manhattan plot shows the CpG sites associated with PDB severity at FDR <0.05 (red horizontal line). The top 10 FDR-significant CpG sites are highlighted in color and annotated with the closest gene. (B). Q-Q plot shows the expected versus observed -log10 P. Lambda indicates the genomic inflation factor. (C–L) Box and whiskers plots show PDB severity versus methylation levels from the top 10 sites after correction for confounders.
A subset of FDR-significant CpG sites from site level association analysis with PDB severity. Listed are the top 10 most significant sites together with CpG sites with lower ranks that were located genes with known bone functions.
| CpG Site | Chr | Position | Beta† |
| FDR | Gene | Location‡ | ChromHMM | Rank |
|---|---|---|---|---|---|---|---|---|---|
| cg14252008 | 17 | 57598958 | -0.040 | 4.60 × 10−7 | 0.016 |
| Body | 8 | 1 |
| cg13454199 | 7 | 127914981 | 0.027 | 1.16 × 10−6 | 0.016 |
| Downstream | 13 | 2 |
| cg00040708 | 3 | 49838627 | 0.023 | 1.31 × 10−6 | 0.016 |
| TSS1500 | 13 | 3 |
| cg03694580 | 7 | 2271722 | -0.035 | 1.91 × 10−6 | 0.016 |
| 5′UTR | 1 | 4 |
| cg01872872 | 11 | 60382923 | -0.050 | 2.26 × 10−6 | 0.016 |
| TSS1500 | 13 | 5 |
| cg19616083 | 15 | 74908514 | -0.022 | 2.30 × 10−6 | 0.016 |
| Body | 1 | 6 |
| cg11946072 | 13 | 47371514 | -0.029 | 3.07 × 10−6 | 0.019 |
| TSS200 | 1 | 7 |
| cg01641286 | 2 | 10483916 | -0.021 | 5.33 × 10−6 | 0.025 |
| 5′UTR | 13 | 8 |
| cg09121994 | 1 | 181205431 | -0.044 | 5.37 × 10−6 | 0.025 |
| Upstream | 13 | 9 |
| cg05080074 | 11 | 13517684 | 0.022 | 5.92 × 10−6 | 0.025 |
| TSS200 | 13 | 10 |
| cg05094429 | 6 | 167536184 | 0.025 | 6.55 × 10−6 | 0.025 |
| 5′UTR | 4 | 11 |
| cg01132064 | 11 | 46382562 | -0.025 | 8.39 × 10−6 | 0.026 |
| Body | 4 | 12 |
| cg19311431 | 2 | 162091931 | 0.034 | 1.64 × 10−5 | 0.027 |
| Body | 11 | 26 |
| cg03014008 | 20 | 57463767 | 0.027 | 2.01 × 10−5 | 0.028 |
| Body | 1 | 30 |
| cg23635663 | 22 | 30639979 | 0.040 | 2.44 × 10−5 | 0.028 |
| Body | 10 | 37 |
| cg20609803 | 1 | 161184305 | -0.015 | 2.85 × 10−5 | 0.029 |
| TSS1500 | 10 | 42 |
| cg03349241 | 8 | 145743334 | -0.023 | 3.11 × 10−5 | 0.029 |
| TSS200 | 6 | 46 |
| cg03177593 | 3 | 171890144 | -0.027 | 3.40 × 10−5 | 0.029 |
| Body | 7 | 49 |
| cg18362496 | 11 | 2019930 | 0.013 | 3.71 × 10−5 | 0.029 |
| TSS1500 | 8 | 54 |
| cg24335984 | 3 | 50361554 | 0.018 | 3.79 × 10−5 | 0.029 |
| TSS1500 | 11 | 56 |
| cg06217803 | 7 | 128830440 | 0.021 | 4.23 × 10−5 | 0.030 |
| Body | 6 | 59 |
| cg19327006 | 6 | 143084770 | 0.017 | 5.20 × 10−5 | 0.040 |
| Body | 11 | 69 |
| cg17359265 | 5 | 178567126 | -0.017 | 5.39 × 10−5 | 0.040 |
| Body | 13 | 71 |
| cg08452338 | 3 | 149271990 | 0.028 | 6.29 × 10−5 | 0.046 |
| Body | 11 | 84 |
| cg07341934 | 20 | 57463711 | 0.022 | 6.50 × 10−5 | 0.046 |
| Body | 1 | 89 |
| cg17775490 | 20 | 45179354 | -0.026 | 7.18 × 10−5 | 0.049 |
| TSS200 | 6 | 100 |
CpG was manually mapped to the closest gene. The rest of the sites were mapped by Illumina †The beta value refers to the effect size from the linear model. Positions are in reference to GRCh37 genome build.‡Location relative to gene based on the longest isoform, TSS (Transcription Start Site), UTR (untranslated regions). ChromHMM, indicates chromatin state in the lymphoblastoid cell line GM12878: 1 active promoter, 2 weak promoter, 3 inactive promoter, 4 & 5 strong enhancer, 6 & 7 weak enhancer, 8 insulator, 9 transcriptional transition, 10 transcriptional elongation, 11 weak transcribed, 12 polycomb-repressed, 13 heterochromatin, 14 & 15 repetitive/copy number variation.
FIGURE 2Functional annotations of CpG sites associated with PDB severity. Summary of enrichment analysis of gene ontology (GO) functional categories associated with genes located in the 100 FDR-significant sites.
Statistically significant islands, gene bodies and promoters from region-level association analysis with PDB severity (all with Bonferroni corrected p < 0.05).
| Region | Chr | Start | End | # Sites |
| FDR | Gene |
|---|---|---|---|---|---|---|---|
| Island | 10 | 75755183 | 75761004 | 12 | 3.56 × 10−7 | 0.004 |
|
| Island | 12 | 114839007 | 114840942 | 4 | 5.34 × 10−7 | 0.004 |
|
| Island | 1 | 10750680 | 10754162 | 6 | 5.77 × 10−7 | 0.004 |
|
| Island | 6 | 150262044 | 150266173 | 12 | 7.12 × 10−7 | 0.004 |
|
| Island | 13 | 48608110 | 48616161 | 16 | 1.00 × 10−6 | 0.004 |
|
| Island | 5 | 179246001 | 179251299 | 16 | 1.00 × 10−6 | 0.004 |
|
| Gene body | 3 | 45969823 | 46000900 | 17 | 1.02 × 10−6 | 0.012 |
|
| Gene body | 9 | 126145211 | 126157693 | 26 | 2.11 × 10−6 | 0.012 |
|
| Promoter | 17 | 41149985 | 41150394 | 7 | 1.09 × 10−6 | 0.012 |
|
| Promoter | 11 | 60382082 | 60382923 | 3 | 1.32 × 10−6 | 0.012 |
|
| Promoter | 12 | 110939283 | 110941087 | 9 | 1.83 × 10−6 | 0.012 |
|
Positions are in reference to GRCh37 genome build.
FIGURE 3VCL Island-level association analysis with PDB severity. The top Bonferroni significant island was located in the promoter region of the VCL gene. Green indicates significance from island level analysis The island encompasses 12 CpG sites showing a synergy of methylation effects. Cg00995584 is the most significantly associated site with PDB severity and the colour code in the top regional association plot indicates the correlation levels between each other site and the reference site cg00995584. Ensembl genes are shown in the middle part (orange boxes indicate exons). ChromHMM indicates chromatin state in the lymphoblastoid cell line GM12878. The bottom part denotes the correlation level for all possible pairs of CpG sites.
FIGURE 4SQSTM1 Island-level association analysis with PDB severity. The Bonferroni-significant island on chromosome five overlapping with the SQSTM1 gene. Green indicated significance from island level analysis. The island encompasses 16 CpG sites showing a synergy of methylation effects. Cg17987505 is the most significantly associated site with PDB severity and the color code in the top regional association plot indicates the correlation levels between each other site and the reference site cg17987505. Ensembl genes are shown in the middle part (orange boxes indicate exons). ChromHMM indicates chromatin state in the lymphoblastoid cell line GM12878. The bottom part denotes the correlation level for all possible pairs of CpG sites.
FIGURE 5Validation of predictive models for PDB severity scores. Orthogonal Partial Least Square (OPLS) classifier was trained on the discover set and tested on the cross-validation set. The features consisted of the list of significant sites from site and region level analysis based on the discovery samples (Panels A and C, n = 2,247 CpG sites) and separately, on a best explanatory subset from this pool of sites obtained with the Glmnet (Panels B and D, n = 33 CpG sites) (A,B) Scatter plots of predicted versus observed PDB_severity scores for each model. The observed PDB severity scores were also dichotomized into two groups based on the median PDB severity score of seven; low severity (PDB_severity_score <7; blue) and high severity (PDB_severity_score ≥7; red) and subsequently used for ROC curve analysis showing the area under curve AUC for pooled sites (C) and Glmnet selected sites (D).