Literature DB >> 33762478

Genome-wide association analysis reveals regulation of at-risk loci by DNA methylation in prostate cancer.

Qiang Liu1,2, Gang Liu3, Darryl T Martin2, Yu-Tong Xing4, Robert M Weiss2, Jun Qi1, Jian Kang1.   

Abstract

Epigenetic changes are potentially important for the ontogeny and progression of tumors but are not usually studied because of the complexity of analyzing transcript regulation resulting from epigenetic alterations. Prostate cancer (PCa) is characterized by variable clinical manifestations and frequently unpredictable outcomes. We performed an expression quantitative trait loci (eQTL) analysis to identify the genomic regions that regulate gene expression in PCa and identified a relationship between DNA methylation and clinical information. Using multi-level information published in The Cancer Genome Atlas, we performed eQTL-based analyses on DNA methylation and gene expression. To better interpret these data, we correlated loci and clinical indexes to identify the important loci for both PCa development and progression. Our data demonstrated that although only a small proportion of genes are regulated via DNA methylation in PCa, these genes are enriched in important cancer-related groups. In addition, single nucleotide polymorphism analysis identified the locations of CpG sites and genes within at-risk loci, including the 19q13.2-q13.43 and 16q22.2-q23.1 loci. Further, an epigenetic association study of clinical indexes detected risk loci and pyrosequencing for site validation. Although DNA methylation-regulated genes across PCa samples are a small proportion, the associated genes play important roles in PCa carcinogenesis.

Entities:  

Keywords:  CpG sites; DNA methylation; expression quantitative trait loci; genome-wide association study; prostate cancer

Mesh:

Year:  2021        PMID: 33762478      PMCID: PMC8451484          DOI: 10.4103/aja.aja_20_21

Source DB:  PubMed          Journal:  Asian J Androl        ISSN: 1008-682X            Impact factor:   3.285


INTRODUCTION

Prostate cancer (PCa) is a common malignancy and leading cause of cancer death among men in the US where it is estimated that approximately 191 930 new PCa cases are expected to be diagnosed in 2020.1 Further, according to a Chinese statistical report, 60 300 PCa cases were confirmed in 72 population-based cancer registries in China from 2009 to 2011, with a mortality of 26 600, resulting in rankings of sixth and eleventh place, respectively, among all cancers affecting men, in China.2 As the Chinese population continues to age, both the incidence and mortality of PCa are expected to continue increasing over the next two decades. PCa cells are known to harbor a variety of genetic defects, including gene mutations and translocations, all of which provide the cells with new capabilities for dysregulated proliferation, immune system evasion, tissue invasion and destruction, inappropriate survival, and metastasis.3 Furthermore, there is abundant evidence that along with genetic changes,4 somatic epigenetic alterations also contribute to PCa carcinogenesis and metastasis.5678 Epigenetic gene inactivation in cancer cells is largely based on transcriptional silencing mediated by the aberrant CpG methylation of CpG-rich promoter regions.69101112 DNA methylation is a widely recognized epigenetic marker associated with diagnosis and prognosis in many malignancies.61314 Notably, abnormal DNA methylation has been reported to contribute to the occurrence and progression of PCa.1516 Previous studies of DNA methylation and PCa risk have found that specific promoter sequences are hypermethylated at a higher frequency in PCa tumor tissues than those in nontumor tissues.461017 Quantitative genetics has made a significant progress in revealing the genetic bases of complex traits, especially in developing sophisticated tools to identify the location of genes that impact complex traits. A region of the genome contributing to the variation in a quantitative trait, also known as quantitative trait loci (QTLs), has been used to study gene expression phenotypes (expression quantitative trait loci [eQTLs]) on a massive scale. DNA methylation quantitative trait loci (meQTLs) have been identified in pathological and physiological contexts. The genome-wide gene expression studies can provide information on genetic variation that affects gene expression levels.18 We can use linkage or association mapping to map cis- and trans-acting factors for many genes to explain the inheritance patterns. Previous reports showed that cis/trans-meQTLs could target different CpGs and clarified DNA methylation involvement in diseases and cancers.1920212223 Thus, eQTL analysis is a straightforward and popular method for discovering regulatory genome sites242526 and detecting underlying associations across the genome in PCa studies.2728 The Cancer Genome Atlas (TCGA, http://cancergenome.nih.gov/) is a database that collects multiple types of “omics” from thousands of samples and provides public data access to researchers. Accordingly, modified eQTL methods involving calculations of correlations among gene expression, phenotype, DNA methylation, copy number variations, and single-nucleotide polymorphisms (SNPs) have proven to be a powerful tool.293031 However, genome-wide correlations between gene expression, DNA methylation, and clinical phenotypes in PCa are not yet understood. Therefore, we performed a meQTL analysis of PCa samples from the TCGA database. Although genes regulated via methylation comprise a minor proportion of the genome (0.8%), these genes are enriched in some gene ontology (GO) groups and in important canonical cancer-related pathways. In our study, we mainly identified meQTL pairs on chromosomes 16q and 19q, which have been reported as high-risk regions for PCa, using SNP analyses. We also identified DNA methylation regions and genes associated with clinical indexes at 11q13 and 16q13 and found several genes regulated by DNA methylation that are important for prognosis. According to our meQTL analyses, we selected some androgen receptor (AR) gene-related CpG sites and some sites that are altered during PCa genesis, and correlated them with Gleason score.

MATERIALS AND METHODS

Data preanalysis

A transcriptome of a prostate adenocarcinoma (PRAD) gene evaluation was downloaded with the TCGA dataset. The nonprimary PCa samples were discarded. Furthermore, the primary tumor samples not providing either gene expression (evaluated with RNA sequencing [RNA-seq]) or DNA methylation information also were excluded from later analysis. As result, a total of 419 samples were used for meQTL identification. Methylation data also were downloaded for the TCGA dataset and were correlated with expression data. Methylation levels at the evaluated sites were estimated using an Illumina Human Infilium 450k BeadChip (Illumina, San Diego, CA, USA). After sample normalization, samples were combined to a methylation matrix according to the gene ID.

meQTL analysis

We used R software MatrixeQTL package32 (version 2.15.1; R Project for Statistical Computing, Vienna, Austria) for eQTL analysis. Each methylated CpG level was regarded as a continuous variable rather than a discrete variable. Correlations between the methylation level of each CpG site and each gene were evaluated using MatrixeQTL. As DNA methylation mostly influences the expression of genes via promoter regions, we distinguished cis- and trans-regulation for further analysis. The gene and methylation CpG sites were extracted from Illumina Human Infilium 450k BeadChip annotation files and University of California, Santa Cruz (UCSC) reference gene list locations. For reference genes, the gene location was counted from the transcription start site to terminal site. We set a P value threshold of 1 × 10−5 for cis-regulated and 1 × 10−6 for trans-regulated eQTLs. Cis-regulated meQTLs were defined as interacting pairs with tested DNA methylation sites and genes <1 megabase (MB), whereas trans-regulated meQTLs were defined as pairs with genes >1 MB or located on other chromosomes.

Tissue specimens and bisulfite modification of DNA

A total of 70 PCa patients who underwent radical prostatectomy at Xinhua Hospital affiliated to Shanghai Jiao Tong University School of Medicine (Shanghai, China), between July 2014 and January 2017, were enrolled in this study. Informed consent was obtained from all subjects. All cases were histologically confirmed and had clinical stage II or III tumors, no clinical evidence of lymph node or distant metastasis, available pathology specimens, and complete clinical and serum prostate-specific antigen (PSA) data. Patients with missing any variable needed to accurately assign them to a risk group (PSA, tumor [T] stage, or Gleason score) were excluded. Patients with missing information on any available demographic variables were excluded. All samples were retrieved from the archive of the Institute of Pathology, Xinhua Hospital, and were anonymously analyzed in accordance with the guidelines of the Ethics Committee of Xinhua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine. We confirm that all experimental protocols were approved by the local ethics committee (approval No. XHEC-D-2016-005). Patients were categorized as low-, moderate-, or high-risk PCa based on the 2015 National Comprehensive Cancer Network guidelines (). All 70 cases included tumor tissues and adjacent nontumor tissues. Prostate tissue samples from these cases were obtained at the time of radical prostatectomy, and tumor cell contents were determined to exceed 70.0% of all tissues. Tissue microdissection yielded tumor and adjacent nontumor tissues from which DNA was extracted using an FFPE DNA Kit (Omega Bio-Tek, Norcross, GA, USA) according to the manufacturer's instructions. Extracted DNA concentrations were measured using a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA) and subsequently adjusted to approximately 40 ng ml−1. Prepared DNA was subjected to bisulfite treatment using the EZ DNA Methylation-Gold kit (ZYMO, Orange, CA, USA); converted DNA was dissolved in Tris-EDTA (TE) buffer and stored at or below −20°C for later use. Patient baseline characteristics Patients who met the study criteria with 14.3%, 68.6%, and 17.1% were classified as low, moderate, and high risk, respectively. PSA: prostate-specific antigen; T: tumor; N: node; M: metastasis; s.d.: standard deviation

Primer design and PCR

Specific primers were designed for 30 loci using the publicly available MethPrimer software package (http://www.urogene.org/methprimer/). A complete list of the primer pairs is available in . Biotinylated reverse primers were substituted with 5′-tailed unlabeled reverse primers (aaccttcaacaccccaaccatata), allowing single (expansive) biotinylated primers to be used for subsequent pyrosequencing™. All primers and tag sequences were provided by Sangon (Shanghai, China). The primers of the polymerase chain reaction amplification and pyrosequencing F: forward primer; R: reverse primer; SEQ: sequencing primer PCR amplification of DNA was conducted using a nested-PCR protocol, using the primers shown in . Two rounds of amplification reactions were performed. In the first round, a total reaction volume of 10 μl contains 1 mmol l−1 first primers (1 μl), bisulfite converted DNA (1 μl), ×2 PCR master mix (5 μl), and ddH2O (2 μl). In the second round, the total reaction volume was 30 μl: 4 μl of first-round PCR product as template, plus 10 mmol l−1 second primers (0.5 μl), ×2 PCR Master Mix (15 μl), and ddH2O (10 μl). PCR conditions were as follows: 94°C for 5 min, 30 cycles of denaturation at 94°C for 30 s, annealing at 55°C for 40 s, elongation at 72°C for 30 s, and an additional elongation step at 72°C for 7 min. Approximately 8 μl of each PCR product was separated by electrophoresis using a 3% agarose gel stained with GelRed (Invitrogen, Carlsbad, CA, USA) for 40 min at 120 V and visualized using a Gene Genius bio-imaging system (Syngene, Cambridge, UK).

Pyrosequencing™ methylation analysis

PCR products (20 μl) were added to a mix comprising Streptavidin Sepharose HP™ (3 μl; GE Healthcare, Dornstadt, Germany) and binding buffer (37 μl; Qiagen, Hilden, Germany). The contents were mixed at 2000g (Centrifuge 5810R, Eppendorf, Hamburg, Germany) for 10 min at room temperature. Using the Vacuum Prep Tool™ (Qiagen), according to the manufacturer's instructions, single-stranded PCR products were prepared. Sepharose beads with attached single-stranded templates were released into a PSQ 96 Plate Low™ (Qiagen) containing a mix of 40 μl annealing buffer (Qiagen) and the corresponding sequencing primer at 400 nmol l−1 (). Pyrosequencing™ reactions were performed in a PyroMark ID System (Qiagen), according to the manufacturer's instructions, using the PyroMark Gold 96 Reagent Kit (Qiagen). CpG site quantification was performed using Pyro Q-CpG™ methylation software (Qiagen).

Statistical analyses

The cis- and trans-meQTL sites were identified using R package “MatrixeQTL”. The P values were adjusted using Bonferroni method, and cis-meQTLs with false discovery rate (FDR) <1 × 10−5 and trans-meQTLs with FDR <1 × 10−6 were identified as meQTLs. Association significance between clinical indexes and gene expression/methylation levels was identified using analysis of variance (ANOVA) test (P < 0.05 being considered as significant). Gene expression comparison between recurrence and nonrecurrence group was performed using Student's t-test, and P < 0.05 was considered as statistically significant.

RESULTS

Methylated meQTLs in PCa

We subjected PCa samples including 419 samples from TCGA to a meQTL analysis using expression levels evaluated via RNA-seq and CpG site methylation levels in CpG island (CGI) regions. Among 20 321 genes, the methylation levels of 485 513 CpG loci were used for meQTL identification. We identified 5852 cis-regulation and 5 156 662 trans-regulation meQTL pairs in our dataset. Cis-regulation pairs included 1717 genes (8.4% of all genes tested) and 4895 corresponding CpG sites, with an FDR (Bonferroni) of <0.01. Among these pairs, 784 genes (45.7%) were regulated by multiple CpG sites, and 1661 CpG sites (33.9%) regulated multiple genes. We noticed that adenosine triphosphate (ATP)-binding cassette subfamily A member 17, pseudogene (ABCA17P), chromosome 11 open reading frame 85 (C11orf85), and cyclic adenosine monophosphate (cAMP) responsive element binding protein 3 like 3 (CREB3L3) were regulated according to the methylation levels of 135, 95, and 93 nearby CpG sites, respectively, which are the top three CpG sites-related genes, suggesting the importance of DNA methylation on these genes. We also found that significantly high numbers of CpG sites in meQTLs were located on 19q13.2–q13.43 (P = 1 × 10−11) and 16q22.2–q23.1 (P = 1 × 10−11), as shown in and , consistent with previous reports.3334 According to GATHER (http://gather.genome.duke.edu/), the genes with significant involvement are located at 19q13, 16p13, 11q13, and 17q25 (). Through a GO and pathway analysis using the database for annotation, visualization, and integration discovery (DAVID),35 we discovered that although most GO groups appeared stochastic, several signaling pathways associated with PCa, including the Janus kinase/signal transducer and activator of tran-ions (JAK/STAT), vascular endothelial growth factor (VEGF), and cytokine–cytokine interactions, also are involved (Supplementary Table and ). Finally, the analysis of trans-regulation pairs identified 18 999 genes (93.5% of tested genes) and 239 808 CpG sites, with an FDR of <1 × 10−6. In summary, SNP analysis identified the locations of CpG sites and genes within at-risk loci, including the 19q13.2–q13.43 and 16q22.2–q23.1 loci. The CpG sites were significantly enriched on some cytobands Gene cytogenetics band enrichment loci of methylation quantitative trait loci-related genes according to a gene annotation tool to help explain relationships (http://gather.genome.duke.edu/) FE: Fisher’s exact test; FDR: false discovery rate Methylation quantitative trait loci related gene-enriched gene ontology terms according to database for annotation, visualization, and integration discovery GO: gene ontology; FDR: false discovery rate Methylation quantitative trait loci-related gene-enriched signaling pathways FDR: false discovery rate; KEGG: Kyoto Encyclopedia of Genes and Genomes Circos plot showing the CpG density of eQTLs. The density of CpG sites associated with eQTLs (from inside to outside) across the genome. Each wedge represents a chromosome and different colors show different cytobands. In the inner track, from inner to outer, the fold line represents the values. Higher values mean that more meQTL-related CpG sites are over-present in these regions. eQTLs: expression quantitative trait loci; meQTL: methylation quantitative trait loci.

Gene/CpG methylation and clinical information

To identify the mechanism by which these CpG sites affect carcinogenesis and development, we performed a correlation analysis and an ANOVA between gene expression levels and clinical indexes. The evaluated genes included 20 532 genes from the TCGA database that were evaluated via RNA-seq and correlated with clinical information, including primary and secondary Gleason score, node invasion stage, biochemical recurrence indicators, most recent PSA level, and clinical and pathological primary tumor stages. Each clinical index was associated with a number of genes, as shown in . Number of genes identified significantly associated with clinical observations N: node; PSA: prostate-specific antigen; T: tumor Genes associated with the same clinical information may affect each other. We further performed a similar analysis to compare methylation sites and clinical information and identified 57 719 CpG sites significantly correlated with the aforementioned clinical information (). Number of CpG sites significantly associated with clinical observations N: node; PSA: prostate-specific antigen; T: tumor

Methylation sites influence clinical performance by regulating gene expression

A canonical function of DNA methylation is the blockade of transcription factors from binding gene regulatory elements, thus inhibiting gene expression. To reduce the FDR and determine causality, we sought CpG sites that were significantly correlated with gene expression via cis-regulation according to methylated QTLs. These genes were positively correlated further with clinical information (), and the CpG sites also were positively correlated with the same clinical indexes. We identified 92 paired methylated QTLs that fulfilled the aforementioned criteria. Most methylated QTLs were associated with Gleason score; a DAVID-based GO analysis36 revealed that these genes were significantly enriched for DNA repair and cell cytoskeleton. Most of the sites associated with Gleason score were located in 11q13 and 16q13, and their correlated genes included essential meiotic structure-specific endonuclease subunit 2 (EME2), potassium channel tetramerization domain containing 13 (KCTD13), kelch like family member 17 (KLHL17), and WD repeat domain 90 (WDR90). Of these, EME2 is associated with genomic stability maintenance, whereas reports of the other genes in the context of all types of cancer are scarce. CpG sites and gene expression significantly associated with clinical information SNP: single nucleotide polymorphism; FDR: false discovery rate; CHR: chromosome; PSA: prostate-specific antigen We identified that calcium/calmodulin-dependent serine protein kinase interacting protein 1 (CASKIN1) was enriched with CpGs and CASKIN1 itself was also significantly associated with genes related to biochemical cancer recurrence (P = 0.001; ). According to the methylated QTL algorithm, these CpG sites’ methylation levels may act as cis-regulators of CASKIN1, suggesting that the methylation of nearby CpG sites probably influences the expression of CASKIN1 and thus affects the recurrence in these cases. The violin plot of relative expression of CASKIN1 between biomedical recurrence and nonrecurrence subgroup evaluated by TCGA. The expression of CASKIN1 is significantly different between recurrence and nonrecurrence group (P = 0.001). CASKIN1: calcium/calmodulin-dependent serine protein kinase interacting protein 1; TCGA: The Cancer Genome Atlas.

Methylated QTLs in PCa-related genes

An analysis of these regulation pairs should include the regulatory status in both prostate and PCa cells. To extract useful information for later analysis, we selected meQTLs concerning genes associated with PCa genesis and metastasis.373839404142 Ninety-two genes were used for meQTL selection. In our meQTL dataset, 38 cis-regulated meQTLs existed near these sites, and 1753 trans-regulated eQTLs were identified. Genes of interest among cis-eQTLs included alanyl aminopeptidase, membrane (ANPEP), activating transcription factor 3 (ATF3), B cell leukemia/lymphoma 2 (BCL2) associated X (BAX), and early growth response 1/3 (EGR1/3), suggesting that the expression levels of these PCa-related genes are regulated by the methylation levels of nearby CpG sites. For further analysis, these gene and methylation findings were validated in 70 pairs of clinical tumorous and nontumorous tissue samples. The patients from whom samples were collected were classified as low, moderate, and high risk according to Gleason score, mean PSA, and tumor/node/metastasis (TNM) clinical staging; a variety of tissues from different risk levels were subjected to methylation quantification via pyrosequencing™. In our results, the false discovery rate was relatively high because we did not detect the expression levels of the selected meQTL-related genes. However, we still detected several differences in methylation sites between normal and tumorous samples (). These sites included meQTL pairs involving FosB proto-oncogene, activator protein-1 (AP-1) transcription factor subunit (FOSB), ANPEP, and ras-related dexamethasone-induced 1 (RASD1). In tumorous samples, the methylation levels at these sites, particularly cg20664996 near RASD1, were low. We also noticed that in high- and moderate-risk samples, the methylation levels at this site reached 0. RASD1 has been reported as an apoptosis- and ras-related gene that prevents aberrant cell growth in several cell lines.43 In short, an epigenetic association study of clinical indexes detected risk loci and pyrosequencing for site validation. Although the number of DNA methylation-regulated genes across PCa samples is a small proportion, the associated genes play essential roles in PCa carcinogenesis. Four CpG loci that were significantly differently methylated according to the validated dataset. In each CpG locus, the left dots indicate the relative methylation level of the normal tissues and the right is the corresponding cancerous tissues, the P values between the normal tissues and cancerous tissues are shown from left to right.

DISCUSSION

DNA methylation is one among the most common and well-characterized epigenetic changes in PCa.567891011 The importance of large-scale methylome studies has increased in biomedical fields, and high-throughput sequencing technologies promise sensitive, quantitative, and high-resolution large-scale DNA methylation analyses. A genome-wide association study (GWAS) based on QTL analysis is an effective method with which cancer-related genes and risk loci throughout the genome can be studied;141544 similarly, the usefulness of methylation-based GWAS has been reported.45 However, despite reports on the association of DNA methylation with carcinogenesis and development in the prostate,46474849 genome-wide analyses of the associations between gene expression, DNA methylation, and clinical information related to PCa continue to yield vague results. Here, we utilized TCGA to obtain DNA methylation data from more than 400 000 CpG sites, expression levels of over 20 000 genes, and clinical indexes for 419 PCa samples, to perform a methylated QTL analysis in which we identified 5852 cis-regulating pairs comprising 1712 genes. Compared with previous breast cancer studies, only 0.8% of genes associated with PCa are affected by DNA methylation, and these genes exhibit significant involvement in several pathways that previously have been reported to play important roles in PCa genesis, including the JAK/STAT and VEGF pathways.5051 We also noticed that the associated genes and locations were significantly enriched at several genome locations, including 19q13.2–q13.43 and 16q22.2–q23.1. As these are cis-interactions within 1 MB, the regulated genes also are enriched at these sites. Previous reports of SNPs and PCa noted an association between SNPs at 19q13 and PCa risk.52 Another GWAS of a Chinese population demonstrated 19q13 as a novel risk locus for PCa,33 and a decade-long study detected chromosomal deletion and gene suppression at 16q22 in PCa.53 It was worth noting that this region was reported to show a loss of heterogeneity in breast cancer.54 In addition to somatic mutations, SNPs, and copy number variations, our data suggest that DNA methylation may also influence PCa genesis and probably plays important roles in this process by regulating the transcription of this gene nexus. Although a previous study based on eQTL algorithms aimed to define the relationships between genomic, epigenomic, and transcriptomic changes,2631 the link between the data obtained and clinical information remains unclear. Heyn et al.55 reported a TCGA QTL analysis using 13 tumor types from TCGA, including PCa, but their study lacks correlations to PCa-specific clinical variables such as Gleason score and PSA. Their data sets (cancer samples and adjacent normal tissues) did not include control subjects without the disease; future studies are needed to determine if GWAS risk alleles exhibit similar relationships in cancer-unrelated donors. To further investigate the relationships of DNA methylation and gene expression with clinical information, we performed an association study between the three pairs (expression-methylation, methylation-clinical information, and clinical information-expression) and identified 92 genes regulated by CpG site methylation. Furthermore, we associated the expression levels of these genes with clinical information, particularly the Gleason score. GO analysis revealed that these 92 genes were significantly enriched with respect to DNA repair and cell cytoskeleton, suggesting that the genes probably direct the development and prognosis of PCa. Of these genes, we noted that EME2 previously was reported to maintain genomic stability,56 whereas the roles of the other genes were poorly elucidated. A locus analysis of these 92 genes demonstrated significant enrichment at 11q13 (P = 0.008, 3 genes) and 16q13 (P < 0.0001, 6 genes). The genes identified were enriched according to an in-house R script of the hypergeometric distribution, using the genes assayed for meQTLs as background. Enriched genomic loci with P < 0.01 were identified, and some genes associated with the mQTL were significantly enriched for DNA repair and cell cytoskeleton. Most of the sites associated with Gleason score were in 11q13 and 16q13, and their correlated genes included EME2,56 KCTD13,57 KLHL17,58 and WDR90.59 These special genes played critical roles in the biological and pathological characteristics of an organism. Studies based on SNPs or somatic mutation profiling have shown that 11q13 and 16q13 loci are associated with aggressiveness, invasion, and poor prognosis of PCa. Our results show that epigenetic alterations in these regions are associated with genes considered necessary for the progression of PCa and suggest that methylation at 11q13 and 16q13 loci is essential to PCa progression. We were the first to prove that CASKIN1 was enriched with CpGs, and CASKIN1 itself was also significantly associated with genes related to biochemical cancer recurrence. Although the mechanism of how CASKIN1 affects PCa is unclear, its interacting protein, CASK has been associated with survival of patients with colorectal cancer.60 For validation, we collected 70 clinical tissue samples from 2014 to 2017. Although these patients are Asian, we used the 2020 National Comprehensive Cancer Network (NCCN) guidelines as a criterion to classify distinct groups. Some studies suggested racial differences in PCa. Non-Whites were associated with a significantly higher likelihood of presenting with high-risk PCa (22.9% of Hispanics, 23.8% of Blacks, and 23.3% of those of other races compared with 19.0% of Whites; all P < 0.001).61 Age-adjusted PCa mortality in black men was more than double that of white men (42.0/100 000 vs 18.7/100 000). Asian men had a lower incidence of PCa and death than white men. Chinese and Japanese men have a relatively low risk of PCa than their European and North American counterparts.62 The method we describe is based on detecting methylation and expression differences between samples of PCa. Therefore, it aims to identify correlations that occur within particular subsets of cases. For example, we found that some genes of interest among cis-eQTLs have played essential roles in PCa, including ANPEP,6364 ATF3,6566 BAX,67 and EGR1/3.68 These genes are involved in tumorigenesis, metastasis, and recurrence of PCa. We selected 30 high-confidence CpG sites that have been reported to associate with PCa-related genes and evaluated methylation at related CpG sites (). Despite a relatively high FDR, we determined differences in the methylation levels of several CpG sites between cancerous and noncancerous tissues (associated genes included ANPEP and FOSB). FDR is a tremendous concern for this study. To reduce FDR, we first enrolled the samples using the inclusion criteria described in the methods part. Second, we selected the P-value of meQTL pairs as 1 × 10−5 instead of 0.01/0.05. Last, the FDR values were calculated using Bonferroni, which is the most rigorous method available, and 0.01 was used for cutoff. The details of selected 30 high-confidence CpG sites via methylation quantitative trait loci reported to associate with prostate cancer-related genes PCa: prostate cancer; SNP: single nucleotide polymorphism; FDR: false discovery rate; FOSB: FosB proto-oncogene, activator protein-1 (AP-1) transcription factor subunit; SCGB1A1: secretoglobin family 1A member 1; MUC16: mucin 16; IL-6: interleukin 6; WIF1: Wnt inhibitory factor 1; TTN: titin; AMACR: alpha-methylacyl-coenzyme A racemase; GNMT: glycine N-methyltransferase; RASD1: ras-related dexamethasone induced 1; ANPEP: alanyl aminopeptidase, membrane; NPY: neuropeptide Y; IL-1A: interleukin 1A; RYR2: ryanodine receptor 2; PENK: proenkephalin; EGR3: early growth response 3; C15orf48: chromosome 15 open reading frame 48 We further identified a correlation between the Gleason score and methylation level at cg20664996, a CpG location associated with RASD1 expression, according to an eQTL analysis. RASD1 is a 30 kDa G-protein that belongs to the Ras superfamily of small GTPases. Several robustly upregulated bicalutamide-dependent genes were identified by micro-array in LNCaP-ARW741L cells that were not significantly enhanced by dihydrotestosterone (DHT) in the LNCaP-LacZ line, including RASD1.69 Liu et al.70 showed that formononetin inhibited cell proliferation and induced apoptosis in DU-145 cells throughout the RASD1/mitogen-activated protein kinase (MAPK)/BAX pathway in PCa. The selected high-confidence CpG sites are reported to associate with some PCa-related genes, including FOSB,71 secretoglobin family 1A member 1 (SCGB1A1),72 mucin 16 (MUC16),73 alpha-methylacyl-coenzyme A racemase (AMACR),74 and glycine N-methyltransferase (GNMT).75 Emerging evidence has shown that these genes may be useful diagnostic and prognostic biomarkers for PCa.

CONCLUSIONS

We demonstrated that novel meQTL pairs were associated with PCa, and for the first time, we identified the locations of CpG sites and genes within at-risk loci, including the 19q13.2–q13.43 and 16q22.2–q23.1 loci. We further used pyrosequencing™ for validation and identified several genes that may impact the development and prognosis of PCa.

AUTHOR CONTRIBUTIONS

QL conceived and participated in its design, searched databases, and extracted and assessed studies. QL and GL carried out the statistical analysis and interpretation of data and drafted the main manuscript. QL and DTM prepared the tables and figures. QL, YTX, and RMW prepared the figures and supplementary tables. QL, JK, and JQ participated in the conceptualization and design of the manuscript, performed the selection of studies, and drafted the manuscript. All authors reviewed and approved the final manuscript.

COMPETING INTERESTS

All authors declare no competing interests.
Table 1

Patient baseline characteristics

Variable Low risk Moderate risk High risk
Patient (n)10341412
Age (year), mean±s.d.65.4±7.767.3±9.565.6±8.768.1±12.3
PSA (ng ml-1), mean±s.d.8.6±1.215.4±2.417.2±1.732.7±10.5
Gleason score≤ 6 (3+3)7 (3+4)7 (4+3)≥ 8 (4+4, 4+5, 5+3, 5+4)
Clinical T-stage≤ T2aT2bT2b≥ T2c
Clinical N-stageN0N0N0N0
Clinical M-stageM0M0M0M0

Patients who met the study criteria with 14.3%, 68.6%, and 17.1% were classified as low, moderate, and high risk, respectively. PSA: prostate-specific antigen; T: tumor; N: node; M: metastasis; s.d.: standard deviation

Supplementary Table 1

The primers of the polymerase chain reaction amplification and pyrosequencing

Sites F1 F2
cg14338887, chr6:42928000,42929000ATTTGTTATTGGTTAGGTGGGGTTTGTTAGTAGTGTTTATGTTTTAAGTG
cg02213678, chr16:85112035,85113035TTGGTTATGGTTAGTAGAAAAGTTATTGGTTATGGTTAGTAGAAAAGTTA
cg09207578, chr13:114781449,114782449GTTTTAGGTTTGGTTGAGATGTAGATTGTTTTAGGTTTGGTTGAGATGTAGATT
cg15492820, chr11:61322613,61323613TTTTTAAAGTTTTATTATATTTTTGGGTTTTTTAAAGTTTTATTATATTTTTGGGT
cg12918275, chr17:18150454,18151454GTTATAGGGGTTAGGGTATGTTAGGGTTATAGGGGTTAGGGTATGTTAGG
cg08170140, chr22:31090247,31091247GATTTAGGAAGGATTTAGGGTTTAAATATGGGATTTAGGAAGGATTTAGGGTTTAAATATGG
cg20664996, chr17:17359321,17360321TATAATGAAGTGTGTATTAYGGGAGYGTTTTTAAGTTGATTTTTGGTTGTAGTG
cg09093477, chr5:34838689,34839689GAGATATAATATGTAGAGGTTAGTTTTGGGGTTTAATTTGYGGGGAAGGGAAAGYGGAG
cg14927321, chr15:89491114,89492114GAAGTAATTAGTATATTAATGTGGTATTTTGGGTGGGGGGTGATATTTTTTGGGT
cg15854606, chr19:18654048,18655048TTGGTTTYGTTTTTGTYGTGTTTATTGTTGGTTTATGTAGAYGTTYGTTTYGGG
cg186688 36, chr2:113102684,113103684GTTAGTTTTTAATTTTTGGGTTTAGGGATTGTATTGTATTAATTTTATTTATATATAGGTTTGTAT
cg19283427, chr18:29522880,29523880GAGGTGTATAATTGTAGGTYGGGTTTTTAAGTTYGTTTTTAGTAGAGTAGAGTTGAGAG
cg06577658, chr19:46295770,46296770TTYGTTTATAGGTGTTGTTAGAGTTTTGATTTAGGAAGGATTTAGGGTTTAAATATG
cg23158051, chr2:65454192,65455192TTAGTTTTTTYGTTATATTTTATTGTYGTAGTAGTGTTTGYGTTTTTTGTAGTTYGTTAAGTAG
cg04744134, chr19:8407628,8408628ATTTTTAATGTGGGAGGGGTTTGGTAGAGTTTTTAGTGGGGTTGGAAA
cg18770350, chr1:236849466,236850466TGGGTTYGTTTGTTAGTTAGTTYGTGTGGGTTYGTTTGTTAGTTAGTTYGTG
cg23139584, chr8:56987006,56988006TTGAGTTTAGTTGATATGAGTATTGGTAGTTGAGTTTAGTTGATATGAGTATTGGTAG
cg05407490, chr7:23749167,23750167TTAAATTTAYGTGAATAGATTAGGAAAGATTATTAAATTTAYGTGAATAGATTAGGAAAGATTA
cg23410069, chr22:20287122,20288122TAGTTTGTTGGTGTTGYGTTTTTGTTGTTGGYGYGTTAYGAGTAGGTG
cg05754508, chr12:65672031,65673031TTTTATGTTTATATTTGGATTYGGTTGTTAGGAATTTTTYGTTATGTTTTTYGTYGT
cg15850918, chr15:89900241,89901241TTTAGGTTAGATTAGGGATTAGTTTTAAGGGAGGATGAAGGATATAGAAG
cg25902616, chr11:47788693,47789693AAGGGTGAGAGTTTAGTTAATAGTGAAGGGTGAGAGTTTAGTTAATAGTG
cg23303108, chr8:23083078,23084078AAGAGTTTTGAAAGTTTTTAAATTAGGGGATAGGTTTTTTTAGYGGT
cg27069132, chrX: 135991043,135992043GAAATTTTTAGGATAGTTTTTTTTAGGYGGGGAAGTTTGGGGAT
cg09804858, chr6:42396621,42397621GAGTATTGGTGTTTGGAGAAATTTTTGGATGGATTTTGYGAGGTTTGG
cg14338887, chr6:42928000,42929000aaccttcaacaccccaaccatataCAACCCAAACAACCATACCTTATACaaccttcaacaccccaaccatataAGTGTTTATGTTTTAAGTGYGG
cg02213678, chr16:85112035,85113035aaccttcaacaccccaaccatataATACAAATAAATCTTCTCCCCCTTCaaccttcaacaccccaaccatataAGTGGGTGYGTAATTAGAGTTGTTTTTAG
cg09207578, chr13:114781449,114782449aaccttcaacaccccaaccatataCCAAAATTACCCCCTTACAACTAAaaccttcaacaccccaaccatataTAYGGTGAGGTYGGATTGTAGGGTTAT
cg15492820, chr11:61322613,61323613aaccttcaacaccccaaccatataACCCAAAACCATACATCTAAAAACAaaccttcaacaccccaaccatataTTGGGYGGGGYGGYGYGATG
cg12918275, chr17:18150454,18151454aaccttcaacaccccaaccatataCCCTCAAACTAAATCAAAAACTATACaaccttcaacaccccaaccatataAGGGTATGTTAGGGTGYGAAGGAT
cg08170140, chr22:31090247,31091247aaccttcaacaccccaaccatataCTTTCCCCATAAAACCRACCRACCaaccttcaacaccccaaccatataTATGGTTYGAGGTTGGTYGAGAG
cg20664996, chr17:17359321,17360321aaccttcaacaccccaaccatataATTTAAAATAAATACCCATCCAAACaaccttcaacaccccaaccatataTGTAGTGGTTTGGGTTTGTGTGG
cg09093477, chr5:34838689,34839689aaccttcaacaccccaaccatataCAACCCAAATCCRACCAAAAAAACAATACaaccttcaacaccccaaccatataAGGGTAAGGTAAGGGTTGTTGTAGG
cg14927321, chr15:89491114,89492114aaccttcaacaccccaaccatataACACCACCCTATAAAATTACTCATTAACaaccttcaacaccccaaccatataGTTTGGTAGTTTAATTTTTTTATTGTA
cg15854606, chr19:18654048,18655048aaccttcaacaccccaaccatataTATTAAAATCCAACCCAAAATCCCCaaccttcaacaccccaaccatataTTAAAGGGATYGAAGTTTTTYGGTTG
cg186688 36, chr2:113102684,113103684aaccttcaacaccccaaccatataTATTTCCTCCAACTTTCTAATTATTTAAAAAaaccttcaacaccccaaccatataTATAGGTTTGTATTTTTTTGTTAAAATT
cg19283427, chr18:29522880,29523880aaccttcaacaccccaaccatataCCAAATCRATTACCCACAACAACCAAAAAaaccttcaacaccccaaccatataAGTAGAGTAGAGTTGAGAGTTATTTA
cg06577658, chr19:46295770,46296770aaccttcaacaccccaaccatataCTTTCCCCATAAAACCRACCRACCaaccttcaacaccccaaccatataGGGTTTAAATATGGTTYGAGGTTGGTYG
cg23158051, chr2:65454192,65455192aaccttcaacaccccaaccatataATAAACTCCAATAAAAAATAAAAAAACaaccttcaacaccccaaccatataAGTTTTTGTGATATTTTAGGGAGAAAG
cg04744134, chr19:8407628,8408628aaccttcaacaccccaaccatataCAATCACCACCCCTACATTCCAAACCCaaccttcaacaccccaaccatataGTGYGTTTTYGAATTTATTTATTTAGGGG
cg18770350, chr1:236849466,236850466aaccttcaacaccccaaccatataTACTTCTCCCAAACTAAATCCAAAAACaaccttcaacaccccaaccatataTTYGTGYGTTYGAGTTTTTYGYGTTT
cg23139584, chr8:56987006,56988006aaccttcaacaccccaaccatataCCTTTCCAAAAATAAATATAAATACTTCTACTAAACaaccttcaacaccccaaccatataGTTGATATGAGTATTGGTAGAAATGTTT
cg05407490, chr7:23749167,23750167aaccttcaacaccccaaccatataCAAAAACCCCACACTACRCCTAaaccttcaacaccccaaccatataGTTTGAAGTTTTGYGAAAGGAATTGG
cg23410069, chr22:20287122,20288122aaccttcaacaccccaaccatataCCTCCCTTCCTAACTTCCCTTACaaccttcaacaccccaaccatataGGGTTTTTGTTYGTTTYGGGAG
cg05754508, chr12:65672031,65673031aaccttcaacaccccaaccatataAACTACRCACTTCCCAAAACaaccttcaacaccccaaccatataTTYGGAGTYGGGGAGGGAGGGAG
cg15850918, chr15:89900241,89901241aaccttcaacaccccaaccatataCCTCTAAAATATCTTTCAAATTCTCaaccttcaacaccccaaccatataGGTTAGTTAGGTTTATTTAGTTTTTAGG
cg25902616, chr11:47788693,47789693aaccttcaacaccccaaccatataAATAAAAAACAAATAAACAAAAAAAAaaccttcaacaccccaaccatataAGTGYGGTAGGGTTTGAATTAGAA
cg23303108, chr8:23083078,23084078aaccttcaacaccccaaccatataAAAAAAAATAAACATCAATCAACaaccttcaacaccccaaccatataTTTGTATTTAGYGTTGTAYGGG
cg27069132, chrX: 135991043,135992043aaccttcaacaccccaaccatataACTAATAACACTATAACTCCCAAAACaaccttcaacaccccaaccatataGTTYGGTTYGGYGGGGATTTTTA
cg09804858, chr6:42396621,42397621aaccttcaacaccccaaccatataAACAAAAAATACTTCCTTCCATTTCaaccttcaacaccccaaccatataGAGGTTTGGGAAGTTTTATTTATTTTA

F: forward primer; R: reverse primer; SEQ: sequencing primer

Supplementary Table 2

The CpG sites were significantly enriched on some cytobands

Locus Total genes with annotation Genes involved LOG10 P Genes
19q1311011011.19 A1BG APOC1 APOE ATP4A BCL2L12 CABP5 CAPN12 CEACAM4 CGB1 COX7A1 CPT1C CRX CYP2A13 CYP2B6 CYP2B7P1 CYP2F1 DBP DLL3 DMPK DPF1 EHD2 FLJ16165 FLJ23569 FLJ26850 FLJ32658 FLJ40125 FLJ40235 FLJ40321 FOSB FUT2 GIPR GPR32 GPR77 GRIN2D HSPB6 IGFL1 IL28B IL29 IL4I1 KCNA7 KCNJ14 KIR3DL3 KLC2L KLK12 KLK13 KLK14 KLK8 KLK9 LGALS14 LIG1 LMTK3 LOC112703 LOC126147 LOC147645 LOC199800 LOC342900 LOC388550 LOC440533 LRRC4B MAG MGC17986 MGC34799 MUC16 MYADM MYBPC2 NALP13 NALP4 NALP9 NANOS2 NAPSB NKG7 NKPD1 NPHS1 NUMBL NYD-SP11 PPP1R13L PPP1R15A PRKCG PRODH2 PSG1 PSG11 PSG3 PSG4 PSG7 R30953_1 RCN3 RPL18 RPS19 RSHL1 RUVBL2 SELV SIPA1L3 SLC6A16 SLC7A10 SPACA4 SPIB SYCN SYT3 TEX101 TIP39 TNNI3 TNNT1 UNQ467 VN1R4 ZFP36 ZNF30 ZNF331 ZNF342 ZNF473 ZNF541
16p13474711.19 C16orf35 C16orf5 C1QTNF8 CASKIN1 DNASE1L2 EME2 FBXL16 FLJ32252 FLJ34512 GNG13 HAGH HAGHL HBA1 HBAP2 HBQ1 HBZ IGFALS IL32 KIAA1924 KREMEN2 LOC146562 LOC283951 LOC342346 LOC400500 MGC2494 MMP25 MPN Magmas NME4 NOD3 NOXO1 PAQR4 PDIP PRM1 PRM2 PRM3 PRSS21 RAB11FIP3 RAB40C RGS11 RNU64 RPL3L SOCS1 SYNGR3 TNP2 TPSD1 WFIKKN1
11q13464611.19 ACY3 AIP B3GNT6 Bles03 CABP2 CABP4 CD5 CFL1 CST6 ESRRA FBXL11 FGF19 FGF3 FGF4 FIBP FLJ33790 FOLR1 FOSL1 GAL3ST3 GIF GPHA2 GPR152 HTATIP KIAA1394 LOC144097 LOC283129 LOC387778 MGC11102 MGC20410 MTL5 NXF OVOL1 SCGB1D1 SCGB1D2 SCYL1 SLC22A12 SLC29A2 SPTBN2 SSSCA1 SUV420H1 TNFRSF19L TSARG6 TSGA10IP TncRNA WNT11 YIF1
17q25323211.19 ANAPC11 CARD14 CBX8 DKFZP434P0316 EVPL EXOC7 FADS6 FASN FLJ31882 FLJ35767 FSCN2 GCGR GRIN2C HCNGP IREM2 LOC147111 LOC255275 LOC284001 LOC92659 MGC29814 OTOP3 PDE6G RAC3 RFNG RNF157 SECTM1 SPHK1 SRP68 USP36 UTS2R WBP2 ZC3HDC5
17q2137379.24 AOC2 C1QL1 CA10 CCR10 DLX3 DLX4 FLJ33318 FLJ40137 FLJ40342 G6PC GCN5L2 GIP GSDM1 HOXB1 HOXB2 HOXB7 HOXB8 IMP-1 IMP5 KRT20 LOC284067 LOC388389 LOC388394 MGC16309 MRPL10 NAGS PHOSPHO1 PLCD3 PNPO PPY PRAC PYY ProSAPiP2 SCAP1 TBX4 TREM4 ZNFN1A3
19p1361618.19 ANKRD24 ANKRD25 APC2 BRUNOL5 C19orf23 C19orf30 CAPS CASP14 CD37 CREB3L3 DF DNAJB1 EMR2 EPOR FLJ11535 FLJ25758 FLJ32416 FLJ35784 FLJ39369 FLJ40365 FLJ45910 FUT5 FUT6 GRIN3B HCN2 IER2 IL27RA ISYNA1 KIAA1086 LOC126536 LOC199675 LOC255809 LOC390874 LRG1 MADCAM1 MBD3L1 MGC15631 MGC17791 MGC23244 MGC24975 MGC39581 NANOS3 NRTN OR10H4 OR1I1 OR1M1 OR7A17 OR7C2 OR7D4 PCP2 PKN1 RAB3A RPL36 SAFB2 SHD SIRT6 THSD6 TIMM13 TMPRSS9 UNC13A VMD2L1
1q2137377.18 ANKRD34 ANXA9 APCS APOA2 AQP10 ATP8B2 C1orf45 CRP FCRH1 FCRH3 FLJ20519 FLJ37964 HAX1 IRTA1 IRTA2 KCNJ9 KCNN3 LCE1A LCE1D LCE2B LCE2C LCE3A LCE3E LCE4A LOC388698 LRRN6D MCSP MUC1 RHBG S100A2 S100A3 SH2D2A SPAP1 SPRR2A SPRR2B SV2A THHL1
12q1333335.95 AMHR2 AQP6 CACNB3 CYP27B1 FMNL3 GEFT HNRPA1 HOXC11 HUMCYT2A IGFBP6 IL23A INHBC ITGB7 K6IRS4 KRT6E KRTHB3 KRTHB5 KRTHB6 LETMD1 MGC17301 MLC1SA NXPH 4 OR10P1 OR6C1 OR6C65 OR6C68 OR6C74 OR9K2 SLC4A8 STAT2 TAC3 WNT10B ZNFN1A4
11p1543435.67 ASCL2 BRSK2 CALCA CNGA4 DKFZp761L1518 HBE1 HBG2 HCCA2 HSD-40 IFITM1 IFITM3 IGF2 IGF2AS KCNQ1DN KRTAP5-6 LDHA LOC387733 LOC439915 MRGX3 MRVI1 MUC2 MUCDHL NALP6 ODF3 OR51B4 OR51B5 OR51B6 OR51D1 OR51F2 OR51I2 OR52A1 OR52E6 OR52E8 OR52K1 OR52N1 OR52N2 OR56A4 OR56B1 SAA3P SYT9 TH USH1C ZDHHC13
20q1329295.14 BHLHB4 C20orf111 C20orf166 C20orf58 C20orf59 C20orf67 CBLN4 DNTTIP1 EDN3 FLJ30313 GATA5 GPR8 HRH3 KCNG1 KCNQ2 LOC198437 MC3R MYT1 NTSR1 PHACTR3 PPGB RIMS4 SGK2 SLC35C2 STMN3 SYCP2 UBE2C WFDC10A WFDC5
Supplementary Table 3

Gene cytogenetics band enrichment loci of methylation quantitative trait loci-related genes according to a gene annotation tool to help explain relationships (http://gather.genome.duke.edu/)

Annotation Total genes with annotation Your genes (with annotation) Your genes (no annotation) Genome (with annotation) Genome (no annotation) Ln (Bayes factor) Negative ln (P) FE: negative ln (P) FE: negative ln (FDR)
19q1311011014948572786924.3811.1931.1625.64
16p13474715573072841911.9511.1918.8213.99
11q1346461558320284069.9111.1916.7612.34
17q2532321572184285428.8511.1915.811.67
17q2137371567276284505.539.2412.338.42
19p1361611543598281284.398.1910.967.23
1q2137371567306284203.577.1810.286.71
12q1333331571277284492.355.959.085.64
11p1543431561410283162.095.678.75.38
20q1329291575240284861.645.148.375.15

FE: Fisher’s exact test; FDR: false discovery rate

Supplementary Table 4

Methylation quantitative trait loci related gene-enriched gene ontology terms according to database for annotation, visualization, and integration discovery

Term Count Percentage P List total Pop hits Pop total Fold enrichment Bonferroni Benjamini FDR
GO: 0007186~G-protein coupled receptor protein signaling pathway17910.498533729.80E-1911211123135281.9235401543.27E-153.27E-151.78E-15
Signal peptide39923.401759534.18E-1815883250191131.4776374731.58E-145.28E-157.72E-15
Topological domain: Extracellular33019.354838719.83E-1415882719191131.4607742143.73E-109.32E-111.82E-10
Membrane59935.131964815.76E-0615946256192351.1554042370.0037590972.90E-040.008605043
Cytokine341.9941348971.32E-051594181192352.2667530870.0085656016.14E-040.019654069
GO: 0007267~cell–cell signaling764.4574780061.92E-041121600135281.5285875710.4726415550.0446765860.349044181
GO: 0051302~regulation of cell division130.7624633433.35E-04112147135283.337901190.6722537930.0635120010.607711083
Growth factor241.4076246334.48E-041594131192352.2107713080.2542638940.0108072330.668115409

GO: gene ontology; FDR: false discovery rate

Supplementary Table 5

Methylation quantitative trait loci-related gene-enriched signaling pathways

Category Term Count Percentage P List total Pop hits Pop total Fold enrichment Bonferroni Benjamini FDR
KEGG_PATHWAYhsa04740:Olfactory transduction714.1642228741.22E-0945837950852.0799103592.14E-072.14E-071.49E-06
KEGG_PATHWAYhsa04080:Neuroactive ligand-receptor interaction442.5806451613.13E-0545825650851.9082628280.0054970440.002752310.038425378
KEGG_PATHWAYhsa04060:Cytokine-cytokine receptor interaction442.5806451615.55E-0545826250851.8645621520.0097193810.0032503470.068074717
KEGG_PATHWAYhsa05320:Autoimmune thyroid disease120.7038123170.0047476754585150852.6123811970.567243730.1889253235.672613309
KEGG_PATHWAYhsa00590:Arachidonic acid metabolism120.7038123170.0098644944585650852.3791328760.8253149910.29457454411.45423332
KEGG_PATHWAYhsa04610:Complement and coagulation cascades130.7624633430.0186255884586950852.0917979870.9634478880.42391653220.60338795
KEGG_PATHWAYhsa04672:Intestinal immune network for IgA production100.5865102640.0282829914584950852.2658408340.9935878210.51391178429.67691858
KEGG_PATHWAYhsa04630:Jak-STAT signaling pathway221.2903225810.03593306245815550851.5758557540.9984046160.55294787836.17726313
KEGG_PATHWAYhsa05310:Asthma70.4105571850.0407865464582950852.679942780.9993437250.55706204540.01015413
KEGG_PATHWAYhsa04950:Maturity onset diabetes of the young60.3519061580.067797274582550852.6646288210.9999956960.7093418457.74731756
KEGG_PATHWAYhsa04370:VEGF signaling pathway120.7038123170.070003434587550851.7764192140.9999971640.68688667558.95815416
KEGG_PATHWAYhsa04640:Hematopoietic cell lineage130.7624633430.0816084564588650851.6783030360.9999996890.71309231164.8191755
KEGG_PATHWAYhsa04664:Fc epsilon RI signaling pathway120.7038123170.0875350294587850851.7080953980.99999990.71067326467.50599408

FDR: false discovery rate; KEGG: Kyoto Encyclopedia of Genes and Genomes

Table 2

Number of genes identified significantly associated with clinical observations

Clinical variable Gene (n)
Primary Gleason score376
Secondary Gleason score13
Gleason score726
Nodes examined (entered N stage)58
Biochemical recurrence indicator14
PSA level25
Clinical T stage68
Pathological T stage96

N: node; PSA: prostate-specific antigen; T: tumor

Table 3

Number of CpG sites significantly associated with clinical observations

Clinical variable CpG sites (n)
Primary Gleason score4087
Secondary Gleason score6518
Gleason score11 321
Nodes examined (entered N stage)2271
Biochemical recurrence indicator7667
PSA level5350
Clinical T stage2583
Pathological T stage11 615

N: node; PSA: prostate-specific antigen; T: tumor

Supplementary Table 6

CpG sites and gene expression significantly associated with clinical information

SNPs Gene Statistic P FDR Beta CHR MAPINFO Strand PSA_most_recent_results P of methyl-clinical Clinical Type 1 P gene_exp-clinical Clinical Type 2
cg099064028519−7.9091087072.35E-141.99E-10−0.002515419111033236R2.77E-060.917349226PSA_recent_methyl0.917349226PSA_recent
cg000274992539804.4296305631.21E-050.01708860.0003218121630759507F0.0025358450.02366169gleason_methyl0.02366169gleason
cg000862663394514.1223291894.53E-050.0483834440.0002116111242895R0.0001526820.128466513gleason_methyl0.128466513gleason
cg0011251771534.2632466912.49E-050.0303593313.74E-051737783011R0.003740.973106811pathological_T_methyl0.287392771clinical_T.x
cg0011251771534.2632466912.49E-050.0303593313.74E-051737783011R1.15E-050.620386184gleason_methyl0.620386184gleason
cg0011251771534.2632466912.49E-050.0303593313.74E-051737783011R2.46E-060.591742547primary_gleason_methyl0.591742547primary_gleason
cg001398072539805.5679437164.62E-080.0001441845.38E-051629874869F0.0025358450.002673337gleason_methyl0.002673337gleason
cg00905951534711.210664441.13E-252.40E-210.0001488881623652875F0.001740.189760603pathological_T_methyl0.130660172clinical_T.x
cg00905951534711.210664441.13E-252.40E-210.0001488881623652875F3.28E-060.211537011gleason_methyl0.211537011gleason
cg00905951534711.210664441.13E-252.40E-210.0001488881623652875F6.80E-080.441059277primary_gleason_methyl0.441059277primary_gleason
cg00981594232044.0197919886.91E-050.0661945090.0005303861618813271R0.0001185410.004915781gleason_methyl0.004915781gleason
cg00981594232044.0197919886.91E-050.0661945090.0005303861618813271R0.0020539630.010918013primary_gleason_methyl0.010918013primary_gleason
cg0111095557524−4.8664504861.61E-060.003217512−1.34E-05161706924F0.0011850470.765035156biochemical_recurrence_methyl0.765035156biochemical_recurrence
cg01110955146330−5.1565665993.89E-070.000937772−0.00011792161706924F0.0006075640.004459742gleason_methyl0.004459742gleason
cg01110955197342−5.2289397512.70E-070.000684276−4.10E-05161706924F0.0001640070.004459742gleason_methyl0.004459742gleason
cg0111324657007−3.9881051487.86E-050.073139644−2.10E-052237033449F0.0059412320.506264545node_methyl0.506264545node.x
cg030543031463304.2337317962.83E-050.0334993330.000419375161662301R0.0006075640.013219481gleason_methyl0.013219481gleason
cg030543031973354.1416166074.18E-050.0454808980.000450108161662301R3.86E-070.013219481gleason_methyl0.013219481gleason
cg030543031973354.1416166074.18E-050.0454808980.000450108161662301R0.001210.205222679primary_gleason_methyl0.205222679primary_gleason
cg03228985339451−8.0978545166.22E-155.63E-11−8.10E-0511083642F0.0001526820.319957573gleason_methyl0.319957573gleason
cg0358800710541−4.1777800953.59E-050.040550739−0.0001561429100849274R2.46E-050.049332958gleason_methyl0.049332958gleason
cg0358800710541−4.1777800953.59E-050.040550739−0.0001561429100849274R0.0003990.007561488primary_gleason_methyl0.007561488primary_gleason
cg0358800710541−4.1777800953.59E-050.040550739−0.0001561429100849274R0.004780.461942007pathological_T_methyl0.461942007pathological_T
cg0378033822594.9346269591.16E-060.0024205511.39E-0513103244558F2.49E-060.958792934gleason_methyl0.958792934gleason
cg0378033822594.9346269591.16E-060.0024205511.39E-0513103244558F0.007090.550619929primary_gleason_methyl0.550619929primary_gleason
cg04007841283131−4.9459897841.10E-060.002305322−0.0013124761165307006F0.0011815420.300259221node_methyl0.300259221node.x
cg0412522393194.4140243061.29E-050.0180869091.57E-055218200R1.22E-080.006144373gleason_methyl0.006144373gleason
cg0412522393194.4140243061.29E-050.0180869091.57E-055218200R1.77E-060.099088812primary_gleason_methyl0.099088812primary_gleason
cg04588774575244.5105230128.42E-060.0126943877.04E-06163207967R0.0011850470.000893246biochemical_recurrence_methyl0.000893246biochemical_recurrence
cg04738496575245.1963953113.19E-070.0007884031.18E-05162021690R0.0011850470.067732981biochemical_recurrence_methyl0.067732981biochemical_recurrence
cg047384961973425.5264803955.77E-080.000175343.60E-05162021690R0.0001640070.454694526gleason_methyl0.454694526gleason
cg049776102539805.1924662433.25E-070.0008027038.05E-051630366665R0.0025358450.814245385gleason_methyl0.814245385gleason
cg050659262860764.015831317.03E-050.0670122661.08E-068144329806F0.0030558940.000258271gleason_methyl0.000258271gleason
cg051774377058−4.9499627911.08E-060.002267946−5.02E-056170582289F5.26E-050.060200416gleason_methyl0.060200416gleason
cg0531749857524−4.696909543.59E-060.006257313−1.21E-05161425256F0.0011850470.592952261biochemical_recurrence_methyl0.592952261biochemical_recurrence
cg05317498146330−5.7731513951.52E-085.30E-05−0.000122313161425256F0.0006075640.651898021gleason_methyl0.651898021gleason
cg05317498197335−5.8631746439.25E-093.39E-05−0.00013601161425256F3.86E-070.651898021gleason_methyl0.651898021gleason
cg05317498197342−5.3235502831.67E-070.000447261−3.89E-05161425256F0.0001640070.651898021gleason_methyl0.651898021gleason
cg05317498197335−5.8631746439.25E-093.39E-05−0.00013601161425256F0.001210.380661447primary_gleason_methyl0.380661447primary_gleason
cg059704373996644.119055224.59E-050.0489027717.31E-05191864622R0.0016679990.008596126gleason_methyl0.008596126gleason
cg06758143790004.8144601122.07E-060.0039673882.90E-06126798177R0.001310.391977245pathological_T_methyl0.085384829clinical_T.x
cg06758143790004.8144601122.07E-060.0039673882.90E-06126798177R4.95E-050.012484752gleason_methyl0.012484752gleason
cg06758143790004.8144601122.07E-060.0039673882.90E-06126798177R5.34E-090.912885436primary_gleason_methyl0.912885436primary_gleason
cg070089453394513.9528079079.07E-050.0809227940.000284181955720F0.0001526820.205918508gleason_methyl0.205918508gleason
cg088615561973354.1762766593.61E-050.0407197920.00017517716850614R3.86E-070.007086157gleason_methyl0.007086157gleason
cg0886155697274.5789852366.18E-060.0098613190.00024010316850614R4.31E-070.007086157gleason_methyl0.007086157gleason
cg0886155687864.7118022343.35E-060.0059326230.00025812916850614R0.0003827690.007086157gleason_methyl0.007086157gleason
cg088615561973354.1762766593.61E-050.0407197920.00017517716850614R0.001210.266894552primary_gleason_methyl0.266894552primary_gleason
cg1028682924914.7050998243.46E-060.0060745531.66E-06X100913838F5.02E-060.590714131gleason_methyl0.590714131gleason
cg1028682924914.7050998243.46E-060.0060745531.66E-06X100913838F0.0001070.685867114primary_gleason_methyl0.685867114primary_gleason
cg1049843471534.566013816.55E-060.0103563252.83E-051737783488F0.003740.029055459pathological_T_methyl0.088721131clinical_T.x
cg1049843471534.566013816.55E-060.0103563252.83E-051737783488F1.15E-050.970155723gleason_methyl0.970155723gleason
cg1049843471534.566013816.55E-060.0103563252.83E-051737783488F2.46E-060.72449589primary_gleason_methyl0.72449589primary_gleason
cg107054883394516.4880358432.47E-101.22E-060.00044679611550648R0.0001526820.039983988gleason_methyl0.039983988gleason
cg1157497040534.237765052.78E-050.033095959.23E-051475469471F0.0031385010.051219574gleason_methyl0.051219574gleason
cg11847126339451−8.1564217234.10E-153.77E-11−4.28E-0511248233F0.0001526820.985911022gleason_methyl0.985911022gleason
cg11855022570474.4739694319.92E-060.0145800895.27E-073145968700F0.00390.539977096node_methyl0.539977096node.x
cg124319776438665.6894765232.40E-087.99E-052.52E-051424769227R0.0020534720.001252535gleason_methyl0.001252535gleason
cg13444392677770−5.5850739884.22E-080.000132899−6.71E-0641608283R7.19E-090.003853796node_methyl0.003853796node.x
cg1350839123054.3517598541.70E-050.0225085050.000516527123068378R0.0004841240.097979899primary_gleason_methyl0.097979899primary_gleason
cg139179642221614.473164859.95E-060.0146141083.78E-05730635712F4.45E-050.955004144gleason_methyl0.955004144gleason
cg139179642221614.473164859.95E-060.0146141083.78E-05730635712F0.007310.239622641primary_gleason_methyl0.239622641primary_gleason
cg14820176801984.4804081269.64E-060.0142295716.00E-051166346739FALSE0.003730.621062934secondary_gleason_methyl0.621062934secondary_gleason
cg1482017610724.8735865531.56E-060.0031201130.0025769891166346739F0.0050051420.700418497gleason_methyl0.700418497gleason
cg14820176801984.4804081269.64E-060.0142295716.00E-051166346739F7.30E-100.700418497gleason_methyl0.700418497gleason
cg14820176801984.4804081269.64E-060.0142295716.00E-051166346739F0.001080.990656757primary_gleason_methyl0.990656757primary_gleason
cg159683077844.813589662.08E-060.0039813411.54E-051248577213R1.51E-050.103009304gleason_methyl0.103009304gleason
cg1663842557524−4.0952295325.07E-050.052583939−1.11E-05162946805F0.0011850470.394681318biochemical_recurrence_methyl0.394681318biochemical_recurrence
cg167126642539804.5860413665.98E-060.0095988230.0003479311630787355F0.0025358450.815286106gleason_methyl0.815286106gleason
cg17398515258974.1803092013.55E-050.0402178811.34E-058102092795R0.0003041680.378109272gleason_methyl0.378109272gleason
cg1769656350636−4.5844666726.02E-060.009657957−0.0001136072242756362R0.0001845010.082855296primary_gleason_methyl0.082855296primary_gleason
cg1812391133945110.898725971.66E-243.32E-200.00029896811550773F0.0001526820.670851582gleason_methyl0.670851582gleason
cg182255366438664.899087451.38E-060.0028051140.0001265441424658182F0.0020534720.493252557gleason_methyl0.493252557gleason
cg190923963394513.9306427659.92E-050.0863451697.16E-0611143751F0.0001526820.059898103gleason_methyl0.059898103gleason
cg19165854232514.181989553.52E-050.039996853.99E-061579725126R0.0040289960.17701659biochemical_recurrence_methyl0.17701659biochemical_recurrence
cg19414302339451−4.487557989.33E-060.01387107−2.41E-0511234354F0.0001526820.07787519gleason_methyl0.07787519gleason
cg1990138116094.2670567432.45E-050.0299923566.86E-0641577971F0.0029925710.147706171gleason_methyl0.147706171gleason
cg21908287576574.063248865.78E-050.05799691.14E-051155225385R2.61E-060.867305735gleason_methyl0.867305735gleason
cg21908287576574.063248865.78E-050.05799691.14E-051155225385R0.002550.954639475primary_gleason_methyl0.954639475primary_gleason
cg2206243223245.2174768292.86E-070.0007205281.36E-055180634502F0.0099018390.402124768gleason_methyl0.402124768gleason
cg230588636438665.4490346488.68E-080.0002504733.73E-051424610866F0.0020534720.005346928gleason_methyl0.005346928gleason
cg24378253339451−8.4256952725.87E-165.91E-12−0.00029815111017383R0.0001526820.046752456gleason_methyl0.046752456gleason

SNP: single nucleotide polymorphism; FDR: false discovery rate; CHR: chromosome; PSA: prostate-specific antigen

Supplementary Table 7

The details of selected 30 high-confidence CpG sites via methylation quantitative trait loci reported to associate with prostate cancer-related genes

Gene Symbol Chromosome SNPs Position FDR P References
2354 FOSB chr16cg02213678851125353.03E-443.66E-491, 2
7356 SCGB1A1 chr11cg02950427613232847.99E-034.82E-063, 4
94025 MUC16 chr19cg0474413484081281.72E-141.36E-185, 6
3569 IL-6 chr7cg05407490237496674.35E-051.22E-08 7 8 9
11197 WIF1 chr12cg05754508656725315.10E-041.94E-07 10 11 12 13
2354 FOSB chr19cg06577658462962709.10E-035.62E-061, 2
7273 TTN chr22cg08170140310907471.62E-323.26E-37 14 15 16
94025 MUC16 chr19cg0834167384082021.28E-035.56E-075, 6
7356 SCGB1A1 chr11cg08360511613229229.72E-044.06E-073, 4
23600 AMACR chr5cg09093477348391894.50E-024.11E-05 17 18 19
7356 SCGB1A1 chr13cg092075781147819495.43E-212.09E-253, 4
27232 GNMT chr6cg09804858423971217.66E-043.08E-07 20 21 22 23
51655 RASD1 chr17cg12918275181509542.28E-047.79E-0824, 25
27232 GNMT chr6cg14338887429285001.69E-082.57E-1220212223
290 ANPEP chr15cg14927321894916148.79E-035.40E-06 26 27 28
7356 SCGB1A1 chr11cg15492820613231136.78E-061.60E-093, 4
290 ANPEP chr15cg15850918899007411.28E-028.47E-06 26 27 28
4852 NPY chr19cg15854606186545488.51E-223.09E-26 29 30 31 32
3552 IL-1A chr2cg186688361131031841.79E-038.22E-0733, 34
6262 RYR2 chr1cg187703502368499664.31E-023.90E-0535, 36
2354 FOSB chr18cg19283427295233806.27E-389.96E-431, 2
51655 RASD1 chr17cg20664996173598212.22E-021.67E-0524, 25
94025 MUC16 chr19cg2288651284080837.60E-091.10E-125, 6
5179 PENK chr8cg23139584569875061.56E-036.98E-07 37 38 39
2354 FOSB chr2cg23158051654546924.02E-396.05E-441, 2
1960 EGR3 chr8cg23303108230835781.26E-028.31E-068, 40
2354 FOSB chr22cg23410069202876222.15E-393.20E-441, 2
1960 EGR3 chr8cg24278165230835511.83E-021.31E-058, 40
84419 C15orf48 chr11cg25902616477891933.59E-162.02E-20 41 42 43
7356 SCGB1A1 chrXcg270691321359915433.93E-211.50E-253, 4

PCa: prostate cancer; SNP: single nucleotide polymorphism; FDR: false discovery rate; FOSB: FosB proto-oncogene, activator protein-1 (AP-1) transcription factor subunit; SCGB1A1: secretoglobin family 1A member 1; MUC16: mucin 16; IL-6: interleukin 6; WIF1: Wnt inhibitory factor 1; TTN: titin; AMACR: alpha-methylacyl-coenzyme A racemase; GNMT: glycine N-methyltransferase; RASD1: ras-related dexamethasone induced 1; ANPEP: alanyl aminopeptidase, membrane; NPY: neuropeptide Y; IL-1A: interleukin 1A; RYR2: ryanodine receptor 2; PENK: proenkephalin; EGR3: early growth response 3; C15orf48: chromosome 15 open reading frame 48

  112 in total

1.  DNA methylation silences miR-132 in prostate cancer.

Authors:  A Formosa; A M Lena; E K Markert; S Cortelli; R Miano; A Mauriello; N Croce; J Vandesompele; P Mestdagh; E Finazzi-Agrò; A J Levine; G Melino; S Bernardini; E Candi
Journal:  Oncogene       Date:  2012-02-06       Impact factor: 9.867

2.  The important role of glycine N-methyltransferase in the carcinogenesis and progression of prostate cancer.

Authors:  Yoo Hyun Song; Masaki Shiota; Kentaro Kuroiwa; Seiji Naito; Yoshinao Oda
Journal:  Mod Pathol       Date:  2011-05-13       Impact factor: 7.842

3.  MTOR regulates the pro-tumorigenic senescence-associated secretory phenotype by promoting IL1A translation.

Authors:  Remi-Martin Laberge; Yu Sun; Arturo V Orjalo; Christopher K Patil; Adam Freund; Lili Zhou; Samuel C Curran; Albert R Davalos; Kathleen A Wilson-Edell; Su Liu; Chandani Limbad; Marco Demaria; Patrick Li; Gene B Hubbard; Yuji Ikeno; Martin Javors; Pierre-Yves Desprez; Christopher C Benz; Pankaj Kapahi; Peter S Nelson; Judith Campisi
Journal:  Nat Cell Biol       Date:  2015-07-06       Impact factor: 28.824

Review 4.  The fundamental role of epigenetic events in cancer.

Authors:  Peter A Jones; Stephen B Baylin
Journal:  Nat Rev Genet       Date:  2002-06       Impact factor: 53.242

5.  Identification of ryanodine receptor isoforms in prostate DU-145, LNCaP, and PWR-1E cells.

Authors:  Sarah E Kobylewski; Kimberly A Henderson; Curtis D Eckhert
Journal:  Biochem Biophys Res Commun       Date:  2012-07-27       Impact factor: 3.575

6.  Differentiating cutaneous squamous cell carcinoma and pseudoepitheliomatous hyperplasia by multiplex qRT-PCR.

Authors:  Albert Su; Seong Ra; Xinmin Li; Jian Zhou; Scott Binder
Journal:  Mod Pathol       Date:  2013-05-24       Impact factor: 7.842

7.  Global and focused transcriptional profiling of small molecule aminopeptidase N inhibitor reveals its mechanism of angiogenesis inhibition.

Authors:  Joong Sup Shim; Hyo Mi Park; Jiyong Lee; Ho Jeong Kwon
Journal:  Biochem Biophys Res Commun       Date:  2008-04-14       Impact factor: 3.575

8.  Prostate cancer gene expression signature of patients with high body mass index.

Authors:  S Sharad; A Srivastava; S Ravulapalli; P Parker; Y Chen; H Li; G Petrovics; A Dobi
Journal:  Prostate Cancer Prostatic Dis       Date:  2010-11-09       Impact factor: 5.554

9.  Post genome-wide association studies functional characterization of prostate cancer risk loci.

Authors:  Junfeng Jiang; Weirong Cui; Wanwipa Vongsangnak; Guang Hu; Bairong Shen
Journal:  BMC Genomics       Date:  2013-12-09       Impact factor: 3.969

10.  Pancan-meQTL: a database to systematically evaluate the effects of genetic variants on methylation in human cancer.

Authors:  Jing Gong; Hao Wan; Shufang Mei; Hang Ruan; Zhao Zhang; Chunjie Liu; An-Yuan Guo; Lixia Diao; Xiaoping Miao; Leng Han
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.