| Literature DB >> 29434467 |
Yanina Natanzon1, Madalene Earp1, Julie M Cunningham2, Kimberly R Kalli3, Chen Wang1, Sebastian M Armasu1, Melissa C Larson1, David Dl Bowtell4,5, Dale W Garsed5,6, Brooke L Fridley7, Stacey J Winham1, Ellen L Goode1.
Abstract
High-grade serous ovarian cancer (HGSOC) is a complex disease in which initiation and progression have been associated with copy number alterations, epigenetic processes, and, to a lesser extent, germline variation. We hypothesized that, when summarized at the gene level, tumor methylation and germline genetic variation, alone or in combination, influence tumor gene expression in HGSOC. We used Elastic Net (ENET) penalized regression method to evaluate these associations and adjust for somatic copy number in 3 independent data sets comprising tumors from more than 470 patients. Penalized regression models of germline variation, with or without methylation, did not reveal a role in HGSOC gene expression. However, we observed significant association between regional methylation and expression of 5 genes (WDPCP, KRT6C, BRCA2, EFCAB13, and ZNF283). CpGs retained in ENET model for BRCA2 and ZNF283 appeared enriched in several regulatory elements, suggesting that regularized regression may provide a novel utility for integrative genomic analysis.Entities:
Keywords: Elastic Net penalized regression; high-grade serous ovarian cancer; tumor DNA methylation
Year: 2018 PMID: 29434467 PMCID: PMC5802704 DOI: 10.1177/1176935118755341
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Penalized regression (ENET) model[a] of gene-based DNA methylation association with gene expression in TCGA, AOCS, and Mayo Clinic data sets of high-grade serous ovarian cancer.
| Gene name | Ensembl ID | Chromosome | Transcript length, (kb) | ENET methylation model | ||
|---|---|---|---|---|---|---|
| TCGA (N = 339)[ | AOCS (N = 78)[ | Mayo Clinic (N = 54)[ | ||||
|
| ENSG00000143951 | 2 | 706.5 | .047 | .079 | .008 |
|
| ENSG00000170465 | 12 | 5.3 | .005 | .026 | .066 |
|
| ENSG00000139618 | 13 | 80.8 | .047 | .099 | .052 |
|
| ENSG00000178852 | 17 | 118 | .010 | .078 | .007 |
|
| ENSG00000167637 | 19 | 24.7 | .007 | .075 | <.001 |
|
| ENSG00000176399 | 9 | 8.9 | .047 | .014 | >.10 |
|
| ENSG00000255398 | 12 | 2.1 | .043 | .081 | >.10 |
|
| ENSG00000167914 | 17 | 24.5 | .020 | .088 | >.10 |
|
| ENSG00000260727 | 16 | 0.5 | .047 | >.10 | — |
|
| ENSG00000121270 | 16 | 80.7 | <.001 | >.10 | — |
|
| ENSG00000154265 | 17 | 82.9 | .018 | >.10 | — |
Abbreviations: AOCS, Australian Ovarian Cancer Study; ENET, Elastic Net; TCGA, The Cancer Genome Atlas.
Adjusted for gene copy number.
P values adjusted for multiple testing. Only results with <.05 P value tested in AOCS data set.
P values not adjusted for multiple testing. Only results with <.10 P value tested in Mayo Clinic data set.
P values not adjusted for multiple testing.
— indicates not tested.
Figure 1.Distribution of BRCA2 CpG locations by retention status in ENET methylation models.
CpG location represent gene region feature category describing the CpG position from UCSC: TSS1500 = 200 to 1500 bases upstream of transcription start site (TSS);
TSS200 = 0 to 200 bases upstream of TSS;
5′UTR = within the 5′ untranslated region, between the TSS and ATG start site;
1stExon = within first exon of the canonical transcript; body = between the ATG and stop codon, irrespective of the presence of introns, exons, TSS, or promoters;
3′UTR = between the stop codon and polyA signal;
Non_gene = region outside of all region listed above;
Annotation acquired from Illumina Infinium HumanMethylation450 v1.2 manifest file, column “UCSC_RefGene_Group”;
The P value presented represents results of Fisher exact test of CpG location (gene body vs other location);
CpG retention in model (retained vs not retained). CpG counts are provided in Supplemental Table 1.