Literature DB >> 22570537

Integrated Analysis Reveals hsa-miR-142 as a Representative of a Lymphocyte-Specific Gene Expression and Methylation Signature.

Bill Andreopoulos1, Dimitris Anastassiou.   

Abstract

Gene expression profiling has provided insights into different cancer types and revealed tissue-specific expression signatures. Alterations in microRNA expression contribute to the pathogenesis of many types of human diseases. Few studies have integrated all levels of gene expression, miRNA and methylation to uncover correlations between these data types. We performed an integrated profiling to discover instances of miRNAs associated with a gene expression and DNA methylation signature across multiple cancer types. Using data from The Cancer Genome Atlas (TCGA), we revealed a concordant gene expression and methylation signature associated with the microRNA hsa-miR-142 across the same samples. In all cancer types examined, we found a signature of co-expression of a gene set R and methylated sites M, which correlate positively (M+) or negatively (M-) with the expression of hsa-miR-142. The set R consistently contains many genes, such as TRAF3IP3, NCKAP1L, CD53, LAPTM5, PTPRC, EVI2B, DOCK2, LCP2, CYBB and FYB. The signature is preserved across glioblastoma, ovarian, breast, colon, kidney, lung, uterine and rectum cancer. There is 28% overlap of methylation sites in M between glioblastoma (GBM) and ovarian cancer. There is 60% overlap of genes in R between GBM and ovarian (P = 1.3e(-11)). Most of the genes in R are known to be expressed in lymphocytes and haematopoietic stem cells, while M reflects membrane proteins involved in cell-cell adhesion functions. We speculate that the hsa-miR-142 associated signature may signal haematopoietic-specific processes and an accumulation of methylation events triggering a progressive loss of cell-cell adhesion. We also observed that GBM samples belonging to the proneural subtype tend to have underexpressed hsa-miR-142 and R genes, hypomethylated M+ and hypermethylated M-, while the mesenchymal samples have the opposite profile.

Entities:  

Keywords:  cancer; correlation; gene expression; integrated analysis; methylation; microRNA

Year:  2012        PMID: 22570537      PMCID: PMC3306237          DOI: 10.4137/CIN.S9037

Source DB:  PubMed          Journal:  Cancer Inform        ISSN: 1176-9351


Introduction

Gene expression regulation through mechanisms that involve microRNAs and epigenetics (DNA methylation) has attracted much attention recently.1 MiRNAs are a class of small RNA molecules that target mRNAs, causing translation repression. MiRNAs regulate genes associated with different biological processes, such as apoptosis, stress response, or tumorigenesis.2 In the context of cancer, miRNAs have emerged as new molecular players involved in carcinogenesis. Deregulation of miRNAs has been shown in glioblastoma, colon and ovarian cancer.3,4 Recently, it was shown that miR-10b is upregulated in gliomas, even though it is not expressed in healthy human brain tissue.5 Distinguishing cancer subtypes on the basis of miRNA expression may help to personalize therapies; this would enable doctors to better match a patient with the treatment the patient is likely to respond to with fewest side effects.6–11 Expression of genes is affected by miRNA expression and DNA methylation, which are known to regulate each other in both directions.12 MiRNAs are known to be targets of epigenetic regulation, as well as to target the epigenome, allowing for self-regulatory loops. One third of all human miRNA genes have a CpG island in the upstream region, suggesting epigenetic regulation of miRNA expression.13–15 On the other hand, several miRNAs are known to drive methylation signatures via interactions with DNA methyltransferases (DNMTs) and RBL2 proteins16,17 DNMTs are responsible for the methylation of the CpG islands of genes in an RBL2-dependent manner. The over expression of DNA methyltransferases (DNMTs) is often a poor prognostic indicator for cancer, since they induce hypermethylation and underexpression of tumor suppressors.18,19 It still remains largely unknown how, exactly, miRNAs affect genome methylation at the epigenetic level. The precise epigenetic mechanisms underlying the alteration of miRNA expression also remain largely unknown. Few studies have performed an integrated analysis of gene expression, miRNA expression and methylation using data from the same samples.2,20–24 In this study we explore relations of these three data types, in terms of their correlation to one another. Our goal was to identify particular miRNAs having strong and consistent correlative relationships with gene expression and methylation. We focused on integrated cancer datasets that are available from The Cancer Genome Atlas (TCGA) repository. The TCGA Research Network was established to generate the comprehensive catalog of genomic abnormalities driving tumorigenesis.25,26 We found that hsa-miR-142 is consistently associated with a gene expression and methylation signature in all cancer datasets.

Methods

Our analysis involved a computational method to identify miRNA, methylation sites and genes of potential importance in a set of samples, based on correlations among the different data types across samples.27 We performed an integrated correlation analysis of the miRNA, methylation and gene expression data that is available for glioblastoma and ovarian cancer. Our method’s aim is to evaluate the correlations between methylation-miRNA and mRNA-miRNA. For example, the correlation of miRNA Mi’s and gene Ri’s expression (across the samples for which both miRNA and gene expression are available) would show if there is a positive or negative correlation between Mi and Ri.28 We downloaded methylation, miRNA and gene expression data for glioblastoma and ovarian cancer, which was available on The Cancer Genome Atlas (TCGA) repository in April 2011. We used the level 3 data from TCGA and filtered out the healthy samples, such that we considered only tumor samples. For gene and miRNA expression the level 3 data represent the expression for particular genes or miRNAs per sample. For methylation the level 3 data represent the methylated sites/genes per sample. Level 3 data was derived after the raw signals per probe (level 1) were normalized per probe set (level 2) and then averaged for a gene or miRNA. Supplementary file 6 shows the number of tumor samples that were available on TCGA for each combination of data types. Additionally, for breast, colon, kidney, uterine and rectum cancer we downloaded the sequencing-based RNASeq and miRNASeq level 3 data available on TCGA in November 2011. This data represents the calculated expression calls for genes and miRNAs per sample. It is derived after calculating the expression signal for all reads aligning to a particular gene or miRNA (level 1) and normalizing the reads with TMM normalization (level 2).

Discretization of gene expression, miRNAs and methylation

We performed a pair wise correlation analysis between the different data types, as Figure 1 shows. Initially, the algorithm compares the sample-specific methylation of each gene with the expression level of each miRNA, across all samples. To evaluate the pairwise Pearson correlation between different data types (gene expression, miRNA expression and methylation), we discretized all three data types. A gene’s expression (similarly, a methylation site’s or miRNA’s) took a discrete value of −1, 0, or 1 for a sample, representing underexpression (hypomethylation), moderate, or overexpression (hypermethylation) in the sample. The discretization is based on whether the gene’s (or methylation site’s) expression is at least one standard deviation away from the gene’s mean value over all samples. We represented a gene (or miRNA or methylation site) as a vector of discrete values across all samples. Discretization of the values serves the following two purposes: (1) it allows us to compare two data types on a common basis, ie, whether there is aberrant expression or methylation observed in the same samples. (2) It allows us to filter out high correlations if there is moderate (non-extreme) expression/methylation (0) in most or all of the samples.
Figure 1

Preprocessing: We discretized the expression and methylation data to determine if a gene is under, over, or moderately (non-extremely) expressed/methylated in each sample. To turn a gene (or miRNA or methylation site) into a discrete vector over all tumor samples, we evaluated the gene expression’s mean value and standard deviation over all samples. Then, if the gene had value greater than the mean plus standard deviation in a sample, we represented it as 1 (over-expressed or hypermethylated). If the gene had value lower than the mean minus standard deviation in a sample, we represented it as −1 (under-expressed or hypomethylated). Otherwise, the gene was represented as 0 in the sample. We used the discrete vector representation of each miRNA’s expression and methylation over all samples (either gbm or ovarian) from the preprocessing step. Step 1: We evaluated the Pearson correlation between all pairs of miRNAs and methylation sites in glioblastoma and ovarian cancer. Then, we ranked all pairwise correlations in descending order, as shown. Step 2: We kept the top ranked miRNAs for ovarian and gbm. A condition was that the Bonferroni-corrected P-value, derived from a two-tailed t-test that evaluated the Pearson correlation, should be less than 0.01. Step 3: We found the miRNAs appearing in the top ranks in both ovarian and gbm and we selected the best miRNA as representative. Step 4: Using the best miRNA as representative, we found the top correlated methylation sites and genes in gbm and ovarian. We refer to the resulting sets as M_gbm, M_ovarian (methylation sites) and R_gbm, R_ovarian (gene expression).

Our analysis method differs from that of29 who used z-scores for discretizing the data. The z-scores indicate how many standard deviations a value is above or below the mean; previous studies discretized a gene expression value if z-score >2 or <−2. The study then used Fisher’s exact P-value to evaluate correlations between gene expression and mutations by assigning an exact P-value to a correlation, considering only up or only down regulation, and keeping P < 0.01. Pearson correlation, on the other hand, allows us to consider both up and down regulation for a pair, offering us two benefits. Using the Pearson correlation allows us to find negative correlations, as well as positive ones, such as concordantly over-expressed genes and hypomethylated sites. Moreover, Pearson correlation allows us to find negative correlations and it allows us to find subclasses of samples defined by opposite expression/methylation patterns (such as, the M+ and M− patterns). Since we are not dealing with somatic mutations, we believe Pearson correlation is a more suitable choice than Fisher’s exact P-value for our analysis. To evaluate the statistical significance of the Pearson correlation between miRNA and gene expression (or methylation) we used a two-tailed t-test. The t-test was based on the Student’s t-distribution with n-2 degrees of freedom, where n is the number of samples. The t-test was two-tailed since either positive or negative correlation may be of interest. We performed Bonferroni-correction by multiplying the P-value with the number of genes or methylation sites. We set a threshold of 0.01 for the Bonferroni-corrected P-value. Additionally, we made one million random permutations of the samples and evaluated the correlation from the permutated data; a correlation higher than the original is a false discovery. We applied a cutoff false discovery rate of 0.01, such that if the calculated FDR was greater than 0.01, the original correlation was rejected as false. In other words, the FDR was derived by counting how often the permutated absolute correlation was at least as high as the original correlation.

Association with gene ontology

Gene ontology enrichment was assessed using the Database for Annotation, Visualization, and Integrated Discovery—DAVID.30 We also used the Gene Set Enrichment Analysis tool by the Broad Institute to evaluate the annotation enrichment of M and R. The full results are given in Supplementary files 12 and 13.31

Association with proneural and mesenchymal GBM subclasses

We used the Suppl. Info from26 to associate R and M genes to different glioblastoma subclasses, as well as tumor samples to proneural, neural, classical or mesenchymal subclasses. We used Suppl. Table S3 to retrieve the genes highly expressed in different subclasses. We used Table S7 to retrieve samples with specific subclass associations.

Results

We first examined the relative significance of all miRNAs in the context of all miRNA-gene expression (or miRNA-methylation) Pearson correlation coefficient values. For this purpose, we first matched miRNA with gene expression data (or miRNA with methylation data) on the same samples. Then, we created histograms covering all miRNA-gene expression and miRNA-methylation correlation values, including all 17,814 genes and 27,578 methylation sites, over multiple cancers. We found that hsa-miR-142 stands out significantly among other miRNAs as having higher correlation values with more genes and methylation sites. Figure 2 shows the resulting histograms after averaging the correlation coefficients for miRNA-gene expression and miRNA-methylation across all cancer types available. Highlighted in red color are the occurrences where the histogram involving hsa-miR-142 lies outside the envelope of all other histograms. A significant amount of such histogram “tails” exist at the right side of the gene expression histograms (R), and at both sides of the methylation histograms (M+ and M−). Supplementary files 1–2 give the lists of the highest and lowest correlations for hsa-miR-142- methylation and hsa-miR-142-gene expression in all cancer types (corrected P-value <0.01). Supplementary file 5 shows the miRNA-gene expression correlation histograms for individual cancer types.
Figure 2

Left: Histogram of the miRNA-gene expression correlations for cancer types that had miRNA and gene expression data (RNASeq and miRNASeq) available. We matched the miRNA-gene expression data on the same samples. Since we did not find a significant negative miRNA-gene correlation, the left graph shows just positive values. Right: the miRNA-methylation correlations for GBM and ovarian cancer (the other TCGA cancer types lacked integrated miRNA-methylation data). We matched the miRNA-methylation data on the same samples. We plotted the miRNAs that appear in all cancer types, which resulted in 680 miRNAs and 421 miRNAs, respectively. For each miRNA we included the correlation values for all 17,814 genes or 27,578 methylation sites. We averaged the correlations over all cancer types to determine if a correlation remains consistently high in all cancers. As shown, the miRNA hsa-miR-142 is highly correlated with a larger set of genes or methylation sites than other miRNAs.

Core of the signature: R and M

After we determined the significance of hsa-miR-142 in terms of high correlation with a set of genes and methylation sites across multiple cancer types, our aim is to define the sets of genes R and methylation sites M with which hsa-miR-142 is most frequently associated. In glioblastoma and ovarian cancer, we derived the sets R and M by ranking the genes and methylation sites in descending order by their correlation with hsa-miR-142. We derived R for breast, colon, kidney, uterine and rectum cancer by correlating the RNASeq with the miRNASeq sequencing-based data available on TCGA. Then, R and M consisted of the genes or methylation sites with a Bonferroni-corrected P-value less than 0.01. We derived M for breast, colon, kidney, uterine, rectum and lung cancer by using a metagene approach since sequencing-based methylation data was not available to correlate with the miRNASeq data. We first aggregated the 100 genes that had the highest average correlation with has-miR-142 in GBM and ovarian; the average value of these 100 genes over all samples in a cancer gave the first metagene. In each of 11 cancers, we ranked all genes based on their mutual information with the metagene. Then, we averaged the mutual information of the genes with the metagene across all 11 cancers and took the 15 top genes to create a second metagene. We ranked the correlation of this second metagene to all methylation sites in breast, colon, kidney, lung, rectum and uterine cancer. Supplementary files 7–10 give the metagene and the derived M.

M and R signature and overlaps between cancers

Figure 3 illustrates the M+ and M− signatures in glioblastoma and ovarian cancer. Sites positively correlated with hsa-miR-142 are M+, while negatively correlated sites are M−. The M sets shown consist of the methylation sites that have a Bonferroni-corrected P-value of less than 0.01 with hsa-miR-142 over the samples. This heatmap is the result of step4 of our analysis (see Methods section).
Figure 3

Results of step4 of our analysis method (see Methods section). Left: The methylation sites that are most correlated with hsa-miR-142, either positively or negatively, in glioblastoma (M_gbm). Right: M_ovarian in ovarian cancer. We distinguish the signature M into the methylation sites having positive correlation with hsa-miR-142 (M+) and those with negative correlation (M−). The overlap of methylation sites between M+_gbm (236) and M+_ovarian (471) is 76, while the overlap between M−_gbm (259) and M−_ovarian (126) is 63.

Tables 1–2 show the overlaps of the R and M signatures between all cancer types considered. As shown, we found a significant overlap of M and R between different cancer types. We matched the precise probes of the methylation sites in M between different cancers.
Table 1

Overlap of R between cancers.

R_GBMR_COADR_BRCAR_UCECR_READR_KIRC
R_OV (1338)67122427938282629
P = 1.3e-11P = 1.7e-11P = 4e-11P = 5.4e-11P = 3.1e-11P = 3e-11
R_GBM (1106)15519427161453
P = 0P = 0P = 2e-11P = 0P = 0
R_COAD (289)19722485254
P = 1.8e-11P = 3.3e-11P = 2.2e-12P = 0
R_BRCA (404)30578368
P = 4e-11P = 6.3e-12P = 0
R_UCEC (486)87441
P = 3.7e-11P = 3.5e-11
R_READ (101)88
P = 0
R_KIRC (1325)

Notes: The parentheses show the number of genes in R for each cancer type. The cells show the R overlap sizes between different cancer types and the p-values of the overlaps using the hypergeometric cumulative distribution function.

Table 2

Overlap of M signatures between cancers.

M_GBMM_COADM_BRCAM_UCECM_READM_KIRCM_KIRPM_LUSC
M_OV (597)139341375143682688351
P = 1e-11P = 4e-12P = 1e-11P = 1e-11P = 0P = 1e-11P = 0P = 0
M_GBM (495)19024155311334156
P = 1e-11P = 3e-12P = 3e-11P = 0P = 3e-12P = 1e-5P = 0
M_COAD (1749)7051431314039628
P = 1e-11P = 1e-11P = 0P = 1e-11P = 1e-10P = 0
M_BRCA (1787)155984667709
P = 2e-11P = 0P = 1e-11P = 4e-7P = 0
M_UCEC (184)391215149
P = 0P = 4e-11P = 3e-9P = 3e-12
M_READ (131)73486
P = 0P = 9e-8P = 0
M_KIRC (866)9361
P = 0P = 0
M_KIRP (10)7
P = 4e-8
M_LUSC (1258)

Notes: The parentheses show the number of methylation sites in M for each cancer type. The cells show the M overlap sizes between different cancer types and the P-values of the overlaps using the hypergeometric cumulative distribution function.

We used P-values (matlab’s hygecdf function) to evaluate the significance of the overlap sizes of M and R between different cancer types, getting small P-values near zero, as Tables 1–2 show. We also evaluated the significance of the correlations between miRNA, methylation and gene expression by using permutation-based P-values. This involved repeatedly shuffling the values of one of the data types over all samples and re-computing the correlations over one million trials; the P-values were always near zero, which points to the correlations’ significance. We further evaluated whether the R and M associations were specific for the particular genes and methylation sites in GBM. For this purpose, we reversed roles by looking for any correlation of expression of M genes with methylation of R genes. When we looked for correlation of CX3CL1 gene expression (rather than methylation) with the methylation of any genes in R_GBM, there was always zero correlation. This supports that the results are specific to M methylation and R gene expression and hsa-miR-142 is a good representative of the signature.

R is enriched in glioblastoma mesenchymal subclasses

We found the signature profile to reflect two genetically distinct subclasses, which in glioblastoma tend to correspond to the previously identified proneural and mesenchymal subclasses. Table 3 shows how the miRNAs and expressed genes (R) are differentially expressed between the different glioblastoma subclasses. The miRNA hsa-miR-142 and genes R were distinctly expressed between proneural-mesenchymal glioblastoma subclasses. Our integrated analysis revealed that the signature differs between glioblastoma subclasses in that the miRNA hsa-miR-142 and genes R are underexpressed in proneural and overexpressed in mesenchymal. The majority of samples with over-expressed R and hsa-miR-142 belong to the mesenchymal subclass and the P-value of the distributions illustrates the significance.
Table 3

The distribution of glioblastoma samples between proneural, neural, classical and mesenchymal classes according to R gene and miRNA expression.

R genes and hsa-miR-142 expressionProneuralNeuralClassicalMesenchymalMultinomial probability
Over-expressed242255.49178E-09
Under-expressed122500.001937516

Notes: We observed a significant difference between the classes: in proneural R is under-expressed, while in mesenchymal R is over-expressed. The multinomial probability in the last column is the total probability under the null hypothesis that at least 25 out of 33 over-expressed samples (or at least 12 out of 19 under-expressed samples) would have been classified in any one of the four classes.

We examined the overlap of the R_GBM genes and the R_ovarian genes with the mesenchymal (216), classical (162), proneural (178) and neural (129) genes from (Verhaak and others 2010).26 182 of the R_GBM genes are found in the mesenchymal genes and 122 of the R_ovarian genes are found in the mesenchymal genes. There was significantly less overlap of R_GBM and R_ovarian with proneural genes (53 and 15, respectively), neural genes (3 and 10) or classical genes (8 and 11). Therefore, the genes in R significantly overlap with the genes in26 that distinguish between proneural vs. mesenchymal subtypes in glioblastoma. The P-values of the overlaps are <0.0001.

Discussion

Table 4 gives the functional annotations of the top-ranked genes in R; we derived these genes as shown in Supplementary files 8–9. The genes R are all expressed in human hematopoietic stem cells or leukocytes. Many of these genes are involved in regulating T-cell activation; T-cell is a type of lymphocyte. The gene set R is particularly enriched in genes known to be preferentially expressed in T-cell differentiation stages.32 Some of the R genes represent membrane proteins involved in cell-cell adhesion in leukocytes. Lymphocytes are known to play a role in cancer, but their role remains quite controversial.33 Lymphocytes are generally known to detect and eliminate cancer cells, but on the other hand infiltration of lymphocytes, such as B cells and T cells, into cancer can be an indication of poor prognosis (recurrence and metastasis). It was shown that lymphocytes that make the RANKL inflammatory protein are critical for metastasis. No metastasis was found in mice without lymphocytes, but adding RANKL in mice restored the potential for metastasis compensating for the function of lymphocytes.34
Table 4

The known functions of the top-ranked genes in R.

SASH3 (CXorf9)Signaling adapter protein in lymphocytesCD2Cell adhesion molecule found on the surface of T cells and natural killer cells
CCL5Regulated upon activation, normal T-cell expressedCD37Transmembrane protein, leukocyte antigen, may play a role in T-cell-B-cell interactions
FLJ21438Proteins in B-cell exosomesDOK3Negative regulator of JNK signaling in B-cells
RgrMembrane protein, retinal G-protein coupled receptorPSCD4Plasma membrane, regulation of cell adhesion
TLR2Membrane protein, immune system signaling pathwayITGB2Leukocyte cell adhesion molecule
APBB1IPPeripheral membrane protein, mediates Rap1-induced adhesionBTKPeripheral membrane protein, plays a crucial role in B-cell development (mature B lymphocytes)
DOCK2Peripheral membrane protein, haematopoietic and lymphocyte cell-specific proteinCXCR3Membrane protein, expressed primarily on activated T lymphocytes and NK cells, regulate leukocyte trafficking
ARHGAP9Regulates adhesion of hematopoietic cells to the extracellular matrixFYBSignaling transduction in T cells, modulates the expression of interleukin-2
LCP2Lymphocyte protein promoting T cell developmentPTPRCRequired for T-cell activation. Interleukin-12-dependent in activated lymphocytes
CD53Leukocyte surface antigen, signal transduction in T cellsLAIR1Leukocyte-associated receptor, found on NK cells, T cells, and B cells
TRAF3IP3Gene expressed in t-lyphocytesURP2Cell adhesion in hematopoietic cells. Required for leukocyte adhesion to endothelial cells
CYBBGlycoprotein integral to plasma membraneAIF1Promotes the proliferation of T-lymphocytes. Enhances lymphocyte migration
NCKAP1LMembrane-associated haematopoietic proteinITGB2Leukocyte cell adhesion molecule
IL10RAInterleukin-10 receptor in membrane proteins, expressed in hemopoietic cells and lymphocytesHAVCR2T-cell membrane protein
LAPTM5Transmembrane protein associated with lysosomes, may play a role in hematopoiesisCD48Ligand for CD2. Might facilitate interaction between activated lymphocytes. Probably involved in regulating T-cell activation
PLEKHemopoietic progenitor cell differentiationSLANegatively regulates T-cell receptor (TCR) signaling
ARHGAP30Rho GTPase activating protein 30CCR5Integral membrane protein, mainly expressed on T cells
ARHGAP25Actin remodeling, cell migrationEVI2BIntegral membrane protein, bone marrow and blood expression

Note: All of the genes are expressed in leukocytes, specifically lymphocytes, or haematopoietic stem cells.

Table 5 shows the methylation sites that appear in M for both ovarian cancer and GBM. Similar to R, most of M consists of trans-membrane proteins, which are related to cell-cell adhesion functions. For example, SSTR3, SSTR5, TMEM149, are all trans-membrane proteins with cell-cell adhesion functions. A few of the proteins are also involved in signal transduction. The most prominent correlation was between hsa-miR-142 and the CX3CL1 methylation site, which was shown to control cell invasion in glioma.35 Many of the genes in R are also found in the M− set, which suggests that their undermethylation affects their expression. In recent work, hsa-miR-142 was shown to be expressed in hematopoietic and leukocytic cells, which supports the miRNA’s association with R and M.27,36–39
Table 5

Functional annotations of the methylation sites that overlap between M_GBM and M_ovarian.

AFF3 (M)Expressed in the lymphoid system, transcription regulationLAPTM5 (M−)Transmembrane receptor associated with lysosomes
ALDH3A1 (M+)Metabolism of neurotransmittersLCP2 (M−)Lymphocyte protein promoting T cell development
ARHGAP25 (M−)Actin remodeling, cell migrationLRP3 (M+)Integral membrane protein
BIN2 (M−)Breast cancer-associatedLSM7 (M+)Ribonucleoprotein complex
C10orf27 (M−)Cell differentiationLY86 (M−)Inflammatoy response, apoptosis
C16orf54 (M−)Transmembrane proteinMAMSTR (M+)Transcription regulation
C2orf40 (M−)Cancer-related augurin precursorMPHOSPH9 (M−)Peripheral membrane protein
C6orf25 (M−)Plasma membrane-bound cell surface receptorMTMR11 (M +)Protein-tyrosine phosphatase
CARD8 (M−)Apoptotic proteinNCF2 (M−)Superoxide-generating NADPH oxidase activity
CCDC80 (M+)Promotes cell adhesion and matrix assemblyNCOR2 (M+)Transcriptional repression
CD101 (M−)Leukocyte surface membrane proteinOGG1 (M+)DNA repair enzyme
CD6 (M−)Plasma membrane protein involved in T-cell activationOSM (M−)Tumor inhibitor
CD79B (M−)B lymphocyte receptorPAQR6 (M+)Integral membrane protein
CHRM1 (M+)G protein-coupled receptor membrane proteinPHKG1 (M+)Protein kinase activity
CX3CL1 (M+)T cell leukocyte adhesion and migration process at the endotheliumPLD4 (M−)Single-pass membrane protein
DAPP1 (M−)Peripheral membrane protein, B lymphocyte adapter proteinPLEKHA4 (M+)Peripheral Membrane protein
DAPK2 (M+)Cell apoptosis inducerPOR (M−)ER membrane oxidoreductase
DDAH1 (M+)Regulator of nitric oxide generationPPP2R1A (M+)Protein phosphatase
DNAI1 (M+)Dynein intermediate chain, cytoplasmicPRELP (M+)Extracellular matrix, collagen binding in connective tissue
FBN3 (M+)Extracellular matrix structural constituent, fibrillin.PTGFRN (M+)Integral membrane protein, Single-pass type I membrane protein
FAM113B (M−)Hydrolase activityPTPRCAP (M−)Transmembrane phosphoprotein, plasma membrane, integral membrane protein
FAM78A (M−)Hypothetical proteinRB1 (M−)Tumor suppressor, negative regulator of the cell cycle
FGR (M−)Cell migration and adhesionROBO4 (M−)External side of plasma membrane
FUT3 (M+)Membrane protein, tumor metastasis and adhesionRPE65 (M+)Plasma membrane protein
GGT1 (M+)Membrane proteinRUNX1 (M−)Acute myeloid leukemia 1 protein
GIPC1 (M+)Regulator cell surface receptor, traffickingSEMA3B (M−)Extracellular membrane, Neuronal development, tumor suppression by apoptosis induction
GPX2 (M−)Glutathione peroxidaseSHROOM1 (M−)Neuronal development
PLEK (M−)Hemopoietic progenitor cell differentiationSLA (M−)Negative regulator T-cell receptor (TCR) signaling
AFF3 (M−)Expressed in the lymphoid system, transcription regulationLAPTM5 (M−)Transmembrane receptor associated with lysosomes
GRIP1 (M+)Glutamate receptor-interacting protein 1SLC44A2 (M+)Plasma membrane protein
HKDC1 (M+)Hexokinase domain-containing protein 1SNCG (M+)Breast cancer-specific gene 1 protein
IL17RE (M−)Membrane protein, interleukin receptorSSTR3 (M+)Plasma membrane protein
IL18BP (M−)Extracellular bindingSSTR5 (M+)Plasma membrane protein
IL22RA1 (M+)Membrane protein, interleukin receptorTMEM149 (M−)Transmembrane protein
INCA1 (M+)Inhibitor of CDKTNFAIP8L2 (M−)Tumor necrosis factor, immune homeostasis
INPP5J (M+)Plasma membrane proteinTNKS1BP1 (M+)Tankyrase-1-binding protein, enzyme binding
KCNQ1 (M+)Potassium voltage-gated channelTRAF1 (M−)TNF receptor associated factor
KIAA0427 (M+)Regulation of translational initiationWFDC2 (M−)Extracellular region, proteolysis
KLHL34 (M−)Kelch-like protein 34ZNF205 (M+)Zinc finger protein, transcription regulation
KLHL6 (M−)Protein binding, B-lymphocyte antigen receptor signalingZNF48 (M+)Zinc finger protein, transcription regulation
LAMB2 (M+)Basement membrane protein, attachment, migration and organization of cells into tissues during embryonic developmentZNF512B (M+)Zinc finger protein, transcription regulation

Notes: Many of the genes in R are also found in the M− set and the names of these genes are highlighted in bold.

We examined the annotation enrichment of the 139 M methylation sites and 671 R genes that overlap between ovarian cancer and GBM, using the Broad Institute’s Gene Set Enrichment Analysis tool. There was an overlap with several modules that were previously found to be implicated in cancer.40 The enriched Gene Ontology annotations included plasma membrane, response to wounding, cell-cell adhesion, extracellular space, cell-surface receptor signal transduction, inflammatory response, oxidoreductase activity. The P-values of the overlaps of M and R with these modules is <1.0e-5. These results suggest the methylation of M+ and M−, as well as miRNA and gene expression, may play a role in progressive disruption of adhesion and junction molecules and loss of cell adhesion components. In several types of cancer, disruption of cell-cell adhesion molecules is a hallmark in phenotypes such as the epithelial-mesenchymal transition and invasion.41 Table 6 shows the top clusters for a functional annotation clustering of the 671 R genes that overlap between ovarian cancer and GBM, as derived with the DAVID tool.30 Inflammatory and defense response are the most significant annotations, which is consistent with the anti-inflammatory functions of the interleukin genes found in the top-ranked R genes (see Table 4). Integral membrane and transmembrane annotations are also among the most enriched annotations. Additionally, we examined the R genes in the Broad Institute’s Gene Set Enrichment Analysis (GSEA) tool, finding that one third of the genes overlap with sets of known membrane proteins. Several subsets of these genes were previously implicated in breast cancer (74 up regulated and 43 down regulated), liver cancer (41), bladder cancer (59 up regulated) and thyroid cancer (38 up regulated). The P-values of the overlaps is low. Supplementary files 12–13 give the complete functional annotation clusters for R as derived using DAVID, as well as the enriched annotations in M using the Broad Institute’s GSEA tool.
Table 6

The top three functional annotation term clusters associated with the list of 671 R genes that overlap between R_ovarian and R_GBM.

Gene ontology annotationCountP-value
GO:0006952~defense response813.31E-43
GO:0006954~inflammatory response474.55E-26
GO:0009611~response to wounding541.12E-22
Disulfide bond1509.53E-37
Disulfide bond1461.09E-35
Topological domain:Extracellular1351.61E-30
Glycoprotein1741.83E-30
Glycosylation site:N-linked (GlcNAc...)1683.25E-29
Topological domain:Cytoplasmic1471.20E-27
Signal1373.38E-24
Signal peptide1376.20E-24
Membrane2029.05E-24
Receptor884.19E-22
GO:0005886~plasma membrane1621.80E-21
Transmembrane region1702.00E-21
Transmembrane1704.00E-21
GO:0031224~intrinsic to membrane1895.47E-15
GO:0016021~integral to membrane1841.11E-14
GO:0005886~plasma membrane1621.80E-21
GO:0005887~integral to plasma membrane813.86E-20
GO:0031226~intrinsic to plasma membrane811.52E-19
GO:0044459~plasma membrane part1134.82E-19

Note: Annotations are clustered together if they have similar gene members; the more common genes annotations share, the higher the chance they will be grouped together. The count shows how frequently the particular annotation occurs in a cluster and the p-value shows the likelihood that such a count or a higher one would be observed in a random cluster. The p-value associated with each annotation term inside a cluster is statistically measured by Fisher Exact in DAVID system.

Several miRNAs are known to suppress methylation via interactions with DNMTs and RBL2 proteins. DNMTs are known to be responsible for the methylation of the CpG islands of tumor suppressors in an RBL2-dependent manner.16,17 Therefore, miRNAs and their DNMT targets are known to cause epigenetic changes (or their reversal) that contribute to malignant transformation in human tumors.19 The aberrant expression of DNA methyltransferases (DNMTs) is often a poor prognostic indicator for cancer.42 We speculate that the M signature may be partially induced because hsa-miR-142 targets the DNA methyltransferases (DNMTs).31,43 Either DNMT3L or DNMT3B was part of our M+ signature in most cancers. DNMT3L was hypermethylated in mesenchymal GBM phenotype together with hsa-miR-142 overexpression. The miRNA hsa- miR-142 is known to target the DNA methyltransferases DNMT1 and DNMT3A, which are also regulated by DNMT3L.44 DNMT3L is a regulatory factor of DNA methyltransferases, which is essential for the function of DNMT3A and DNMT3B.45,46 In previous studies, the coexpression of DNMT3L with DNMT3A resulted in a striking stimulation of de novo methylation by DNMT3A.47 Whether hsa-miR-142 influences the methylation of M via its regulation of DNMTs remains to be determined in lab experiments. Additionally, hsa-miR-142 has CpG islands embedded in its promoter region.15 Genes associated with CpG islands are believed to be targeted by DNA methyltransferases and are known to be prone to methylation in cancer14 suggesting hsa-miR-142 may be regulated by epigenetics (see Supp. File 4).

Conclusion

By performing miRNA vs. gene expression and miRNA vs. methylation analysis, our work identified hsa-miR-142 as the miRNA most strongly and consistently correlated with a set of genes and methylation sites. Our results show that miRNA expression is associated with altered expression profiles of genes in R and methylated sites in M. The hsa-miR-142 miRNA is the best representative of this signature for all cancers (see Supp. Files 5 and 11). When we used hsa-miR-142 as a proxy in several cancers the resulting M and R overlapped significantly between cancer types. The P-value of the resulting overlaps between cancer types was close to 0. This signature appears to be associated with transformation from proneural to mesenchymal subclass in glioblastoma. The results suggest that in the mesenchymal tumor subtypes R genes tend to be over-expressed, M+ is hypermethylated, M− is hypomethylated and hsa-miR-142 is over-expressed. In proneural subtypes, on the other hand, the gene and miRNA expression and methylation are the opposite. Our study is the first that associated this signature with several cancer types, showing distinct methylation and expression patterns between the proneural and mesenchymal subclasses. All of the R genes are known to be expressed in leukocytes and haematopoietic stem cells. Many of the M genes represent membrane proteins related to cell-cell adhesion functions. We speculate that in cancer the signature may contribute to a progressive loss of cell-cell adhesion. We found either DNA methyltransferase DNMT3L or DNMT3B to be methylated and associated strongly with hsa-miR-142 expression in the cancer types examined. From a previous study, DNA methyltransferases were shown to be associated with the methylation of cell adhesion-related genes in carcinomas.41 Integrated microRNA and gene expression and methylation data for healthy human samples was unavailable at the time of analysis; in the future, a human dataset of healthy samples may provide additional insights. The supplementary material for this manuscript is available from 9037SupplementaryFiles.zip. Supp 1: The R signature in ovarian cancer, GBM, colon, kidney, uterine, rectum, breast cancer. Includes Bonferroni-corrected P-values, correlations and mutual information. Derived based on correlations with hsa-miR-142. Supp 2: The M signature in ovarian cancer and GBM. Includes Bonferroni-corrected P-values, correlations and mutual information. Derived based on correlations with hsa-miR-142. Supp 3: Joint R and M signatures averaged between ovarian cancer and glioblastoma. Supp 4: The CpG islands located in the promoters of miRNAs on the genome. Supp 5: The miRNA-gene expression correlation value histograms for individual cancer types. Supp 6: The number of samples with M and R signature in glioblastoma and ovarian cancer. Supp 7: The first metagene is the list of R genes that occur in both ovarian cancer and GBM at P-value <0.01, sorted in terms of the joint ovarian/GBM average correlation with hsa-miR-142. Supp 8: The R signature is verified using the first metagene in 11 cancers (glioblastoma, ovarian, breast, colon, kidney clear/papillary cell, lung squamous cell/adenocarcinoma, lower grade glioma, uterine, rectum). In all cancers we ranked the genes based on their correlation to the first metagene. Supp 9: The second metagene. Derived by averaging the mutual information of each gene to the first metagene (weighted by the number of samples in each cancer) over 11 cancer types and ranking the genes. Supp 10: The M signature is verified using the second metagene in glioblastoma, ovarian, breast, colon, uterine, rectum, kidney and lung cancer. In all cancers we ranked the methylation sites based on their correlation to the second metagene. Supp 11: The significance of the microRNAs is verified using the second metagene in glioblastoma and ovarian cancer. In GBM and ovarian we ranked the microRNAs based on their correlation to the second metagene. Supp 12: Functional annotation clustering of the R genes using the DAVID tool. Supp 13: Annotations enriched in the M methylation sites using the Broad Institute’s Gene Set Enrichment Analysis Tool (GSEA).
  46 in total

1.  Correlation of somatic mutation and expression identifies genes important in human glioblastoma progression and survival.

Authors:  David L Masica; Rachel Karchin
Journal:  Cancer Res       Date:  2011-05-09       Impact factor: 12.701

2.  MicroRNA-126 regulates DNA methylation in CD4+ T cells and contributes to systemic lupus erythematosus by targeting DNA methyltransferase 1.

Authors:  Sha Zhao; Yu Wang; Yunsheng Liang; Ming Zhao; Hai Long; Shu Ding; Heng Yin; Qianjin Lu
Journal:  Arthritis Rheum       Date:  2011-05

3.  A developmental taxonomy of glioblastoma defined and maintained by MicroRNAs.

Authors:  Tae-Min Kim; Wei Huang; Richard Park; Peter J Park; Mark D Johnson
Journal:  Cancer Res       Date:  2011-03-08       Impact factor: 12.701

4.  Tumour-infiltrating regulatory T cells stimulate mammary cancer metastasis through RANKL-RANK signalling.

Authors:  Wei Tan; Weizhou Zhang; Amy Strasner; Sergei Grivennikov; Jin Q Cheng; Robert M Hoffman; Michael Karin
Journal:  Nature       Date:  2011-02-16       Impact factor: 49.962

5.  Altered methylation at microRNA-associated CpG islands in hereditary and sporadic carcinomas: a methylation-specific multiplex ligation-dependent probe amplification (MS-MLPA)-based approach.

Authors:  Walter Pavicic; Esa Perkiö; Sippy Kaur; Päivi Peltomäki
Journal:  Mol Med       Date:  2011-02-09       Impact factor: 6.354

6.  A network model of a cooperative genetic landscape in brain tumors.

Authors:  Markus Bredel; Denise M Scholtens; Griffith R Harsh; Claudia Bredel; James P Chandler; Jaclyn J Renfrow; Ajay K Yadav; Hannes Vogel; Adrienne C Scheck; Robert Tibshirani; Branimir I Sikic
Journal:  JAMA       Date:  2009-07-15       Impact factor: 56.272

7.  CCancer: a bird's eye view on gene lists reported in cancer-related studies.

Authors:  Sabine Dietmann; Wanseon Lee; Philip Wong; Igor Rodchenkov; Alexey V Antonov
Journal:  Nucleic Acids Res       Date:  2010-06-06       Impact factor: 16.971

8.  DNMT3L modulates significant and distinct flanking sequence preference for DNA methylation by DNMT3A and DNMT3B in vivo.

Authors:  Bethany L Wienholz; Michael S Kareta; Amir H Moarefi; Catherine A Gordon; Paul A Ginno; Frédéric Chédin
Journal:  PLoS Genet       Date:  2010-09-09       Impact factor: 5.917

9.  microRNA-associated progression pathways and potential therapeutic targets identified by integrated mRNA and microRNA expression profiling in breast cancer.

Authors:  Francesca M Buffa; Carme Camps; Laura Winchester; Cameron E Snell; Harriet E Gee; Helen Sheldon; Marian Taylor; Adrian L Harris; Jiannis Ragoussis
Journal:  Cancer Res       Date:  2011-07-07       Impact factor: 12.701

10.  A ten-microRNA expression signature predicts survival in glioblastoma.

Authors:  Sujaya Srinivasan; Irene Rosita Pia Patric; Kumaravel Somasundaram
Journal:  PLoS One       Date:  2011-03-31       Impact factor: 3.240

View more
  12 in total

1.  A potentially functional variant in the serotonin transporter gene is associated with premenopausal and perimenopausal hot flashes.

Authors:  May E Montasser; Ayelet Ziv-Gal; Jessica P Brown; Jodi A Flaws; Istvan Merchenthaler
Journal:  Menopause       Date:  2015-01       Impact factor: 2.953

2.  Learning regulatory programs by threshold SVD regression.

Authors:  Xin Ma; Luo Xiao; Wing Hung Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2014-10-20       Impact factor: 11.205

3.  Pigmy MicroRNA: surveillance cops in Therapies kingdom.

Authors:  Utpal Bhadra; Pradipta Patra; Jagamohan Chhatai; Manika Pal-Bhadra
Journal:  Mol Med       Date:  2016-09-28       Impact factor: 6.354

4.  Minimax Rate-optimal Estimation of High-dimensional Covariance Matrices with Incomplete Data.

Authors:  T Tony Cai; Anru Zhang
Journal:  J Multivar Anal       Date:  2016-05-19       Impact factor: 1.473

5.  miRNA clusters as therapeutic targets for hormone-resistant breast cancer.

Authors:  Gianpiero Di Leva; Douglas G Cheung; Carlo M Croce
Journal:  Expert Rev Endocrinol Metab       Date:  2015-10-13

Review 6.  MicroRNAs--novel regulators of systemic lupus erythematosus pathogenesis.

Authors:  Nan Shen; Dong Liang; Yuanjia Tang; Niek de Vries; Paul-Peter Tak
Journal:  Nat Rev Rheumatol       Date:  2012-10-16       Impact factor: 20.543

7.  Effect of miR-142-3p on the M2 macrophage and therapeutic efficacy against murine glioblastoma.

Authors:  Shuo Xu; Jun Wei; Fei Wang; Ling-Yuan Kong; Xiao-Yang Ling; Edjah Nduom; Konrad Gabrusiewicz; Tiffany Doucette; Yuhui Yang; Nasser K Yaghi; Virginia Fajt; Jonathan M Levine; Wei Qiao; Xin-Gang Li; Frederick F Lang; Ganesh Rao; Gregory N Fuller; George A Calin; Amy B Heimberger
Journal:  J Natl Cancer Inst       Date:  2014-06-28       Impact factor: 13.506

8.  The shaping and functional consequences of the microRNA landscape in breast cancer.

Authors:  Heidi Dvinge; Anna Git; Stefan Gräf; Mali Salmon-Divon; Christina Curtis; Andrea Sottoriva; Yongjun Zhao; Martin Hirst; Javier Armisen; Eric A Miska; Suet-Feung Chin; Elena Provenzano; Gulisa Turashvili; Andrew Green; Ian Ellis; Sam Aparicio; Carlos Caldas
Journal:  Nature       Date:  2013-05-05       Impact factor: 49.962

9.  Biomolecular events in cancer revealed by attractor metagenes.

Authors:  Wei-Yi Cheng; Tai-Hsien Ou Yang; Dimitris Anastassiou
Journal:  PLoS Comput Biol       Date:  2013-02-21       Impact factor: 4.475

10.  Elucidation of molecular and functional heterogeneity through differential expression network analyses of discrete tumor subsets.

Authors:  Rutika R Naik; Nilesh L Gardi; Sharmila A Bapat
Journal:  Sci Rep       Date:  2016-05-03       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.