Literature DB >> 25537518

Genetic alterations of histone lysine methyltransferases and their significance in breast cancer.

Lanxin Liu1, Sarah Kimball1, Hui Liu1, Andreana Holowatyj1, Zeng-Quan Yang1.   

Abstract

Histone lysine methyltransferases (HMTs), a large class of enzymes that catalyze site-specific methylation of lysine residues on histones and other proteins, play critical roles in controlling transcription, chromatin architecture, and cellular differentiation. However, the genomic landscape and clinical significance of HMTs in breast cancer remain poorly characterized. Here, we conducted a meta-analysis of approximately 50 HMTs in breast cancer and identified associations among recurrent copy number alterations, mutations, gene expression, and clinical outcome. We identified 12 HMTs with the highest frequency of genetic alterations, including 8 with high-level amplification, 2 with putative homozygous deletion, and 2 with somatic mutation. Different subtypes of breast cancer have different patterns of copy number and expression for each HMT gene. In addition, chromosome 1q contains four HMTs that are concurrently or independently amplified or overexpressed in breast cancer. Copy number or mRNA expression of several HMTs was significantly associated with basal-like breast cancer and shorter patient survival. Integrative analysis identified 8 HMTs (SETDB1, SMYD3, ASH1L, SMYD2, WHSC1L1, SUV420H1, SETDB2, and KMT2C) that are dysregulated by genetic alterations, classifying them as candidate therapeutic targets. Together, our findings provide a strong foundation for further mechanistic research and therapeutic options using HMTs to treat breast cancer.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25537518      PMCID: PMC4385864          DOI: 10.18632/oncotarget.2967

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


INTRODUCTION

Breast cancer is the most common cancer among women worldwide, with 1.3 million women diagnosed each year and about 500,000 deaths per year from the disease. Distinct subtypes of breast carcinomas that are associated with different clinical outcomes have been identified by expression analysis using microarray-based technology [1, 2]. Five intrinsic molecular subtypes of human breast cancer include Luminal A, Luminal B, human epidermal growth factor receptor 2 (HER2/ERBB2)-positive, basal-like, and normal-like breast cancer [2, 3]. Both Luminal A and Luminal B breast cancers are estrogen receptor (ER) positive, but Luminal B cancers have poorer outcomes [4]. Basal-like breast cancer is especially aggressive as it includes tumors that lack ER, progesterone receptor (PR), and HER2 expression (hence the name “triple-negative”) [5, 6]. These characteristics render conventional therapies ineffective and lead to poor prognosis. By understanding the genetic and epigenetic abnormalities that are associated with the different types of breast cancer, we can identify new subtype-specific targets for therapy. Histone lysine methylation, which is controlled by histone lysine methyltransferases (HMTs) and demethylases, is an important player in epigenetic regulation [7, 8]. More than 50 human HMTs have been identified [9]. Structurally, the HMTs are a diverse group of proteins that can be broadly categorized into two functional enzymatic families, the SET (Suppressor of variegation, Enhancer of zeste, Trithorax)-domain-containing methyltransferases and the DOT1-like (DOT1L) lysine methyltransferases [8, 9]. HMTs catalyze the transfer of one to three methyl groups from S-adenosylmethionine to specific lysine residues on histones [7]. Depending on the site and degree of methylation (mono-, di-, or trimethylated), lysine methylation can lead to various biological outcomes, including the regulation of chromatin organization and gene transcription. Recent studies indicated that dysregulation of HMTs can lead to imbalances in histone methylation pathways and contribute to the pathogenesis of a wide array of human cancers, including breast cancer [10-12]. For example, we demonstrated that the methyltransferase gene WHSC1L1 (Wolf-Hirschhorn syndrome candidate 1-like 1) is significantly amplified and overexpressed in breast cancer. We also demonstrated that WHSC1L1 acts as a transforming gene: stable WHSC1L1 overexpression in nontumorigenic mammary epithelial MCF10A cells induced transformed phenotypes, whereas WHSC1L1 knockdown inhibited proliferation of WHSC1L1-amplified breast cancer cells in vitro. EZH2, a histone 3 lysine 27 (H3K27) methyltransferase, is also significantly overexpressed in breast cancers, and elevated expression of EZH2 protein has been associated with poor prognoses for inflammatory basal-like breast cancers [13, 14]. Emerging evidence indicates that genetic alterations of several HMTs that have oncogenic or tumor-suppressor functions play important roles in cancer initiation and progression [10, 11, 15]. Despite the extensive DNA and RNA sequencing data, such as The Cancer Genome Atlas (TCGA), in human cancer, there has been no systematic analysis of genomic anomalies and expression of HMTs in different subtypes of breast cancer. In addition, the clinical relevance of genetic alterations for each HMT in breast cancer has yet to be fully explored. Thus, our goals were to determine the genomic landscape and significance of HMTs in different types of breast cancers and to evaluate their diagnostic and prognostic potentials.

RESULTS

Genetic alterations of HMTs in breast cancer

Copy number alteration (CNA) and somatic mutation are important mechanisms that activate oncogenes or inactivate tumor suppressors in human cancers [16, 17]. We hypothesized that HMTs with recurrent CNA or mutation would be more likely to play critical roles in breast cancer. Thus, to systematically investigate genetic alterations of HMTs in breast cancer, we first analyzed the genome sequencing data of 958 breast cancer samples from the TCGA database via cBioPortal [18, 19]. In cBioPortal, copy numbers were computed using a GISTIC (Genomic Identification of Significant Targets in Cancer) algorithm, which identified the putative copy number as high-level amplification (+2), low-level gain (+1), diploid (0), heterozygous deletion (−1), or homozygous deletion (−2) [18, 19]. In 958 breast cancer samples, the average CNA rate was 0.15 (range 5.23*10−6 to 0.588), based on the segmented copy number scores of the tumor samples and the paired-normal control. The human genome encodes 51 proteins with demonstrated or predicted ability to methylate histone lysine residues [9, 20]. Except for DOT1L, the rest of the HMTs contain a characteristic SET enzymatic domain and can be divided into four subgroups according to phylogenetic analysis (Table 1 and Supplementary Figure S1) [7, 9]. We analyzed copy numbers (excluding that of KMT2B, for which data were not available in the cBioPortal database) and mutations of these 51 HMTs compiled from 958 TCGA breast cancer specimens. As shown in Table 2, we discovered distinct patterns of altered copy numbers and mutations of HMTs in breast cancer. Notably, we found that eight HMTs exhibited high-level amplification in more than 5% of breast cancers, and five of these eight HMTs (SMYD3, SETDB1, ASH1L, WHSC1L1, and SMYD2) had high-level amplification in more than 10% of samples. Two HMT genes, PRDM7 and SETDB2, showed homozygous deletion in more than 2% of breast cancers. In addition, two other HMT genes, KMT2C and KMT2D, exhibited somatic mutations in more than 2% of the 958 breast cancer samples.
Table 1

Summary of identified human HMTs and their substrates

Official SymbolOther AliasesGene IDGene LocationHistone Substrates
EHMT1GLP; GLP1; KMT1D; FP13812; EUHMTASE1; Eu-HMTase1; bA188C12.1798139q34.3H3K9; H1.2K187
EHMT2G9A; BAT8; GAT8; NG36; KMT1C; C6orf30109196p21.31H3K9; H3K27; H3K56; H1.2K187; H1.4K26;
SUV39H1MG44; KMT1A; SUV39H; H3-K9-HMTase 16839Xp11.23H3K9
SUV39H2KMT1B7972310p13H3K9
SETDB1ESET; KG1T; KMT1E; TDRD21; H3-K9-HMTase498691q21.3H3K9
SETDB2CLLD8; CLLL8; KMT1F; C13orf48385213q14.2H3K9
SETD2HYPB; SET2; HIF-1; HIP-1; KMT3A; HBP231; HSPC069; p231HBP290723p21.31H3K36
KMT2EMLL5; NKp44L; HDCMC04P559047q22.3H3K4
SETD5552093p25.3
SETMARMar1; HsMar1; METNASE64193p26.1H3K36
EZH1KMT6B214517q21.2H3K27
EZH2WVS; ENX1; EZH1; KMT6; WVS2; ENX-1; EZH2b; KMT6A21467q36.1H3K27
SETD7KMT7; SET7; SET9; SET7/9808544q31.1H3K4
KMT2BHRX2; MLL2; MLL4; TRX2; WBP7; MLL1B; WBP-7975719q13.12H3K4
KMT2CHALR; MLL3585087q36.1H3K4
SETD1ASet1; KMT2F; Set1A973916p11.2H3K4
SETD1BKMT2G; Set1B2306712q24.31H3K4
KMT2AHRX; MLL; MLL1; TRX1; ALL-1; CXXC7; HTRX1; MLL1A; WDSTS; MLL/GAS7; TET1-MLL429711q23.3H3K4
KMT2DALR; KMS; MLL2; MLL4; AAD10; KABUK1; TNRC21; CAGL114808512q13.12H3K4
SETD8SET8; KMT5A; SET07; PR-Set738789312q24.31H4K20
MECOMEVI1; MDS1; PRDM3; MDS1-EVI1; AML1-EVI-121223q26.2H3K9me1
PRDM16MEL1; LVNC8; PFM13; CMD1LL639761p36.32H3K9me1
PRDM13PFM10; MU-MB-20.220593366q16.2
PRDM8PFM5569784q21.21H3K9
PRDM1BLIMP1; PRDI-BF16396q21
PRDM2RIZ; KMT8; RIZ1; RIZ2; MTB-ZF; HUMHOXY177991p36.21H3K9
PRDM10PFM75698011q24.3
PRDM12PFM9593359q34.12
PRDM6931665q23.2
PRDM14PFM11639788q13.3
PRDM4PFM11110812q23.3
PRDM15PFM15; ZNF298; C21orf836397721q22.3
PRDM5BCS2; PFM2111074q27
PRDM7PFM4; ZNF9101110516q24.3
PRDM9PFM6; MSBP3; PRMD9; ZNF899; MEISETZ569795p14.2H3K4
PRDM11PFM85698111p11.2
SUV420H1CGI85; KMT5B; CGI-855111111q13.2H4K20
SUV420H2KMT5C8478719q13.42H4K20
ASH1LASH1; KMT2H; ASH1L1558701q22H3K4; H3K36
SETD3C14orf1548419314q32.2
SETD4C21orf18; C21orf275409321q22.12
SETD67991816q21
SMYD1BOP; KMT3D; ZMYND18; ZMYND221505722p11.2H3K4
SMYD3KMT3E; ZMYND1; ZNFN3A1; bA74P14.1647541q44H3K4
SMYD2KMT3C; HSKM-B; ZMYND14569501q32.3H3K4; H3K36
SMYD4ZMYND2111482617p13.3
SMYD5RRG1; RAI15; NN8-4AG; ZMYND23103222p13.2
NSD1STO; KMT3B; SOTOS; ARA267; SOTOS1643245q35.2H3K36
WHSC1WHS; NSD2; TRX5; MMSET; REIIBP74684p16.3H3K36; H4K20
WHSC1L1NSD3; pp14328549048p11.23H3K4; H3K27
DOT1LDOT1; KMT48444419p13.3H3K79
Table 2

Frequency of HMT copy number alterations and mutations

GeneGene LocationAmpGainDiploidHetlossHomdelMutation
SMYD31q4414.6158.8723.173.130.210.52
SETDB11q21.314.4157.8325.891.880.001.15
ASH1L1q2212.8461.1724.431.570.001.77
WHSC1L18p11.2312.8424.5337.3723.701.570.52
SMYD21q32.312.1161.5923.282.920.100.21
PRDM148q13.39.7145.3040.084.800.100.31
SUV420H111q13.25.6422.6557.4114.200.100.84
SETD1A16p11.25.0146.8739.778.350.000.42
MECOM3q26.24.3827.5662.535.530.000.63
PRDM16q212.9214.5151.3630.900.310.84
SUV39H210p132.8223.9061.4811.690.100.10
SUV420H219q13.422.7122.4460.2314.510.100.21
SETMAR3p26.12.1917.9563.7815.660.420.21
SETD53p25.32.0917.8564.6115.340.100.52
PRDM95p14.21.8832.0556.059.810.210.73
EZH117q21.21.4618.7945.0934.130.520.31
PRDM1111p11.21.4616.8164.2017.330.210.10
EHMT26p21.311.3623.8060.2314.610.000.42
PRDM136q16.21.3614.3051.5732.570.210.31
NSD15q35.21.2524.8455.7417.850.311.04
PRDM1521q22.31.2520.9861.0616.490.210.63
PRDM161p36.320.947.2052.9238.000.940.42
SUV39H1Xp11.230.9415.3467.0116.280.420.21
EZH27q36.10.8424.4357.5216.910.310.31
KMT2C7q36.10.8423.8055.5318.791.046.99
SETD421q22.120.8420.8862.3215.660.310.00
PRDM716q24.30.738.8726.1061.902.400.21
SETD812q24.310.7316.4962.9419.620.210.31
EHMT19q34.30.6314.4157.8326.620.520.94
KMT2E7q22.30.6325.3759.5014.410.101.15
PRDM129q34.120.6313.2658.5627.350.210.10
PRDM84q21.210.6311.6960.6526.930.100.42
SETD1B12q24.310.6316.7062.4220.040.210.63
SETD314q32.20.6314.3055.7429.020.310.31
PRDM1011q24.30.529.5043.6344.781.570.73
WHSC14p16.30.5210.1357.4131.210.730.42
KMT2D12q13.120.4219.7365.7614.090.002.40
PRDM54q270.4210.0260.5429.020.000.31
DOT1L19p13.30.3110.7556.3731.630.940.84
PRDM412q23.30.3116.4964.8218.270.100.31
PRDM65q23.20.3118.2757.2023.700.520.10
SETD616q210.319.0828.7161.060.840.00
SETD74q31.10.319.6060.6529.330.100.52
SMYD417p13.30.316.3734.3458.250.730.52
SMYD52p13.20.3116.1868.2715.140.100.21
PRDM21p36.210.215.3253.1340.810.520.73
SMYD12p11.20.2115.0369.1015.550.100.52
KMT2A11q23.30.109.2941.6548.020.941.67
SETD23p21.310.109.2959.7130.480.421.57
SETDB213q14.20.107.5246.3543.952.090.21

Footnote: Amp=high-level amplification; Gain=low-level gain; Hetloss=heterozygous deletion; Homdel= homozygous deletion. Genes were ranked based on the frequency of high-level amplification.

Footnote: Amp=high-level amplification; Gain=low-level gain; Hetloss=heterozygous deletion; Homdel= homozygous deletion. Genes were ranked based on the frequency of high-level amplification. To determine whether the genetic alteration of each HMT is specific to a breast cancer subtype, we performed an independent analysis of copy number and mutation in different subtypes of breast cancer. Of the 958 breast cancer samples, 492 had subtype data available, including 8 normal-like, 220 Luminal A, 121 Luminal B, 55 HER2+, and 88 basal-like breast cancers [6]. Because the normal-like subtype had such a small sample size (n=8), those samples were excluded from this analysis. As shown in Supplementary Tables S1 and S2, basal-like breast cancer had the highest frequencies of HMT gene amplification, deletion, and mutation, whereas Luminal A had the lowest frequencies among the four subtypes in every category of genetic alteration. In addition, each HMT showed different frequencies of CNA or mutation in different subtypes of breast cancer. Among the eight most frequently amplified HMTs (totaling more than 5% of 958 samples), the frequencies of ASH1L, SETDB1, and SMYD3 amplification were dramatically higher in basal-like breast cancer, with more than 25% of tumors exhibiting high-level amplification compared with the other three subtypes (Figure 1A and Supplementary Table S1). In contrast, WHSC1L1 exhibited the highest frequency of amplification in the Luminal B subtype (19%), and SETD1A had the highest frequency of amplification in the Luminal A subtype (7.27%) (Figure 1A). Of the two most common HMTs with homozygous deletions, SETDB2 exhibited the highest frequency of homozygous deletion (6.82%) in basal-like samples, and PRDM7 was most frequent (5.79%) in Luminal B breast cancer (Figure 1B). Of the most commonly mutated HMTs, KMT2C was most frequently mutated in Luminal A and KMT2D in HER+ breast cancer (Figure 1C). These data indicate that breast cancer, particularly the basal-like subtype, has a high frequency of CNAs and somatic mutations in several HMTs, including amplification of ASH1L, SETDB1, and SMYD3, as well as homozygous deletion of SETDB2.
Figure 1

Frequencies of HMTs

(A) High-level amplification of eight HMTs, (B) homozygous deletion of two HMTs, and (C) mutation of two HMTs in 492 primary breast cancer specimens from The Cancer Genome Atlas across different types of breast cancer.

Frequencies of HMTs

(A) High-level amplification of eight HMTs, (B) homozygous deletion of two HMTs, and (C) mutation of two HMTs in 492 primary breast cancer specimens from The Cancer Genome Atlas across different types of breast cancer.

Expression profiling of HMTs in breast cancer

Correlation between gene expression and copy number has been used widely to prioritize candidate driver oncogenes in human cancer, because mRNA overexpression can better translate the effect of elevated copy number to cancer initiation and progression. Therefore, we next analyzed the correlation between copy number and mRNA level of 48 HMTs from 958 sequenced breast cancer specimens. Three HMTs (KMT2B, KMT2D, and PRDM9) were excluded from analysis because they lacked RNA sequencing data. To weigh the benefits of different statistical analyses, we compared three different correlation tests—Spearman, Kendall, and Pearson. The rank correlation coefficients among the three statistical tests were similar for the HMTs, specifically by mostly keeping the relative order constant (Table 3). The highest weight was given to the Spearman correlation coefficient, as it measures the relationship between rank-ordered variables and relates the two variables while conserving the order of data points. As shown in Table 3, except for MECOM, DNA copy number versus mRNA expression correlations for 47 HMT genes were positive, with 12 of them (WHSC1L1, SETD3, SETD6, SETDB1, SMYD4, EZH1, SUV420H1, EHMT1, SETD2, SUV39H2, PRDM4, and SETDB2) having a Spearman correlation coefficient (r) greater than 0.5. Among the 48 HMTs, WHSC1L1 had the highest correlation by both Spearman (r=0.737) and Kendall (r=0.604) analyses, consistent with our previous findings that WHSC1L1 is an amplified gene in breast cancer [12, 21].
Table 3

Associations between CNA and expression, and comparison of mRNA expression between basal and nonbasal breast cancer subtypes

GeneDNA/mRNA CorrelationBasal/non-basal Comparison
SpearmanKendallPearsonT StatP value
WHSC1L10.7370.6040.460−1.8153.65E-02
SETD30.6630.5450.612−4.2912.30E-05
SETD60.6250.5030.6154.6715.41E-06
SETDB10.6240.5060.5604.1154.38E-05
SMYD40.6140.4980.626−0.3443.66E-01
EZH10.6120.4850.562−6.2826.41E-09
SUV420H10.5530.4480.599−0.7422.30E-01
EHMT10.5430.4400.547−7.3365.42E-11
SETD20.5200.4170.500−3.1021.30E-03
SUV39H20.5070.4060.5738.9502.84E-14
PRDM40.5050.4090.516−4.6316.31E-06
SETDB20.5010.3980.433−2.5416.41E-03
SETD1B0.4810.3850.451−4.1214.29E-05
EHMT20.4780.3860.4705.6211.13E-07
SETD1A0.4710.3770.4510.2943.85E-01
SETD40.4300.3470.4620.6892.46E-01
PRDM20.4280.3440.422−1.4008.25E-02
SETD50.4180.3370.4691.9312.84E-02
SMYD50.4020.3240.4513.6322.38E-04
KMT2A0.4010.3160.4040.8811.90E-01
SETMAR0.3880.3090.435−1.7194.46E-02
SETD80.3670.2920.3873.2398.50E-04
NSD10.3580.2830.387−1.9362.81E-02
PRDM110.3550.2810.3721.5576.15E-02
WHSC10.3450.2780.3653.6252.43E-04
SMYD20.3430.2700.3367.2737.23E-11
PRDM150.3390.2670.3673.8959.63E-05
ASH1L0.3170.2480.310−2.7223.92E-03
SETD70.3160.2520.325−9.2457.08E-15
SMYD30.3110.2420.213−6.0561.73E-08
EZH20.3000.2350.3178.0042.43E-12
PRDM100.2780.2200.309−3.0701.43E-03
SUV420H20.2670.2110.329−0.3843.51E-01
KMT2C0.2610.2050.262−2.5576.15E-03
PRDM130.2520.2240.1664.2093.11E-05
DOT1L0.2370.1880.2682.6055.40E-03
PRDM70.2270.2000.1772.5386.47E-03
PRDM60.1970.1550.031−6.3424.91E-09
KMT2E0.1940.1540.210−2.3511.05E-02
PRDM50.1540.1230.200−0.7572.26E-01
PRDM140.1230.1150.057−1.0541.47E-01
PRDM120.1100.0860.1131.1681.23E-01
PRDM160.0920.0740.1220.4953.11E-01
PRDM10.0680.0530.0862.2381.39E-02
SMYD10.0630.050−0.0021.4197.98E-02
PRDM80.0350.0270.0450.3803.52E-01
SUV39H10.0160.0110.0775.4152.69E-07
MECOM−0.079−0.063−0.034−0.2613.97E-01

Footnote: Genes were ranked based on the Spearman correlation coefficient. Significantly higher expression of HMTs in the basal-like subtype is highlighted in dark gray, and significantly lower expression is highlighted in little gray. T stat represents the T statistic, which is equivalent to the number of standard deviations of mRNA expression levels between the basal-like and nonbasal breast cancer samples. A positive value means that the basal-like subtype has a higher value, and a negative value means that the basal-like subtype has a lower value than the nonbasal breast cancer samples.

Footnote: Genes were ranked based on the Spearman correlation coefficient. Significantly higher expression of HMTs in the basal-like subtype is highlighted in dark gray, and significantly lower expression is highlighted in little gray. T stat represents the T statistic, which is equivalent to the number of standard deviations of mRNA expression levels between the basal-like and nonbasal breast cancer samples. A positive value means that the basal-like subtype has a higher value, and a negative value means that the basal-like subtype has a lower value than the nonbasal breast cancer samples. Basal-like breast cancer, the most aggressive subtype, is associated with higher rates of metastasis and death [22]. We next sought to compare expression levels of the 48 HMTs between basal and non-basal subtypes in the 492 TCGA breast cancer samples with subtype information. The significance of difference for each HMT between the basal-like and the other subtypes was calculated using Student's t-test. In the basal-like subtype, compared with non-basal subtypes, we found that the expression levels of 12 HMTs (SETDB1, SMYD2, SUV39H2, EHMT2, SUV39H1, EZH2, WHSC1, SMYD5, PRDM15, SETD8, PRDM13, and SETD6) were significantly higher (p<0.001), and the expression levels of 8 HMTs (SMYD3, SETD3, EZH1, EHMT1, PRDM4, SETD1B, SETD7, and PRDM6) were significantly lower (p<0.001) (Figure 2 and Table 3). Notably, our analysis validated that EZH2 exhibits significantly higher expression in basal-like breast cancer, consistent with previous results [14, 23]. Conversely, WHSC1L1 showed moderately higher expression (t=−1.815, p=0.036) in non-basal subtypes, which supports our other findings that it is amplified more in Luminal subtypes and shows high correlation between copy number and mRNA expression (Figure 1A, Table 2) [12, 21].
Figure 2

Heatmap of HMT expression profiles in different types of breast cancer

The breast cancer samples used in this analysis included 8 normal-like, 220 Luminal A, 121 Luminal B, 55 HER2+, and 88 basal-like breast cancers. Significantly higher-expressed genes (p<0.001) in basal-like tumors are shown at the top, indicated by a red box; and lower-expressed genes (p<0.001) in basal-like tumors are indicated by a blue box.

Heatmap of HMT expression profiles in different types of breast cancer

The breast cancer samples used in this analysis included 8 normal-like, 220 Luminal A, 121 Luminal B, 55 HER2+, and 88 basal-like breast cancers. Significantly higher-expressed genes (p<0.001) in basal-like tumors are shown at the top, indicated by a red box; and lower-expressed genes (p<0.001) in basal-like tumors are indicated by a blue box.

KMT2C and KMT2D mutations in breast cancer

Because KMT2C and KMT2D are the most frequently mutated HMT genes in breast cancers, at rates of 6.99% and 2.40%, respectively (Table 2), we performed a comprehensive analysis of the KMT2C and KMT2D mutation spectrum in 958 breast cancer samples. As shown in Figure 3, we identified a total of 80 KMT2C mutations, consisting of 26 missense mutations, 23 nonsense mutations, 17 frameshift deletions, 12 frameshift insertions, 1 splice, and 1 inframe insertion. Eight tumor samples had two mutations, and two samples had four mutations in the KMT2C gene. For example, sample TCGA-AC-A23H contained three missense mutations (D3264N, E3724K, and D4344H) and one nonsense mutation (Q1218*). In the KMT2D gene, 25 mutations were identified, most of them missense mutations (Figure 3A). KMT2C and 2D are large proteins (approximately 5000 amino acids) that contain the zf-HC5HC2H, PHD, FYRN, and FYRC domains and the carboxy-terminal SET domain. Figure 3B shows the distribution of KMT2C and KMT2D mutations in 958 breast cancer samples across protein domains; most of the mutations are localized to the amino-terminal end of the SET domain. Previous studies demonstrated that mice lacking the KMT2C catalytic SET domain developed ureteric tumors, which supports its hypothesized role as a tumor suppressor [24]. Therefore, we predict that mutations at the amino terminus of KMT2C and KMT2D SET domains (Supplementary Table S3) might result in the truncation of the SET domain or loss of function of KMT2C and/or KMT2D methyltransferases, subsequently contributing to breast cancer initiation and progression.
Figure 3

KMT2C and KMT2D mutational spectrum in breast cancer

(A) Frequency of each mutation type for KMT2C from 958 breast cancer samples. The data were obtained from The Cancer Genome Atlas database via cBioPortal. (B) The images show protein domains and the positions of specific mutations of KMT2C and KMT2D. A red dot indicates a nonsense mutation, frameshift deletion, insertion, or splice; a green dot indicates a missense mutation; and a black dot indicates an inframe insertion or deletion.

KMT2C and KMT2D mutational spectrum in breast cancer

(A) Frequency of each mutation type for KMT2C from 958 breast cancer samples. The data were obtained from The Cancer Genome Atlas database via cBioPortal. (B) The images show protein domains and the positions of specific mutations of KMT2C and KMT2D. A red dot indicates a nonsense mutation, frameshift deletion, insertion, or splice; a green dot indicates a missense mutation; and a black dot indicates an inframe insertion or deletion.

Amplification/overexpression of multiple HMTs from chromosome 1q

Among the five most frequently amplified HMT genes (frequency>10%) in breast cancers, four were localized on the long arm of chromosome 1, with SETDB1 at 1q21.3, ASH1L at 1q22, SMYD2 at 1q32.3, and SMYD3 at 1q44 (Figure 4). Of the 958 breast cancer samples, 215 (22.44%) contained high-level amplification in at least one locus of these four genes (Figure 4A). Of those 215 samples, 65 had amplification in all four loci, while 23 samples were amplified only at SETDB1, 8 only at ASH1L, 12 only at SMYD2, and 21 only at SMYD3 (Figure 4A). In addition, 111 of 215 samples were co-amplified at ASH1L and SETDB1, and this amplicon (1q21-22) spans approximately 4 Mb in basal-like breast cancer [6]. We assume that in the 65 samples containing co-amplification of SETDB1, ASH1L, SMYD2, and SMYD3, the whole arm of chromosome 1q is amplified. We found that amplification of the whole arm of 1q is more common in Luminal subtypes (Luminal A, 7.73%, Luminal B, 8.26%) than in HER2+ (5.45%) and basal (5.68%) subtypes.
Figure 4

High-level amplification of four HMT genes at chromosome 1q in breast cancer

(A) Copy numbers of SETDB1, ASH1, SMYD2, and SMYD3 were obtained from the Oncoprint output of The Cancer Genome Atlas breast cancer data (cBioPortal). A red rectangle indicates high-level amplification; blue rectangle, homozygous deletion; grey rectangle, unaltered. SASS represents amplification of all four (SETDB1, ASH1, SMYD2, and SYMD3) loci; SE1 represents SETDB1 locus only; AS, ASH1L only; SM2, SMYD2 only; and SM3, SMYD3 only. (B) Agilent array of comparative genomic hybridization profiles shows amplification/gain of chromosome 1q in three breast cancer cell lines (Colo824, HCC1187, and MCF7, also see Supplementary Table S4).

High-level amplification of four HMT genes at chromosome 1q in breast cancer

(A) Copy numbers of SETDB1, ASH1, SMYD2, and SMYD3 were obtained from the Oncoprint output of The Cancer Genome Atlas breast cancer data (cBioPortal). A red rectangle indicates high-level amplification; blue rectangle, homozygous deletion; grey rectangle, unaltered. SASS represents amplification of all four (SETDB1, ASH1, SMYD2, and SYMD3) loci; SE1 represents SETDB1 locus only; AS, ASH1L only; SM2, SMYD2 only; and SM3, SMYD3 only. (B) Agilent array of comparative genomic hybridization profiles shows amplification/gain of chromosome 1q in three breast cancer cell lines (Colo824, HCC1187, and MCF7, also see Supplementary Table S4). In breast cancer, cell lines mirror many of the molecular characteristics of the tumors from which they were derived, so we used a panel of breast cancer cell lines to further investigate the genetic landscape of HMTs, particularly the amplification and overexpression of the four HMTs on chromosome 1q. We first analyzed our own genomic array of comparative genomic hybridization (CGH) data as well as publicly available data for 17 breast cancer cell lines [12, 21, 25]. Seven lines (Colo824, SUM229, SUM149, HCC70, HCC1937, HCC1187, and MDB-MA-468) were of the basal-like subtype, three lines (HCC1954, SUM190, and SUM225) were of the HER2+ subtype, and seven (HCC1428, SUM44, SUM52, T47D, SUM185, ZR-75-1, and MCF7) were of the Luminal subtypes. We found that, similar to primary breast cancer samples, 16 of 17 lines showed gain or amplification in at least one locus of SETDB1, ASH1L, SMYD2, and SMYD3. Six lines (Colo824, HCC70, HCC1187, MDB-MA-468, HCC1954, and ZR-75-1) showed low-level gain across most of chromosome 1q, whereas Colo824 and HCC1187 showed high-level amplification at the SETDB1 locus (Figure 4B and Supplementary Table S4). Notably, MCF7 showed moderate amplification at the SETDB1 locus as well as heterozygous deletions at the ASH1L and SMYD2 loci (Figure 4B and Supplementary Table S4). In addition, consistent with our findings in primary breast cancers, we also found that other HMTs, such as WHSC1L1, SUV420H1, and SETD1A, were commonly gained or amplified, and SETDB2 was commonly lost or deleted in breast cancer cell lines (Supplementary Figure S2). Next, we performed quantitative RT-PCR (qRT-PCR) analysis to measure the mRNA expression level of eight HMTs (SETDB1, ASH1L, SMYD2, SMYD3, SETD1A, WHSC1L1, SUV420H1, and SETDB2) in 20 breast cancer cell lines. MCF10A, an immortalized but non-tumorigenic breast epithelial cell line, was used as the control. These eight HMTs were chosen because seven of them showed the highest frequency of amplification (>5%), and the eighth, SETDB2, exhibited homozygous deletion in more than 2% of breast cancers. Although PRDM14 and PRDM7 were also among the HMTs with the highest frequency of amplification or homozygous deletion, respectively, in breast cancer, they were excluded from the qRT-PCR analysis because their expression levels in breast cancer cells were too low for detection (data not shown). Figure 5 shows the relative expression of eight HMT genes in 20 breast cancer cell lines compared with MCF10A cells. We found that mRNA levels of SETDB1 were more than two-fold higher in 14 of 20 breast cancer cell lines. For ASH1L, 11 of 20 cell lines; for SMYD2, 14 of 20; and for SMYD3, 6 of 20 breast cancer cell lines had two-fold higher mRNA levels. Notably, Colo824 showed higher expression of four 1q HMTs, with the highest expression of SETDB1 among the 20 breast cancer cell lines. In contrast, MCF7 showed higher expression of SETDB1, but not ASH1L and SMYD2, which is consistent with amplification of SETDB1, but not ASH1L and SMYD2, in this breast cancer cell line (Figure 5). We also found that SMYD3 and WHSC1L1 were prevalently overexpressed in several Luminal breast cancer cell lines. Breast cancer cell lines with deletion of SETDB2, such as SUM229, SUM149, HCC170, and SUM190, showed dramatically lower mRNA levels than MCF10A cells (Figure 5). Similar results were observed in RNA sequencing (RNA-Seq) data from 42 breast cancer cell lines compared with five normal mammary epithelial cell lines (Supplementary Figure S3). These experiments demonstrated that breast cancer cell lines with HMT gene CNAs showed a correlated change in mRNA expression. Thus, these cell lines provide useful preclinical models in which to investigate the biological functions of HMTs and to explore novel inhibitors for targeting HMTs in breast cancer in the future.
Figure 5

mRNA expression levels of eight HMTs in a panel of 20 breast cancer cell lines measured by qRT-PCR

mRNA expression levels in the MCF10A cells, an immortalized but non-tumorigenic breast epithelial cell line, were arbitrarily set as 1. Relative expression levels are shown as fold changes compared with that in MCF10A cells.

mRNA expression levels of eight HMTs in a panel of 20 breast cancer cell lines measured by qRT-PCR

mRNA expression levels in the MCF10A cells, an immortalized but non-tumorigenic breast epithelial cell line, were arbitrarily set as 1. Relative expression levels are shown as fold changes compared with that in MCF10A cells.

HMT copy number and expression are associated with breast cancer patient survival

To investigate the clinical relevance of genetic alterations of HMTs in breast cancer, we examined the relationship between HMT copy number, mRNA expression, and overall patient survival in 770 of 958 breast cancer samples for which detailed survival data were available. To investigate DNA copy number, samples were segregated into the following three groups for each HMT: amp/gain (high-level amplification and low-level gain), diploid, or deletion (heterozygous and homozygous deletion). Log-rank statistical analysis was first used to determine whether an increase or decrease in copy number for each HMT was associated with overall patient survival. We found that for seven HMTs (KMT2C, SETDB2, SETD2, SETMAR, PRDM1, PRDM5, and PRDM8), copy number amp/gain or deletion was significantly associated (p<0.05) with shorter survival in breast cancer patients (Figure 6A and Supplementary Table S5). Both deletion and amp/gain of SETDB2 and SETMAR were associated with shorter patient survival, whereas only amp/gain of PRDM1, PRDM5, and PRDM8 was more likely associated with shorter patient survival (Supplementary Table S5). Most importantly, we discovered that deletion of KMT2C was significantly associated with shorter survival, and amp/gain of this gene was significantly associated with longer survival, compared with patients who had no change in copy number (Figure 6A). However, KMT2C, which has the highest genetic mutation rate of HMTs in breast cancer, showed no significant difference in patient survival with regard to mutation (Supplementary Figure S4).
Figure 6

Kaplan-Meier plots of overall survival associated with

(A) copy number and (B) mRNA expression levels of three HMTs (WHSC1L1, SETDB2, and KMT2C) in breast cancer

Kaplan-Meier plots of overall survival associated with

(A) copy number and (B) mRNA expression levels of three HMTs (WHSC1L1, SETDB2, and KMT2C) in breast cancer To analyze the relationships between HMT mRNA expression and overall patient survival in breast cancer, samples were divided into low (n=385) and high (n=385) groups based on the mRNA expression Z-scores [RNA-Seq V2 RSEM (RNA-Seq by Expectation-Maximization)] of each HMT. Seven HMTs were excluded from survival analysis of mRNA expression because three of them (KMT2B, KMT2D, and PRDM9) lacked RNA sequencing data, and the expression levels for four of them (PRDM7, PRDM13, PRDM14, and SMYD1) were too low to be convincing, as previously noted. Supplementary Table S6 summarizes the results of a log-rank statistical analysis for 44 HMTs in breast cancer. High mRNA levels of WHSC1L1 (p=0.0231), SETD7 (p=0.0021), and SETD5 (p=0.0456) were significantly associated (p<0.05) with shorter survival in breast cancer patients (Figure 6B, Supplementary Table S6). For WHSC1L1, higher mRNA expression had a hazard ratio (HR), a ratio of the probability of death, of 1.659 (95% confidence interval, 1.074 to 2.571) compared with lower mRNA expression in breast cancer. Additionally, high mRNA levels of SUV39H2 (p=0.0506), KMT2A (p=0.0644), NSD1 (p=0.0701), and ASH1L (p=0.0877) showed moderately significant association (p<0.1) with shorter survival in patients with breast cancer (Supplementary Table S6). In contrast, only two HMTs, SETD8 (p=0.0769) and EHMT2 (p=0.0978), displayed low mRNA levels that were moderately associated with shorter survival in patients. KMT2C, which had the highest genetic mutation rate among HMTs in breast cancer, showed no significant difference (p=0.3774) in patient survival in terms of mRNA expression levels (Figure 6B). We then performed a multivariate analysis (Cox model, n=468) to investigate whether the copy number and/or expression level of each HMT were predictive of poor prognosis compared with standard prognostic markers, including age at diagnosis, ER status, PR status, HER2 status, tumor size, lymph node status, metastasis status, and molecular subtype (basal vs. non-basal). We found that copy number amp/gain of NSD1 or PRDM1, or loss of SETDB2 or PRDM10 was significantly associated (p<0.05) with shorter survival in breast cancer patients (Supplementary Table S7). In addition, we found that high mRNA levels of SETD4 (p=0.0468, HR=1.92), SETD5 (p=0.00231, HR=2.79), or SETD7 (p=0.0391, HR=1.97) were significantly associated (p<0.05) with shorter survival in breast cancer patients (Supplementary Table S8). However, amp/gain or high expression level of SMYD3 was negatively correlated with shorter survival in breast cancer patients (Supplementary Tables S7 and S8).

Integrative identification of critical HMTs in breast cancer

The preceding results suggested the possibility of ranking the importance of HMTs in breast cancer according to CNAs, mutation, mRNA expression, and clinical relevance. We calculated a score for each HMT, where every category counted as one point: when an HMT had high-level amplification (frequency >5%), homozygous deletion (frequency >2%), or mutation (frequency >2%); CNA associated with patient survival (log-rank test p<0.1); DNA/mRNA correlation (r > 0.5, p<0.001); altered expression in aggressive basal-like breast cancer (p<0.001); and mRNA associated with shorter patient survival (log-rank test p<0.1). We then ranked the HMTs, as shown in Table 4, observing that three HMTs (SETDB1, WHSC1L1, and SETDB2) had a score of 3, and five HMTs (ASH1L, SMYD2, SMYD3, SUV420H1, and KMT2C) had a score of 2. These results suggest that these eight HMTs act as drivers of oncogenic processing or as tumor suppressors and have critical roles in breast cancer initiation and progression.
Table 4

Integrative identification of critical HMTs in breast cancer

GeneCNA/MutationsCNA/SurvivalDNA/mRNAExpressionmRNA/SurvivalScore
ASH1L++2
SETDB1+++3
SMYD2++2
SMYD3++2
WHSC1L1+++3
PRDM14+1
SUV420H1++2
SETD1A+1
PRDM7+1
SETDB2+++3
KMT2C++2
KMT2D+1

Footnote: CNA/Mutation: high-level amplification, homozygous deletion, or mutation; CNA/Survival: CNA associated with patient survival; DNA/mRNA: DNA/mRNA correlation; Expression: altered expression in aggressive basal-subtype ; and mRNA/Survival: mRNA associated with patient survival.

Footnote: CNA/Mutation: high-level amplification, homozygous deletion, or mutation; CNA/Survival: CNA associated with patient survival; DNA/mRNA: DNA/mRNA correlation; Expression: altered expression in aggressive basal-subtype ; and mRNA/Survival: mRNA associated with patient survival.

DISCUSSION

We performed comprehensive genomic and transcriptomic analysis of 51 human HMTs in a panel of breast cancer cell lines and in primary breast cancer samples. Our main findings include the following: (1) we identified 12 HMTs with the highest frequency of genetic alterations, including eight with high-level amplification, two with putative homozygous deletion, and two with somatic mutation; (2) there was a correlation between gene expression and copy number: most HMTs had a positive correlation, and WHSC1L1 had the highest correlation coefficient in breast cancer; (3) different subtypes of breast cancer have different patterns of copy number and expression for each HMT gene; several, including SETDB1 and SMYD2, were highly amplified and overexpressed in basal-like breast cancer; (4) the two most mutated HMTs, KMT2C and KMT2D, exhibited various types of mutation across protein domains, possibly resulting in loss of their methyltransferase functions; (5) chromosome 1q contains four HMTs that are concurrently or independently amplified/overexpressed in breast cancer cell lines and primary samples; (6) we identified several HMTs, including WHSC1L1, SETDB2, and KMT2C, whose DNA copy number or mRNA expression level was significantly associated with shorter survival in breast cancer patients; and (7) integrative analysis prioritized SETDB1, WHSC1L1, and SETDB2 as the most critical HMTs in breast cancer. The HMTs constitute a large class of enzymes that catalyze site-specific methylation of lysine residues on histones and other proteins [7-9]. Previously, oncogenic alterations, including amplification, mutation, and translocation of several HMTs, were identified in various human tumors, including breast cancer [8, 10, 11, 26, 27]. A well-studied example is EZH2, the catalytic subunit of the polycomb repressive complex 2. Hyperactivation of EZH2, by amplification/overexpression or mutation, was documented in breast and prostate cancers, lymphoma, and other types of tumors [28]. Tissue microarray analysis revealed that EZH2 protein levels were strongly associated with breast cancer aggressiveness [13, 29]. Another example is WHSC1L1, a commonly amplified gene at 8p11-12 in breast and lung cancers; WHSC1L1 is fused with NUP98 in acute myeloid leukemia [12, 30-32]. Evidence indicated that the 8p11-12 amplicon contained several candidate oncogenes in breast cancer [12]. As shown in Supplementary Figure S5, we found that most WHSC1L1-amplified breast cancer samples also showed the amplification of other candidate oncogenes at the 8p11-12 amplicon. Notably, our previous study demonstrated the oncogenic potential of WHSC1L1, particularly in ER+ Luminal breast cancer [12]. Although previous studies revealed the dysregulation of individual HMTs in breast cancer, to our knowledge, this is the first report showing comprehensive genomic and transcriptomic analysis of 51 HMTs in different types of breast cancer [10-12]. This study validates previous findings in individual HMTs, namely that WHSC1L1 is highly amplified/overexpressed in Luminal subtypes, and EZH2 is highly expressed in basal-like breast cancer [12, 23, 27]. Importantly, our results reveal the genomic landscape for many HMTs in different subtypes of breast cancer. Previous studies demonstrated a prevalent gain/amplification of chromosome 1q in breast cancer [33-35]. Of the five HMTs that have high-level amplification in more than 10% of breast cancer samples, four are clustered on chromosome 1q, from 1q21 to 1q44. We found that, of 215 breast cancer samples in which chromosome 1q is amplified, 65 (30%) had concurrent high-level amplification in all four genetic loci; this amplification is more common in Luminal breast cancer. Furthermore, detailed genomic analysis showed that 1q amplifications were heterogeneous in most breast cancer cell lines and primary samples. The SETDB1 gene (1q21.3) encodes the H3K9 methyltransferase and showed independent amplification in 23 of 215 primary breast cancers as well as the MCF7 breast cancer cell line. We found that amplification or overexpression of SETDB1 is more common in basal-like breast cancer (Figure 1A, Figure 2, and Supplementary Table S9). Recent studies demonstrated that SETDB1 is frequently amplified in lung and urothelial cancers as well as melanoma [36-38]. The inhibition of SETDB1 expression in SETDB1-gene-amplified lung cancer cells reduced tumor growth in cell culture and nude mice models, whereas its overexpression increased tumor invasiveness [36]. SETDB1 significantly accelerated melanoma formation in a zebrafish model [38]. SMYD2 and SMYD3, which share a high degree of sequence homology, are localized at 1q32 and 1q44, respectively. We queried published TCGA breast cancer data and found that the size of the 1q44 (SMYD3) amplicon spans approximately 1.5 Mb in breast cancer [6]. Recent studies demonstrated that SMYD2 and SMYD3 can methylate both histone and nonhistone proteins, such as the tumor suppressors p53 and Rb as well as the ERα protein [39-41]. Enzymatic analysis shows that SMYD2 is a monomethyltransferase that prefers nonmethylated p53 peptide as a substrate among different histones and protein substrates tested in vitro [42]. Conversely, another study demonstrated the role of SMYD3 in ER-mediated transcription via its histone methyltransferase activity [43]. Furthermore, SMYD3 activity regulates cytoplasmic oncogenic signaling; specifically, SMYD3-mediated MAP3K2 methylation activates RAS signaling and drives carcinogenesis in vivo [44]. Additionally, ASH1L is an H3K4 and H3K36 histone methyltransferase that occupies the transcribed region of active genes [45]. Notably, H3K4 demethylase KDM5B (also known as JARID1B), which is localized at 1q32.1, was recently shown to be an oncogene in Luminal breast cancer by regulating lineage-specific genes [46]. Thus, chromosome 1q contains multiple oncogenic histone lysine modifiers that are concurrently or independently amplified or overexpressed in breast cancer. It is necessary to investigate whether or how these five histone lysine modifiers contribute, independently or cooperatively, to breast tumorigenesis. Two HMT genes demonstrated high (>2%) rates of homozygous deletion among breast cancer samples, implying their potential roles as tumor suppressors. Of the two, PRDM7 is the more mysterious gene. Most studies of PRDM7 to date investigated its phylogenetic and evolutionary ancestry [47, 48]. The other gene, SETDB2, which was concurrently deleted with another tumor suppressor, RB1, at 13q14 (Supplementary Figure S5A), is a homologue of SETDB1 and participates in the distribution of trimethylated H3K9 (H3K9me3) marks and contributes to chromosome segregation during mitosis [49]. However, although SETDB1 and SETDB2 are homologues, they seem to have opposing functions with regard to breast cancer. Structurally, the most significant difference between the two proteins is the presence or absence of a Tandem-Tudor domain. The Tandem-Tudor domain functions essentially as the guiding module that allows a protein to recognize and bind to the appropriate histone, where it can regulate methylation reactions. For example, the Tandem-Tudor domain of 53BP1 has a direct role in recognizing and binding H4K20 dimethylation (H4K20me2) to promote DNA repair [50]. In addition, the Tandem-Tudor domain of histone H3K9 and H3K36 demethylase KDM4A recognizes and binds H3K4me3 and H4K20me3/me2 marks [51, 52]. However, the histone site and marks that the SETDB1 Tandem-Tudor domain recognizes are still not known. We still need to investigate how the presence of the Tandem-Tudor domain in SETDB1 differentiates it from SETDB2 so dramatically. We identified the two most mutated HMT genes in breast cancer. KMT2C was mutated in 67 of 958 breast cancer specimens (6.99%), which is almost three times more than KMT2D with a rate of 2.40%. KMT2C and KMT2D belong to a set of genes known as mixed-lineage leukemia (MLL) genes and have the aliases MLL3 and MLL4, respectively. Translocation mutations cause the generated MLL proteins to lose their SET domain and fuse with other proteins to create fusion proteins, which have been identified as direct causes of aggressive leukemia [53]. In terms of function, KMT2C and KMT2D are monomethylases for the lysine residue H3K4 and are required to generate H3K4me1 at various enhancer regions throughout the genome [54]. It has been proposed that cancer-associated mutations in KMT2C and KMT2D exert their properties through the malfunction of KMT2C/KMT2D-dependent enhancers [55]. In addition, KMT2C/D can associate with activating signal cointegrator-2 (ASC-2), a multifunctional coactivator, to form an ASC complex, which has been identified as a key factor in the DNA damage response and in p53 activation [56]. Our data, which demonstrate that KMT2C is commonly mutated and that its deletion is significantly associated with shorter patient survival, suggests that KMT2C might function as a tumor suppressor in breast cancer. Because epigenetic changes are reversible and HMTs are druggable, targeting HMTs provides a unique opportunity for pharmacologic intervention by means of designing inhibitors that represent a novel class of anti-cancer drugs. Recent evidence shows that aberrant activity of HMTs, due to amplification, deletion, or mutation of their corresponding genes, contributes to cancer initiation and progression. Consequently, a promising strategy could be to target patient populations with those alterations. Selective inhibitors of several HMTs have been reported to show antitumor effects in vitro and in vivo [57]. Furthermore, an EZH2 inhibitor (EPZ-6438) has entered phase I human clinical trials. Here, we reported a systematic and integrative analysis of HMTs and identified eight HMTs (SETDB1, SMYD3, ASH1L, SMYD2, WHSC1L1, SUV420H1, SETDB2, and KMT2C) that have the highest frequencies of genetic alterations and most clinical relevance. Our findings for these HMTs represent a strong foundation for further mechanistic research and therapeutic advances in breast cancer.

MATERIALS AND METHODS

Cell culture

The cultures for the SUM series of breast cancer cell lines and nontransformed human mammary epithelial cell MCF10A line have been described in detail previously [21, 58]. The Colo824 cell line was obtained from DMSZ, and the cell lines HCC70, HCC1187, HCC1428, HCC1937, HCC1954, MDB-MA-468, T47D, and ZR-75-1 were obtained from ATCC. These lines were maintained in RPMI with 10% FBS (Atlanta Biologicals, Flowery Branch, GA, USA) according to DMSZ and ATCC protocols.

Genomic array CGH

Genomic array CGH experiments were done using the Agilent human genome CGH microarray chip (Agilent Technologies, Palo Alto, CA, USA) as previously described in detail [21]. Agilent's CGH Analytics software was used to calculate various measurement parameters, including log2 ratios of total integrated Cy-5 and Cy-3 intensities for each probe. Array data have been posted in the NCBI GEO database (GEO accession numbers: GSM718287, GSM718288, GSM718290).

Semiquantitative RT-PCR reactions

mRNA was prepared from human breast cancer cell lines and the MCF10A cell line by using an RNeasy Plus Mini Kit (QIAGEN). mRNA was mixed with qScript cDNA SuperMix (Quanta Biosciences, Gaithersburg, MD, USA), then converted into cDNA through a reverse-transcription reaction for real-time PCR reactions. Primer sets for HMT genes were ordered from Life Technologies (Carlsbad, CA, USA). A PUM1 primer set was used as a control. Semiquantitative RT-PCR was done using the FastStart Universal SYBR Green Master (Rox) (Roche Diagnostics Indianapolis, IN, USA).

Genomic and clinical data

The DNA copy number, mutation, and overall survival datasets of 958 breast cancer samples used in this research were obtained from the cBio Cancer Genomics Portal at http://www.cbioportal.org. The copy number for each HMT was generated from the copy number analysis algorithms GISTIC and indicates the copy number level per gene. “-2” is a deep loss (possibly a homozygous deletion), “-1” is a heterozygous deletion, “0” is diploid, “1” indicates a low-level gain, and “2” is a high-level amplification. For mRNA expression data, the relative expression of an individual gene and the gene's expression distribution in a reference population were analyzed. The reference population was either all tumors that are diploid for the gene in question, or, when available, normal adjacent tissue. The returned value indicates the number of standard deviations away from the mean of expression in the reference population (Z-score). The breast cancer subtype information was from a previous publication [6].

Statistical analysis

Statistical analyses were performed using the R software (http://www.r-project.org) and Graphpad Prism (version 6.03). The correlations between copy numbers and mRNA levels of each HMT from 958 sequenced breast cancer specimens were analyzed using Spearman, Kendall, and Pearson correlation tests. The Spearman and Kendall tests are rank correlations—the Spearman coefficient relates the two variables conserving the order of data points, and the Kendall coefficient measures the number of ranks that match in the data set. Although the Pearson correlation coefficient is the most widely used, it was deemed the least relevant to our study, as it measures only the strength of linear relationships and ignores all others. We used the “cor” function in R statistical software for computation, specifying in the code which type of test we wanted (Spearman, Kendall, or Pearson). A systematic approach was used to analyze the correlations between the mRNA levels and DNA copy numbers for each of the 48 HMTs for all three of the methods. The difference in mRNA expression level for each HMT between the basal-like and the other cancer subtypes was calculated using Student's t-test. The association between the clinical outcome and individual HMT copy number and expression level was evaluated using a log-rank test. Multivariate survival analysis was conducted using the Cox regression function in R statistical software.
  57 in total

1.  NUP98 is fused to the NSD3 gene in acute myeloid leukemia associated with t(8;11)(p11.2;p15).

Authors:  Roberto Rosati; Roberta La Starza; Angelo Veronese; Ana Aventin; Christine Schwienbacher; Teresa Vallespi; Massimo Negrini; Massimo F Martelli; Cristina Mecucci
Journal:  Blood       Date:  2002-05-15       Impact factor: 22.113

2.  Structural basis for the methylation state-specific recognition of histone H4-K20 by 53BP1 and Crb2 in DNA repair.

Authors:  Maria Victoria Botuyan; Joseph Lee; Irene M Ward; Ja-Eun Kim; James R Thompson; Junjie Chen; Georges Mer
Journal:  Cell       Date:  2006-12-29       Impact factor: 41.582

Review 3.  Gene expression profiling in breast cancer: classification, prognostication, and prediction.

Authors:  Jorge S Reis-Filho; Lajos Pusztai
Journal:  Lancet       Date:  2011-11-19       Impact factor: 79.321

4.  Molecular characterization of breast cancer with high-resolution oligonucleotide comparative genomic hybridization array.

Authors:  Fabrice Andre; Bastien Job; Philippe Dessen; Attila Tordai; Stefan Michiels; Cornelia Liedtke; Catherine Richon; Kai Yan; Bailang Wang; Gilles Vassal; Suzette Delaloge; Gabriel N Hortobagyi; W Fraser Symmans; Vladimir Lazar; Lajos Pusztai
Journal:  Clin Cancer Res       Date:  2009-01-15       Impact factor: 12.531

Review 5.  Enhancer malfunction in cancer.

Authors:  Hans-Martin Herz; Deqing Hu; Ali Shilatifard
Journal:  Mol Cell       Date:  2014-03-20       Impact factor: 17.970

Review 6.  Histone methyltransferases in cancer.

Authors:  Mareike Albert; Kristian Helin
Journal:  Semin Cell Dev Biol       Date:  2009-11-03       Impact factor: 7.727

Review 7.  Mutations in regulators of the epigenome and their connections to global chromatin patterns in cancer.

Authors:  Christoph Plass; Stefan M Pfister; Anders M Lindroth; Olga Bogatyrova; Rainer Claus; Peter Lichter
Journal:  Nat Rev Genet       Date:  2013-10-09       Impact factor: 53.242

Review 8.  Chromosome aberrations in solid tumors.

Authors:  Donna G Albertson; Colin Collins; Frank McCormick; Joe W Gray
Journal:  Nat Genet       Date:  2003-08       Impact factor: 38.330

9.  Molecular and Epigenetic Mechanisms of MLL in Human Leukemogenesis.

Authors:  Erica Ballabio; Thomas A Milne
Journal:  Cancers (Basel)       Date:  2012-09-10       Impact factor: 6.639

10.  Family expansion and gene rearrangements contributed to the functional specialization of PRDM genes in vertebrates.

Authors:  Irene Fumasoni; Natalia Meani; Davide Rambaldi; Gaia Scafetta; Myriam Alcalay; Francesca D Ciccarelli
Journal:  BMC Evol Biol       Date:  2007-10-04       Impact factor: 3.260

View more
  65 in total

1.  Discovery of Isoxazole Amides as Potent and Selective SMYD3 Inhibitors.

Authors:  Dai-Shi Su; Junya Qu; Mark Schulz; Chuck W Blackledge; Hongyi Yu; Jenny Zeng; Joelle Burgess; Alexander Reif; Melissa Stern; Raman Nagarajan; Melissa Baker Pappalardi; Kristen Wong; Alan P Graves; William Bonnette; Liping Wang; Patricia Elkins; Beth Knapp-Reed; Jeffrey D Carson; Charles McHugh; Helai Mohammad; Ryan Kruger; Juan Luengo; Dirk A Heerding; Caretha L Creasy
Journal:  ACS Med Chem Lett       Date:  2019-12-27       Impact factor: 4.345

2.  Two Loops Undergoing Concerted Dynamics Regulate the Activity of the ASH1L Histone Methyltransferase.

Authors:  David S Rogawski; Juliano Ndoj; Hyo Je Cho; Ivan Maillard; Jolanta Grembecka; Tomasz Cierpicki
Journal:  Biochemistry       Date:  2015-08-25       Impact factor: 3.162

Review 3.  H3K36 methyltransferases as cancer drug targets: rationale and perspectives for inhibitor development.

Authors:  David S Rogawski; Jolanta Grembecka; Tomasz Cierpicki
Journal:  Future Med Chem       Date:  2016-08-22       Impact factor: 3.808

4.  Differences between BCL2-break positive and negative follicular lymphoma unraveled by whole-exome sequencing.

Authors:  A Zamò; J Pischimarov; M Schlesner; P Rosenstiel; R Bomben; H Horn; T Grieb; T Nedeva; C López; A Haake; J Richter; L Trümper; C Lawerenz; W Klapper; P Möller; M Hummel; D Lenze; M Szczepanowski; L Flossbach; M Schreder; V Gattei; G Ott; R Siebert; A Rosenwald; E Leich
Journal:  Leukemia       Date:  2017-08-21       Impact factor: 11.528

5.  Genetic alterations of KDM4 subfamily and therapeutic effect of novel demethylase inhibitor in breast cancer.

Authors:  Qin Ye; Andreana Holowatyj; Jack Wu; Hui Liu; Lihong Zhang; Takayoshi Suzuki; Zeng-Quan Yang
Journal:  Am J Cancer Res       Date:  2015-03-15       Impact factor: 6.166

6.  Smad3-mediated recruitment of the methyltransferase SETDB1/ESET controls Snail1 expression and epithelial-mesenchymal transition.

Authors:  Dan Du; Yoko Katsuno; Dominique Meyer; Erine H Budi; Si-Han Chen; Hartmut Koeppen; Hongjun Wang; Rosemary J Akhurst; Rik Derynck
Journal:  EMBO Rep       Date:  2017-12-12       Impact factor: 8.807

7.  The molecular genetic make-up of male breast cancer.

Authors:  Cathy B Moelans; Joep de Ligt; Petra van der Groep; Pjotr Prins; Nicolle J M Besselink; Marlous Hoogstraat; Natalie D Ter Hoeve; Miangela M Lacle; Robert Kornegoor; Carmen C van der Pol; Wendy W J de Leng; Ellis Barbé; Bert van der Vegt; John Martens; Peter Bult; Vincent T H B M Smit; Marco J Koudijs; Isaac J Nijman; Emile E Voest; Pier Selenica; Britta Weigelt; Jorge S Reis-Filho; Elsken van der Wall; Edwin Cuppen; Paul J van Diest
Journal:  Endocr Relat Cancer       Date:  2019-10       Impact factor: 5.678

8.  Identification and characterization of PKF118-310 as a KDM4A inhibitor.

Authors:  Gianluigi Franci; Federica Sarno; Angela Nebbioso; Lucia Altucci
Journal:  Epigenetics       Date:  2016-10-21       Impact factor: 4.528

9.  Residual expression of SMYD2 and SMYD3 is associated with the acquisition of complex karyotype in chronic lymphocytic leukemia.

Authors:  Wilson Oliveira-Santos; Doralina Amaral Rabello; Antônio Roberto Lucena-Araujo; Fábio Morato de Oliveira; Eduardo Magalhaes Rego; Fábio Pittella Silva; Felipe Saldanha-Araujo
Journal:  Tumour Biol       Date:  2016-01-20

10.  Novel role of ASH1L histone methyltransferase in anaplastic thyroid carcinoma.

Authors:  Bin Xu; Tingting Qin; Jingcheng Yu; Thomas J Giordano; Maureen A Sartor; Ronald J Koenig
Journal:  J Biol Chem       Date:  2020-05-12       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.