Literature DB >> 30279964

Intratumor heterogeneity of HMCN1 mutant alleles associated with poor prognosis in patients with breast cancer.

Chie Kikutake1, Minako Yoshihara1,2, Tetsuya Sato1,2, Daisuke Saito1,2, Mikita Suyama1,2.   

Abstract

Human breast cancers comprise a complex and highly heterogeneous population of tumor cells. Intratumor heterogeneity is an underlying cause of resistance to effective therapies and disease recurrence. To explore prognostic factors based on intratumor heterogeneity, we analyzed genomic mutations in breast cancer patients registered in The Cancer Genome Atlas. We calculated the variant allele frequency (VAF) at each mutation site and evaluated the associations of VAFs with the prognosis of breast cancer. VAFs of HMCN1 correlated with the prognosis and lymph node status. Although the detailed function of HMCN1 remains unknown, it is located in extracellular matrix and the mutation in the gene might be associated with cancer cell invasion and metastasis. This finding suggests that HMCN1 is a potential metastatic factor and can be a candidate gene for targeted breast cancer therapy.

Entities:  

Keywords:  breast cancer; genetic variant; lymph node metastasis; next-generation sequencing; variant allele frequency

Year:  2018        PMID: 30279964      PMCID: PMC6161790          DOI: 10.18632/oncotarget.26071

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


INTRODUCTION

Breast cancer is the most common type of cancer affecting women worldwide. In 2012, approximately 1.7 million cases of breast cancer were newly diagnosed [1]. Changes in dietary habits and a reduced birth rate can increase the risk of breast cancer. Breast cancer is a clinically heterogeneous disease for which four basic therapeutic or molecular subtypes have been classified based on the expression status of three receptors: estrogen receptor, progesterone receptor, and human epidermal growth factor receptor 2 [2, 3]. Immunohistochemistry is used to classify these four tumor subtypes and ensure that effective treatment is provided to each patient. Despite recent therapeutic advances, tumor recurrence and drug resistance remain major challenges in the field of breast cancer. These challenges are mainly attributed to intratumor heterogeneity [4], which is characterized by subclonal diversity within a tumor that originates from the accumulation of various somatic mutations during cell division and proliferation [5-7]. Intratumor heterogeneity has already been identified in several types of cancer, including breast, prostate, kidney, brain, liver, and lung cancers [8]. Drug-resistant subclones may develop via clonal evolution and reside at low frequencies within a tumor; after drug therapy, however, these subclones become the main population, leading to recurrence [9-11]. Intratumor heterogeneity can be most directly evaluated from DNA sequences using next-generation sequencing (NGS). One of the commonly used methods to analyze heterogeneity is the sequencing of samples from multiple regions of the same tumor [10, 12]. Ultra-deep sequencing can also be used to detect mutations with extremely low allele frequencies. Variant allele frequency (VAF), calculated as the proportion of reads with mutations at the variant site, is used as an index of heterogeneity [13, 14]. VAFs in a tumor can be used to determine the cellular prevalence of a mutation within a sample and estimate subpopulation frequencies and the tumor evolutionary process [15-17]. For example, deep sequencing was used to evaluate mutational processes of 21 breast cancers, leading to the finding that every tumor harbored a distinct subclonal lineage [10]. The use of NGS and analytical methods to define clonal heterogeneity has also provided insights into the genetic processes underlying breast cancer metastasis [18]. Recent studies also showed that clonal distribution based on VAF correlated with prognosis [19, 20]. Additionally, heterogeneity can be evaluated using large datasets generated by The Cancer Genome Atlas (TCGA) or the International Cancer Genome Consortium [21, 22]. Despite these advances, it remains highly challenging to identify tumor genetic factors associated with tumor growth or metastasis, as tumors exhibit considerable heterogeneity. Several recent studies based on TCGA data have focused on the identification of driver genes and identified pathways containing potential drug targets [23]. These studies have accelerated the development of pathway-specific inhibitory drugs. Although mutations in breast cancer driver genes such as TP53, PIK3CA, and GATA3 have been extensively investigated, somatic alterations in other genes are also believed to be associated with breast cancer [24]. To gain better insights into the extent of intratumor heterogeneity, we analyzed breast cancer genome sequencing data from TCGA. In this study, we focused on genes associated with breast cancer prognosis.

RESULTS

Identification of genes with high frequencies of mutations

We sought genes with one of four types of mutation (missense mutations, nonsense mutations, frameshift insertions, and frameshift deletions) in ≥ 50 samples derived from the 1,044 breast cancer datasets in TCGA. We identified 17 such genes (Table 1) and calculated the mean VAF for each in the sample containing mutations (Figure 1). All VAFs were adjusted for tumor purity taken from the previous study [25]. The mean VAFs of already known driver genes in breast cancer, such as TP53, PIK3CA, and CDH1 were found to be relatively high.
Table 1

Frequently mutated genes and mean variant allele frequencies and hazard ratios

GeneSample countMean value of VAFHazard Ratio (95% CI)P-valueFDRa
PIK3CA3040.4711.78 (0.729–4.348)0.2060.696
TP532930.6261.276 (0.455–3.581)0.6430.994
TTN1930.3111.85 (0.843–4.06)0.1250.531
MUC161130.3051.768 (0.619–5.048)0.2870.696
CDH11040.4681.019 (0.263–3.951)0.9790.999
GATA31020.3980.859 (0.245–3.01)0.8130.999
KMT2C860.3701.327 (0.478–3.687)0.5870.994
MAP3K1770.4360.114 (0.013–0.985)0.0480.287
HMCN1640.25111.441 (2.065–63.406)0.0050.090*
USH2A630.2831.185 (0.245–5.74)0.8330.999
RYR2620.2910.059 (0.003–1.008)0.0510.287
SYNE1560.2721.635 (0.11–24.181)0.7210.999
FLG530.2640.342 (0.04–2.923)0.3270.696
SPTA1520.2641.965 (0.531–7.274)0.3120.696
DMD510.2611.87 (0.396–8.831)0.4290.811
NEB500.2781.124 (0.135–9.342)0.9140.999
ZFHX4500.264---

Abbreviations: 95% CI, 95% confidence interval; VAF, variant allele frequency; FDR, false discovery rate.

aAstarisk indicates FDR < 0.1.

bZFHX4 VAF could not be analyzed because sample size is small.

Figure 1

Frequently mutated genes and mean variant allele frequencies (VAFs)

The scatter plot depicts 17 genes harboring mutations in > 50 samples. The x-axis indicates the mean VAF, and the y-axis indicates the number of samples with mutations. The plot at right is an enlargement of the square area enclosed by dotted lines in the left plot.

Abbreviations: 95% CI, 95% confidence interval; VAF, variant allele frequency; FDR, false discovery rate. aAstarisk indicates FDR < 0.1. bZFHX4 VAF could not be analyzed because sample size is small.

Frequently mutated genes and mean variant allele frequencies (VAFs)

The scatter plot depicts 17 genes harboring mutations in > 50 samples. The x-axis indicates the mean VAF, and the y-axis indicates the number of samples with mutations. The plot at right is an enlargement of the square area enclosed by dotted lines in the left plot. To examine the association of overall survival (OS) with VAFs of these 17 genes, we applied a Cox proportional hazards regression analysis with the covariates of age, tumor grade, and VAFs. In this analysis, the samples were divided into two groups using a VAF of 0.30 (i.e., 30%) as a cutoff. We used this cutoff because a previous study, which focused on samples with high tumor purity (≥ 70%), considered that a VAF of ≥ 0.25 was more likely to be clonal, whereas lower values were more likely to be subclonal [20]. Assuming that average purity is 85% (range 70–100%) then the cutoff should be 0.3 (0.25/0.85 = 0.294). We conducted this analysis without adjusting for other covariates just for a screening of genes that are possibly associated with breast cancer prognosis. We corrected P values for multiple testing using Benjamini and Hochberg false discovery rate (FDR) [26]. VAFs of HMCN1 was found to be possibly associated with breast cancer prognosis (FDR < 0.1) (Table 1), and we focused on HMCN1, for which no association with breast cancer has previously been reported. A total of 78 somatic mutations in HMCN1 were detected in 6.1% (64/1,044) of samples (Table 2 and Figure 2). Among samples with detectable HMCN1 mutations, 9.4% (6/64) contained two distinct mutations and 3.1% (2/64) contained more than two mutations. Of the 78 HMCN1 mutations, 82.1% (64/78) were missense, whereas 10.3% (8/78) were nonsense and 7.7% (6/78) were indels. Furthermore, 69.2% (54/78) of the mutations were clustered in the Ig-like C2-type domains of HMCN1.
Table 2

Distribution of HMCN1 mutations

HMCN1 DomainsMissense mutationNonsense mutationDeletionInsertionTotal
VWFA domain20002
Ig-like C2-type domains4552254
TSP type-1 domains81009
Nidogen G2 beta-barrel domain11002
EGF-like domains30003
Other51118
Total6483378

Abbreviations: EGF, epidermal growth factor; Ig, immunoglobulin; TSP, thrombospondins; VWFA, von Willebrand factor type A.

Figure 2

A schematic of the domains of human HMCN1 (hemicentin-1)

The types and positions of 78 somatic mutations are indicated above the diagram.

Abbreviations: EGF, epidermal growth factor; Ig, immunoglobulin; TSP, thrombospondins; VWFA, von Willebrand factor type A.

A schematic of the domains of human HMCN1 (hemicentin-1)

The types and positions of 78 somatic mutations are indicated above the diagram. To evaluate the existence of any association between the mutation type and VAFs, we applied a one-way ANOVA to the data and found that the mean VAF values did not significantly differ among the four types of mutations (P = 0.430) (Supplementary Figure 1). We further evaluated the associations of the four molecular breast cancer subtypes with VAFs of HMCN1. However, an ANOVA indicated that the mean VAF values did not significantly differ among the four subtypes (P = 0.060) (Supplementary Figure 2).

Expression of HMCN1

We compared the relative HMCN1 mRNA expression levels among samples with higher (VAF of ≥ 0.30, n = 19) and lower VAFs (VAF of < 0.30, n = 45). As a result, we found that HMCN1 expression levels did not significantly differ between the two groups (P = 0.343) (Supplementary Figure 3A). Additionally, we compared TP53 and PIK3CA expression levels between the VAF groups and found no significant differences in either (P = 0.515 and 0.300, respectively) (Supplementary Figure 3B and 3C). We also found no significant differences in the relative HMCN1 mRNA expression levels between HMCN1 mutant and wild-type samples (P = 0.984) (Supplementary Figure 3D). To identify genes for which the expression levels were associated with the HMCN1 VAF, we analyzed mRNA expression levels of all annotated genes. Among the annotated genes, only two significantly exhibited different expression in terms of the HMCN1 VAF. A high CA9 and CASP14 expression level (P = 0.043 and 0.024, respectively) and low MTRNR2L1 and TCN1 expression level (P = 0.024 and 0.043, respectively) were found to be significantly associated with a higher VAF (Figure 3). CA9 encodes carbonic anhydrase IX, an endogenous marker of hypoxic cells in breast cancers. CASP14 encodes caspase14, which is one of the apoptosis-related cysteine peptidase. MTRNR2L1 encodes human MT-RNR2-like 1, for which detailed functions remain unknown and TCN1 encodes a member of the vitamin B12-binding protein family, named “transcobalamin 1”.
Figure 3

CA9 and MTRNR2L1 mRNA expression according to HMCN1 variant allele frequencies (VAFs)

Samples were divided into two groups using a VAF cutoff of 0.30 (< 0.30, n = 45 and ≥ 0.30, n = 19). The asterisk indicates statistical significance.

CA9 and MTRNR2L1 mRNA expression according to HMCN1 variant allele frequencies (VAFs)

Samples were divided into two groups using a VAF cutoff of 0.30 (< 0.30, n = 45 and ≥ 0.30, n = 19). The asterisk indicates statistical significance.

Relationship between intratumor heterogeneity and HMCN1 VAFs

To investigate intratumor heterogeneity in individuals with HMCN1 mutations, we measured the number of subclones across each sample. We performed this analysis using the SciClone [16] with VAFs from somatic SNVs and copy number estimates. We compared between the higher and the lower HMCN1 VAF groups in terms of the number of subclones by Fisher's exact test. The distributions of the number of subclones were not significantly different between the two groups (P = 0.347) (Supplementary Figure 4A). There is another index for intratumor heterogeneity, mutant-allele tumor heterogeneity (MATH), which can be calculated from VAF distribution in a sample [27]. Previous studies have shown that higher MATH score was correlated with poor prognosis in head and neck squamous cell carcinoma and colon cancer [27, 28]. We compared between the higher and the lower HMCN1 VAF groups in terms of MATH scores by Wilcoxon signed rank test. In the 19 samples with higher HMCN1 VAF, the mean MATH was 35.014 (SD = 11.300). Meanwhile, in the 45 samples with lower HMCN1 VAF, the mean MATH was 33.919 (SD = 10.557). No significant differences between the two groups were detected (P = 0.771) (Supplementary Figure 4B). These findings indicate that the prevalence of the mutations in HMCN1 might not be involved in the status of intratumor heterogeneity.

Associations of HMCN1 with common driver genes

As TP53 and PIK3CA mutations are among the most common genetic aberrations in breast cancers [23], we compared the VAFs of these two driver genes with those of HMCN1. Among the 64 samples harboring mutations in HMCN1, 22 and 23 also harbored mutations in TP53 and PIK3CA, respectively, and five harbored mutations in both genes. In the 22 samples with both TP53 and HMCN1 mutations, the mean TP53 VAF was 0.697 (SD = 0.249) and the mean HMCN1 VAF was 0.288 (SD = 0.201). Meanwhile, in the 23 samples with both PIK3CA and HMCN1 mutations, the mean PIK3CA VAF was 0.442 (SD = 0.269) and the mean HMCN1 VAF was 0.230 (SD = 0.148). A paired t-test showed that VAFs of the two driver genes were significantly higher than that of HMCN1 (TP53; P < 0.01, PIK3CA; P < 0.01) (Supplementary Figure 5), indicating that mutations in HMCN1 occurred later in the tumor evolutionary process than the mutations in TP53 and PIK3CA. This finding suggests that the mutations in HMCN1 might be involved in breast cancer progression.

HMCN1 mutations and clinical outcomes

Next, we evaluated the association between HMCN1 mutation status and clinical variables by χ2 test or Fisher's exact test. We found tumor size (P = 0.028) and molecular subtype (P = 0.021) were related with the HMCN1 mutation (Table 3). To assess the relationship of the HMCN1 VAF with prognosis, the 64 samples harboring HMCN1 mutations were divided into two groups according to VAFs and subjected to an OS analysis. These groups were also compared with individuals without HMCN1 mutations (wild-type; WT). The resulting Kaplan–Meier plot shows that a higher HMCN1 VAF significantly correlated with poor prognosis (log-rank test: vs. WT; P = 0.022 and vs. VAF of < 0.30; P = 0.015) (Figure 4). Concordantly, in a multivariate Cox proportional hazards regression analysis adjusted for the covariates of lymph node status, tumor grade, tumor size, and age, the VAF (P = 0.036) and lymph node status (P = 0.012) were significantly associated with poor prognosis (Table 4).
Table 3

Clinical data of 1,044 breast cancer patients

VariablesOverallWTMTP-valuea
n = 1,044n = 980n = 64
No. (%)No. (%)No. (%)
Lymh node status0.175
Negative485 (46.5)450 (45.9)35 (54.7)
Positive540 (51.7)513 (52.3)27 (42.2)
Unknown19 (1.8)17 (1.7)2 (3.1)
Tumor grade0.080
1172 (16.5)163 (16.6)9 (14.1)
2582 (55.7)536 (54.7)46 (71.9)
3239 (22.9)231 (23.6)8 (12.5)
420 (1.9)19 (1.9)1 (1.6)
Unknown31 (3.0)31 (3.2)0
Tumor size (cm)0.028*
< 2267 (25.6)255 (26.0)12 (18.8)
2–5603 (57.8)556 (56.7)47 (73.4)
≧ 5171 (16.4)166 (16.9)5 (7.8)
Unknown3 (0.3)3 (0.3)0
Molecular subtype0.021*
Luminal A401 (38.4)385 (39.3)16 (25.0)
Luminal B171 (16.4)163 (16.6)8 (12.5)
HER2-enriched65 (6.2)58 (5.9)7 (10.9)
Basal-like132 (12.6)123 (12.6)9 (14.1)
Normal23 (2.2)19 (1.9)4 (6.3)
Unknown252 (24.1)232 (23.7)20 (31.3)
Age (year)0.450
Median (range)59 (27–90)59 (27–90)59 (34–90)
< 50276 (26.4)261 (26.6)15 (23.4)
≧ 50743 (71.2)649 (66.2)49 (76.6)
Unknown25 (2.4)25 (2.6)0 (0)

Abbreviations: 95% CI, 95% confidence interval; MT; HMCN1 mutant, VAF; variant allele frequency; WT, wild-type.

aAstarisk indicates statistical significance.

Figure 4

Kaplan–Meier analysis of overall survival according to HMCN1 variant allele frequencies (VAFs)

Samples were divided into three groups using a VAF cutoff of 0.30 (< 0.30, red, n = 45 and ≥ 0.30, blue, n = 19) or WT (black, n = 968). The log-rank test was used to evaluate the statistical significance of the difference between the two survival curves (VAF of ≥ 0.30 vs. VAF of < 0.30, VAF of < 0.30 vs. WT and VAF of ≥ 0.30 vs WT).

Table 4

Multivariate Cox proportional hazards regression analysis of overall survival according to the clinical characteristics of 64 breast cancer patients

VariablesHazard Ratio (95% CI)P-valuea
Lymph node status
 Positive vs Negative97.931 (2.709–3539.805)0.012*
Tumor grade
 3–4 vs 1–29.468 (0.042–2147.487)0.417
Tumor size, cm
 2–5 vs ≤ 21.281 (0.159–10.302)0.816
 > 5 vs ≤ 229.032 (0.284–2966.692)0.154
Age
 ≥ 50 vs < 500.114 (0.011–1.169)0.068
VAF
 ≥ 0.30 vs < 0.3017.950 (1.216–264.976)0.036*

Abbreviations: 95% CI, 95% confidence interval; VAF, variant allele frequency.

aAstarisk indicates statistical significance.

Abbreviations: 95% CI, 95% confidence interval; MT; HMCN1 mutant, VAF; variant allele frequency; WT, wild-type. aAstarisk indicates statistical significance.

Kaplan–Meier analysis of overall survival according to HMCN1 variant allele frequencies (VAFs)

Samples were divided into three groups using a VAF cutoff of 0.30 (< 0.30, red, n = 45 and ≥ 0.30, blue, n = 19) or WT (black, n = 968). The log-rank test was used to evaluate the statistical significance of the difference between the two survival curves (VAF of ≥ 0.30 vs. VAF of < 0.30, VAF of < 0.30 vs. WT and VAF of ≥ 0.30 vs WT). Abbreviations: 95% CI, 95% confidence interval; VAF, variant allele frequency. aAstarisk indicates statistical significance. To exclude the possibility that this significant association is not attributable to bias in the impact of mutations in the two groups, we evaluated the impact of single nucleotide variants in HMCN1 on protein structure and function using PolyPhen-2 scores [29]. The Pearson correlation coefficient between VAFs and PolyPhen-2 scores of HMCN1 was −0.204 (P = 0.151), indicating no significant correlation. To analyze the relationship between prognosis and PolyPhen-2 scores, the 64 samples were divided into two groups using a PolyPhen-2 score of 0.85 as a cutoff (higher, n = 27 and lower, n = 24); this score ranges from 0 to 1 and yields predictions of “probably damaging” (> 0.85), “possibly damaging” (0.85–0.15), or “benign” (< 0.15). We found that the Polyphen-2 score of nonsynonymous HMCN1 mutations did not significantly associate with breast cancer prognosis (PolyPhen-2 scores of < 0.85 vs. WT; P = 0.801 and PolyPhen-2 scores of ≥ 0.85 vs WT; P = 0.671) (Supplementary Figure 6). These results suggest that the HMCN1 VAF is an independent prognostic factor for OS, such that a higher VAF may be associated with poor survival in patients with breast cancer.

Correlations with potential prognostic factors

We next evaluated the association of the HMCN1 VAF with individual clinical characteristics (lymph node status, tumor grade, tumor size, and age) in the 64 tumor samples, which were divided into three groups by lymph node status: N0, N1, and N2–N3. Samples were also divided into three groups by tumor grade: grades 1, 2, and 3–4. Regarding lymph node status, tumor grade, and tumor size, we examined whether a higher VAF was associated with significantly higher stages of clinical features using the Cochran–Armitage trend test. We found a significant association of a higher VAF with a much higher lymph node status (P = 0.029) (Figure 5A). By contrast, the tumor grade (P = 0.151) and tumor size (P = 0.283) were not significantly associated with the HMCN1 VAF (Figure 5B and 5C). The mean ages of patients (n = 64) in the higher and lower VAF groups were 58.05 (SD = 17.95) years and 61.41 (SD = 11.85) years, respectively. A t-test revealed no significant difference in the mean age between the groups (P = 0.461) (Figure 5D).
Figure 5

Associations of the HMCN1 variant allele frequency (VAF) with clinical characteristics of (A) lymph node status, (B) tumor grade, (C) tumor size, and (D) patient age. Samples were divided into two groups using a VAF cutoff of 0.30 (< 0.30, red, n = 45 and ≥ 0.30, blue, n = 19). Blue and red squares in mosaic plots indicate sample counts from the higher and lower VAF groups, respectively. In the TCGA dataset, the lymph node status for two cases were not provided. The asterisk indicates statistical significance.

Associations of the HMCN1 variant allele frequency (VAF) with clinical characteristics of (A) lymph node status, (B) tumor grade, (C) tumor size, and (D) patient age. Samples were divided into two groups using a VAF cutoff of 0.30 (< 0.30, red, n = 45 and ≥ 0.30, blue, n = 19). Blue and red squares in mosaic plots indicate sample counts from the higher and lower VAF groups, respectively. In the TCGA dataset, the lymph node status for two cases were not provided. The asterisk indicates statistical significance. We also used other 15 types of cancer dataset from TCGA and examined the association between HMCN1 VAFs and OS. In the 15 types of cancer, only cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC) samples showed significantly poorer OS in the samples with higher HMCN1 VAFs (n = 7, VAF ≥ 0.30) than those with lower VAFs (n = 15, VAF < 0.30) (log-rank test: P = 0.048) (Supplementary Figure 7). Although, we applied a Cox proportional hazards regression analysis with the covariates of age, tumor grade, and HMCN1 VAFs, the VAFs were not associated with poor prognosis (HR = 5.436, 95% CI: 0.543–54.432, P = 0.150).

DISCUSSION

Breast cancers are known to exhibit a large degree of genetic heterogeneity. In this study, we analyzed intratumor heterogeneity using VAFs calculated from a set of breast cancer cases registered in TCGA. Through a VAF-based analysis of mutations, we showed that VAFs of HMCN1 was possibly associated with breast cancer prognosis. Although the detailed function of HMCN1 in humans remains unknown, mutations in HMCN1 might be associated with cancer cell invasion and metastasis. Using the data of the DRIVE dataset, the CIMBA dataset, and the Foundation One dataset from breast cancer patients, we could not validate our result because the HMCN1 VAF and survival time information could not be obtained. In cervical cancer from TCGA, however, survival time between the two groups of HMCN1 VAF values were also significantly different. Cervical cancer, like breast cancer, is known to associated with hormone estrogen. Therefore, this result may support the prognostic impact of HMCN1 on breast cancer. It will be possible to further evaluate the validity of our results by accumulating more cohorts. HMCN1 encodes a large extracellular protein belonging to the immunoglobulin superfamily and comprises several distinct domains, including the von Willebrand factor and Ig-like C2-type domains [30]. HMCN1 mutations are believed to correlate with age-related macular degeneration [31]. According to another recent report, HMCN1 acts as a suppressor of gallbladder cancer metastasis [32] and is commonly mutated in certain samples of head and neck squamous cell carcinoma [33]. There are at least three possible functional implications of the mutations in HMCN1 in breast cancer metastasis. First, HMCN1 is also known as FBLN6 (fibulin 6) and one of the extra cellular matrix (ECM) proteins [34, 35]. The fibulins are shown to be involved in basement membrane and formation of stable cell-to-cell interactions, leading to organization and stabilization to ECM structure [36]. When HMCN1 does not function properly in cancer cell, sufficient cell adhesion might be inhibited and as a result of promoting cancer invasion due to instability of HMCN1 caused by the variants in the gene. For example, previous study reported that epigenetically silenced fibulin 5 promotes invasion and metastasis in lung cancer [37]. Second, HMCN1, which contains estrogen receptor binding site, seems to be associated with postpartum depression symptoms [38], and one of its functions is suggested to be cell adhesion [39]. Therefore, HMCN1 mutations may be associated to cancer proliferation and metastasis because of the disruption of these functions. Finally, other studies have shown that HMCN1 might interact with DDX1 [40], a DEAD box protein with RNA helicase activity [41, 42]. Notably, the expression of DDX1 was reported to decrease under hypoxic conditions [43], and intratumor hypoxia is associated with cancer metastasis and, consequently, patient mortality [44, 45]. Indeed, an earlier report found that DDX1 correlated with ovarian tumor metastasis and progression [46]. In the current study, we found that CA9, the expression of which is also associated with tumor hypoxia [43, 47], was expressed at significantly higher levels in patients with higher HMCN1 VAFs than in those with lower VAFs. Therefore, the HMCN1 VAF may indicate the metastatic potential of a breast cancer. Although metastasis is the main cause of death among breast cancer patients, factors involved in metastasis remain poorly characterized. It is more difficult to identify genetic factors associated with metastasis, a complex process, than to identify driver genes [48]. Differences in genetic heterogeneity between metastatic and primary tumors may affect treatment efficacy and thus represent one of the biggest obstacles toward cure for breast cancer. Using VAFs to explicitly address heterogeneity, we successfully identified HMCN1 as a possible metastatic factor. Although further experimental validation is needed to determine the involvement of HMCN1 in metastasis, this approach could be used to screen genes that have not previously been investigated. In this study, we focused on nonsynonymous mutations and indels. However, intratumor heterogeneity may also be caused by copy number variants or mutations in noncoding regions, such as those in cis-regulatory elements and splice sites [49-52]. Moreover, epigenetic alterations can also promote cancer progression [53]. As genome-wide epigenetic datasets from normal cells grow rapidly [54], epigenome data analyses of cancer cells will allow evaluations of the impacts of epigenetic factors on intratumor heterogeneity. In conclusion, to our knowledge, this is the first study to identify HMCN1 as a potential metastatic factor in breast cancer using a comparative analysis of genomic and transcriptomic data registered in TCGA. In addition to the standard classification of breast tumors based on the four molecular types, the use of VAFs, which reflect tumor evolution, might provide further genetic profile information that can be used to characterize tumor samples. Our approach allows us to identify new diagnostic markers or candidate genes for targeted therapy and is therefore expected to facilitate precision medicine.

MATERIALS AND METHODS

Datasets

A total of 1,080 RNA-seq and variant datasets from breast cancers were downloaded from TCGA (https://portal.gdc.cancer.gov/). For variant data, we used VCF files generated by comparing matched tumor–normal pairs using the Mutect2 software package. We also downloaded the associated clinical patient data. Similarly, dataset from other 15 types of cancer were obtained from TCGA: Bladder urothelial carcinoma (BLCA; n = 416), cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC; n = 307), colon adenocarcinoma (COAD; n = 605), glioblastoma multiforme (GBM; n = 938), head and neck squamous cell carcinoma (HNSC; n = 512), kidney renal clear cell carcinoma (KIRC; n = 697), lower grade glioma (LGG; n = 938), liver hepatocellular carcinoma (LIHC; n = 378), lung adenocarcinoma (LUAD; n = 587), lung squamous cell carcinoma (LUSC; n = 503), ovarian serous cystadenocarcinoma (OV; n = 443), prostate adenocarcinoma (PRAD; n = 503), skin cutaneous melanoma (SKCM; n = 472), thyroid carcinoma (THCA; n = 504), and uterine corpus endometrial carcinoma (UCEC; n = 604).

Mutation analysis

In this study, we only considered mutations with a coverage depth of ≥ 20. We extracted gene mutations [i.e., nonsynonymous substitutions (missense and nonsense mutations) and indels (frameshift insertions and frameshift deletions)] observed in ≥ 50 samples. We used this cutoff because the lower limit of the average mutation rate for significantly mutated genes was approximately 2–4% [23]. VAFs were calculated as the proportion of variant allele reads to total reads at the mutation site. When a sample harbored multiple mutations in the same gene, the larger VAF was used as the VAF for the gene. The VAF was adjusted for tumor purity estimate. This estimate, which was derived from immunohistochemistry analysis, was downloaded from the previous study [25]. The number of subclones in a tumor cell were inferred by both VCF files and DNA copy number variation data using the R package SciClone with default settings [16].

Statistical analysis

Statistical analysis was conducted using the R software, version 3.3.1 (R Project for Statistical Computing, Vienna, Austria), and JMP Pro, version 13.0 (SAS Institute Inc., Cary, NC, USA). A χ2 test or Fisher's exact test (when ≥ 1 cells had an expected frequency of ≤ 5 in any clinical group) was used to evaluate the relationships between the mutation status and clinical variables. We also used the test for comparison the number of subclones. OS was estimated using the Kaplan–Meier method in the R survival package (version 2.41–3). For the multivariate analysis, adjusted hazard ratios (HRs) with 95% confidence intervals (95% CIs) were calculated using a Cox proportional hazards regression model. We used edgeR (version 3.16.5), a Bioconductor package (http://www.bioconductor.org/packages/release/bioc/html/edgeR.html), to detect genes differentially expressed between two groups. For each gene, the R exactRankTests package (version 0.8–28) was used to evaluate the difference in both expression levels and MATH between groups of samples. For categorical data such as tumor grade, tumor size category, and lymph node status, we used a one-sided Cochran–Armitage trend test to evaluate the existence of a linear relationship in terms of VAFs. We used Welch's t-test to compare continuous data between the two groups. An analysis of variance (ANOVA) model was used to compare the mean values of more than two groups. P-values were considered statistically significant at < 0.05 (*P < 0.05, **P < 0.01).
  53 in total

1.  Association of human DEAD box protein DDX1 with a cleavage stimulation factor involved in 3'-end processing of pre-MRNA.

Authors:  S Bléoo; X Sun; M J Hendzel; J M Rowe; M Packer; R Godbout
Journal:  Mol Biol Cell       Date:  2001-10       Impact factor: 4.138

Review 2.  Tumor metastasis: molecular insights and evolving paradigms.

Authors:  Scott Valastyan; Robert A Weinberg
Journal:  Cell       Date:  2011-10-14       Impact factor: 41.582

3.  Expression of fibulin-6 in failing hearts and its role for cardiac fibroblast migration.

Authors:  Arpita Chowdhury; Christine Herzog; Lisa Hasselbach; Houra Loghmani Khouzani; Jinli Zhang; Matthias Hammerschmidt; Carsten Rudat; Andreas Kispert; Matthias Gaestel; Manoj B Menon; Igor Tudorache; Denise Hilfiker-Kleiner; Christian Mühlfeld; Jan Dieter Schmitto; Martin Müller; Gregor Theilmeier
Journal:  Cardiovasc Res       Date:  2014-06-20       Impact factor: 10.787

4.  Hypoxia-inducible expression of tumor-associated carbonic anhydrases.

Authors:  C C Wykoff; N J Beasley; P H Watson; K J Turner; J Pastorek; A Sibtain; G D Wilson; H Turley; K L Talks; P H Maxwell; C W Pugh; P J Ratcliffe; A L Harris
Journal:  Cancer Res       Date:  2000-12-15       Impact factor: 12.701

Review 5.  Role of non-coding sequence variants in cancer.

Authors:  Ekta Khurana; Yao Fu; Dimple Chakravarty; Francesca Demichelis; Mark A Rubin; Mark Gerstein
Journal:  Nat Rev Genet       Date:  2016-01-19       Impact factor: 53.242

6.  International network of cancer genome projects.

Authors:  Thomas J Hudson; Warwick Anderson; Axel Artez; Anna D Barker; Cindy Bell; Rosa R Bernabé; M K Bhan; Fabien Calvo; Iiro Eerola; Daniela S Gerhard; Alan Guttmacher; Mark Guyer; Fiona M Hemsley; Jennifer L Jennings; David Kerr; Peter Klatt; Patrik Kolar; Jun Kusada; David P Lane; Frank Laplace; Lu Youyong; Gerd Nettekoven; Brad Ozenberger; Jane Peterson; T S Rao; Jacques Remacle; Alan J Schafer; Tatsuhiro Shibata; Michael R Stratton; Joseph G Vockley; Koichi Watanabe; Huanming Yang; Matthew M F Yuen; Bartha M Knoppers; Martin Bobrow; Anne Cambon-Thomsen; Lynn G Dressler; Stephanie O M Dyke; Yann Joly; Kazuto Kato; Karen L Kennedy; Pilar Nicolás; Michael J Parker; Emmanuelle Rial-Sebbag; Carlos M Romeo-Casabona; Kenna M Shaw; Susan Wallace; Georgia L Wiesner; Nikolajs Zeps; Peter Lichter; Andrew V Biankin; Christian Chabannon; Lynda Chin; Bruno Clément; Enrique de Alava; Françoise Degos; Martin L Ferguson; Peter Geary; D Neil Hayes; Thomas J Hudson; Amber L Johns; Arek Kasprzyk; Hidewaki Nakagawa; Robert Penny; Miguel A Piris; Rajiv Sarin; Aldo Scarpa; Tatsuhiro Shibata; Marc van de Vijver; P Andrew Futreal; Hiroyuki Aburatani; Mónica Bayés; David D L Botwell; Peter J Campbell; Xavier Estivill; Daniela S Gerhard; Sean M Grimmond; Ivo Gut; Martin Hirst; Carlos López-Otín; Partha Majumder; Marco Marra; John D McPherson; Hidewaki Nakagawa; Zemin Ning; Xose S Puente; Yijun Ruan; Tatsuhiro Shibata; Michael R Stratton; Hendrik G Stunnenberg; Harold Swerdlow; Victor E Velculescu; Richard K Wilson; Hong H Xue; Liu Yang; Paul T Spellman; Gary D Bader; Paul C Boutros; Peter J Campbell; Paul Flicek; Gad Getz; Roderic Guigó; Guangwu Guo; David Haussler; Simon Heath; Tim J Hubbard; Tao Jiang; Steven M Jones; Qibin Li; Nuria López-Bigas; Ruibang Luo; Lakshmi Muthuswamy; B F Francis Ouellette; John V Pearson; Xose S Puente; Victor Quesada; Benjamin J Raphael; Chris Sander; Tatsuhiro Shibata; Terence P Speed; Lincoln D Stein; Joshua M Stuart; Jon W Teague; Yasushi Totoki; Tatsuhiko Tsunoda; Alfonso Valencia; David A Wheeler; Honglong Wu; Shancen Zhao; Guangyu Zhou; Lincoln D Stein; Roderic Guigó; Tim J Hubbard; Yann Joly; Steven M Jones; Arek Kasprzyk; Mark Lathrop; Nuria López-Bigas; B F Francis Ouellette; Paul T Spellman; Jon W Teague; Gilles Thomas; Alfonso Valencia; Teruhiko Yoshida; Karen L Kennedy; Myles Axton; Stephanie O M Dyke; P Andrew Futreal; Daniela S Gerhard; Chris Gunter; Mark Guyer; Thomas J Hudson; John D McPherson; Linda J Miller; Brad Ozenberger; Kenna M Shaw; Arek Kasprzyk; Lincoln D Stein; Junjun Zhang; Syed A Haider; Jianxin Wang; Christina K Yung; Anthony Cros; Anthony Cross; Yong Liang; Saravanamuttu Gnaneshan; Jonathan Guberman; Jack Hsu; Martin Bobrow; Don R C Chalmers; Karl W Hasel; Yann Joly; Terry S H Kaan; Karen L Kennedy; Bartha M Knoppers; William W Lowrance; Tohru Masui; Pilar Nicolás; Emmanuelle Rial-Sebbag; Laura Lyman Rodriguez; Catherine Vergely; Teruhiko Yoshida; Sean M Grimmond; Andrew V Biankin; David D L Bowtell; Nicole Cloonan; Anna deFazio; James R Eshleman; Dariush Etemadmoghadam; Brooke B Gardiner; Brooke A Gardiner; James G Kench; Aldo Scarpa; Robert L Sutherland; Margaret A Tempero; Nicola J Waddell; Peter J Wilson; John D McPherson; Steve Gallinger; Ming-Sound Tsao; Patricia A Shaw; Gloria M Petersen; Debabrata Mukhopadhyay; Lynda Chin; Ronald A DePinho; Sarah Thayer; Lakshmi Muthuswamy; Kamran Shazand; Timothy Beck; Michelle Sam; Lee Timms; Vanessa Ballin; Youyong Lu; Jiafu Ji; Xiuqing Zhang; Feng Chen; Xueda Hu; Guangyu Zhou; Qi Yang; Geng Tian; Lianhai Zhang; Xiaofang Xing; Xianghong Li; Zhenggang Zhu; Yingyan Yu; Jun Yu; Huanming Yang; Mark Lathrop; Jörg Tost; Paul Brennan; Ivana Holcatova; David Zaridze; Alvis Brazma; Lars Egevard; Egor Prokhortchouk; Rosamonde Elizabeth Banks; Mathias Uhlén; Anne Cambon-Thomsen; Juris Viksna; Fredrik Ponten; Konstantin Skryabin; Michael R Stratton; P Andrew Futreal; Ewan Birney; Ake Borg; Anne-Lise Børresen-Dale; Carlos Caldas; John A Foekens; Sancha Martin; Jorge S Reis-Filho; Andrea L Richardson; Christos Sotiriou; Hendrik G Stunnenberg; Giles Thoms; Marc van de Vijver; Laura van't Veer; Fabien Calvo; Daniel Birnbaum; Hélène Blanche; Pascal Boucher; Sandrine Boyault; Christian Chabannon; Ivo Gut; Jocelyne D Masson-Jacquemier; Mark Lathrop; Iris Pauporté; Xavier Pivot; Anne Vincent-Salomon; Eric Tabone; Charles Theillet; Gilles Thomas; Jörg Tost; Isabelle Treilleux; Fabien Calvo; Paulette Bioulac-Sage; Bruno Clément; Thomas Decaens; Françoise Degos; Dominique Franco; Ivo Gut; Marta Gut; Simon Heath; Mark Lathrop; Didier Samuel; Gilles Thomas; Jessica Zucman-Rossi; Peter Lichter; Roland Eils; Benedikt Brors; Jan O Korbel; Andrey Korshunov; Pablo Landgraf; Hans Lehrach; Stefan Pfister; Bernhard Radlwimmer; Guido Reifenberger; Michael D Taylor; Christof von Kalle; Partha P Majumder; Rajiv Sarin; T S Rao; M K Bhan; Aldo Scarpa; Paolo Pederzoli; Rita A Lawlor; Massimo Delledonne; Alberto Bardelli; Andrew V Biankin; Sean M Grimmond; Thomas Gress; David Klimstra; Giuseppe Zamboni; Tatsuhiro Shibata; Yusuke Nakamura; Hidewaki Nakagawa; Jun Kusada; Tatsuhiko Tsunoda; Satoru Miyano; Hiroyuki Aburatani; Kazuto Kato; Akihiro Fujimoto; Teruhiko Yoshida; Elias Campo; Carlos López-Otín; Xavier Estivill; Roderic Guigó; Silvia de Sanjosé; Miguel A Piris; Emili Montserrat; Marcos González-Díaz; Xose S Puente; Pedro Jares; Alfonso Valencia; Heinz Himmelbauer; Heinz Himmelbaue; Victor Quesada; Silvia Bea; Michael R Stratton; P Andrew Futreal; Peter J Campbell; Anne Vincent-Salomon; Andrea L Richardson; Jorge S Reis-Filho; Marc van de Vijver; Gilles Thomas; Jocelyne D Masson-Jacquemier; Samuel Aparicio; Ake Borg; Anne-Lise Børresen-Dale; Carlos Caldas; John A Foekens; Hendrik G Stunnenberg; Laura van't Veer; Douglas F Easton; Paul T Spellman; Sancha Martin; Anna D Barker; Lynda Chin; Francis S Collins; Carolyn C Compton; Martin L Ferguson; Daniela S Gerhard; Gad Getz; Chris Gunter; Alan Guttmacher; Mark Guyer; D Neil Hayes; Eric S Lander; Brad Ozenberger; Robert Penny; Jane Peterson; Chris Sander; Kenna M Shaw; Terence P Speed; Paul T Spellman; Joseph G Vockley; David A Wheeler; Richard K Wilson; Thomas J Hudson; Lynda Chin; Bartha M Knoppers; Eric S Lander; Peter Lichter; Lincoln D Stein; Michael R Stratton; Warwick Anderson; Anna D Barker; Cindy Bell; Martin Bobrow; Wylie Burke; Francis S Collins; Carolyn C Compton; Ronald A DePinho; Douglas F Easton; P Andrew Futreal; Daniela S Gerhard; Anthony R Green; Mark Guyer; Stanley R Hamilton; Tim J Hubbard; Olli P Kallioniemi; Karen L Kennedy; Timothy J Ley; Edison T Liu; Youyong Lu; Partha Majumder; Marco Marra; Brad Ozenberger; Jane Peterson; Alan J Schafer; Paul T Spellman; Hendrik G Stunnenberg; Brandon J Wainwright; Richard K Wilson; Huanming Yang
Journal:  Nature       Date:  2010-04-15       Impact factor: 49.962

7.  The RNA-binding protein DDX1 promotes primary microRNA maturation and inhibits ovarian tumor progression.

Authors:  Cecil Han; Yunhua Liu; Guohui Wan; Hyun Jin Choi; Luqing Zhao; Cristina Ivan; Xiaoming He; Anil K Sood; Xinna Zhang; Xiongbin Lu
Journal:  Cell Rep       Date:  2014-08-28       Impact factor: 9.423

8.  Analysis of the ARMD1 locus: evidence that a mutation in HEMICENTIN-1 is associated with age-related macular degeneration in a large family.

Authors:  Dennis W Schultz; Michael L Klein; Andrea J Humpert; Christina W Luzier; Vesna Persun; Mitchell Schain; Alison Mahan; Charles Runckel; Maria Cassera; Vasavi Vittal; Trudy M Doyle; Tammy M Martin; Richard G Weleber; Peter J Francis; Ted S Acott
Journal:  Hum Mol Genet       Date:  2003-10-21       Impact factor: 6.150

9.  Intratumor heterogeneity and branched evolution revealed by multiregion sequencing.

Authors:  Marco Gerlinger; Andrew J Rowan; Stuart Horswell; James Larkin; David Endesfelder; Eva Gronroos; Pierre Martinez; Nicholas Matthews; Aengus Stewart; Charles Swanton; M Math; Patrick Tarpey; Ignacio Varela; Benjamin Phillimore; Sharmin Begum; Neil Q McDonald; Adam Butler; David Jones; Keiran Raine; Calli Latimer; Claudio R Santos; Mahrokh Nohadani; Aron C Eklund; Bradley Spencer-Dene; Graham Clark; Lisa Pickering; Gordon Stamp; Martin Gore; Zoltan Szallasi; Julian Downward; P Andrew Futreal
Journal:  N Engl J Med       Date:  2012-03-08       Impact factor: 91.245

10.  Mutational landscape and significance across 12 major cancer types.

Authors:  Cyriac Kandoth; Michael D McLellan; Fabio Vandin; Kai Ye; Beifang Niu; Charles Lu; Mingchao Xie; Qunyuan Zhang; Joshua F McMichael; Matthew A Wyczalkowski; Mark D M Leiserson; Christopher A Miller; John S Welch; Matthew J Walter; Michael C Wendl; Timothy J Ley; Richard K Wilson; Benjamin J Raphael; Li Ding
Journal:  Nature       Date:  2013-10-17       Impact factor: 49.962

View more
  8 in total

1.  Prognostic impact of intra-field heterogeneity in oral squamous cell carcinoma.

Authors:  Andrea Gabusi; Davide Bartolomeo Gissi; Lucio Montebugnoli; Sofia Asioli; Achille Tarsitano; Claudio Marchetti; Tiziana Balbi; Timothy R Helliwell; Maria P Foschini; Luca Morandi
Journal:  Virchows Arch       Date:  2019-08-29       Impact factor: 4.064

2.  Breast Cancer Prognosis Prediction and Immune Pathway Molecular Analysis Based on Mitochondria-Related Genes.

Authors:  Weixu Luo; Yuanshan Han; Xin Li; Zhuo Liu; Pan Meng; Yuhong Wang
Journal:  Genet Res (Camb)       Date:  2022-05-31       Impact factor: 1.375

3.  Integrative analysis of genomic, epigenomic and transcriptomic data identified molecular subtypes of esophageal carcinoma.

Authors:  Mingyang Ma; Yang Chen; Xiaoyi Chong; Fangli Jiang; Jing Gao; Lin Shen; Cheng Zhang
Journal:  Aging (Albany NY)       Date:  2021-02-26       Impact factor: 5.682

4.  Identification of Predictive Biomarkers for Lymph Node Involvement in Obese Women With Endometrial Cancer.

Authors:  Vanessa M López-Ozuna; Liron Kogan; Mahmood Y Hachim; Emad Matanes; Ibrahim Y Hachim; Cristina Mitric; Lauren Liu Chen Kiow; Susie Lau; Shannon Salvador; Amber Yasmeen; Walter H Gotlieb
Journal:  Front Oncol       Date:  2021-07-07       Impact factor: 6.244

5.  Patterns of Mutation Enrichment in Metastatic Triple-Negative Breast Cancer.

Authors:  César H Saravia; Claudio Flores; Luis J Schwarz; Leny Bravo; Jenny Zavaleta; Jhajaira Araujo; Silvia Neciosup; Joseph A Pinto
Journal:  Clin Med Insights Oncol       Date:  2019-08-09

6.  Pan-Cancer Analysis of the Genomic Alterations and Mutations of the Matrisome.

Authors:  Valerio Izzi; Martin N Davis; Alexandra Naba
Journal:  Cancers (Basel)       Date:  2020-07-24       Impact factor: 6.639

7.  Comprehensive Analysis of HMCN1 Somatic Mutation in Clear Cell Renal Cell Carcinoma.

Authors:  Ziqi Gong; Xiaowen Wu; Qian Guo; Haizhen Du; Fenghao Zhang; Yan Kong
Journal:  Genes (Basel)       Date:  2022-07-20       Impact factor: 4.141

8.  Gene Expression Analysis Reveals Prognostic Biomarkers of the Tyrosine Metabolism Reprogramming Pathway for Prostate Cancer.

Authors:  Wei Li; Zhe Lu; Dongqing Pan; Zejian Zhang; Hua He; Jiacheng Wu; Naixiong Peng
Journal:  J Oncol       Date:  2022-07-06       Impact factor: 4.501

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.