Literature DB >> 28659724

Identification of Sets of Cytoskeletal Related and Adhesion-related Coding Region Mutations in the TCGA Melanoma Dataset that Correlate with a Negative Outcome.

John M Yavorski1, Rebecca J Stoll1, Mohammad D Samy1, James A Mauro1, George Blanck1.   

Abstract

BACKGROUND: Relatively little cancer genome atlas data has been associated with clinically relevant stratifications of individual cancers.
RESULTS: Mutations in two subsets of a cytoskeletal related and adhesion-related protein coding region set (CAPCRs) were determined to have strong associations with a negative outcome for melanoma, in-cluding a subset constituted by: DSCAM, FAT3, MUC17 and PCDHGC5 (p < 0.0001).
CONCLUSION: Roles for CAPCR mutations in cancer progression raise a question about the potential dominant negative impact of these mutations for multi-meric subcellular and extra-cellular protein struc-tures.

Entities:  

Keywords:  Cancer genome atlas; Cytoskeletal and cell-adhesion genomics; Extra-cellular matrix; Melanoma metastasis; Melanoma stratification

Year:  2017        PMID: 28659724      PMCID: PMC5476947          DOI: 10.2174/1389202918666170105093953

Source DB:  PubMed          Journal:  Curr Genomics        ISSN: 1389-2029            Impact factor:   2.236


Introduction

One of the most important goals for understanding somatic mutations in cancer is to readily be able to identify the mutations, particularly in circulating tumor cells, which are associated with metastasis and negative prognoses. Once a reasonable subset of mutations is available for such an association, relatively simple, cost-effective and very rapid tests could be developed to assay for these mutations. Availability of these tests would thereby provide for regular monitoring routines, more accurate prognoses, and more refined treatment courses. In addition, certain mutations may represent new targets for drug development to limit metastasis. There is a very high mutation rate among cytoskeletal related and adhesion-related coding regions (CAPCRs) in a variety of cancers [1], and in particular, mutations in this set occur at a higher mutational density, as do oncoproteins, and in rare cases, tumor suppressor proteins [2]. The disproportionately higher level of CAPCR mutations, as well oncoprotein mutations, in cancer samples with relatively few mutations, indicates that the CAPCR and oncoprotein mutations are candidate driver mutations [2]. In addition, in both cases, a wealth of empirical literature has supported a role for these two classes of coding regions in oncogenesis. The cancer genome atlas (TCGA) has provided a large amount of conveniently accessible primary tumor mutations, facilitating the above conclusions regarding driver candidates, among many related conclusions. However mutations from metastatic samples have been less available, most likely because in the case of most cancers, accessing metastatic lesions would require surgery that is not indicated or would be difficult in other ways. However, there have likely been other, paradigm-oriented hindrances to understanding “metastasis driver” mutations. For example, so-called metastasis suppressor proteins differ little if at all, from the standpoint of biochemistry or molecular mechanisms, from classical tumor suppressor proteins [3]. Metastasis suppressor proteins are significantly smaller than classical tumor suppressor proteins, which is consistent with their inactivation late in the tumorigenesis process, i.e, inactivation that is occurring largely due to the stochastic process of mutagenesis that “hits” small targets less frequently [1]. Other work favors signal amplification as a feature of poor cancer prognoses [4], but this work has not been well developed, with the on/off switch of cancer signaling pathways a more commonly researched paradigm. TCGA has established numerous raw sequence files representing melanoma metastases, an efficient goal, given the accessible nature of many melanoma metastases. We have analyzed these files, along with patient outcome data, and determined that CAPCRs are more heavily mutated in metastatic melanoma samples and are very likely associated with a negative patient outcome.

Methods

Analysis of the Mutations in the TCGA Processed Data

The basic processing approach is described in Fig. (). From the TCGA data portal (https://tcga-data.nci.nih.gov/), clinical and somatic mutation data were downloaded for the SKCM dataset, with NIH approval (request #27073-3 for project #6300). The clinical drug file from the clinical data (Biotab) was used to sort barcodes into negative and positive outcome categories based on best treatment response. Clinical progressive disease and stable disease were assigned to the Negative outcome group, while partial response and complete response were assigned to the Positive outcome group. These assignments were made on the basis that a recorded response to treatment, either complete or partial, was a better outcome than either progressive disease or lack of any detectable response to treatment (i.e., stable disease). These groupings do not rule out the possibility that long-term results, not available via the TCGA clinical data files, would not align would negative or positive outcome. The BI_IlluminaGA_DNASeq file was used from the somatic mutation data and was sorted by HUGO symbol. The tumor sample barcodes were truncated to contain only the following characters, TCGA-##-####. The clinical drug and mutation files were then compared based on the clinical barcodes and truncated tumor sample barcodes. Barcodes in the Negative and Positive outcome groups, respectively, that did not appear in the mutation file were eliminated. A list of 42 barcodes was generated; with 28 barcodes in the Negative outcome group and 14 barcodes in the Positive outcome group. The grouping of the barcodes into Negative outcome, from progressive disease and stabile disease, and into positive outcome, from partial and complete response, was necessitated by the low number of samples for the indicated subcategories, e.g., progressive disease. Five sets of coding regions, for analyses of mutation occurrence in the Negative and Positive outcome groups, were established: (i) Cytoskeletal related and adhesion-related proteins coding regions (CAPCRs), established as a group because of previous indications of cancer driver status and because coding regions in this group fall into the top 25 most mutated coding regions for five specific, solid-tissue cancers, particularly indicated in refs [1, 2, 5]. Also, three additional cytoskeletal related and adhesion-related proteins coding regions (DNAH7, GPR98, DSCAM) were added due to their occurrence in among the top 25 most mutated coding regions in SKCM (melanoma). (ii) “other CR (coding regions), commonly mutated”, which represents some proteins involved in cytoskeletal related and adhesion-related functions but which are not part of the cancer-frequent CAPCR set, because these “commonly mutated” coding regions are not among the top most 25 mutated coding regions in the original five cancers in ref. [1] or in SKCM. The other CR, commonly mutated set also includes many other coding regions with no relationship to the cytoskeleton or to adhesion. (iii) A set of large coding regions equal in number and approximately equal to the overall coding region size of the CAPCRs. The members of this latter set (termed, “CAPCR-coding region size control”) have no specific connection to the TCGA SKCM mutation dataset and were selected randomly from a Uniprot list of coding regions and their AA lengths. As with the other CR, commonly mutated set, the CAPCR-coding region size control set also represents proteins involved in cytoskeletal related and adhesion-related functions, but these proteins are not part of the cancer-frequent CAPCR set, i.e., are not found heavily mutated in the five cancers of ref. [1] or in SKCM. Additional gene sets for this study are: (iv) oncoproteins; and (v) tumor suppressor proteins, lists established in refs. [1, 2]. As noted in the previous paragraph, the CAPCR set was previously established in ref. [1], based on the occurrence of each coding region in the top 25 most commonly mutated coding regions among the five TCGA datasets studied in ref. [1], namely, BLCA, COAD, LUAD, GBM, STAD. And, for this project three additional CAPCRs were included in the set due to their occurrence among the top 25 most commonly mutated coding regions in SKCM. The HUGO symbols for the coding regions for all sets are in (Table ). See also SOM file, “SOM Table source file”. A file for each HUGO symbol in each of the above indicated, five coding region sets was created, i.e. the information, from the comprehensive mutation file, for each of the HUGO symbols was copied to a newly generated “individual gene file”. This newly generated individual gene file (examples of which are indicated as “SOM file Fig. ()” in the SOM) includes the truncated tumor sample barcode and mutation type (amino acid altering or silent). The above indicated list of 42 barcodes, representing negative outcomes (28 barcodes) and positive (14 barcodes) outcomes, was copied into each individual gene file and a COUNTIF function was used to determine the number of mutations in the individual coding region for any one barcode in the Negative and Positive outcome groups’ barcode list. A final results file was created for each coding region set. The TCGA barcodes from each individual gene file, that contained at least one mutation that represented the coding region for that gene file, were copied and transferred to a single column in the final results file. The Negative and Positive outcome groups’ barcode lists indicated above were copied into (Column F of) the file used to generate the statistical analyses, termed “SOM Fig. ()”. There is a “SOM Fig. ()” file for each coding region set (in the SOM). A COUNTIF function was used to determine the number of HUGO symbols, from the particular coding region set, that were mutated for each barcode in the Negative and Positive outcome groups. The TTEST function was used to determine the statistical significance of coding region mutations for the Negative and Positive outcome groups (for each coding region set).

Analysis of the Mutations in the Raw Exome Files

The basic processing approach is indicated in Fig. (), and the detailed steps are provided in the SOM file, “SOM Fig. (), detailed protocol”. The original programming code and several output files are also provided in the SOM. Note, two SNP databases (All SNPs (142) and 1000G) were used to filter the originally identified variants in the raw sequence files representing the metastatic and primary melanoma samples, as indicated in (Fig. ). Because there is no SNP information in the 1000G SNP database for GRP98, this coding region is not included in any analyses using the mutations representing the metastatic melanoma, TCGA barcodes.

Results

Association of CAPCR and Oncoprotein Coding Regions with Negative Outcome

The TCGA designations, (i) progressive and stable disease and (ii) complete and partial remission, were grouped as Negative and Positive outcome, respectively (Table ), and the total number of mutations for each barcode in each of the two categories were determined. The Negative outcome group had more mutations, but without a statistically significant difference from the positive outcome group (Table ). The algorithm for establishing the assessment of the mutations in the coding region sets discussed below is shown as a flow chart in Fig. (), with the legend to Fig. () indicating the availability of related documents in the supporting online material (SOM). Keeping in mind the functional degeneracy of many mutations and proteins facilitating cancer [6-8], we established the following five sets of coding regions for analyses of mutation occurrence in the Negative and Positive outcome groups: (i) cytoskeletal related and adhesion-related proteins coding regions (CAPCRs); (ii) other CR (coding regions), commonly mutated; (iii) and a set of large coding regions equal to the coding region size of the cytoskeletal related and adhesion-related proteins but otherwise having no specific, functional connection to the TCGA SKCM dataset (CAPCR-coding region size control); (iv) oncoproteins; (v) tumor suppressor proteins. The HUGO symbols for these five sets are listed in Table . The tumor suppressor set includes both classical tumor suppressor protein coding regions, such as p16, and BRCA1, and coding regions representing what have been previously termed metastasis suppressor proteins. However, there is little biochemical, mechanistic distinction between classical tumor suppressor and metastasis suppressor proteins [3], and thus, for the purpose of applying the functional degeneracy principle, the tumor suppressor set used here includes both types of coding regions. (For further details on the establishment of the above, five coding regions sets, see Methods.) CAPCRs have a long but contradictory history in cancer development [9-15]. In addition, CAPCRs are often very large and are thus subject to extensive mutation, presumably traceable to the highly stochastic aspect of mutagenesis, as indicated by a number of different types of analyses [1, 16]. Thus, the CAPCR-coding region size control was established in an attempt to appreciate the specificity of the CAPCR mutations in the TCGA SKCM data set. As indicated in Table , the CAPCR mutations (in the TCGA processed data) are significantly associated with negative outcome while the CAPCR-coding region size control is not. In the Table analysis, the mutated, coding region-member of a set is counted one time, regardless of the actual number of mutations in that coding region. However, if the Negative and Positive outcome groups are distinguished based on differences in the total number of mutations in their respective CAPCR sets, the results are the same, i.e, CAPCR mutations are significantly associated with the Negative outcome group (data not shown). We also determined that the oncoprotein coding region mutations, and the other CR, commonly mutated set were associated with negative outcome. However, no such association was detected for the tumor suppressor set. The above associations hold with or without consideration of silent mutations in the coding regions, not surprising given the fact that silent mutations consistently represent about 25% of the total mutations in cancer cells, even when only a relatively small number of mutations has occurred [1]. To be certain this was the case for the current study, the ratios of silent mutations to total mutations was determined for the entire SKCM dataset, for negative and positive outcome groups, and for the CAPCR mutations for the preceding two groups, respectively, with the results indicated in Table . The ratios of silent mutations, as predicted, were found to be consistent across all datasets accounting for about 30% of the total mutations. Therefore, the distinction of the functional impact of the mutation type is statistically unnecessary in distinguishing the Negative and Positive outcome groups. However, each gene set for the two groups was compared with the silent mutations removed, with the results indicated in the final row of Table . As expected, given previous studies [1] and the data of Table , the removal of silent mutations does not impact the statistical significance of the association, or lack thereof, of any gene set with the Negative and Positive outcome group. In short, there is a strong stochastic aspect to mutagenesis as represented by the TCGA datasets, as indicated by the very consistent ratio of silent to non-silent mutations over a wide range of TCGA datasets and mutation frequencies represented by the different samples within those datasets [1]. Thus, the silent mutations do not interfere with observing a statistically significant association of “mutation biomarkers” of the above indicated gene sets with Negative outcome.

More CAPCR Mutations in TCGA, Metastatic Melanoma Samples, Compared with Primary Melanoma Samples

Because of the association of CAPCR mutations with the Negative outcome group, we determined whether there were more CAPCR mutations in metastatic melanoma samples vs. primary melanoma samples. To accomplish this goal, we downloaded raw exome sequence files representing both primary and metastatic melanoma samples from the CGHub browser (https://cghub.ucsc.edu/) as described in Methods; in Fig. (); and in the SOM file, “SOM Fig. , detailed protocol”. The exome sequence files were processed as indicated in Fig. (). The range of mutation occurrences is indicated in Table (totaling all mutations for every coding region within the respective CAPCR sets), and a summary of the results for each member of the CAPCR set is indicated in Fig. (). There are indeed about twice as many CAPCR mutations in the metastatic samples (p < 0.0008), consistent with more CAPCR mutations being associated with the Negative outcome group. It is likely that this result is due to an increase in the overall number of mutations in the metastatic samples and the highly stochastic nature of mutagenesis as revealed by previous studies, for example, ref. [1].

ECM-related Proteins are Significantly Mutated in the Negative Outcome Group Representing the Processed TCGA Mutation Data

In the course of the above analyses of the metastatic samples, we noticed a range of ratios of mutation occurrence to number of amino acids in the coding region (Fig. ). We reasoned that, more mutations normalized to coding regions size could suggest cancer driver mutations, i.e., could suggest a slight skewing away from the random impact of the likely, overall increase in mutations for the metastatic samples. Thus, we ranked the CAPCRs with respect to mutations per amino acid lengths (Fig. ) and observed the shift in rank order between the primary and metastatic samples. The CAPCRs with the ten largest rank shifts were found to have significantly higher mutation rates in the Negative outcome group defined and studied above Table . However, coding regions with a decreased rank, from primary to metastatic, showed no statistically significant difference between the Positive and Negative outcome groups. The two groups defined by the rank order shifts also represented different numbers of deleterious amino acids (AA), as determined by use of the PROVEAN tool (http://provean.jcvi.org/index.php), as follows: the average number of deleterious AA replacements for the Negative outcome group was 4.0 per barcode versus 2.29 deleterious AA replacements for the Positive outcome group. However, for the eleven CAPCR coding regions with a decrease in rank (going from primary to metastatic; Table ), the average number of deleterious AA was 0.79 and 0.50 for the Negative and Positive outcome groups, respectively. The use of the PROVEAN tool and the calculation of the averages are available in “SOM file deleterious AA, source info”. A second subset of CAPCR’s was tentatively identified by inspection of the frequencies of individual, mutated CAPCRs in the Negative and Positive outcome groups. This subset was subsequently defined such that a CAPCR was included in this subset if the CAPCR was mutated in a minimum of 50% of the Negative outcome samples and a maximum of 50% in the Positive outcome samples. This subset of four CAPCRs showed a very strong association with the Negative outcome group (Table , p < 1.0 E-4). Furthermore, the average number of deleterious AA replacements, in this CAPCR subset (Table ), was 9.96 for the Negative outcome group and 5.93 for the Positive outcome group.

Discussion

The above data demonstrate an opportunity to establish an association of a subset of cytoskeletal and adhesion-related coding regions with subcategories of melanoma. The subcategory clinical definitions in this case are minimal, due to the TCGA limitations on clinical information, and the approach does not provide a cause and effect connection between cytoskeletal and adhesion-related coding region mutations and a specific course of disease for melanoma. However, the results do strongly suggest that the CAPCR sets and subsets indicated are biomarkers of melanoma subcategories. Furthermore, the CAPCR coding regions studied in this report represent a specificity of association in comparison to control sets, although it is possible, even likely, that other groupings of melanoma would reveal other types of associations, presumably in comparison to other types of control coding region sets. Also, the other-CR, commonly mutated set, effectively and unexpectedly representing a second large set of coding region mutations associated with negative outcome (Table ), includes additional cytoskeletal related and adhesion related coding regions that did not emerge as among the top 25 most mutated coding regions in SKCM. However, it is possible that these coding regions, if mutated, can have a driver effect in SKCM. This is not unexpected, in that certain oncoproteins are rarely mutated in certain cancer types, but can readily replace original oncoproteins as a result of designer drug resistance [8]. We would hypothesize a similar phenomenon here, whereby certain CAPCRs are commonly mutated in SKCM for unknown reasons, perhaps related to exposure of specific segments of DNA to mutagens. And, that less common mutations in the coding regions of other cytoskeletal and adhesion-related proteins could be selected during tumor evolution or during drug treatment and essentially substitute for the cancer driving impact of more commonly mutated CAPCRs. Further specificity will come from rotating CAPCR set members in and out of the above CAPCR group when studying even more detailed clinical parameters not yet available through TCGA. In particular, Guan et al. identified specific, single driver mutations in melanoma, as opposed to the above “either/or” (or subset) approach, and identified one CAPCR not included in any of the sets defined above, as a driver, namely CDH9, a cadherin involved in cell-cell adhesion [17]. However, given the nature of polymer formation and the functional importance of protein-protein interaction in the proper functioning of the cytoskeleton and in cell adhesion, we would expect that the opportunities to identify single mutations as drivers among CAPCR proteins to require an unusually high sample size and to be rarely detectable. And as a consequence, we would expect such mutations to appear as drivers in very, very few cases. However, cooperative protein-protein function does imply successful opportunities to identify driver groups, particularly among CAPCRs [2]. Indeed, it is important to note that CAPCRs are not the only category where grouping can be of use. In particular, both oncoproteins and tumor suppressor proteins can be represented by alternatives within the same cancer datasets [2-4], as has been recently confirmed by the very common selection for alternative oncogenic signaling pathways upon the application of designer drugs [8], as noted above. Cytoskeletal and adhesion-related proteins have long been a topic of cancer research, with early reports indicating that a disorganized cytoskeleton is associated with tumorigenesis [14, 18]. However, later reports have led to contradictory conclusions, particularly with an emphasis on the idea that cytoskeletal organization is a crucial aspect of cell migration and metastasis [13]. Understanding the role of cytoskeletal disorganization in prognoses will likely require a better understanding of distinct sub-functions of the cell cytoskeleton. For example, metastasis has been associated with the cell’s ability to form, Rho-regulated invadopodia, to migrate in experimental settings [11], and to form the components of the cytoskeleton required for migration. On the other hand, the complete lack of dystrophin [19, 20] can lead to a tumor phenotype in mice. There is virtually no understanding of the impact of CAPCR mutations, and ensuing mutant CAPCR proteins, on patient cancer prognosis, not surprising, given the contradictions discussed above and the challenges of studying mutant forms of very large proteins in many experimental settings. Nevertheless, CAPCRs are among the most commonly mutated coding regions in cancer, likely owing to their large sizes [1]. And, protein products of CAPCRs in many cases interact, making it possible that any number of mutant proteins could have a dominant negative effect, akin to the dominant negative effect seen in many other, better studied polymeric or multimeric protein complex-related disorders, e.g., the dominant negative effect seen in osteogenesis imperfecta, due to collagen protein mutations. Thus, in addition to signaling pathway connections to the cytoskeleton, and other broad-based analysis approaches, such as microscopic analyses of the staining of cytoskeletal components, understanding mutated protein components of the cytoskeleton is likely to shed new light on the connection between the cytoskeletal function and cancer. Due to a lack of clinical information overlapping molecular information, TCGA has not facilitated many connections between the two sets of parameters, particularly within a given cancer type. However, in the case of the above study and results, there is an indication that increased numbers of mutations of CAPCRs are inconsistent with a positive outcome in melanoma. Although as presented, and given the available quantity of information, the results are consistent, the results are based on modest clinical information available via TCGA databases and would need to be further investigated with a more controlled clinical setting. For example, it is possible that patients representing the Negative outcome group were treated more aggressively from the beginning of cancer detection with alkylating agents, likely to mutate more, and in particular mutate large coding regions more. In this case, the association between mutation of large coding regions and a negative outcome could be a by-product of the treatment, rather than a biomarker or functional aspect of cancer progression. However, it remains possible that disruption of the cytoskeleton or cell adhesion via expression of mutant cytoskeletal or adhesion related proteins could have effects that lead to more aggressive cancers. While the regulated, or abnormally regulated disassembly and re-formation of the cytoskeleton or ECM may provide explanations for migration or other aspects of cell function, the disruption of the cytoskeleton, particularly via mutant proteins, raises the question of what cell-shape changes could be physically associated with cancer development? Cells that lack the capacity to form an extensive cytoskeleton are likely to be more spheroid in shape, and due to physical attributes alone, less likely to remain attached to surrounding tissue. Also, it remains intriguing that almost all cells selected for drug-resistance in culture are spherical, a result that is often connected to epithelial-mesenchymal transition. However, the very common CAPCR mutations raise the question of whether cell shape changes lead to reduced surface to volume ratios, increased diffusion co-efficients, and therefore less sensitivity to drugs? In the above results, we observed a trend towards a higher number of mutations, normalized to coding region size, in the metastatic samples, for many CAPCRs that function in adhesion, i.e., as extra-cellular matrix (ECM) proteins, e.g, DSCAM and PCDHGC5. Mutation of the ECM subclass of the CAPCRs has a statistically significant association with the Negative outcome group (Table ). A further reduced subset of mutated CAPCRs (DSCAM, FAT3, MUC17 and PCDHGC5; Table ) also had a statistically significant association with the Negative outcome group. This subset also represented ECM components, and overlapped with the subset defined by increased mutation rates in the metastatic samples, with the exception of MUC17 being included in the smaller subset. Interestingly, the deleterious AA rates, as determined by use of the PROVEAN tool, track the mutation rates almost exactly. As would be expected for a functional impact, there are more deleterious AA in the ECM coding regions associated with the Negative outcome group, although there is no apparent selection for deleterious AA independently of the mutation rates (unpublished observations), i.e., there is no apparent increase in the number of deleterious AA per mutation. However, there is an apparent selection for a higher mutation rate in the indicated CAPCR sets (Tables , , ). This mutation rate is apparently largely, random, leading to deleterious AA replacements at the same rate, whether there is a high rate of mutagenesis or a low rate of mutagenesis. This randomness of “deleterious AA related mutagenesis” is consistent with previous work indicating almost no differences between the ratio of silent mutations to AA replacement mutations in cancer samples with high rates of mutations versus low rates of mutation [1], and is consistent with verification of that analysis in this report (Table ). Presumably, with a high enough rate of mutagenesis, there is a high enough accumulation of deleterious AA to have a functional, in this case, negative impact. The increased number of deleterious AA among the ECM subset of the CAPCRs is consistent with the possibility that tissue detachment or tissue instability facilitating metastasis is at the heart of the association of the mutated CAPCRs with a negative outcome. The result also raises questions about the overlap of mutations in ECM proteins and other ECM signatures reported as associated with metastasis, such as the higher level of expression of particular ECM components [21]. For example, is it possible that metastasis selects for the higher level expression of mutant ECM proteins because the increased level of expression of the mutant form would further enhance the disruption of the ECM caused by a structurally defective subunit?

Conclusion

Many CAPCRs are mutated in melanomas that have high numbers of overall mutations, consistent with the significant stochastic effect of mutagenesis and the vulnerability of large coding regions. The CAPCRs can be subdivided into groups of CAPCRs that have a strong association with a negative outcome and relatively higher rates of mutations in metastatic melanoma samples.
Table 1

Establishment of clinical outcome categories from the cancer genome atlas (TCGA), skin cancer cutaneous melanoma (SKCM), clinical data (detailed in the SOM labeled “SOM file Fig1D, eliminate barcodes”).

Clinical Outcome Categories Best Treatment Response Categories from TCGA SKCM Clinical Drug File Number of Barcodes in Each Group
Negative outcomeStable disease4
Progressive disease24
SubTotal28
Positive outcomePartial response1
Complete response13
SubTotal14
Total number of barcodes42
Table 2

Overview of mutations in the negative and positive, cancer genome atlas, skin cancer cutaneous melanoma, outcome groups (detailed in the supporting online material file labeled “SOM file Fig1D, eliminate barcodes”).

Total number of mutations for the Negative outcome group15088
Total number of mutations for the Positive outcome group4479
Average number of mutations for the Negative outcome group538.86
Average number of mutations for the Positive outcome group319.93
p-value0.1111 (Not significant)
Table 3

HUGO symbols for the indicated gene sets (additional information regarding categorization of individual coding regions in methods; more detailed gene information in the supporting online material file “SOM Table 3 source file”.) CAPCR, cytoskeletal-related and adhesion-related protein coding region.

CAPCRs Other-CR, Commonly Mutated CAPCR-coding Region Size Control Oncoprotein Tumor Suppressor
ANK2MUC4abPartsMYH1ABCA13KIAA1109ACVR1AKAP12KISS1R
APCNEBANK3MYH2AHNAKLPAALKAXIN1KLF6
COL11A1NEFHANKRD30AMYO18BAHNAK2LRP1ARAFBMP2LATS2
DNAH10NF1APOBOBSCNAPOBLRP1BBRAFBMPR1BLIMD1
DNAH11PCDH15C15orf2PAPPA2BIRC6MACF1CTNNB1BMPR2MAP2K4
DNAH3PCDHAC2CACNA1EPKHD1L1DNAH17MDN1EGFRBRCA1MED23
DNAH5PCDHGC5CSMD1PTPRTDNAH2MUC2FGFR2BRCA2PBRM1
DNAH7PCLOCSMD2RP1DNAH9MUC5ACFLT3BRMS1PEBP1
DNAH8PKHD1CSMD3RYR1DNHD1MUC5BFRKCASZ1PPAPDC1B
DSCAMPLECDNAH17SCN10ADYNC1H1MYCBP2HRASCDKN2APRDM2
DSTRELNDNAH9SCN11AEPPK1OBSCNJAK2CHD5PTEN
FAT3SPTA1DSPSPHKAPFAT1RNF213KRASCHEK2RB1
FAT4SPTAN1GRIN2ATHSD7BFCGBPRYR1MTORCTCFRECK
FBN2SSPOHYDINTNXBFSIP2RYR2NRASDLC1SMAD4
FLGSYNE1LRP1BTRANK1HERC1RYR3PRKACADOK2SMAD7
GPR98SYNE2MAGEC1UNC13CHERC2SACSRAF1FLCNSMARCB1
MUC16TTNMGAMUSH2AHMCN1UBR4FOXP3SP100
MUC17XIRP2MXRA5ZFHX4HYDINUSH2AGPR68TFPI2
ING1TMPRSS11A
ING4TXNIP
INPP4BVHL
KISS1WWOX
Table 4

Association of mutations in the indicated gene sets with TCGA SKCM negative outcome. (The source data for each coding region set (column) is present in the following SOM files: (i) “SOM file Fig. 1F, CAPCR, neg outcome stats”, (ii) “SOM file Fig1F, Non-CAPCR, common mut., neg outcome stats”, (iii) “SOM file Fig1F, Non-CAPCR, size control, neg outcome stats”, (iv) “SOM file Fig1F, Oncoprotein, neg outcome stats”, (v) “SOM file Fig1F, Tumor Suppressor, neg outcome stats”. The source data for the “silent removed” p-values are present in SOM files: (i) “SOM file Fig1F, CAPCR, neg outcome stats, silents removed”, (ii) “SOM file Fig1F, Non-CAPCR, common mut., neg outcome stats, silents removed”, (iii) “SOM file Fig1F, Non-CAPCR, size control, neg outcome stats, silents removed”, (iv) “SOM file Fig1F, Oncoprotein, neg outcome stats, silents removed”, (v) “SOM file Fig1F, Tumor Suppressor, neg outcome stats, silents removed”). NS = not significant.

Table 4 . Coding Region Sets
CAPCRs Other-CR, Commonly Mutated CAPCR-coding Region Size Control Oncoprotein Tumor Suppressor
Total number of coding regions in set3636361644
Total number of amino acids in coding region set1912261075841867061305834582
Table 4. Coding Region Sets
CAPCRsOther-CR, Commonly MutatedCAPCR-coding Region Size ControlOncoproteinTumor Suppressor
Average amino acid length per coding region531329885186816786
Average number of coding regions per barcode with at least one mutation (Negative outcome group)12.9314.939.041.641.86
Average number of coding regions per barcode with at least one mutation (Positive outcome group)8.869.296.1411.07
p-Value for association of gene set mutations with negative outcome0.04660.04440.1175(NS)0.01660.1127(NS)
p-Value for association of gene set mutations with negative outcome (silent mutations removed)0.04550.04740.2044 (NS)0.03530.0642 (NS)
Table 5

Ratios of silent mutations to total number of mutations (Source data present in “SOM Table. 5, source file”). The data below are in very close agreement with Parry et al. [1] and support the conclusion that mutagenesis in cancer samples has a large stochastic component and the rate of silent mutations largely tracks the rate of amino acid replacement mutations.

Ratio of Silent Mutations to Total Mutations
Entire SKCM dataset0.323
Negative outcome group mutations0.319
Positive outcome group mutations0.304
CAPCR mutations negative outcome0.274
CAPCR mutations positive outcome0.297
Table 6

Range of CAPCR mutations in primary and metastatic, TCGA SKCM raw exome sequence files. (Additional detail, including the TCGA barcodes for each sample, is present in the SOM file labeled “Figure 2G -Total mutations, prim and Met”).

Sample number →Primary 1Primary 2Primary 3Primary 4Primary 5Primary 6Primary 7Primary 8Primary 9Primary 10
Number of mutations24291582295316331433
Sample number →Metastatic 1Metastatic 2Metastatic 3Metastatic 4Metastatic 5Metastatic 6Metastatic 7Metastatic 8Metastatic 9Metastatic 10
Number of Mutations674397154794021287234
Table 7

Association of a reduced CAPCR set, based on ranking shift between the mutations per amino acid ratios of the primary and metastatic samples, with the negative outcome group. (Additional detail is present in the SOM file labeled “SOM Table. 7, source file”).

Increased Rank Shift in mut/AA Ratio From Primary to Metastatic Sample Decreased Rank Shift in mut/AA Ratio From Primary to Metastatic Sample
DNAH7NEFH
FAT3PCDHAC2
SPTAN1SSPO
DNAH8FBN2
DSCAMNF1
PKHD1XIRP2
PLECSYNE2
TTNSPTA1
DNAH5PCDH15
PCDHGC5DNAH10
ANK2
p-value for association with negative outcome0.00120.0602
Table 8

Association of reduced CAPCR set from inspection of negative outcome group. CAPCR coding regions were determined using selection criteria of any coding region with greater than 50% of negative outcome barcodes and less than 50% of positive outcome barcodes having a mutation in that coding region. (Source data present in “SOM Table 8, source file”)

Description CAPCR Coding Regions with Greater Than 50% of Negative Outcome Barcodes and Less Than 50% of Positive Outcome Barcodes with a Mutation
DSCAM
FAT3
MUC17
PCDHGC5
p-value for association with negative outcome8.65E-05
  21 in total

Review 1.  Tumor adaptation and resistance to RAF inhibitors.

Authors:  Piro Lito; Neal Rosen; David B Solit
Journal:  Nat Med       Date:  2013-11       Impact factor: 53.440

2.  Size matters: sequential mutations in tumorigenesis may reflect the stochastic effect of mutagen target sizes.

Authors:  Kimberly Long; Toaa Abuelenen; Libia Pava; Maya Bastille; George Blanck
Journal:  Genes Cancer       Date:  2011-10

3.  Actin cytoskeleton and motility in rat sarcoma cell populations with different metastatic potential.

Authors:  E Pokorná; P W Jordan; C H O'Neill; D Zicha; C S Gilbert; P Veselý
Journal:  Cell Motil Cytoskeleton       Date:  1994

4.  Defective organization of actin in cultured skin fibroblasts from patients with inherited adenocarcinoma.

Authors:  L Kopelovich; S Conlon; R Pollack
Journal:  Proc Natl Acad Sci U S A       Date:  1977-07       Impact factor: 11.205

5.  Mice lacking dystrophin or alpha sarcoglycan spontaneously develop embryonal rhabdomyosarcoma with cancer-associated p53 mutations and alternatively spliced or mutant Mdm2 transcripts.

Authors:  Karen Fernandez; Yelda Serinagaoglu; Sue Hammond; Laura T Martin; Paul T Martin
Journal:  Am J Pathol       Date:  2009-12-17       Impact factor: 4.307

6.  Cytoskeletal F-actin patterns quantitated with fluorescein isothiocyanate-phalloidin in normal and transformed cells.

Authors:  M Verderame; D Alcorta; M Egnor; K Smith; R Pollack
Journal:  Proc Natl Acad Sci U S A       Date:  1980-11       Impact factor: 11.205

7.  Flat cells come full sphere: Are mutant cytoskeletal-related proteins oncoprotein-monsters or useful immunogens?

Authors:  Michele L Parry; George Blanck
Journal:  Hum Vaccin Immunother       Date:  2015-07-30       Impact factor: 3.452

8.  Actin filament organization of the Dunning R3327 rat prostatic adenocarcinoma system: correlation with metastatic potential.

Authors:  J M Zachary; G Cleveland; L Kwock; T Lawrence; R M Weissman; L Nabell; F A Fried; E V Staab; M A Risinger; S Lin
Journal:  Cancer Res       Date:  1986-02       Impact factor: 12.701

Review 9.  ACTB in cancer.

Authors:  Chunmei Guo; Shuqing Liu; Jiasheng Wang; Ming-Zhong Sun; Frederick T Greenaway
Journal:  Clin Chim Acta       Date:  2012-12-22       Impact factor: 3.786

10.  Dystrophin is a tumor suppressor in human cancers with myogenic programs.

Authors:  Yuexiang Wang; Adrian Marino-Enriquez; Richard R Bennett; Meijun Zhu; Yiping Shen; Grant Eilers; Jen-Chieh Lee; Joern Henze; Benjamin S Fletcher; Zhizhan Gu; Edward A Fox; Cristina R Antonescu; Christopher D M Fletcher; Xiangqian Guo; Chandrajit P Raut; George D Demetri; Matt van de Rijn; Tamas Ordog; Louis M Kunkel; Jonathan A Fletcher
Journal:  Nat Genet       Date:  2014-05-04       Impact factor: 38.330

View more
  1 in total

1.  Somatic synonymous mutations in regulatory elements contribute to the genetic aetiology of melanoma.

Authors:  Di Zhang; Junfeng Xia
Journal:  BMC Med Genomics       Date:  2020-04-03       Impact factor: 3.063

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.