Literature DB >> 22057237

Insertional mutagenesis identifies multiple networks of cooperating genes driving intestinal tumorigenesis.

H Nikki March1, Alistair G Rust, Nicholas A Wright, Jelle ten Hoeve, Jeroen de Ridder, Matthew Eldridge, Louise van der Weyden, Anton Berns, Jules Gadiot, Anthony Uren, Richard Kemp, Mark J Arends, Lodewyk F A Wessels, Douglas J Winton, David J Adams.   

Abstract

The evolution of colorectal cancer suggests the involvement of many genes. To identify new drivers of intestinal cancer, we performed insertional mutagenesis using the Sleeping Beauty transposon system in mice carrying germline or somatic Apc mutations. By analyzing common insertion sites (CISs) isolated from 446 tumors, we identified many hundreds of candidate cancer drivers. Comparison to human data sets suggested that 234 CIS-targeted genes are also dysregulated in human colorectal cancers. In addition, we found 183 CIS-containing genes that are candidate Wnt targets and showed that 20 CISs-containing genes are newly discovered modifiers of canonical Wnt signaling. We also identified mutations associated with a subset of tumors containing an expanded number of Paneth cells, a hallmark of deregulated Wnt signaling, and genes associated with more severe dysplasia included those encoding members of the FGF signaling cascade. Some 70 genes had co-occurrence of CIS pairs, clustering into 38 sub-networks that may regulate tumor development.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22057237      PMCID: PMC3233530          DOI: 10.1038/ng.990

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


It is estimated that on average 7-15 somatic mutations are required for a normal human cell to undergo malignant transformation[1-3]. Colorectal cancer progression appears to have an extended clinical course, with mutation of genes including APC, K-RAS, TP53 and SMAD4, being frequent events, whilst genomic analysis has implicated many more genes in the pathogenesis of this disease[4-6]. The diversity of the mutagenic processes that create tumors in humans results in many different types of lesions in the cancer genome such that discriminating cancer driver genes from passengers is often not possible using genome analysis alone[7]. Recently the Sleeping Beauty (SB) DNA transposon system has been used to identify genes involved in the development of intestinal tumorigenesis[8]. Insertional mutagenesis provides the advantage of using engineered elements to initiate mutagenesis largely through the loss or gain of gene function, while tagging potential cancer genes[9]. In the intestine, loss of APC function is widely believed to be an initiating event in the formation of colorectal cancer, with 80% of sporadic tumors having an APC mutation, while germline APC mutations are associated with a predisposition to familial polyposis [4,6,10]. APC normally functions to sequester and target β-catenin for degradation through phosphorylation by GSK-3β. In the absence of APC, β-catenin can freely translocate to the nucleus where it signals an intracellular cascade impacting upon the transcription of hundreds of genes[6]. Here, in mice with somatic or germline mutation of Apc, we performed large-scale insertional mutagenesis using the Sleeping Beauty transposon system to create a comprehensive catalog of new candidate drivers of intestinal tumorigenesis [8,9,11-17]. Several hundred candidate driver genes are implicated, allowing novel genetic interactions between genes and conserved pathways to be explored.

RESULTS

Generation of Mice and Histological Analysis

A cohort of 67 transgenic animals (AhCre;Apc;Rosa26) underwent concomitant somatic heterozygous loss of Apc and activation of SB transposase following Cre induction. Thirty-seven control mice were Apc deficient but unable to mobilize the transposons (AhCre;Apc or AhCre;Apc;Rosa26) (Supplementary Fig. 1-2). All Apc mice eventually developed intestinal tumors, but those with mobilized transposons showed a significantly earlier onset of disease compared to controls (Fig. 1). Tumors mostly occurred in the small intestine, with the occasional lesion found in the caecum and colon (Fig. 1c). In Apc mice the experimental group [Apc;Rosa (10 mice)] developed disease approximately 30 days earlier than control Apc littermates, and had a higher proportion of colonic lesions (Fig. 1b,d).
Figure 1

SB transposon mobilization increases morbidity and tumor burden of Apc-deficient mice

(a) Kaplan-Meier survival curve comparing Apc experimental (Ah-Cre/Apc/Rosa26 (black line) and control mice (Ah-Cre/Apc and Ah-Cre/Apc/Rosa26 (grey line) (log-rank P<0.001). (b) Kaplan-Meier survival curve comparing Apc experimental (Apc/Rosa (black line), Apc control mice (Apc and Apc (grey line) and Rosa control mice (green line) (log-rank P<0.0057). (c) Average polyp size and number in Apc experimental mice (n=3), and matched littermate Apc control mice (n=3) at 18 weeks post-induction. All polyps visible under the dissecting microscope at 6x magnification were counted and grouped into those that were non-excisable (black bars) (<3mm) and those that could be excised for further analysis (>3mm) (grey bars). Average polyp number for small intestine and caecum/colon is shown. (d) Average number of excisable polyps (>3mm) from small intestine, caecum and colon of Apc experimental mice (n=10) and Apc control mice (n=7) at 10-22 and 17-30 weeks of age respectively. β-catenin immunohistochemistry (upper panel of each) and hematoxylin and eosin staining (lower panel of each) of representative tissue from Apc mice showing (e) a tubulovillous adenoma at 18 weeks post-induction, (f) an adenoma at 28 weeks post-induction, (g) an adenoma at 17 weeks post-induction with the arrow indicating the region of adenocarcinoma in the submucosa. (h) Lysozyme staining of representative tissue from an Apc mouse at 22 weeks post-induction, with the arrow indicating Paneth cell differentiation. All scale bars = 100μm.

A total of 446 tumors (of which 367 underwent histopathological review) were excised for analysis of transposon insertion sites from the experimental groups; 391 and 55 from the somatic Apc and germline Apc models, respectively. In addition, 106 (46 somatic, 60 germline) tumors from animals without mobilized transposon were taken for histopathological review. Tumors were larger and more numerous in Apc animals with mobilized transposons (Fig. 1c). The majority of tumors were oligocryptal and tubulovillous adenomas (Supplementary Table 1). However, a proportion of tumors from the somatic model with mobilized transposons showed moderate or severe dysplasia (n=40, 23 mice), while 10 showed evidence of invasion (9 mice) (Fig. 1e-g). Both somatic and germline models produced microsatellite stable tumors (Supplementary Fig. 3).

Identification of Sleeping Beauty Common Insertion Sites (CISs)

In total, 146,249 non-redundant transposon insertion sites (an average of 328 per tumor) were identified after 454 sequencing and mapping to the mouse genome. We employed two statistical frameworks to determine common insertion sites (CISs): Monte Carlo (MC) simulations, and a Gaussian kernel convolution (GKC) algorithm that was optimized to take into account the TA bias of the Sleeping Beauty transposon[8,18,19]. To maximize statistical power, we pooled insertions from both the Apc and Apc models. The concordance between the two statistical methods was 60%, based on overlapping CIS regions; Monte Carlo analysis identified 749 CISs, while GKC analysis with 30 Kb and 120 Kb kernel sizes identified 919 and 641 CISs, respectively, equating to 997 unique CIS regions (867 genes) (Supplementary Tables 2a-d). The results of the GKC analysis were used to define a discovery set for downstream analyses and refinement by comparison to other datasets. Importantly the T2Onc transposon can generate gain-of-function mutations by integrating in or near genes and driving over-expression from the MSCV promoter in the transposon, and also loss-of-function mutations by integrating in genes and hijacking transcription by use of splice acceptors and polyA signals in the transposon. Insertions may also generate neomorphic alleles, or potentially disrupt regulatory elements around cancer gene loci. To identify CISs that may be the result of preferential insertion hotspots, we mobilized transposons as described above and collected a total of 48 tissue samples for insertion site analysis 2 weeks and 10 months after mobilization. Sequencing of the insertions from these tissues yielded 17,358 unique insertions and 35 CISs using GKC analysis. Only 1.8% of CISs called from the tumors overlapped with these control CISs (Supplementary Table 2e). Analysis of CISs associated with known cancer genes allowed us to establish a set of rules that were used to assign insertions to genes. These rules are outlined in the supplementary methods (Supplementary Fig. 4-9, Supplementary Tables 3-4).

Tumor Etiology

The implication that there is positive selection for hundreds of candidate cancer genes following insertional mutagenesis initially appeared surprising. Consideration of the tumor etiology is important in understanding this observation. First, we determined that multiple tumors obtained from individual mice were biologically independent because their insertions were no more likely to be in the same genomic locations as tumors from different mice (Supplementary Fig. 10). Therefore, separate lesions taken from the same mouse intestine evolve independently. Second, consideration of Apc (mutated in 91.3% of tumours) demonstrates not only that mutation of the second allele is a near obligate event in this model, but also that it is recurrently mutated (65% of tumors had 3 or more insertions). This must arise from selection for serial truncating mutations within Apc during tumor development, as has been shown for the attenuated form of FAP – the ‘three hit’ hypothesis[20]. This observation also suggests tumor oligoclonality, supported by the number of non-redundant insertions per tumor being 10-fold greater than the number of transposons available for mobilization (328 versus 30). A corollary is that transposons are inefficiently mobilized following Cre induction. This was confirmed by measuring the copy number of transposons retained at the donor locus: around 40% of the original 30 copies of T2Onc are retained (n=24, mean number of unmobilized transposons = 11.19 copies, SD = 2.611) (Supplementary Methods). Since 50% survival is reached at around 16 weeks post-induction (Fig 1a), the rate of insertional mutagenesis is less than 1 event/cell/week (the presence of contaminating stromal cells in which the transposase is not activated may mean that this rate is an underestimate). Thus, transposon mobilization is inefficient and ongoing throughout tumorigenesis, with the extensive accumulated mutational burden arising from selective forces. This leads to the development of oligoclonal tumours, and explains in part the number of insertions we identify. As a consequence many mutated genes make a relatively small contribution to tumor growth. Overall, the average number of insertions per tumor that contributed to a CIS was 1.14, demonstrating that some tumors contained multiple insertions contributing to a CIS (Supplementary Tables 5a-d). For example, 5.4% of tumors had 2 or more insertions in Ctnnd1, 4.7% of tumors had 2 or more insertions in Kcnq1, and 4.0% of tumors had 2 or more insertions in Lass6. This may be due to the presence of multiple independent tumor sub-clones within the tumor bulk, suggesting that the mutation of these genes may represent a bottleneck in tumor evolution whereby individual cells within the tumor acquire different mutations in order to progress. This observation is also consistent with these genes functioning as tumor suppressors, requiring biallelic mutations before they become operative in cancer development[21]. Given the above, we cannot distinguish between these possibilities. Other frequently mutated CIS genes previously implicated in intestinal cancer include Pten, Smad4 and Bmpr1a, which were mutated in a significant number of tumors (13.2%, 15.7% and 7.6% respectively (P<1.51 × 10−5)) demonstrating that transposons mutate the same genes that are disrupted in the human disease. In 48 tumors with single insertions in Pten, Smad4 and Bmpr1a we sequenced the coding regions of these genes and found no further spontaneous mutations, indicating that the insertions in these genes may be acting in a haploinsufficient manner (data not shown).

Transposons Drive Intestinal Tumorigenesis via Conserved Pathways

To evaluate the types of genes involved, and to facilitate the filtering of the discovery set of CIS genes, we used Ingenuity Pathway Analysis (IPA) (Ingenuity Systems Inc.). We determined that the CISs in this screen were significantly enriched in cancer-associated signaling pathways, accounting for 113 CIS genes. For example genes in the PI3K/Akt, ERK/MAPK, Wnt and integrin signaling pathways were mutated with P-values of 2.69 × 10−8, 2.76 × 10−8, 2.13 × 10−7 and 1.39 × 10−7, respectively (Supplementary Table 6). Loss of function mutations, characterized by intragenic transposon insertions in both orientations, appeared to predominate. Insertions near K-Ras, mutated in around 40% of sporadic human colorectal carcinomas, were present in only 5% of tumors[22,23]. In addition, mutations in p53 and Smad4 were found at a frequency of only 5.2% and 15.7%, respectively. To assess the ability of our insertional mutagenesis screen to exploit the K-Ras, p53 and TGFβ signaling pathways, we examined insertions in CISs associated with genes immediately up or downstream of these key cancer genes in each signaling cascade, and determined that they are frequently mutated (all with P ≤2.23 × 10−5) (Fig. 2). In total, 38.6% of tumors carried mutations in genes in the K-Ras pathway, 48.0% in the TGFβ pathway, and 61.4% in the p53 pathway. Just under half of all tumors carried mutations in CISs associated with genes in more than one of these pathways, with 15.9% carrying mutations in all three (Fig. 2). Consideration of tumor pathology with the number or type of pathways mutated showed no correlation with more dysplastic disease (data not shown). However, a proportion of CIS genes were over-represented in tumors carrying insertions in these pathways; around 40, 43 and 36 CIS genes for the K-Ras, p53 and TGFβ signaling pathways, respectively (Supplementary Tables 7-9). Therefore, transposon insertions drive tumorigenesis using the same conserved pathways as are operative in human cancers, and these are associated with additional gene insertions.
Figure 2

Comparison of Sleeping Beauty insertions in K-Ras, p53 and TGFβ signaling pathways

CIS genes immediately upstream or downstream of K-Ras, p53 or Smad4 were identified and analyzed for the occurrence of transposon insertions. Percentages of tumors with insertions in genes are indicated in italics. Percentages in bold show the overall proportion of tumors with insertions in each pathway.

Cross-Species Comparison

Human cancer genomes acquire mutations that not only drive tumorigenesis, but that also reflect the signatures of the mutagenic processes that have driven their evolution[7]. Using cross-species oncogenomics, we compared the CISs identified in this study to human colorectal cancer datasets to implicate up to 234 CIS genes in the human disease (Fig. 3a-d, Supplementary Table 10-14). Genes overlapping in two or more of the datasets (34 genes) were distributed throughout the CIS list and include important known CRC genes such as Fbxw7, K-Ras and Ctnndb1 (Fig. 3e, Supplementary Tables 15-16). In addition to comparisons with human tumor studies, we also identified 55/77 of the CIS genes identified in a recent SB transposon screen in the intestine, but by using a significantly larger number of tumors (446 versus 135) and deeper sequencing, we extend this CIS list by 12-fold (Supplementary Table 17)[8]. The 55 CISs identified by Starr et al.[8], were distributed throughout our entire CIS list (Supplementary Table 18), with 7 identified in the top 20 genes of the 30 Kb and 120 Kb GKC analyses (Supplemental Figure 11, Supplementary Table 19). Interestingly, an additional 10 proteins encoded by CIS genes overlap with over-expressed proteins and transcripts identified in a screen of adenomas from Apc mice, including Cul1 and Pdlim1[24,25]. Furthermore, recent studies using intercrosses with Apc mice have shown that Pparg, Pten, Rb1, Mapk1, Stat3, Ephb4, Myb, Cd44 and Smad3 co-operate with Apc loss in intestinal tumorigenesis, all of which were detected in our analysis[26-34]. Finally, comparison of our data with a CGH dataset from spontaneously evolving tumors from ApcMin mice showed a significant overlap of CIS genes with regions rearranged in these tumors (P = 6.75 × 10−7) (Supplementary Methods)[35], suggesting that transposons identify the same genes that are disrupted in spontaneous cancers. Importantly, 224 genes were predicted to be disrupted in each of these spontaneous mouse tumors, a number 3-fold higher than the average number of CISs per tumor (76).
Figure 3

Cross-species comparison of CISs with cancer datasets

(a) 47 CIS orthologs of gene products found to be up-regulated or down-regulated in a proteomic comparison of nuclear matrix fractions from human adenomas and adenocarcinomas (Supplementary Table 14) [59]. (b) 43 CIS orthologs of genes identified through exon re-sequencing of human colorectal tumors, 13 of which were highlighted as potential drivers by the authors (P= 0.003) (Supplementary Table 13) [47]. (c) 236 CIS orthologs of genes reported in the Catalogue of Somatic Mutations in Cancer (COSMIC) as harboring mutations in human sporadic colorectal cancers (36 orthologues in colorectal cancer) (Supplementary Table 12) [60]. (d) 143 CIS orthologs of genes found in deleted or amplified regions in 123 human sporadic colorectal cancers through CGH analysis (q<25%) (P = 0.025) (Supplementary Tables 10 & 11) [44]. Genes in black indicate CIS orthologs that overlap in two or more datasets. (e) Distribution of CIS gene orthologs from the 30 Kb Gaussian Kernel Convolution analysis. Genes are shown in rank order according to the percentage of tumours containing a given CIS (and then by height of the CIS peak). Some genes such as Mll3 contain two CIS regions and are therefore shown twice. Genes in bold indicate well-known CRC genes.

Association of CIS with the Wnt Pathway

The observation that there was enrichment for subsets of CIS genes in tumors with mutations affecting the p53, K-Ras and Wnt pathways suggested that one explanation for so many potential cancer drivers is their modulation of known cancer pathways. To test this functionally, we explored one such pathway, namely the Wnt signaling pathway. The presence of TCF/β-catenin consensus binding sites in promoter regions of 183 CIS genes suggested that they are potential Wnt targets (P = 1.1×10−64) (TRANSFAC 2009.3, BIOBASE GmBH)[36]. Of these, 37 genes showed significant copy number changes in a CGH analysis of human colorectal cancers (Fig. 3; Supplementary Table 20), and were assessed to determine if they were potential modulators of Wnt signaling. The CellSensor Wnt reporter system in SW480 colorectal carcinoma cells was used in conjunction with siRNA knockdown, and revealed 16 genes as being negative regulators of Wnt signaling, and 4 genes as positive regulators (Fig. 4). To control for off-target effects, 9 randomly selected siRNAs were tested and were found to have no effect on Wnt reporter activity (Supplemental Fig. 12). Overall this represents a significant enrichment for genes that regulate canonical Wnt signaling in our screen (P<0.005, Fisher’s exact test) (Supplementary Methods). The E3 ubiquitin-protein ligase NEDD4, known to polyubiquinate the tumor suppressor PTEN, appears to negatively regulate Wnt signaling, suggesting a potential link between two distinct tumor suppressor mechanisms involving APC and PTEN. Rcor1, Onecut2 and Zfpm1 are transcription factors, knockdown of which caused a robust increase in Wnt signaling, thus confirming a regulatory role for these genes in the Wnt pathway.
Figure 4

Identification of novel Wnt targets with tumourigenic potential

Potential regulators of Wnt signaling were identified by analysis with TRANSFAC 2009.3 (BIOBASE GmbH). The results were then cross-referenced with CIS corresponding to orthologs of genes identified from the CGH analysis of human sporadic colorectal cancers[44] to generate a final set of CIS genes for further study. SW480 cells containing a stably integrated β-lactamase reporter gene under the control of the LEF/TCF binding consensus sequence were transiently transfected with siRNA (final concentration 20nM) and processed after 72 h. FRET read-out was measured using the manufacturer’s standard protocol. The fold change represented in the graph is calculated with respect to GC control. Error bars: s.d, n=4. Data were pooled from three independent experiments to determine statistical significance by Student’s t-test (***P<0.001, **P<0.01, *P<0.05).

A subset of human colorectal cancers containing an increased proportion of Paneth cells has been described previously [37]. In the present study a significant number of tumors (83/367) contained an enlarged proportion of these cells (Fig. 1h). Mouse models in which germline Apc mutations are engineered to retain a single domain of β-catenin binding/degradation and thereby retain some residual functionality from the truncated allele (1322T mutation) have also been shown to carry tumors with a higher proportion of Paneth cells and a more dysplastic phenotype [38]. Thirty-six CISs were identified as being over-represented in this group when compared to non-Paneth cell tumors (Table 1). Of these, eight have previously been identified as Wnt targets[36]. Interestingly Eps8, which we show to be a positive regulator of the Wnt pathway (Fig. 4), is also a CIS gene enriched in the Paneth cell-containing tumors. Three of the 36 genes (Usp10, Eps8 and Cask) have been shown to be upregulated in intestines showing an increase in Paneth cell number due to deletion of Klf9[39]. These results further suggest an important role in the fine-tuning of Wnt signaling throughout tumor development, with one outcome being altered tumor pathology.
Table 1

Genes over-represented in tumors containing differentiated Paneth cells.

Gene IDTotalinsertionsInsertions inPaneth tumorsSample specificP-value
Usp10 1380.001
Tnpo3 25120.002
Slmap 1790.003
Pak2 21100.004
Ppm1b 38150.005
Mbnl1 26110.008
Ncoa4 26110.008
Ppp1r2 48170.009
Tax1bp1 30120.010
CIS7:143946337_30k 1470.012
Bspry 1880.016
Frs2 32120.017
Zfx 1260.019
Sfrs7 26100.023
Spin1 1980.023
Ppp1r12a 93270.024
9130404D08Rik|Gatad2a 41140.024
Fubp3 1670.027
Eps8 38130.029
Fat1 38130.029
Bclaf1|Mtap7 46150.031
Styk1 27100.031
Cask 2080.032
Rps27 2080.032
Ash1l 39130.036
CISX:39487624_30k 2490.036
Nipbl 47150.037
Mlxip 28100.039
CIS9:21215498_30k 2180.043
H2afy 2180.043
Ocln 2180.043
Nacc1 2590.047
Tomm70a 2590.047
Akap9 29100.049
CIS11:50153019_30k 29100.049
Nfib 61180.050

CIS that are not located within ± 150K base pairs of a gene are labeled identified as ‘CIS’ followed by the chromosome, the peak location of the Gaussian kernel and the kernel size.

We also compared tumors histopathologically classified as presenting with either mild or moderate atypia (n=332) against those with moderate dysplasia and severe dysplasia (n=35); 23 genes were over-represented in the tumors with a more severe phenotype suggesting that these genes contribute to disease progression (Table 2). Importantly, genes involved in FGF signaling (Frs2 and Grb2) feature; it has been proposed that co-activation of this pathway with Wnt signaling results in a more malignant phenotype[40]. From this list Uhrf2 has been associated with an increased risk of colon cancer[41], while a gene fusion of Tmprss2 (Tmprss2-Erg1) has been linked with a poor prognosis in prostate cancer[42]. Slk is a serine-threonine protein kinase, which, when complexed with Asap1, promotes CRC metastasis[43]. Therefore, these over-represented genes may provide new insights into CRC progression.
Table 2

Genes over-represented in tumors with severe pathology.

Gene IDTotalinsertionsInsertions insevere tumors[1]Sample specificP-value
Uhrf2 22140.002
Kdelc2 64490.004
Fgfr2|Oat 1270.007
Frs2 28200.010
Msl2 23160.011
Tmed10 18120.012
Fnbp1l 40300.012
Tpm1 24170.014
Atp1a1 37280.020
6430598A04Rik 20140.020
Chn2 15100.021
Grb2 26190.022
Agpat3 38290.023
Tmprss2 38290.023
Chd6 39300.027
C77080 1170.028
Cnot2 53420.033
Brwd1 28210.033
Rtn4rl1 28210.033
Tnrc6c 23170.038
Bspry 18130.045
CIS12:88752362_30k 30230.046
Slk 30230.046

Tumors showing severe atypia, moderate dysplasia or severe dysplasia

Co-occurrence of Genes

Co-occurring CISs (co-CISs), where two CISs were co-mutated at a higher frequency than expected by chance, were identified. As expected the majority of co-CISs we identified were with Apc, since mutation of the wild type allele of Apc is a near obligate event in adenomagenesis in our model. We focused on other co-CIS identifying 70 CIS genes involved in statistically significant pairwise co-occurrences constituting 38 networks, the largest containing 28 genes (Supplementary Table 21, Fig. 5). Several of the interactions identified were with CISs associated with known tumor suppressor genes, such as Lrrc41 with Pten, and Lamp1 with Smad4 (or Mex3c) (Fig. 4). Analysis of human CGH data provided independent support for two of the co-CISs: those between Mex3c/Smad4 and Lamp1 and between Sfi1 and Adnp (Supplementary Table 20)[44]. At the centre of the largest network we found the CIS Atp6v0a2/Tctn2; Tctn2 has previously been shown to be a gene mutated in both mouse and human colorectal cancers[35]. Intriguingly, these networks include genes that co-occur with well-established cancer drivers such as Pten, Smad4 and Rac1.
Figure 5

Co-occurring CISs can be grouped into interacting networks

CISs generated from the CIMPLR 30 Kb and 120 Kb analyses were merged and pairwise comparisons between each CIS were performed. Contingency tables were constructed for each comparison and a Fisher-Exact test was performed. To account for multiple testing a q value was generated for each test and adjusted P values of P<0.05 were reported. This generated 70 co-occurring relationships in 38 networks, three of which are shown. CIS are represented by the nearest associated gene and the thickness of each interconnecting line between two genes represents the level of significance of the co-occurrence.

DISCUSSION

We have performed an extensive forward genetic screen in the intestinal tract using mouse models of somatic and germline neoplasia. These studies reveal hundreds of genes in addition to 38 genetic networks between these genes that are candidate drivers of carcinogenesis. The value of this discovery set will ultimately be established by the overlaying of high-throughput mutation data from human colon cancers. However, it is worth noting that our screen identified novel genes recently found to be mutated in human colorectal cancers including, MLL3 and CDK8[44,45]. In line with previous commentaries that implicate selection pressures and not elevated mutation rates in driving tumorigenesis, we confirm that the Sleeping Beauty transposon is inefficiently mobilized, generating a very low rate of cellular mutagenesis[46]. The high number of accumulated mutations observed is thus a consequence of selection acting on increasingly diverse populations to create highly oligoclonal tumours. Using cross-species oncogenomics, we can independently implicate around a third of our CIS genes in colorectal neoplasia from human studies. Functionally, we identify 20 novel regulators of Wnt signaling that implicates these genes in colorectal tumorigenesis through this pathway. Collectively, our analysis provides a rich catalogue of candidate genes that represent potential diagnostic, prognostic and therapeutic targets, and defines the breadth of genes that may contribute to cancer of the intestine. Our screen is in agreement with a previous exome re-sequencing study of human colorectal cancers, which predicted many hundreds of functional mutations in human cancer[47]. The number of genes identified in our screen suggests that many more genes than previously thought can contribute to colorectal cancer development if appropriately altered. Others have suggested that around 10 genes in any one cancer may drive cancer development, with each cancer carrying common driver mutations and also rarer mutations that may also contribute to cancer growth[1-3,7]. Determining which genes are drivers is complicated by a lack of knowledge of the functional significance of the many hundreds of infrequent somatic alterations that are acquired during cancer development, and because at present no study has systematically analyzed the landscape of somatic mutations in a human cancer type in a large number of samples at base-pair resolution. In this screen, we use a transposon that must integrate proximally to a gene to function, and in this way loci-containing candidate cancer genes were identified when they were mutated at a frequency higher than expected by chance. While genes that are infrequently mutated in human cancer and in our screen are less likely to be major players in tumorigenesis, population theory suggests that any genetic alteration that causes a fractional increase in the ability of a cell to outgrow other cells may contribute to tumorigenesis[48,49]. The classical model for this is yeast, in which as many as 6% of mutations that alter the growth kinetics of a population of cells can result in the outgrowth of a more vigorous population[50]. Likewise, in human studies there is evidence to suggest that mutations that fractionally increase the fitness of a cell within a population can predispose that cell to evolve into a cancer[51]. In this way, many of the genes that we have discovered in this screen may contribute independently or combinatorially to tumor development. This interpretation of our study is supported by several lines of evidence. First, we have shown that the majority of cancers in this screen carry mutations in Apc, and furthermore, that 20 genes can modify canonical Wnt signaling, even in tumor cell lines in which Apc is already disrupted. There is a long standing observation that the second somatic hit identified as affecting APC in sporadic and familial colorectal cancers is influenced by the nature of the first mutation, such that at least one β-catenin binding domain is retained in one allele. The interpretation is that the complete loss of Apc function is not tolerated by the normal epithelium (the ‘just-right’ phenomenon). Hence, the initial transformation event does not give ‘maximal’ dysregulation of the pathway [52]. Loss of function mutations in SFRPs (Wnt antagonists) in human colorectal cancers have been found to increase Wnt signaling over that resulting from the initiating APC mutation [53,54]. Moreover, one of the key effects of both Kras and Braf mutations is to synergize with Apc mutations by enhancing Wnt signaling [55]. In short, it is well established that the dysregulation of Wnt signaling is not a single event, but is subject to modulation throughout cancer development. Fine-tuning of Wnt signaling is one likely substrate for achieving fractional increases in fitness. Second, many of the genes discovered in this screen are associated with known colorectal cancer pathways, such as the Wnt, PI3K and Ras cascades. Genes associated with these pathways are significantly enriched for mutations over the level that would be expected by chance. This suggests that many of the genes recurrently mutated by transposons are functioning through these pathways to optimize tumor cell growth. Finally, many CIS genes can be clustered into other novel pathways using pathway analysis tools, suggesting in addition to the main oncogenic routes other pathways are also operative in bowel cancer, and that if appropriately altered can contribute to tumorigenesis. Follow-up functional analysis will help us gain a better understanding of how the CIS genes we have discovered contribute to CRC. The co-occurring networks of mutated genes discovered from this screen suggest that the mutation of many genes is important in colorectal cancer development. The presence of co-mutated genes in a cancer does not however necessarily mean that mutations of these genes function in a cell autonomous manner to promote cancer growth. Each of the tumors we analyzed is composed of multiple tumor cell clones, and this heterogeneity may favor cellular interactions that result in some of the co-occurrences we observe. In conclusion, we have demonstrated that insertional mutagenesis in the mouse, performed on a hitherto unprecedented scale, reveals hundreds of candidate cancer driver genes that promote colorectal cancer development.

METHODS

Generation of mice

All experiments were performed on a C57BL/6J background. : Apc mice have been described previously[56]. To model sporadic cancer formation we used Ah-Cre/Apc mice in which the first initiating event (loss of exon 14 of Apc and a concomitant frameshift mutation in exon 15 is induced somatically following administration of β-naphthoflavone (bNF)[17]. In the CYP1A1 promoter drives expression of Cre throughout the intestinal tract[57]. T2Onc mice: We used the mouse line LC76 with a transposon donor locus located on chromosome 1[9]. These mice have been described previously[12]. Conditional (Rosa26 The conditional Sleeping Beauty transposase allele was generated by cloning the HSB5 variant of the Sleeping Beauty transposase in the antisense orientation between Lox66 and Lox71 sites. This cassette was targeted to the Rosa locus in E14Tg2a ES cells, transmitted onto a C57BL/6J background and then backcrossed to C57BL/6J. Expression of Cre in these mice catalyzes recombination between the Lox66 and Lox71 sites inverting the transposase cDNA so it can be expressed from the CAGGS promoter. The CAGGS promoter was used in the transposase allele because it drives reproducible expression in a broad range of cells and tissues. Ah-Cre(hom)/T2Onc(het) mice were maintained by crosses to Ah-cre(hom). Rosa26/Apc mice were maintained as obligate homozygotes for both alleles. Experimental and control cohorts were obtained in a single generation by crossing Ah-Cre(hom)/T2Onc(het) mice to Rosa26/Apc which gave obligate inheritance of Ah-Cre, Rosa26 alleles and segregation of the T2Onc allele. For the germline model Apc mice were intercrossed with Rosa26ase mice, which express the SB transposase ubiquitously, and offspring from this cross were in turn mated to T2Onc mice to obtain experimental and control cohorts.

Tumor Watch Analysis

Animals aged between 6-11 weeks received intraperitoneal injections of 80mg/kg β-naphthoflavone daily for 5 days. Experimental mice were monitored on ‘tumor watch’ and sacrificed when moribund. Normal and tumor tissue was collected from each mouse and samples were divided into two parts; one section was snap frozen in liquid nitrogen for splinkerette PCR, while the other section was stored in 10% NBF, transferred to 70% EtOH and processed for histopathological analysis. All animal procedures were approved by the UK Home Office.

Splinkerette PCR

Genomic DNA was extracted from snap frozen tumor samples using Gentra Puregene kits (Qiagen). Splinkerette PCRs were performed as previously described[19].

Generation and Pre-processing of 454 Reads

Pools of transposon-genome junction PCRs from the 446 tumor samples were sequenced on the “454 FLX” platform generating a total of 1,763,374 DNA sequence reads. The raw reads were processed removing SB transposon sequence, DNA barcode tags, and sequence beyond the restriction enzyme site. Processed sequences shorter than 20 nucleotides were discarded from the subsequent genome mapping process since they have an increased tendency to map to multiple genomic locations. Following this filtering process 1,316,365 clipped sequences remained for alignment to the mouse genome.

Mapping Clipped Sequences to the Mouse Genome and Identifying Insertion Sites

The set of clipped sequences were mapped to the mouse genome (NCBI Build 37 assembly) using the ssaha2 alignment algorithm (version 2.5)[58]. Of the 1,316,365 clipped sequences aligned using ssaha, 898,082 unique alignments were reported (50.93% of the original reads). All aligned sequences were checked to confirm that the two nucleotides adjacent to the start of each alignment were TA. Aligned sequences from the same tumor, that were generated using the same restriction enzyme and that mapped to the same genomic location, were merged together to report a single insertion site. Chromosome 1 contains the donor site of the T2Onc transposon and insertion site reads mapping to this chromosome were removed to negate CIS calls resulting from “local hopping”. Further, insertions were filtered from the loci of the genes En2 (chr5:28492236-28498706) and Foxf2 (chr13:31717648-31723271) as elements from these loci are found in the T2Onc transposon. These filtering steps resulted in 146,249 insertion sites which were used for CIS analysis (Supplementary Materials Table S2a).

Identification of Common Insertion Sites

Common insertions sites (CISs) were identified using two complementary approaches:

(1) Gaussian Kernel Convolution: CIMPLR

The Common Insertion Mapping Platform (CIMPLR) is a Gaussian kernel convolution (GKC) method developed in a previous study to identify CISs in retroviral insertional mutagenesis screens[18]. An enhanced version of the method was developed for SB screens which considers the local density of TA sites within the genome. Kernel widths of 30 Kb and 120 Kb nucleotides were selected for CIS detection. For each CIS, the genomic location of its kernel peak was used as the reference point to assign gene annotations. A CIS peak was associated with a gene if either it lay within the coding region of a gene or was within +/− 150 Kb nucleotides of its nearest gene. If the distance to the nearest gene was greater than 150 Kb nucleotides, no gene name was assigned. Using 30 Kb and 120 Kb kernels 919 and 641 CISs were reported, respectively (Supplementary Materials Tables S2a,b). To summarise the CIS predictions from the two independent kernel widths, a method of combining predictions across the kernels was devised, such that where CISs from the two kernels overlapped, those CISs with the smallest “footprints” on the genome were reported. Using this procedure, a total of 997 cross-scale CISs were predicted (Supplementary Materials Table S2c). Depending on the size and location of CISs, genes may be associated with multiple CISs. Consequently the 997 cross-scales CISs are associated with 867 unique genes.

(2) Monte Carlo Simulations

In a previous study Starr et al. developed a Monte Carlo (MC) simulation method to identify CISs in a colorectal cancer screen[8]. This method determines the local density of insertions in a given window that exceed the expectation of observing this density by chance. The 146,429 insertions were analysed using a Perl script that implemented the Starr et al. MC method (personal communication: Kevin A.T. Silverstein, Univ of Minnesota). Five independent simulations were performed of 100 iterations each. The range of insertions to be detected in a CIS varied from 5 to 20, and the size of windows examined in the simulations ranged from 10 Kb to 150 Kb, in increments of 1 Kb. Using a similar approach for gene annotation as in the CIMPLR method, described above, CISs were annotated according to the nearest mouse gene. A total of 749 CISs were reported by the MC simulation method.

Discovery Set vs Genome-wide adjusted CISs

The analysis described in our paper used a discovery set of CIS calls that were compared to other datasets to help filter passengers from drivers and to identify other collections of genes of biological interest. This approach (described above) identified statistically significant CISs on a chromosome-by-chromosome basis, and was motivated by the observation that there were significant differences in the number of insertions, and hence CISs, per chromosome. In Supplementary Table 2c we also present a set of CISs calculated using genome-wide GKC with more stringent genome-wide cut-offs.

Co-occurrence

Pairwise comparisons were made between the 997 combined CISs from the GKC analyses, recording counts for all 446 tumor samples for the following cases: shared insertions between the two CISs, insertions unique to each CIS and samples that had no insertions in either CIS (Fig. 5). Where tumor samples had multiple insertions within a CIS, a single occurrence was recorded. The counts were converted into contingency tables and Fisher Exact Tests were performed. To account for multiple testing on the 496,506 comparisons, a q value was generated for each test. CIS pairwise comparisons with adjusted p-values of P < 0.05 were used to construct the resultant co-occurrence networks.

Software

Cytoscape software (version 2.7.0, www.cytoscape.org) was used to create the CIS interaction network.

Cell Culture

CellSensor LEF/TCF-bla SW480 cell line (Invitrogen - further details included in Supplementary Methods) was cultured in RPMI medium 1640 with GlutaMAX (Invitrogen) supplemented with dialyzed FBS (Invitrogen), non-essential amino acids (Invitrogen), sodium pyruvate (Invitrogen), blasticidin (Invitrogen) and penicillin/streptomycin (Invitrogen) in a 37 °C, 5% CO2 incubator.

CellSensor Experiments

CellSensor LEF/TCF-bla SW480 cells (Invitrogen) were seeded at 2.5 × 105 cells in 96-well plates and transfected with 240nm siRNA (Dharmacon) or medium GC RNAi oligo negative control duplex (GC control) (Invitrogen) or AllStars Neg. Control siRNA (siNT control) (Qiagen) for a final concentration of 20 nm, using Lipofectamine RNAimax (Invitrogen). After 48 h cells were treated with Liveblazer FRET B/G loading kit (Invitrogen), incubated for 2 h at RT and read on a spectrophotometer (Tecan) at an excitation wavelength of 409 nm and emission wavelengths of 460 nm and 530 nm. Emission ratios were calculated from these values. Experiments were carried out in triplicate on three separate occasions and data were statistically analysed using a Student’s t-test.
  60 in total

1.  The receptor tyrosine kinase EPHB4 has tumor suppressor activities in intestinal tumorigenesis.

Authors:  Higinio Dopeso; Silvia Mateo-Lozano; Rocco Mazzolini; Paulo Rodrigues; Laura Lagares-Tena; Julian Ceron; Jordi Romero; Marielle Esteves; Stefania Landolfi; Javier Hernández-Losa; Julio Castaño; Andrew J Wilson; Santiago Ramon y Cajal; John M Mariadason; Simo Schwartz; Diego Arango
Journal:  Cancer Res       Date:  2009-09-08       Impact factor: 12.701

2.  A conditional transposon-based insertional mutagenesis screen for genes associated with mouse hepatocellular carcinoma.

Authors:  Vincent W Keng; Augusto Villanueva; Derek Y Chiang; Adam J Dupuy; Barbara J Ryan; Ilze Matise; Kevin A T Silverstein; Aaron Sarver; Timothy K Starr; Keiko Akagi; Lino Tessarollo; Lara S Collier; Scott Powers; Scott W Lowe; Nancy A Jenkins; Neal G Copeland; Josep M Llovet; David A Largaespada
Journal:  Nat Biotechnol       Date:  2009-02-22       Impact factor: 54.908

3.  Intestinal adenoma formation and MYC activation are regulated by cooperation between MYB and Wnt signaling.

Authors:  D Ciznadija; R Tothill; M L Waterman; L Zhao; D Huynh; R M Yu; M Ernst; S Ishii; T Mantamadiotis; T J Gonda; R G Ramsay; J Malaterre
Journal:  Cell Death Differ       Date:  2009-07-17       Impact factor: 15.828

4.  Paneth cell differentiation in colonic epithelial neoplasms: evidence for the role of the Apc/beta-catenin/Tcf pathway.

Authors:  Mee Joo; Aliakbar Shahsafaei; Robert D Odze
Journal:  Hum Pathol       Date:  2009-03-09       Impact factor: 3.466

Review 5.  The cancer genome.

Authors:  Michael R Stratton; Peter J Campbell; P Andrew Futreal
Journal:  Nature       Date:  2009-04-09       Impact factor: 49.962

6.  Prevalence of TMPRSS2-ERG fusion prostate cancer among men undergoing prostate biopsy in the United States.

Authors:  Juan-Miguel Mosquera; Rohit Mehra; Meredith M Regan; Sven Perner; Elizabeth M Genega; Gerri Bueti; Rajal B Shah; Sandra Gaston; Scott A Tomlins; John T Wei; Michael C Kearney; Laura A Johnson; Jeffrey M Tang; Arul M Chinnaiyan; Mark A Rubin; Martin G Sanda
Journal:  Clin Cancer Res       Date:  2009-07-07       Impact factor: 12.531

7.  A modified sleeping beauty transposon system that can be used to model a wide variety of human cancers in mice.

Authors:  Adam J Dupuy; Laura M Rogers; Jinsil Kim; Kishore Nannapaneni; Timothy K Starr; Pentao Liu; David A Largaespada; Todd E Scheetz; Nancy A Jenkins; Neal G Copeland
Journal:  Cancer Res       Date:  2009-10-06       Impact factor: 12.701

8.  The Apc 1322T mouse develops severe polyposis associated with submaximal nuclear beta-catenin expression.

Authors:  Patrick Pollard; Maesha Deheragoda; Stefania Segditsas; Annabelle Lewis; Andrew Rowan; Kimberley Howarth; Lisa Willis; Emma Nye; Amy McCart; Nikki Mandir; Andrew Silver; Robert Goodlad; Gordon Stamp; Matthew Cockman; Philip East; Bradley Spencer-Dene; Richard Poulsom; Nicholas Wright; Ian Tomlinson
Journal:  Gastroenterology       Date:  2009-02-25       Impact factor: 22.682

9.  A transposon-based genetic screen in mice identifies genes altered in colorectal cancer.

Authors:  Timothy K Starr; Raha Allaei; Kevin A T Silverstein; Rodney A Staggs; Aaron L Sarver; Tracy L Bergemann; Mihir Gupta; M Gerard O'Sullivan; Ilze Matise; Adam J Dupuy; Lara S Collier; Scott Powers; Ann L Oberg; Yan W Asmann; Stephen N Thibodeau; Lino Tessarollo; Neal G Copeland; Nancy A Jenkins; Robert T Cormier; David A Largaespada
Journal:  Science       Date:  2009-02-26       Impact factor: 47.728

10.  A high-throughput splinkerette-PCR method for the isolation and sequencing of retroviral insertion sites.

Authors:  Anthony G Uren; Harald Mikkers; Jaap Kool; Louise van der Weyden; Anders H Lund; Catherine H Wilson; Richard Rance; Jos Jonkers; Maarten van Lohuizen; Anton Berns; David J Adams
Journal:  Nat Protoc       Date:  2009-04-30       Impact factor: 13.491

View more
  111 in total

Review 1.  Review: In vivo models for defining molecular subtypes of the primitive neuroectodermal tumor genome: current challenges and solutions.

Authors:  Jon D Larson; David A Largaespada
Journal:  In Vivo       Date:  2012 Jul-Aug       Impact factor: 2.155

2.  Differential WNT activity in colorectal cancer confers limited tumorigenic potential and is regulated by MAPK signaling.

Authors:  David Horst; Justina Chen; Teppei Morikawa; Shuji Ogino; Thomas Kirchner; Ramesh A Shivdasani
Journal:  Cancer Res       Date:  2012-02-08       Impact factor: 12.701

3.  Novel strategies for gene trapping and insertional mutagenesis mediated by Sleeping Beauty transposon.

Authors:  Guili Song; Zongbin Cui
Journal:  Mob Genet Elements       Date:  2013-10-02

4.  Class III phosphatidylinositol-3-OH kinase controls epithelial integrity through endosomal LKB1 regulation.

Authors:  Fergal O'Farrell; Viola Hélène Lobert; Marte Sneeggen; Ashish Jain; Nadja Sandra Katheder; Eva Maria Wenzel; Sebastian Wolfgang Schultz; Kia Wee Tan; Andreas Brech; Harald Stenmark; Tor Erik Rusten
Journal:  Nat Cell Biol       Date:  2017-10-30       Impact factor: 28.824

5.  Sleeping Beauty screen reveals Pparg activation in metastatic prostate cancer.

Authors:  Imran Ahmad; Ernest Mui; Laura Galbraith; Rachana Patel; Ee Hong Tan; Mark Salji; Alistair G Rust; Peter Repiscak; Ann Hedley; Elke Markert; Carolyn Loveridge; Louise van der Weyden; Joanne Edwards; Owen J Sansom; David J Adams; Hing Y Leung
Journal:  Proc Natl Acad Sci U S A       Date:  2016-06-29       Impact factor: 11.205

6.  NUMB phosphorylation destabilizes p53 and promotes self-renewal of tumor-initiating cells by a NANOG-dependent mechanism in liver cancer.

Authors:  Hifzur R Siddique; Douglas E Feldman; Chia-Lin Chen; Vasu Punj; Hiroshi Tokumitsu; Keigo Machida
Journal:  Hepatology       Date:  2015-08-28       Impact factor: 17.425

7.  Transposon mutagenesis identifies genetic drivers of Braf(V600E) melanoma.

Authors:  Michael B Mann; Michael A Black; Devin J Jones; Jerrold M Ward; Christopher Chin Kuan Yew; Justin Y Newberg; Adam J Dupuy; Alistair G Rust; Marcus W Bosenberg; Martin McMahon; Cristin G Print; Neal G Copeland; Nancy A Jenkins
Journal:  Nat Genet       Date:  2015-04-13       Impact factor: 38.330

8.  The homophilic receptor PTPRK selectively dephosphorylates multiple junctional regulators to promote cell-cell adhesion.

Authors:  Gareth W Fearnley; Katherine A Young; James R Edgar; Robin Antrobus; Iain M Hay; Wei-Ching Liang; Nadia Martinez-Martin; WeiYu Lin; Janet E Deane; Hayley J Sharpe
Journal:  Elife       Date:  2019-03-29       Impact factor: 8.140

9.  Transposon mutagenesis reveals cooperation of ETS family transcription factors with signaling pathways in erythro-megakaryocytic leukemia.

Authors:  Jian Zhong Tang; Catherine L Carmichael; Wei Shi; Donald Metcalf; Ashley P Ng; Craig D Hyland; Nancy A Jenkins; Neal G Copeland; Viive M Howell; Zhizhuang Joe Zhao; Gordon K Smyth; Benjamin T Kile; Warren S Alexander
Journal:  Proc Natl Acad Sci U S A       Date:  2013-03-26       Impact factor: 11.205

Review 10.  Mouse models of cancer: Sleeping Beauty transposons for insertional mutagenesis screens and reverse genetic studies.

Authors:  Barbara R Tschida; David A Largaespada; Vincent W Keng
Journal:  Semin Cell Dev Biol       Date:  2014-01-24       Impact factor: 7.727

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.