Recent genome sequencing studies have identified several classes of complex genomic rearrangements that appear to be derived from a single catastrophic event. These discoveries identify ways that genomes can be altered in single large jumps rather than by many incremental steps. Here we compare and contrast these phenomena and examine the evidence that they arise "all at once." We consider the impact of massive chromosomal change for the development of diseases such as cancer and for evolution more generally. Finally, we summarize current models for underlying mechanisms and discuss strategies for testing these models.
Recent genome sequencing studies have identified several classes of complex genomic rearrangements that appear to be derived from a single catastrophic event. These discoveries identify ways that genomes can be altered in single large jumps rather than by many incremental steps. Here we compare and contrast these phenomena and examine the evidence that they arise "all at once." We consider the impact of massive chromosomal change for the development of diseases such as cancer and for evolution more generally. Finally, we summarize current models for underlying mechanisms and discuss strategies for testing these models.
The evolution of a normal cell to a cancer cell is generally thought of as a sequential accumulation of many independent lesions to the genome (Nowell 1976; Greaves and Maley 2012; Yates and Campbell 2012). In addition to somatic point mutations, these lesions include copy number alterations, such as chromosomal deletions or duplications (Fig. 1A,B), and balanced structural rearrangements, such as translocations and inversions (Mitelman et al. 2007; Rabbitts 2009; Stephens et al. 2009; Stratton et al. 2009). By chance, these somatic alterations in the genome will hit cancer genes—disrupting tumor suppressors or activating proto-oncogenes—affecting cells and causing their progeny to gradually accumulate the necessary “hallmarks of cancer” (Hanahan and Weinberg 2011). Although there is ample evidence that cancer can develop through this type of gradual mutational accrual, there are reasons to think that there could be advantages if the oncogenic changes could be acquired all at once (at least from the vantage point of the aspiring cancer cell). One obvious advantage is speed, with the possibility of rapidly accruing a large phenotypic effect from combinatorial acquisition of smaller-effect genetic changes. Another potential advantage of all-at-once or “punctuated equilibrium” (Gould and Eldredge 1977) mechanisms is that intermediate steps in tumor development can actually be deleterious. For example, oncogene activation can trigger stress responses, such as p53 activation, that can inhibit proliferation or promote cellular senescence (Lowe et al. 1994; Fearnhead et al. 1997; Kastan and Bartek 2004; Halazonetis et al. 2008; Jackson and Bartek 2009). In principle, an all-at-once mutational mechanism could circumvent this problem by simultaneously generating an active oncogene together with changes that eliminate the stress response.
Figure 1.
Characteristics of simple and complex chromosomal rearrangements. (A) Copy number losses or gains can be detected by array-based or sequencing-based technologies from abrupt changes in the read depth signal (sequencing) or the intensity of fluorescent probes (array). Loss or gain generally affects only one homolog; therefore, a copy number loss also results in LOH. (B) Copy number alterations are often generated by simple chromosomal rearrangements, such as deletions or tandem duplications. (C) Chromothripsis is characterized by an alternating copy number profile with loss and retention of heterozygosity. Minus signs in C and D denote chromosomal segments that are inverted. Fluorescent in situ hybridization (FISH) characterization confirms that only one chromatid (allele B) is affected. (D) Chromoanasynthesis is characterized by a copy number profile that alternates between euploid and higher ploidy. In contrast to chromothripsis, chromoanasynthesis does not necessarily exhibit LOH but reflects resynthesis of segments from one chromatid. Chromoanasynthesis also contains frequent insertions of short sequences between the rearrangement junctions (red arrows) that are copied from the rearranged segments (templated insertions). Both DNA resynthesis and templated insertions are suggestive of replication-based mechanisms. (E) Chromoplexy is characterized by a closed chain of translocations, with little or no copy number alteration.
Characteristics of simple and complex chromosomal rearrangements. (A) Copy number losses or gains can be detected by array-based or sequencing-based technologies from abrupt changes in the read depth signal (sequencing) or the intensity of fluorescent probes (array). Loss or gain generally affects only one homolog; therefore, a copy number loss also results in LOH. (B) Copy number alterations are often generated by simple chromosomal rearrangements, such as deletions or tandem duplications. (C) Chromothripsis is characterized by an alternating copy number profile with loss and retention of heterozygosity. Minus signs in C and D denote chromosomal segments that are inverted. Fluorescent in situ hybridization (FISH) characterization confirms that only one chromatid (allele B) is affected. (D) Chromoanasynthesis is characterized by a copy number profile that alternates between euploid and higher ploidy. In contrast to chromothripsis, chromoanasynthesis does not necessarily exhibit LOH but reflects resynthesis of segments from one chromatid. Chromoanasynthesis also contains frequent insertions of short sequences between the rearrangement junctions (red arrows) that are copied from the rearranged segments (templated insertions). Both DNA resynthesis and templated insertions are suggestive of replication-based mechanisms. (E) Chromoplexy is characterized by a closed chain of translocations, with little or no copy number alteration.Although all-at-once massive genomic alterations might accelerate the acquisition of growth-promoting mutations, they have an obvious downside: They could also potentially generate massive deleterious effects that could overwhelm any growth-promoting mutations. One way to minimize this collateral damage would be to have the massive genomic change be restricted to a portion of the genome. Although it is common to think of mutation rates as numbers that apply evenly across the genome (Durrett 2008; Bozic et al. 2010; Tomasetti et al. 2013), there is a growing appreciation of localized variability in these rates (Hodgkinson and Eyre-Walker 2011; Hodgkinson et al. 2012; Roberts et al. 2012; Schuster-Böckler and Lehner 2012). Because of transcription-coupled DNA repair, transcriptionally silent portions of the genome have approximately threefold higher mutation rates (Fousteri and Mullenders 2008; Lawrence et al. 2013). Likewise, late-replicating regions of the genome exhibit a similar fold increase in mutation rates (Stamatoyannopoulos et al. 2009; Chen et al. 2010; Koren et al. 2012; Donley and Thayer 2013). Localized hypermutation has also been associated with chromosomal rearrangements. In budding yeast, DNA breaks repaired by mitotic gene conversion are accompanied by surprisingly high mutation rates (>1000-fold higher than spontaneous mutations) in the vicinity of the break (Hicks et al. 2010; Malkova and Haber 2012). Similarly, localized regions of hypermutation (“kataegis”) that often occurred at the site of somatic rearrangements were discovered in breast cancer genomes (Nik-Zainal et al. 2012; Alexandrov et al. 2013). Even more dramatic local genomic change can be generated from critical telomere shortening, resulting in breakage–fusion–bridge (BFB) cycles (McClintock 1941) that occur over several cell division cycles (and hence are not “all-at-once” events). Analysis of genome copy number alterations in cancer also identified “firestorms” of local amplifications and deletions whose origin is uncertain but might originate from BFB cycles (Hicks et al. 2006).Recently, whole-genome sequencing has led to the discovery of three new classes of complex catastrophic chromosomal rearrangement: chromothripsis, chromoanasynthesis, and chromoplexy (Fig. 1C–E; Liu et al. 2011; Stephens et al. 2011; Rausch et al. 2012; Baca et al. 2013). Although the underlying mechanisms for these phenomena are unknown and likely to be different, all of these patterns of rearrangements appear to have originated from a single event, simultaneously generating tens or even hundreds of translocations and/or DNA copy number changes (Meyerson and Pellman 2011; Tubio and Estivill 2011; Forment et al. 2012; Jones and Jallepalli 2012; Maher and Wilson 2012). To encompass the potentially all-at-once origin of these mutational mechanisms, a unifying term, chromoanagenesis, has been proposed (Holland and Cleveland 2012). Two of these classes of chromosomal rearrangement, chromothripsis and chromoanasynthesis, affect limited portions of the genome (often a single chromosome or chromosome arm), raising the mechanistic question of how extensive chromosomal rearrangements can be so highly localized. The frequency of these types of events in cancer remains unclear, but many recent studies suggest that they may be more common than initially suspected (Malhotra et al. 2013; Yang et al. 2013; Zack et al. 2013). These new classes of mutagenesis are not restricted to human cancer and have turned up in such exotic places as the Tasmanian devil's transmissible facial tumor (Deakin et al. 2012) and human congenital diseases (Kloosterman et al. 2011a, 2012; Liu et al. 2011; Chiang et al. 2012; Nazaryan et al. 2013). Although these new classes of chromosome rearrangement have mostly been found in the context of disease, it seems likely that they reflect very general mechanisms for genome alteration that could be important for organismal evolution.
A new game at the genome casino
Genome instability is not only a hallmark but also an enabling characteristic of cancer (Negrini et al. 2010; Hanahan and Weinberg 2011; Gordon et al. 2012). Disruption of genome maintenance and elevated rates of somatic mutations (e.g., due to disruption of DNA repair machineries or increased damage) accelerate the transformation from precancerous tissue into malignant tumors (Gorgoulis et al. 2005; Halazonetis et al. 2008; Vogelstein et al. 2013). Despite accelerated tumor evolution, individual mutation events are usually regarded as independent of each other, acquired randomly and gradually. In addition to site-specific mutation events (point mutations, chromosomal translocations, and copy number variations), doubling of the genome content due to cytokinesis failure is a one-off event that can generate aneuploidy and structural alterations of chromosomes (Fujiwara et al. 2005; Holland and Cleveland 2009; Carter et al. 2012; Ganem and Pellman 2012; Zack et al. 2013).In studying the tumor genome of a patient with chronic lymphocytic leukemia (CLL), Stephens et al. (2011) observed a striking pattern of chromosomal rearrangements that was at odds with all known models of genomic alteration. The tumor from this patient contained 45 tumor-specific translocations, 42 of which were concentrated on the long arm of chromosome 4. Furthermore, the rearrangement events were clustered within multiple focal regions in chromosome 4, raising the question of how the damage could be so highly localized. Did this result from a genuinely new mechanism for genome damage, or was the genome center playing the conventional game of random chance but had been dealt a rare hand?Several lines of evidence suggested this was, in fact, a novel phenomenon. First, despite the large number of rearrangements, SNP array copy number analysis showed that the altered region of chromosome 4 existed in only two copy number states, with many transitions between these two states. (As in the study by Stephens et al. (2011), we use the term copy number states because of the highly aneuploid nature of cancer genomes, where each of the parental haplotypes can be present in multiple copies.) This pattern of having only two copy number states is in sharp contrast to conventional clusters of complex rearrangements (Hicks et al. 2006; Bignell et al. 2007; Zhang et al. 2009a). Known causes of complex rearrangements such as BFB cycles characteristically result in multiple amplified copy number states that are assumed to derive from events occurring over many rounds of cell division (McClintock 1941; Stark et al. 1989; Gisselsson et al. 2000; Bignell et al. 2007; Greenman et al. 2012b).The alternation between two copy number states was accompanied by loss and preservation of heterozygosity (Fig. 1A,C). Whole-genome sequencing further revealed a very complex pattern of segmental rearrangements at the boundaries of high copy number states, with events that were head to tail (deletion type), head to head/tail to tail (inversion type), and tail to head (tandem duplication type) (Fig. 2A). However, despite this chaotic pattern of rearrangement, loci that preserved parental copy number also preserved heterozygosity of the parental genotypes (Fig. 1C). As discussed below, this maintenance of islands of heterozygosity among many large regions with loss of heterozygosity (LOH) is difficult to explain by multiple independent deletions.
Figure 2.
Signatures of chromosomal rearrangements from paired-end sequencing. (A) Four major types of chromosomal rearrangement signatures deduced from paired-end sequencing: deletion type, tandem duplication type, inversion type, and long-range type (including interchromosomal). Each type of read pair shows aberrations from proper pairs when aligned to the reference assembly, which implies a new junction in the test chromosome. A deletion (read pairs shown as red arrows) or tandem duplication (shown as green arrows) only introduce one fusion and hence can be unambiguously inferred from a single type of signature (in the absence of other overlapping events). Inversion events, however, introduce at least two rearranged junctions (shown here is a simple inversion); hence, the rearranged genome can only be constructed by considering all related events. The same is true for long-range events with pair mates residing in different chromosomes or separated by centromeres, assembly gaps, or other rearrangements. In general, it is necessary to incorporate copy number information together with rearrangement signatures to derive the structure of the rearranged genome (for a computational framework, see Greenman et al. 2012a). (B) Examples of complex chromosomal rearrangements exhibiting multiple overlapping rearrangement signatures that do not reflect the same events as the “typing.” The discordant read pair signatures (color arrows) are shown both in the rearranged genome and according to their alignment to the reference genome. (i) A complex rearrangement resulting in the deletion of segment A and swapping of segments B and C generates two “tandem duplication” and one “deletion” read pair signatures, none of which corresponds to a simple event. (ii) Two inversions alter a deletion signature into a tandem duplication signature (green arrows).
Signatures of chromosomal rearrangements from paired-end sequencing. (A) Four major types of chromosomal rearrangement signatures deduced from paired-end sequencing: deletion type, tandem duplication type, inversion type, and long-range type (including interchromosomal). Each type of read pair shows aberrations from proper pairs when aligned to the reference assembly, which implies a new junction in the test chromosome. A deletion (read pairs shown as red arrows) or tandem duplication (shown as green arrows) only introduce one fusion and hence can be unambiguously inferred from a single type of signature (in the absence of other overlapping events). Inversion events, however, introduce at least two rearranged junctions (shown here is a simple inversion); hence, the rearranged genome can only be constructed by considering all related events. The same is true for long-range events with pair mates residing in different chromosomes or separated by centromeres, assembly gaps, or other rearrangements. In general, it is necessary to incorporate copy number information together with rearrangement signatures to derive the structure of the rearranged genome (for a computational framework, see Greenman et al. 2012a). (B) Examples of complex chromosomal rearrangements exhibiting multiple overlapping rearrangement signatures that do not reflect the same events as the “typing.” The discordant read pair signatures (color arrows) are shown both in the rearranged genome and according to their alignment to the reference genome. (i) A complex rearrangement resulting in the deletion of segment A and swapping of segments B and C generates two “tandem duplication” and one “deletion” read pair signatures, none of which corresponds to a simple event. (ii) Two inversions alter a deletion signature into a tandem duplication signature (green arrows).A third feature of the observation was that the pattern of end-joining strongly suggested an origin from a DNA double-strand break (DSB). As highlighted in later studies (Kloosterman et al. 2011a, 2012; Chiang et al. 2012), the preponderance of rearrangements involved ligation of both ends of a single DNA break to other genomic segments (ends being operationally defined as sequences within ∼1 kb of each other; hereafter referred to as a pairing of ends). This contrasted with complex genomic rearrangements that arise by replication-based mechanisms (Lee et al. 2007; Carvalho et al. 2009, 2011) where the rearrangement is often seen only on one side of the initial lesion (Haber and Debatisse 2006; see below for more details).The final defining feature of this new mutagenesis event was the nonrandom, highly localized clustering of chromosomal breakpoints. These four hallmarks—alternating copy number profiles, loss and preservation of heterozygosity, pairing of the breakpoint ends, and clustering of rearrangements—define the class of chromosomal alteration termed chromothripsis (Greek for shattering of chromosomes).
All at once?
The defining hallmarks of chromothripsis cannot be explained by a gradual accumulation of independent events. First, truly independent chromosomal rearrangements are expected to be randomly distributed across the genome rather than clustered in focal regions (Beroukhim et al. 2007, 2010). Second, the gradual accumulation of such a large number of overlapping rearrangements, both copy number neutral and copy number changing, would almost certainly produce a final outcome with more than two copy number states. Finally, maintenance of islands of heterozygosity interspersed between regions that have lost heterozygosity is hard to explain by multiple sequential events. This is illustrated by the fact that a deletion early in the process will remove sequences that cannot be regained. Thus, if the observed pattern resulted from sequential mutation events, deletions would need to be restricted to a later time period, when they could not interrupt the acquisition of balanced rearrangements—a complex “programmed” event for which it is hard to envision a mechanism (one example is shown in Fig. 3).
Figure 3.
Reconstruction of a local complex rearrangement and its mutational history from read pair signatures and copy number information. (A) Copy number alterations can be detected by array methods or read depths in sequencing data; rearrangement signatures can be detected from discordant read pairs in sequencing data. (B) By combining copy number information and rearrangement signatures, one can construct the configuration of the rearranged segments in the chromothriptic chromosome. (C) The history of mutation events that could have generated the outcome in B is often not unique. However, all solutions have to break four junctions in the reference assembly (L|A, A|B, B|C, and C|R) and generate three new junctions (L|C, C|B, and B|R). A catastrophe model can easily solve this puzzle because all of the breaks and translocations occur all at once. By comparison, a solution from the gradual model of simple events requires at least three simple events to generate the final configuration (one for each new junction). As each event causes two breakpoints, three events need six breakpoints, which requires two additional breaks in addition to the four original junctions to be broken. These two additional breaks arise from either within the lost A segment (lost history) or hitting the same junction multiple times (recurrent breaks). Shown here is one possible solution that involves recurrent breaks at both B|C and C|R junctions. Both lost history and recurrent breaks are highly unlikely. As the number of rearranged fragments increases, the need to include these types of unlikely events further increases, strongly arguing against the gradual model.
Reconstruction of a local complex rearrangement and its mutational history from read pair signatures and copy number information. (A) Copy number alterations can be detected by array methods or read depths in sequencing data; rearrangement signatures can be detected from discordant read pairs in sequencing data. (B) By combining copy number information and rearrangement signatures, one can construct the configuration of the rearranged segments in the chromothriptic chromosome. (C) The history of mutation events that could have generated the outcome in B is often not unique. However, all solutions have to break four junctions in the reference assembly (L|A, A|B, B|C, and C|R) and generate three new junctions (L|C, C|B, and B|R). A catastrophe model can easily solve this puzzle because all of the breaks and translocations occur all at once. By comparison, a solution from the gradual model of simple events requires at least three simple events to generate the final configuration (one for each new junction). As each event causes two breakpoints, three events need six breakpoints, which requires two additional breaks in addition to the four original junctions to be broken. These two additional breaks arise from either within the lost A segment (lost history) or hitting the same junction multiple times (recurrent breaks). Shown here is one possible solution that involves recurrent breaks at both B|C and C|R junctions. Both lost history and recurrent breaks are highly unlikely. As the number of rearranged fragments increases, the need to include these types of unlikely events further increases, strongly arguing against the gradual model.In contrast to the gradual model, Stephens et al. (2011) proposed a much more compelling catastrophic model of chromothripsis that offers a parsimonious explanation of the observation: One chromosome (or regions from one or more chromosome arms) is shattered into segments of different lengths in a single catastrophic event; the segments are then stitched together, most likely by nonhomologous end-joining, resulting in a derivative chromosome containing a subset of the shattered segments in random orientations. The retained segments maintain the parental copy number and heterozygosity, yielding the high copy number state. The segments that are not incorporated into the derivative chromosome are either lost, yielding the low copy number state, or incorporated into a double-minute chromosome, as discussed below.
Historiography of the cancer genome
These general arguments make intuitive sense but ideally would be backed up by a detailed molecular assembly of the evolutionary path taken to generate the chromothriptic chromosome. Stephens et al. (2011) faced the obvious challenge of inferring prior history from a terminal snapshot of the present, which is already difficult for cancer genomes with simple rearrangement patterns (Greenman et al. 2012a) and quickly becomes impractical for a complex case such as chromothripsis (Yates and Campbell 2012).The first complication for inferring an evolutionary trajectory is purifying selection: The mature tumor is biased to preserve genomic changes that are beneficial to tumor development and to have lost deleterious changes (Beroukhim et al. 2007; Greenman et al. 2007; Bignell et al. 2010). By viewing only the endpoint of selection, we could miss intermediate steps in the development of the rearranged chromosome whose signatures were lost during tumor evolution. An additional technical complication for solving the genomic Rubik's cube for the chromothriptic chromosome is that one cannot interpret paired-end sequences in a simple straightforward way when they occur in the context of other overlapping rearrangements. For example, a discordant pair of sequencing reads that is inferred to be a “tandem duplication type” event when directly mapped onto the reference genome sequence (Fig. 2A; Raphael et al. 2003; Tuzun et al. 2005; Korbel et al. 2007; Campbell et al. 2008; Medvedev et al. 2009) has a completely different interpretation under a rearranged genome when additional rearrangement events have occurred between the pair mates (Fig. 2B). This technical issue is less problematic for simple genomes where rearrangements seldom overlap and can be decoupled and interpreted unambiguously (Greenman et al. 2012a,b; Quinlan and Hall 2012). However, it becomes prohibitively complex in chromothripsis, where tens or hundreds of rearrangements overlap, resulting in an enormous number of possible permutations. This provides an important cautionary note for aspiring genome analysts: An accurate interpretation of a highly rearranged cancer genome from paired-end sequencing data requires an integrative analysis of rearrangement signatures (such as deletion type or tandem duplication type) (Fig. 2A) and copy number data. In other words, context matters.The following back-of-the-envelope calculations illustrate the magnitude of the problem. The number of different outcomes created by random permutations of 10 overlapping rearrangements is 10! = 3,628,800 and the number grows to ∼1064 for 50 rearrangements. Typical chromothripsis harbors tens or even hundreds of overlapping rearrangements, and it is clearly impossible to enumerate all possible outcomes. Additional complications are caused by intermediate events being obscured by later events; e.g., a later deletion event that removes the signature of an earlier translocation. This estimate also does not account for rearrangements that are not detectable due to technical issues such as uncertainties in the reference genome assembly or inaccuracies in short-read alignment.The most difficult challenge stemming from the vast number of outcomes due to permutation complexity, however, is finding not just one solution, but the most likely solution based on a certain background statistical model of tumor evolution. Ultimately, we want to know the likelihood of the multievent solution to chromothripsis based on the background model of tumor evolution to dismiss the gradual model in favor of a single catastrophe. (Fig. 3 illustrates this problem by a simple example.) To answer this last question without enumerating all possible outcomes of a gradual model, Stephens et al. (2011) focused on the number of copy number states during genome evolution, essentially reducing the multidimensional space of chromosome configurations to a single dimension of “copy number.” They set up simulations in which individual rearrangement events were applied to the genome sequentially, starting from the reference sequence. Instead of brute force enumeration of all genomic configurations, the eventual number of copy number states was scored, making a testable prediction about the structure of the chromothriptic chromosome without the need for a complete molecular reconstruction of the rearranged chromosome. The gradual model rapidly generated multiple high copy number states because gradual accumulation allows for overlapping tandem duplications to affect the same region multiple times. The discrepancy between the large copy number complexity implied by the gradual model and the hallmark two copy number states observed in chromothripsis provided a strong statistical argument against a gradual model for chromothripsis. It is worth noting that chromothriptic chromosomes can sometimes exhibit multiple copy number states (Stephens et al. 2011; Rausch et al. 2012). One explanation is the overlap between chromothripsis and other independently generated copy number gains; another possibility is that chromothripsis that occurred during or after DNA replication can result in a shattering and assembly of both sister chromatids to generate copy number gains. In either scenario, however, the low copy number states (three or four) are still in stark contrast to the rapid accumulation of multiple copy number states, as the gradual model suggests.The focal pattern of breakpoints in chromothripsis provides yet another argument against the gradual model. Independent rearrangements are generally assumed to be randomly distributed across the genome. The large size of the human genome implies that it is extremely unlikely to see two events that are very close to each other (∼1 kb apart). The likelihood of seeing tens or hundreds of breakpoints within one chromosome (or chromosome arm) with few events elsewhere in the genome is vanishingly small if these events were independent and not due to specific targeted mechanisms. This statistical argument suggests that clustered rearrangements in chromothripsis very likely occurred simultaneously. A generalization of this statistical argument into a systematic way of inferring co-occurring translocations (“chained” translocations) was recently proposed (Baca et al. 2013). This new approach, discussed further below, potentially provides a complementary method to validate the idea that chromothriptic lesions occurred at the same time.In summary, chromothripsis is best explained by a catastrophic all-at-once model and cannot be reconciled with the gradual model. The catastrophic model yields a simple explanation of the low number of observed copy number states. It also provides an explanation for the many overlapping tandem duplication type events that do not increase copy number (see Fig. 2B, i, for an example): These duplication type events are instead explained by the random orientations at which the segments are stitched together. Similar explanations can apply to deletion and other “type” events (Fig. 2B).
When is it chromothripsis?
The discovery of chromothripsis in cancer immediately raises the question of how frequently it occurs. After the initial discovery, there have been a plethora of studies confirming that chromothripsis is indeed a widespread and previously unappreciated phenomenon in both primary tumors (Kloosterman et al. 2011b; Magrangeas et al. 2011; Berger et al. 2012; Rausch et al. 2012) and cancer cell lines such as HeLa (Adey et al. 2013; Landry et al. 2013). Many cancer genomes reported in earlier studies also exhibited features that resemble chromothripsis (Stephens et al. 2009; Campbell et al. 2010). So how are we to recognize chromothripsis? How many rearrangements “make” the definition, and how focal is focal? Because copy number data are the most available, a common sense approach was to adopt operational definitions that would require, say, 10, 20, or 50 oscillating copy number changes (Magrangeas et al. 2011). In some cases, corrections were applied to normalize the number of focal events to the general frequency of events on that chromosome (Kim et al. 2013).Although an operational criterion is reasonable for a first-pass screening and many instances with multiple oscillations between two copy number states likely are bona fide chromothripsis, the field is moving away from operational definitions and adopting criteria that incorporate information from paired-end sequencing. Through more precise analysis of the structure of the derivative chromosome, one can better distinguish chromothripsis from a large number of progressively acquired rearrangements in tumors with high levels of genetic instability. Because an excellent review by Korbel and Campbell (2013) has recently covered this topic in depth, we only briefly review the main elements of these criteria here.As discussed above, one hallmark of chromothripsis is the clustering of rearrangements. The degree of clustering can be assessed by simple statistical methods because the null hypothesis of random breakpoints predicts that the distance between breakpoints should be distributed exponentially (Beroukhim et al. 2010; Malhotra et al. 2013). However, it is important to note that clustering of breakpoints can also be generated by other processes involving multiple steps over several cell division cycles. For example, clustering can result from BFB cycles (Gisselsson et al. 2000), chromosome fragile sites (Debatisse et al. 2012), and strong positive selection for local oncogene amplifications (Bignell et al. 2007). Thus, clustering by itself is a necessary but not sufficient criterion for chromothripsis. An additional criterion is provided by the fact that chromothripsis is characterized by regions with interspersed retention and LOH, which can be verified from genotype information from either SNP array or sequencing. Finally, if restricted to a single chromosome, the regions of rearrangement and segment loss should be confined to a single parental haplotype because the shattering and stitching back together occurs (mostly) on a single chromatid. This can be directly verified by cytogenetic methods, such as fluorescence in situ hybridization (FISH), or inferred from phased haplotypes in trio family studies (Browning and Browning 2011).Additional information from sequencing can evaluate the model that a chromosome has been shattered and randomly stitched together in more detail. First, the model predicts that the order of the fragments on the derivative chromosome would be random or near random with respect to their positions on the reference genome. Second, if reassembly of the derivative chromosome is random, the orientations of fragments on the derivative chromosome should also be random. This means that there should be no bias for fragments to be joined head to tail (deletion type), tail to head (tandem duplication type), head to head, or tail to tail (inversion type). Such randomness in the orientation of fragments contrasts with, for example, BFB cycles, which are characterized by inverted segments biased to be head to head or tail to tail (Bignell et al. 2007). Finally, if the shattering and restitching indeed happen all at once, then each rearranged fragment will have a unique head and tail sequence (with respect to the reference sequence). Thus, when “walking” down a chromothriptic chromosome, there will be an invariant alternation between head and tail (or tail and head) sequences, reflecting a linear connection (Korbel and Campbell 2013). This contrasts with the prediction for a mechanism involving sequential acquisition of rearrangements, particularly those involving amplifications. In this latter case, rearranged segments could be “reused” in ways that would generate heads followed by heads and tails followed by tails when amplifications are nested.Clearly, within a short period of time, the field has advanced far beyond “know it when you see it” operational definitions for chromothripsis. The above-delineated criteria each address a different hallmark feature of chromothripsis and can be subjected to rigorous statistical evaluation. Nevertheless, challenges remain. How exactly to combine multiple criteria and weigh them properly to make the most probable case for or against chromothripsis needs further development. There are also numerous technical challenges. Short sequence reads can only be aligned to the mappable regions of the genome, and the interesting question of what happens within highly repetitive sequences remains to be resolved. Complications also come from other factors, such as inaccuracies in short-read alignments, the obscuring effect of other structural rearrangements (e.g., simple deletions/amplifications/translocations) layered on top of chromothripsis, tumor heterogeneity, and aneuploidy. All of these technical factors may lead to an underestimate of the frequency of chromothripsis.
How frequently does chromothripsis happen?
So what, then, is our best guess for the frequency of chromothripsis? Unfortunately, the largest volume of cancer genomic data (such as in The Cancer Genome Atlas [TCGA] project) is array-based and does not have the power to provide an accurate estimate of the overall frequency of chromothripsis. However, it is becoming evident that the frequency of chromothripsis tends to vary between different tumor tissue types. In a survey of whole-genome sequencing data for 64 samples of seven tumor types from TCGA, Malhotra et al. (2013) found no chromothriptic rearrangements in basal-like breast cancers, colon adenocarcinomas, ovarian cancers, or renal adenomas but found rearrangements consistent with chromothripsis in two of six (33%) lung adenocarcinomas, two of 13 (15%) lung squamous cell carcinomas, and 39% of glioblastomas (n = 18). In a recent survey of 4934 cancers from TCGA, Zack et al. (2013) employed multiple criteria to model chromothripsis by SNP array copy number alone and suggested that chromothripsis occurred in 5% of all samples, with frequencies ranging from 0% in head and neck squamous carcinoma to a maximum of 16% in glioblastoma. In addition, chromothripsis has been observed in colorectal cancer (Kloosterman et al. 2011b), melanoma (Berger et al. 2012; Hammond et al. 2013; Hirsch et al. 2013), small-cell lung cancer (Iwakawa et al. 2013), prostate cancer (Wu et al. 2012), and a variety of brain tumors (Molenaar et al. 2012; Northcott et al. 2012; Brastianos et al. 2013) and blood malignancies (MacKinnon and Campbell 2013; Morin et al. 2013; Nagel et al. 2013). It is interesting to point out that the rate of chromothripsis seems to be unrelated to the overall rate of somatic copy number alterations (SCNAs) (Zack et al. 2013).The high frequency of chromothripsis in certain tumor types suggests that chromothripsis depends on the genetic and environmental background of cancers. Among the most frequently mutated genes in human cancer is TP53 (Olivier et al. 2010), which plays central roles in maintaining genome stability (Zhu et al. 2002; Meek 2004; Thompson and Compton 2010). The correlation between TP53 mutation and chromothripsis was first demonstrated in Sonic Hedgehog (SHH) medulloblastoma. Rausch et al. (2012) applied high-throughput whole-genome sequencing and long-fragment mate-pair sequencing to characterize genomic rearrangements in four patients of SHH medulloblastoma who also had germline-inactivating TP53 mutations (Li-Fraumeni syndrome). Sequencing of each of these tumors not only confirmed clustering of chromosomal breaks indicative of chromothripsis but also revealed high-level amplifications due to double-minute chromosomes that originated from the shattered fragments. Thus, TP53 mutations are permissive or conducive for either the initiating chromothriptic event, the selective advantage of cells bearing the rearranged chromosome, or both. Rausch et al. (2012) further identified strong correlations between TP53 mutation and chromothripsis in acute myeloid leukemia, demonstrating that the effect of TP53 mutation can manifest in multiple tissue types.How accurate are the estimates of the frequency of chromothripsis? Although there could be some overestimation due to ambiguities in the interpretation of genomic data, we must also consider underestimation due to the fact that the overwhelming majority of chromothriptic events are almost certainly lost because of deleterious fitness effects. Negatively selected events in normal or premalignant tissues could be detected only at a single-cell level immediately after the chromothriptic rearrangement occurs, which is currently not feasible on the appropriate scale. Ultimately, understanding the mechanisms that generate chromothripsis will be integral to estimating its frequency. If the underlying cellular defects could lead to, but not cause, chromothripsis every time they occur, then the observed cases of chromothripsis only represent the tip of the iceberg for less dramatic but possibly more common mutational events.
Does chromothripsis cause cancer?
In general, we expect that the disruption of chromosomal integrity in a catastrophe such as chromothripsis will be negatively selected or even trigger apoptosis. However, this does not exclude the possibility that rare catastrophes could promote cellular transformation, which could happen in at least two ways through copy number alterations. First, the massive DNA loss can include one or more tumor suppressor genes (Stephens et al. 2011; Yang et al. 2013). In their survey of the TCGA SNP array data, Zack et al. (2013) found that 72% of chromothripsis events resulted in SCNAs in regions that are frequently disrupted in cancer, with the most dramatic case in glioblastoma showing that 20 out of 22 samples with chromothripsis had one event disrupting the CDKN2A locus. Second, and perhaps most importantly, shattered chromosomal segments not incorporated into the derivative chromosome can be linked to form a double-minute (circular) chromosome. These double minutes are readily amplified, which commonly occurs if they harbor an oncogene. An illustration of this comes from the original reports on chromothripsis (Stephens et al. 2011; Rausch et al. 2012). Stephens et al. (2011) described a small-cell lung cancer cell line where a chromothriptic event generated a derivative chromosome 8 and a double minute containing MYC and other segments of chromosome 8. Most notably, the sequences on the derivative chromosome and the double minute were exclusive, providing compelling evidence that early in the development of this tumor, chromosome 8 was shattered, with some bits being contemporaneously incorporated into the derivative and others going into the double minute.Having recognized the association between chromothripsis, double minutes, and cancer-causing mutations, one can infer chromothripsis as the most likely underlying mechanism in the formation of high-level focal amplifications that incorporate multiple disconnected segments (Iwakawa et al. 2013; Sanborn et al. 2013). The multiple focal amplifications can be either extrachromosomal (double minutes) or intrachromosomal (homogeneously staining regions [HSRs]) due to chromosomal integration of double minutes (Storlazzi et al. 2010; Gibaud et al. 2013). Indeed, data linking the formation of double minutes to the formation of derivative chromosomes containing complex rearrangements were known prior to the first description of chromothripsis. Gibaud et al. (2010) characterized five different extrachromosomal amplifications of double-minute chromosomes in a glioma sample: One double-minute chromosome consisted of multiple segments from chromosome 5 and chromosome 9, which, in retrospect, was likely to have resulted from chromothripsis. FISH analysis verified that the extrachromosomal amplicon sequences were not present in the derivative chromosomes derived from chromosome 5 and chromosome 9. We further note that copy number profiles of both chromosomes showed the hallmark copy number patterns suggestive of chromothripsis. In addition, the rearrangements also created oncogenic lesions. The double minute contained the TERT (telomere reverse transcriptase) gene from a fragment in chromosome 5. TERT is amplified or overexpressed in many cancer types (Beroukhim et al. 2010). The functional significance of TERT overexpression is remarkably underscored by the recent discovery that ∼80% of glioblastomas harbored mutations in the 5′ flanking region of TERT that activate its expression (Horn et al. 2013; Huang et al. 2013; Killela et al. 2013). Interestingly, the glioma sample studied by Gibaud et al. (2010) also had deletions encompassing the CDKN2A locus on the chromosome 9 derivatives. Thus, one episode of chromothripsis appears to have generated two critical oncogenic lesions (TERT amplification and CDKN2A deletion).Besides copy number alterations, translocations in chromothripsis can also lead to gene fusions or dysregulated gene expression. Mehine et al. (2013) suggested that chromothripsis is a frequent driver event in uterine leiomyomas, resulting in increased expression of the translocated HMGA1 and HMGA2 genes. However, the probability of generating in-frame gene fusions—the major class of oncogenic translocations—through random chromosomal translocations is generally small (Stephens et al. 2009), and the observed frequency of in-frame fusions due to chromothripsis is comparable with those resulting from simple rearrangement events (Stephens et al. 2011). Nonetheless, although each rearranged segment in chromothripsis has a small probability of generating a functionally important fusion gene (Teles Alves et al. 2013), it is reasonable to speculate that chromothripsis, because it generates many rearrangements, will increase the frequency of such fusions. This is not because chromothripsis has any unique ability to generate gene fusions but simply reflects the fact that in chromothripsis, more shots are taken on goal.Although there is good reason to think that chromothripsis can generate oncogenic alterations and potentially multiple mutations at one fell swoop, we are just starting to understand the impact of chromothripsis on cancer pathogenesis or prognosis. Recent studies have shown that chromothripsis can independently arise in either the primary tumor or its metastasis (Kloosterman et al. 2011b) and does not necessarily predict resistance to drug treatment (Bassaganyas et al. 2013). Even though chromothripsis has been suggested as a poor prognostic factor (Molenaar et al. 2012; Hirsch et al. 2013), direct experimental characterization of chromothripsis-derived mutations is lacking.One important question is when chromothripsis occurs during tumor development: Can it be an early driving event, or is it just another consequence of genomic chaos in late stage cancers? The latter possibility is intuitive, but there is also interesting evidence for chromothripsis occurring in the absence of massive genetic instability. First, allele frequencies of chromothriptic rearrangements suggest that chromothripsis generally occurs earlier than simple rearrangements (Malhotra et al. 2013). Moreover, the high frequency of chromothripsis identified in tumors with low to moderate mutation rates (neuroblastoma, medulloblastoma, glioblastoma, and blood and bone marrow malignancies) suggests that chromothripsis does not, in any obligatory way, require a background of elevated genomic instability. On the other hand, tumor types with frequent copy number alterations, such as ovarian or lung cancers, exhibit a larger number of chromosomal alterations, but they are often independently acquired rather than correlated as in chromothripsis (Malhotra et al. 2013; Zack et al. 2013). Additionally, as described in more detail below, de novo complex rearrangements that resemble chromothripsis have also been observed in patients with congenital disorders. So far, there has been no evidence that these patients have a cancer predisposition or defects in the maintenance of genome stability like patients with germline mutations in the TP53 or BRCA genes. Together, these findings imply that chromothripsis can originate in cells with normal or near-normal genetic stability.
Chromothripsis in congenital disorders
Strong selection during embryonic development ensures that only mild genomic alterations can be tolerated through development. It was therefore very surprising when recent studies reported massive constitutional chromosomal rearrangements similar to those of chromothripsis in cancer in some patients with congenital developmental disorders (Kloosterman et al. 2011a, 2012; Chiang et al. 2012; Nazaryan et al. 2013). In comparison with chromothripsis in cancer, these instances of apparent germline chromothripsis showed minimal DNA loss, presumably reflecting the selection against haploinsufficiency or complete loss of gene function due to monoallelic gene expression. Array-based copy number analysis, such as array comparative genome hybridization (aCGH) or SNP arrays, cannot, as expected, detect these copy-neutral chromosomal alterations, necessitating whole-genome sequencing.
Chromoanasynthesis
A different pattern of constitutional complex genome rearrangements was reported by Liu et al. (2011) in their investigation of patients with congenital developmental defects. The investigators combined aCGH and FISH to characterize copy number alterations and chromosomal rearrangements in these patients and sequenced the PCR products at rearrangement junctions to resolve breakpoint sequences. In contrast to chromothripsis, Liu et al. (2011) observed frequent copy number gains with interspersed regions containing duplication or triplication of one parental allele but no LOH (Fig. 1D). Breakpoint junction sequences exhibited microhomology but also included templated sequence insertions (100–1000 base pairs [bp]) on the rearranged chromosome. The short template insertions suggested a replication-based mechanism for incorporating these segments during the repair of DNA DSBs (Lee et al. 2007; Hastings et al. 2009). The observation of microhomology further led Liu et al. (2011) to propose microhomology-mediated breakage-induced repair (MMBIR) as the major mechanism for generating the observed translocations (Carvalho et al. 2009; Zhang et al. 2009b; Liu et al. 2012). In this model, the extra allele copies are incorporated due to template-switching events in DNA replication instead of ligation (via end-joining) of shattered fragments. To emphasize the difference in mechanisms, Liu et al. (2011) termed this phenomenon chromoanasynthesis (synthesis of new chromosomes). They also suggested that the same replicative mechanism may play a role in chromothripsis due to similar features in both phenomena, such as localized clusters of rearrangements and copy number alternations.One of the hallmark features of chromothripsis is interspersed loss and preservation of heterozygosity, which suggests a “shatter-and-stitch” process of one chromatid. This is in sharp contrast to chromoanasynthesis, which reflects extra synthesis of DNA segments. Although extra synthesis can lead to alternating DNA copy numbers, it does not necessarily result in LOH: LOH occurs by DNA deletion or gene conversion. Another difference between chromoanasynthesis and chromothripsis is that adjacent chromosomal ends in chromothripsis (especially in the germline) often result from one DNA DSB, whereas segmental breaks in chromoanasynthesis are separated by longer distances (0.01∼1 Mb) and appear to result from template-switching events. Although replication-based DNA repair can contribute to chromothripsis, as evidenced from examples of templated insertion sequences (Chiang et al. 2012), it is difficult to explain the proximity of adjacent chromosomal breaks based purely on template switching without DNA DSBs. Finally, chromoanasynthesis results in one aberrant chromosome with duplicated segments due to extra DNA synthesis; in contrast, chromothripsis generally gives rise to one or multiple derivative chromosomes with either massive DNA loss (in cancer) or no apparent DNA copy number alterations (in the germline). Based on these differential characteristics, we agree with the conclusion that chromoanasynthesis is a distinct phenomenon from chromothripsis (Holland and Cleveland 2012; Liu et al. 2012; Kloosterman and Cuppen 2013).The distinction between chromothripsis and chromoanasynthesis is also evident in a more recent report of germline genomic rearrangements. Kloosterman et al. (2012) identified eight cases that resemble chromothripsis, exhibiting random stitching of segments from multiple chromosomes with little DNA loss, and two cases with apparent gain or deletion of DNA segments that are suggestive of replication errors. One interesting observation is that the rearranged alleles in both chromothripsis and chromoanasynthesis are preferentially of paternal origin; this was attributed to errors that result from the higher number of mitotic divisions in gametogenesis in males relative to females (Kloosterman and Cuppen 2013). In contrast, most human germline aneuploidies are maternally derived (Handyside et al. 2012; Nagaoka et al. 2012), and de novo chromosomal instability in early embryonic development does not show a preferred origin (Vanneste et al. 2009; Voet et al. 2011).
Chromoplexy
Chromothripsis and chromoanasynthesis are two examples of complex genomic rearrangement triggered or generated by a single event. A single catastrophic event was also invoked to explain a recently described pattern of rearrangement found in prostate cancers (Fig. 1E; Berger et al. 2011; Baca et al. 2013). Termed chromoplexy (Greek for braid of chromosomes), this event is characterized by “chains” of translocations involving multiple chromosomes—up to eight so far described. The sequence signatures of chromoplexy suggest that all of the partner segments involved in the translocations originate from DNA DSBs, with both ends of each break contributing to the daisy chain cycle of translocations (Fig. 4A). This organization of breakpoint ends bears some similarity to the pairing of the ends described above in chromothripsis. As with chromothripsis, the pairing pattern strongly implies that chromoplexy is another example of an all-at-once mechanism for chromosomal rearrangement: If the breakpoint ends originate from DSBs and are joined into an integrated pattern of translocations, then the breaks must have occurred simultaneously.
Figure 4.
Chromoplexy and ChainFinder. (A) Chromoplexy is an extension of balanced rearrangements to multiple partners. The different colors of linked discordant pair mates reflect different chromosomes or loci to which each mate is aligned. DNA resection during the repair of DSBs often results in loss of a small segment between the two ends. (B) Adjacent translocations of two breakpoint ends can arise in two different ways. (Left) In a single-event model, a single DNA DSB generates two broken ends with a small deletion due to DNA resection; joining of the two broken ends with ends from other DNA breaks produces the observed translocations. (Right) In a multiple-event model, two independent balanced translocations occur independently at different times; the inner DNA segment could have been lost in a subsequent deletion event (shown in the dashed box) or due to DNA resection. The distance between the two adjacent translocation ends, D, provides a statistical measure of the likelihood of either scenario. The two-event model implies that two independent breaks must have occurred within distance D. This likelihood is approximately given by 1 − exp(−μD), where μ is the average density of chromosomal breaks that can be estimated for each sample at any given locus (≲10−6 per base pair). When the ends are very close (100–1000 bp), this probability is vanishingly small, favoring the single-event model and rejecting the multievent model. The central idea of ChainFinder is to find “chained” translocations that form a closed cycle and where the multiple-event model can be rejected for any two pairs of adjacent translocations (i.e., no reciprocal translocation can be postulated as an intermediate state, connecting two smaller chains).
Chromoplexy and ChainFinder. (A) Chromoplexy is an extension of balanced rearrangements to multiple partners. The different colors of linked discordant pair mates reflect different chromosomes or loci to which each mate is aligned. DNA resection during the repair of DSBs often results in loss of a small segment between the two ends. (B) Adjacent translocations of two breakpoint ends can arise in two different ways. (Left) In a single-event model, a single DNA DSB generates two broken ends with a small deletion due to DNA resection; joining of the two broken ends with ends from other DNA breaks produces the observed translocations. (Right) In a multiple-event model, two independent balanced translocations occur independently at different times; the inner DNA segment could have been lost in a subsequent deletion event (shown in the dashed box) or due to DNA resection. The distance between the two adjacent translocation ends, D, provides a statistical measure of the likelihood of either scenario. The two-event model implies that two independent breaks must have occurred within distance D. This likelihood is approximately given by 1 − exp(−μD), where μ is the average density of chromosomal breaks that can be estimated for each sample at any given locus (≲10−6 per base pair). When the ends are very close (100–1000 bp), this probability is vanishingly small, favoring the single-event model and rejecting the multievent model. The central idea of ChainFinder is to find “chained” translocations that form a closed cycle and where the multiple-event model can be rejected for any two pairs of adjacent translocations (i.e., no reciprocal translocation can be postulated as an intermediate state, connecting two smaller chains).To put the intuitive argument into a more rigorous framework, Baca et al. (2013) developed an algorithm called ChainFinder to identify the chained translocations of chromoplexy. The assumption is that the probability that the observed translocations of two adjacent breakpoint ends occurred simultaneously is related to the probability that two breakpoint ends originate from a single DSB. The inference of a DSB is not straightforward because the breakpoint ends originating from a single DSB due to resection and other processing are not precisely connected in the reference sequence (see Fig. 4B for an illustration); a null model for independently generated breaks is therefore necessary for a statistical inference. To create the null model of independently generated chromosomal breaks, ChainFinder first estimates the density of chromosomal breaks at any locus by averaging over multiple tumor samples and then multiplying this by the total number of chromosomal breaks detected in a particular sample. This generates a sample-specific, locus-dependent frequency of breaks. Based on the null model of local breakage frequency, ChainFinder calculates the probability that two adjacent translocation ends have originated from independent breaks (Fig. 4B). This probability monotonically decreases with the separation between the two ends and becomes vanishingly small when the two ends are very close (100∼1000 bp), enabling a rejection of the null model that the two translocations occurred independently. ChainFinder then searches for all “cycles” of rearrangements (closed chains of rearrangements) and tests the null model for every two pairs of adjacent breakpoints: If both pairs of ends have a reasonable chance of having originated from independent breaks, then these two pairs of ends can arise from two independent events, and the original cycle is reduced to two smaller cycles. Chromoplexy events are therefore defined as chains of translocations that cannot be further reduced.The ChainFinder algorithm is conceptually different from and potentially complementary to the Monte Carlo simulations used by Stephens et al. (2011) to infer that complex rearrangements have resulted from a single catastrophic event. The Monte Carlo simulations monitor DNA segment copy number as a surrogate, enabling rejection of the gradual model without directly testing the all-at-once model. In contrast, ChainFinder directly examines the interdependency of rearrangements and thus gets closer to a direct test of the all-at-once hypothesis. The complex patterns of rearrangements in chromothripsis are almost certainly “chained”; it is thus possible to apply ChainFinder to deconvolute chromothripsis and chromoplexy from other independent rearrangements that occur on the same chromosome or its homologous counterpart. Malhotra et al. (2013) adopted a similar heuristic idea to search for clusters of translocations to infer chromothripsis. A more rigorous generalization of the idea of “two simultaneous translocations due to one DSB” as judged from the spatial proximity of breakpoint ends could be a powerful method to deconvolute the evolution history of rearranged genomes.The following example illustrates the potential for a more general application of this type of approach to resolve complex rearrangements into smaller chains and decipher the history of events during tumor evolution. In an early report of germline chromothripsis (Kloosterman et al. 2011a), we noticed that the complex rearrangements of multiple chromosomal segments in three chromosomes (1, 4, and 10) can possibly be reduced to two cycles: the first cycle containing three DSBs originating from each chromosome (Fig. 5A,B) and the second containing only DSBs in chromosome 4 and chromosome 10 (Fig. 5B,C). Because the pairs of adjacent breakpoint ends are close to each other (<1000 bp), the translocations are almost certainly chained together in each cycle. Interestingly, if the first chain of three translocations occurred all at once, the result would have been a functional centromere-containing derivative chromosome, t(4*,10q), which is retained in the final chromosomal configuration, plus two mitotically unstable chromosomes: a dicentric chromosome, t(10*,1*), and an acentric chromosome fragment, t(1p,4q) (Fig. 5B). The two mitotically unstable derivatives would either be lost or undergo additional rearrangement. It is therefore possible that unstable chromosomes generated by the original chain of translocations (chromoplexy) could result in subsequent catastrophes that add chromothripsis on top of the original translocations. This speculative model, which we term translocation-induced chromothripsis, is one potential mechanism for chromothripsis involving multiple chromosomes.
Figure 5.
Possible decomposition of rearrangements reported by Kloosterman et al. (2011a) into two chains. The first chain is plausibly a chromoplexy event that swaps partners between six chromosomal arms, shown in A. Asterisks denote chromosomal segments containing a centromere. Newly formed junctions are shown as red links. This chromoplexy episode would create one functional (single centromere-containing) derivative chromosome, t(4p*, 10q), plus two aberrant ones, t(10p*,1q*) (dicentric) and t(1p,4q) (acentric), as shown in B. The unstable dicentric and acentric chromosomes could then have triggered the remaining rearrangements shown in C, which are interlinked and resemble a chromothripsis. The outcome of this second chain is the swap of partners between the two nonfunctional chromosomes 10A*–1B* and 1A–4B, which results in two new derivative chromosomes, 10A*–(-1A) and (-4B)–1B*. This event is accompanied by fragmentation and repair of multiple segments near the junctions from the first chromoplexy event (10a, 10b, and 10c; and 4b, 4c, 4d, 4e, and 4f).
Possible decomposition of rearrangements reported by Kloosterman et al. (2011a) into two chains. The first chain is plausibly a chromoplexy event that swaps partners between six chromosomal arms, shown in A. Asterisks denote chromosomal segments containing a centromere. Newly formed junctions are shown as red links. This chromoplexy episode would create one functional (single centromere-containing) derivative chromosome, t(4p*, 10q), plus two aberrant ones, t(10p*,1q*) (dicentric) and t(1p,4q) (acentric), as shown in B. The unstable dicentric and acentric chromosomes could then have triggered the remaining rearrangements shown in C, which are interlinked and resemble a chromothripsis. The outcome of this second chain is the swap of partners between the two nonfunctional chromosomes 10A*–1B* and 1A–4B, which results in two new derivative chromosomes, 10A*–(-1A) and (-4B)–1B*. This event is accompanied by fragmentation and repair of multiple segments near the junctions from the first chromoplexy event (10a, 10b, and 10c; and 4b, 4c, 4d, 4e, and 4f).
Mechanisms for complex genome rearrangements
The discovery of chromothripsis, chromoanasynthesis, and chromoplexy suggests the existence of novel processes impacting genome architecture. The underlying mechanisms are likely to be different for each phenomenon. However, any mechanisms need to explain the common theme of single-event-mediated extensive damage and, for chromothripsis and chromoanasynthesis, the fact that the damage is highly localized. Little is known about these underlying mechanisms, but we briefly review current thinking below.For chromoanasynthesis, the sequence signatures at translocation junctions strongly implicate replication-based mechanisms (Liu et al. 2011, 2012). The fact that chromoanasynthesis is characterized by apparent template insertions from nearby genomic regions, often contains microhomology at the breakpoint junctions, and lacks the sequence signatures of DSBs strongly implicates replication template-switching mechanisms. These mechanisms have been termed fork stalling and template switching (FoSTeS) and MMBIR (Slack et al. 2006; Lee et al. 2007; Carvalho et al. 2009; Hastings et al. 2009; Zhang et al. 2009a,b). Best characterized in yeast (for recent reviews, see Anand et al. 2013; Lambert and Carr 2013), these mechanisms can account for duplications (Narayanan et al. 2006; Hicks et al. 2010) and fold-back inversions (Mizuno et al. 2013). The template switches can arise by recombination involving invasion of homologous sequences or the use of microhomologies to prime ectopic DNA synthesis. Also consistent with a role for replication defects in this type of errors, replication stress induced by hydroxyurea or aphidicolin can induce copy number variation in human cells (Arlt et al. 2009, 2012). Replication defects also can explain the regional bias for the abnormalities of chromoanasynthesis: There is evidence that breakpoint ends of copy number abnormalities tend to be in physical proximity (Lieberman-Aiden et al. 2009; De and Michor 2011; Fudenberg et al. 2011); hence, different template-switching events could have occurred in loosely defined nuclear territories. Nevertheless, without direct experimental recapitulation of chromoanasynthesis, the inferred models for the mechanism, although supported by strong indirect evidence, remain provisional (Currall et al. 2013; Kloosterman and Cuppen 2013).For chromoplexy, no mechanistic information is currently available. It is clear that many breaks must be present around the same time for chromoplexy to occur. Thus any event that can induce multiple breaks, such as DNA replication stress (Donley and Thayer 2013) or active transcription (for review, see Kim and Jinks-Robertson 2012), could lead to concurrent DSBs in multiple chromosomes. Chained translocations to repair these breaks could then occur by chance, with selection acting to preserve copy number balance and oncogene activation at the breakpoint junctions. It is also possible that chained translocations are promoted by physical clustering of the breakpoints in the nucleus (Roix et al. 2003; Meaburn et al. 2007; Misteli 2010; Alt et al. 2013). This could occur by clustering of commonly regulated transcriptional sites or other clustering events, such as the assembly of DNA replication factories. In the case of prostate cancer, colocalization of transcriptional sites could be mediated by colocalization of androgen-responsive elements (Lin et al. 2009), which could cluster topoisomerase II-mediated DSBs (Haffner et al. 2010).The origin of chromothripsis is also unclear, but plausible hypotheses have been recently advanced (Forment et al. 2012; Holland and Cleveland 2012; Jones and Jallepalli 2012; Kloosterman and Cuppen 2013). The major puzzle is how a large number of DSBs can be so highly localized. One proposal is that shattering occurs during mitosis when the chromosomes are highly compacted. At this point in the cell cycle, it is conceivable that external damage such as from ionizing radiation could slice through multiple segments of a single chromosome (Stephens et al. 2011). Critical telomere shortening leading to BFB cycles has also been considered because the induced rearrangements are local and chromothripsis is often observed near the telomeres (Stephens et al. 2011). However, disfavoring this explanation is the fact that BFB cycles can result in gene amplification and should produce a bias for inverted rearrangements.One simple solution to generate massive local mutagenesis is when one or two chromosomes (or an isolated acentric chromosome arm) are physically isolated from the other chromosomes. A simple and appealing hypothesis for how this could occur with fairly high frequency in human cancers is the partitioning of whole chromosomes or acentric chromosome fragments into micronuclei (Fig. 6; Crasta et al. 2012). It has become apparent that chromosomes in micronuclei can be subject to massive spontaneous DNA damage (Terradas et al. 2009; Forment et al. 2012; Holland and Cleveland 2012), and this damage occurs in the first cell cycle after micronuclei are formed (Crasta et al. 2012). Importantly, damaged micronuclei can be reincorporated into the main nucleus during subsequent cell division (Crasta et al. 2012; Hatch et al. 2013). This means that damaged micronuclei can, at least in principle, generate mutations that can be integrated back into the genome.
Figure 6.
Extensive DNA damage of chromosomes in micronuclei (MN). Micronuclei can form by missegregation of whole chromosomes after mitotic errors. Pictured here is a “merotelic” kinetochore attachment where a single intact chromatid is caught in a tug-of-war between microtubules from opposite poles. Micronuclei can also originate from chromosome breaks, generating acentric chromosome fragments, or be generated from chromosome bridges by a mechanism that has not been determined (Hoffelder et al. 2004). (Left) Newly generated micronuclei can undergo irreversible loss of nuclear membrane integrity (Hatch et al. 2013). If this occurs during S phase, chromosomes in micronuclei will acquire massive DNA damage. (Right) Alternatively, late-replicating chromosomes in micronuclei may undergo DNA damage if the cell enters mitosis before micronuclear replication is complete (Crasta et al. 2012), subjecting the chromosome to premature chromosome compaction (PCC). The extent and mechanism of DNA damage during PCC remains to be established. During the subsequent cell division cycle, the micronuclear genome can be reincorporated into the primary nucleus, remain isolated in a micronucleus, or possibly be lost due to autophagy or extrusion.
Extensive DNA damage of chromosomes in micronuclei (MN). Micronuclei can form by missegregation of whole chromosomes after mitotic errors. Pictured here is a “merotelic” kinetochore attachment where a single intact chromatid is caught in a tug-of-war between microtubules from opposite poles. Micronuclei can also originate from chromosome breaks, generating acentric chromosome fragments, or be generated from chromosome bridges by a mechanism that has not been determined (Hoffelder et al. 2004). (Left) Newly generated micronuclei can undergo irreversible loss of nuclear membrane integrity (Hatch et al. 2013). If this occurs during S phase, chromosomes in micronuclei will acquire massive DNA damage. (Right) Alternatively, late-replicating chromosomes in micronuclei may undergo DNA damage if the cell enters mitosis before micronuclear replication is complete (Crasta et al. 2012), subjecting the chromosome to premature chromosome compaction (PCC). The extent and mechanism of DNA damage during PCC remains to be established. During the subsequent cell division cycle, the micronuclear genome can be reincorporated into the primary nucleus, remain isolated in a micronucleus, or possibly be lost due to autophagy or extrusion.Micronuclei are structurally similar to regular nuclei in that they have a double-membrane structure and nuclear pores but contain only one or a few chromosomes or chromosomal segments. Spatial isolation of individual chromosomes in micronuclei provides a natural setup for localized chromosomal damage, but how does damage happen?One possible mechanism is that newly generated micronuclei acquire their DNA damage as they progress into the S phase and G2 phase of the subsequent cell cycle (Crasta et al. 2012; Hatch et al. 2013). Micronuclei replicate their DNA asynchronously with the primary nucleus, with many micronuclei showing DNA replication in G2 (Fig. 6; Crasta et al. 2012). This delayed replication means that some micronuclei might enter mitosis and undergo premature chromosome compaction (PCC) (Obe and Beek 1975) before they have completed DNA replication (Fig. 6; Donley and Thayer 2013). PCC is believed to cause DNA damage, as evidenced by gaps observed on chromosome spreads (Kato and Sandberg 1967; Johnson and Rao 1970) and the fact that PCC can induce breakage at chromosome fragile sites (Smith et al. 2001; El Achkar et al. 2005; Ravi et al. 2013). Indeed, our group has found that in chromosome spreads, late-replicating micronuclear DNA develops the pulverized appearance that is characteristic of PCC (Crasta et al. 2012).A recent study from Hetzer's laboratory (Hatch et al. 2013) has provided important new insight into additional defects in micronuclei that could cause DNA damage. Using live-cell imaging, the group had previously shown that cancer cell nuclei occasionally undergo spontaneous loss of nuclear integrity (as assayed by cytoplasmic spilling of a nuclear green fluorescent protein) that is reversible (Vargas et al. 2012). They have now reported that many micronuclei undergo irreversible loss of nuclear envelope (NE) integrity, as evidenced by fluorescent imaging and electron microscopy. The basis for the defect in NE integrity is still not clear, but it is correlated with abnormal and asymmetric nuclear lamina assembly. Most strikingly, the rupture of micronuclei is tightly correlated with the appearance of massive damage (Fig. 6). Although it has not been possible to directly image the timing of damage acquisition relative to the timing of rupture, fixed cell experiments strongly suggest that damage occurs close to the time of rupture.How NE rupture contributes to DNA damage is not known. We found that micronuclei in serum-starved cells readily undergo rupture but acquire little or no damage (A Spektor and D Pellman, unpubl.). Thus, rupture seems to be required for damage but not sufficient to generate damage. One possibility would be that NE rupture during DNA replication would abruptly dilute replication factors and cause DNA replication fork stalling and thus generate damage. Alternatively, rupture during S phase might enable enzymes such as endonucleases or topoisomerases to act aberrantly on micronuclear chromosomes.
Conclusion
The discovery of chromothripsis by Stephens et al. (2011) opened a floodgate of genome sequencing studies of complex, potentially all-at-once genome rearrangement. This has led to the identification of chromothripsis in many cancer types and in some congenital diseases. The flurry of genome sequencing has also uncovered other classes of complex rearrangements (chromoanasynthesis and chromoplexy), which also appear to originate from single catastrophic events. These discoveries have expanded our view of the mechanisms by which genomes can evolve. Although we have no evidence at present that these mechanisms contribute to speciation, this is certainly plausible.These unexpected discoveries were the fruits of new technical advances, particularly next-generation sequencing and the accompanying novel bioinformatics tools. The recent studies illustrate the fact that these events are more common than could have been expected, and further sequencing efforts will clarify their true frequency. However, as discussed above, the phenomena identified to date could be the extreme examples of even more frequent but less dramatic events. Thus, understanding the underlying mechanism is not only interesting in its own right but may impact our understanding of how common these events really are.In this review, we tried to illustrate the importance, power, and limitations of bioinformatic and computational approaches to reconstruct the history of complex chromosomal rearrangements (Stephens et al. 2011; Greenman et al. 2012a; Baca et al. 2013). Although reconstructing history from a snapshot of the genome is a challenging task, we hope that we conveyed the tremendous progress toward solving this open problem.The understanding of catastrophic chromosomal rearrangements is certainly incomplete until the biological mechanisms underlying these phenomena are defined. Indeed, one exciting fallout from the genome sequencing efforts is the potential to discover new biological mechanisms. For example, if it can be shown that damage occurring in micronuclei can lead to chromothripsis, this could provide an important example of the emerging role of nuclear architecture and subnuclear organization in maintaining the stability of the genome (Meaburn et al. 2007; Misteli and Soutoglou 2009; Misteli 2010; Alt et al. 2013). Ultimately, linking postulated biological mechanisms to these new classes of genome rearrangement will require an experimental recapitulation of the phenomena in the laboratory. As the physicist Richard Feynman famously said, “What I cannot create, I do not understand.” Combining experimental live-cell imaging approaches with emerging single-cell genome sequencing technologies (Shapiro et al. 2013) may be one way to recreate and thus understand complex chromosomal rearrangements. Thus, the use of genome sequencing as a hypothesis-generating discovery tool may be ushering in a new era where genome sequencing is harnessed for hypothesis-testing experimental biology.
Authors: Graham R Bignell; Thomas Santarius; Jessica C M Pole; Adam P Butler; Janet Perry; Erin Pleasance; Chris Greenman; Andrew Menzies; Sheila Taylor; Sarah Edkins; Peter Campbell; Michael Quail; Bob Plumb; Lucy Matthews; Kirsten McLay; Paul A W Edwards; Jane Rogers; Richard Wooster; P Andrew Futreal; Michael R Stratton Journal: Genome Res Date: 2007-08-03 Impact factor: 9.043
Authors: Inês Teles Alves; Saskia Hiltemann; Thomas Hartjes; Peter van der Spek; Andrew Stubbs; Jan Trapman; Guido Jenster Journal: Hum Genet Date: 2013-04-25 Impact factor: 4.132
Authors: Feng Zhang; Mehrdad Khajavi; Anne M Connolly; Charles F Towne; Sat Dev Batish; James R Lupski Journal: Nat Genet Date: 2009-06-21 Impact factor: 38.330
Authors: Michael F Berger; Michael S Lawrence; Francesca Demichelis; Yotam Drier; Kristian Cibulskis; Andrey Y Sivachenko; Andrea Sboner; Raquel Esgueva; Dorothee Pflueger; Carrie Sougnez; Robert Onofrio; Scott L Carter; Kyung Park; Lukas Habegger; Lauren Ambrogio; Timothy Fennell; Melissa Parkin; Gordon Saksena; Douglas Voet; Alex H Ramos; Trevor J Pugh; Jane Wilkinson; Sheila Fisher; Wendy Winckler; Scott Mahan; Kristin Ardlie; Jennifer Baldwin; Jonathan W Simons; Naoki Kitabayashi; Theresa Y MacDonald; Philip W Kantoff; Lynda Chin; Stacey B Gabriel; Mark B Gerstein; Todd R Golub; Matthew Meyerson; Ashutosh Tewari; Eric S Lander; Gad Getz; Mark A Rubin; Levi A Garraway Journal: Nature Date: 2011-02-10 Impact factor: 49.962
Authors: Tao Zhang; James R Paulson; Muhammed Bakhrebah; Ji Hun Kim; Cameron Nowell; Paul Kalitsis; Damien F Hudson Journal: Chromosome Res Date: 2016-03-23 Impact factor: 5.239