Literature DB >> 25238782

Are common fragile sites merely structural domains or highly organized "functional" units susceptible to oncogenic stress?

Alexandros G Georgakilas1, Petros Tsantoulis, Athanassios Kotsinas, Ioannis Michalopoulos, Paul Townsend, Vassilis G Gorgoulis.   

Abstract

Common fragile sites (CFSs) are regions of the genome with a predisposition to DNA double-strand breaks in response to intrinsic (oncogenic) or extrinsic replication stress. CFS breakage is a common feature in carcinogenesis from its earliest stages. Given that a number of oncogenes and tumor suppressors are located within CFSs, a question that emerges is whether fragility in these regions is only a structural "passive" incident or an event with a profound biological effect. Furthermore, there is sparse evidence that other elements, like non-coding RNAs, are positioned with them. By analyzing data from various libraries, like miRbase and ENCODE, we show a prevalence of various cancer-related genes, miRNAs, and regulatory binding sites, such as CTCF within CFSs. We propose that CFSs are not only susceptible structural domains, but highly organized "functional" entities that when targeted, severe repercussion for cell homeostasis occurs.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25238782      PMCID: PMC4232749          DOI: 10.1007/s00018-014-1717-x

Source DB:  PubMed          Journal:  Cell Mol Life Sci        ISSN: 1420-682X            Impact factor:   9.261


Introduction

Activated oncogenes are a key feature of cancer development from its earliest stages [1]. One of their major effects is the induction of DNA damage via replication stress (RS) [2]. Specifically, oncogene-induced DNA replication stress (OIRS) leads to the formation of DNA double-strand breaks (DSBs), due to replication forks (RFs) collapse, fueling genomic instability (GI) [2, 3]. In early precancerous lesions, the collapse of DNA RFs occurs preferentially at specific loci termed fragile sites (FSs) [4]. As a result, FSs exhibit breaks, gaps, and rearrangements, collectively termed FSs expression. This is due to the activation of pathways responsible for fork collapse resolution and completion of DNA replication, which involve recombinogenic processes and DNA DSBs production [2]. An important issue is whether the instability manifested at these sites has any wider biological impact on cancer development. As further discussed in the manuscript, although FSs are heterogeneous in their expression patterns, they possess unique features that make them vulnerable to structural destabilization under RS conditions. Are these regions simply prone to DNA damage due to their intrinsic characteristics, conferring only to GI? Sparse evidence indicates that FSs enclose genes and non-coding RNAs, like microRNAs (miRs), while their expression could be epigenetically modulated by histones, implying that they are regions of the genome of a higher organization level (Fig. 1) [5-9]. Important bioinformatic resources are currently available and can be exploited to define potential topological associations between CFSs and these elements. Notably, the miRbase is constantly expanding while the ENCODE project [10] has deposited information on a vast range of binding elements and genomic modifications, including histone marks (like H3K79me2, H3K9ac, H3K4me3, and H3K27ac) that have a prominent influence on the expression process of the genome. As the pattern of instability at FSs in human tumors is variable, suggesting that it also depends on the cell type, this further complicates the role of FSs in malignancy. Last but not least, if such sites contribute to cancer development, why have they not been evolutionary selected for elimination? Could there be a higher reason that makes them at the same time vulnerable “units” of the genome with potentially meaningful function? Attempting to address these questions, in the current work we conducted an extensive review on the nature of their heterogeneity that accounts for their preferential instability. Next, by applying bioinformatic tools on data from the latest miRbase and the ENCODE project, we reveal that these sites are enriched in various (coding and non-coding) elements, such as cancer-related genes, miRs, and binding elements, as well as specific variations in histone modifications (Fig. 1). Based on these findings, we propose that these sites may represent unique “functional” units of the genome that may have a complex role upon OIRS with implications both in normal cell survival and cancer progression.
Fig. 1

CFSs are not only vulnerable structural domains but may also be functional units of the genome that are sensitive to replication stress. CFS’s stability is affected by replications stress (RS). During cancer development, they are affected from the earliest precancerous lesions due to oncogene-induced replication stress (OIRS). Breakage at CFSs (broken red rectangle) may not only confer to genomic instability (GI) (dashed black rectangle), but could also have wider biological implications by affecting elements located within them (question mark). DSBs DNA double-strand breaks

CFSs are not only vulnerable structural domains but may also be functional units of the genome that are sensitive to replication stress. CFS’s stability is affected by replications stress (RS). During cancer development, they are affected from the earliest precancerous lesions due to oncogene-induced replication stress (OIRS). Breakage at CFSs (broken red rectangle) may not only confer to genomic instability (GI) (dashed black rectangle), but could also have wider biological implications by affecting elements located within them (question mark). DSBs DNA double-strand breaks

Heterogeneity of fragile sites

FSs have been assigned in two classes, defined as rare fragile sites (RFSs) and common fragile sites (CFSs). Rare FSs are mainly induced by folate deficiency, correspond to dinucleotide or trinucleotide repeats, usually CGGn, and are found in less than 5 % of the human population and in specific families [11]. Their fragility is due to expansions of the micro- or mini-satellites sequences that they contain and in some cases are responsible for inherited diseases [11]. Therefore, RFSs will not be further discussed in this work. Historically, CFSs were recognized as recurrent hotspots of double-stranded DNA breaks in cultured lymphocytes from healthy individuals [12]. They are present in all individuals, are part of the normal chromosomes, but exhibit different frequencies of expression in a population (reviewed in [13, 14]). CFSs are typically vulnerable to extrinsic replication stress, most notably to aphidicolin (APH), an inhibitor of DNA polymerase α, δ and ε, but are otherwise quiescent under normal conditions. This observation has been gradually broadened to include breakage patterns resulting from various replication inhibitors, such as nucleotide analogs (5-azacytidine, bromodeoxyuridine) or antitumor antibiotics (distamycin) and RS resulting from folate deficiency. Dietary and environmental factors like caffeine, cigarette smoke, and hypoxia may also enhance FS expression [15]. Recently, the induction of stress during the early S phase in B lymphocytes by hydroxyurea has been found to provoke DNA damage in a distinct pattern, corresponding to a new class of “early replicating fragile sites” (ERFSs) [16]. They occur primarily in early replicating DNA, close to replication origins, and are mainly situated in actively transcribed gene clusters (coding regions) [17]. This contrasts with CFSs, like FRA3B, which are most sensitive during their replication in late S phase [18]. Nevertheless, ERFSs seem also to arise from RF collapse and are similarly sensitive to ATR inhibition and oncogene-induced stress (see A. Nussenzweig chapter in this issue). OIRS is expected to induce instability at both ERFS and CFS, as suggested in two independent studies [16, 19]. Recent reports using the phosphorylated form of histone H2AX, the γ-H2AX, as a marker of DSB induction showed that ERFS were enriched for H2AX and γ-H2AX, while CFSs and heterochromatin lacked both, also suggesting differential DNA damage response at these sites [20]. Notably, both ERFSs and CFSs are rich in CpG-rich regions [17, 19], implying that these classes of FSs may either share structural similarities or the exact classification of their members as early and late replicating ones may need further re-assessment. Surprisingly, telomeric regions also appear to exhibit fragility in a similar manner as CFSs upon replication stress, including APH treatment [21]. It has been shown that CFS expression patterns depend not only on culture conditions but also on cell type [22]. Although traditionally studied almost exclusively in lymphocytes, different CFSs have been observed in fibroblasts [22, 23], breast, and colon epithelial cell lines [24, 25] and erythroid cell lines [25]. Considerable overlap exists between experiments, but the relative frequency of CFS breaks varies significantly. For example, FRA3B is the most frequent fragile locus in lymphocytes, but does not seem to be fragile in epithelial cell lines [24, 26], possibly due to the plasticity of replication programs in different cell lineages or because of a putative “housekeeping” role of FHIT. The second most fragile site, FRA16D, is very frequently affected in epithelial breast cancer cell lines (20–25 %), but only occasionally in colon epithelial cells (~5 %). Clearly, a complete characterization of CFS breakage probabilities will require a panel of different cell types. CFSs are found in different individuals and are conserved across different species, including the mouse, the rat, and many mammals [27, 28], and across kingdoms, such as in the yeast S. cerevisiae [29]. Evolutionary conservation could argue in favor of a meaningful function if CFSs are considered outliers compared with the overall fragility of the genome in general. Nevertheless, variation between individuals can be significant. In a study of 20 normal adults [30], only FRA3B and FRA16D were found to be fragile in all individuals, and only 42 % of CFSs (19 of 45 identified) were present in the majority of individuals. In the earliest studies, less than 20 CFSs would explain more than 80 % of gaps and breaks [12]. A similar distribution was found in a population study of Deer mice, where high-frequency CFSs constituted approximately 26 % of the population total breaks and 38 % of CFSs were only found in single individuals.

Fragility of CFS

The issue of fragility at CFSs is a matter of intensive investigation. CFSs replicate either late in S-phase or initiate replication in mid-S phase, but exhibit a significant delay in completing it. Under conditions of RS, they may remain unreplicated even during G2-phase and up to mitosis leading eventually to their instability (as discussed in next section and reviewed in [13, 14]). Several features responsible for their replication sensitivity have not been revealed until now. These include intrinsic structural characteristics, the presence and overlap with large genes, differences in replication features, and epigenetic modulation [13, 14]. At the structural level, CFSs have the propensity to form secondary non-B structures that interfere with the movement of the replication fork thus leading to its collapse and associated DNA breaks [31]. Specifically, at sequence level, CFS are enriched in long stretches of AT dinucleotide-rich repeats that may form stable secondary cruciform DNA structures inducing fork stalling during DNA replication and in general incomplete or delayed DNA replication [31, 32]. In an earlier study, we performed a whole-genome analysis of CFS sequences and observed that they are on average rich in GC and Alu sequences [19]. The Alu family is a family of short interspersed repetitive elements (SINE) of about 300 bp containing mid and terminal poly A-stretches [33]. Interestingly, these elements are the most abundant mobile elements, and thus potentially recombinogenic in the human genome, and are implicated in various inherited human diseases and in cancer [34]. CFSs have been associated with genes extending over long genomic regions (“large genes”) [5]. The FHIT gene in FRA3B and WWOX in FRA16D are striking examples, measuring approximately 1.5 and 1.1 Mb respectively, compared with a mean of 10–15 kb for protein coding genes. The PARK2 gene, at approximately 1.4 Mb is also associated with FRA6E and may be down-regulated in ovarian tumors [30]. Intriguingly, genes over 800 kb may be prone to form RNA:DNA hybrid loops (termed R-loops) at sites of replication–transcription collision [35]. R-loops are structures formed by the association of the nascent transcript with the DNA template strand leaving unpaired the complementary non-coding DNA strand. Replication of large genes is time consuming and exposes the replication machinery to a risk of collision with the transcriptional machinery. In such an occurrence, the elongating RNA polymerase is blocked, leading to increased R-loop formation at Pol II pause sites. As a result, collision events may induce CFS breakage and a consequent enhancement of genomic instability [35]. Recent data have shed new light on the dynamics of the replication process at CFSs, providing several new mechanistic aspects explaining their instability (reviewed in [13, 14]). In the first one, it was shown that stability of FRA16C is perturbed under RS, as RFs progress more slowly and stall upon accounting AT-rich regions within this site. While in the bulk genome dormant origins are activated to complete replication, these are not available within FRA16C leading to delayed replication and instability [14] (also see B. Kerem chapter in this issue). A second report studying the mechanistic fragility of FRA3B, showed that fork speed slowing and stalling is similar with respect to the bulk genome, even under RS [26] (also see M. Debatisse chapter in this issue). In this case, the inability to complete replication was attributed to a large 700-kb core within this site that was found to be poor in origins. To accomplish replication of this site, origins from a long distance, located in the flanking regions, are required to come in and cover its length. The density and timing of origin firing events in the flanking regions seem to dictate the timing of FRA3B replication completion, thus influencing its stability. In a third mechanistic model regarding the FRA6E site, both replication arrest and paucity of origin activation lead to RS sensitivity, providing a functional combination of the two previous models [36]. Interestingly, some CFSs like the FRA3B do not exhibit stably these replication features in each cell type, suggesting that CFSs may demonstrate different patterns of instability, which are tissue specific. This may also explain why in various malignancies distinct profiles of GI are observed at CFSs that characterize each type of cancer. It would be interesting in the future to define the replication behavior of each CFS according to the specific cell type. That would be helpful in defining and possibly predicting the precise patterns of GI that take place during cancer development. The replication density and timing of the genome has been proposed to be highly flexible and epigenetically controlled rather than directed by specific sequence motifs [13]. Therefore, an epigenetic control of CFSs replication stability may also apply. In support of this is the observed H3K9/14 hypoacetylation pattern displayed by the six most expressed CFSs in lymphoblastoid cells [9]. This histone modification has been reported to be associated with chromatin compactness and increased breakage. Also, regions with evenly spaced nucleosomes, an unusual chromatin structure preferentially formed at promoters and regulatory binding sites, have also been observed in FRA3B [37]. Overall, it seems that there is no single mechanism that can explain the fragility of CFSs but rather a multitude. They depend on several characteristics, including structural properties of the FSs as well as dynamic features governing their replication that apply in a given cell type. Interestingly, they are not necessarily mutually exclusive and often can function in complementary ways [13, 14]. The only common shared aspect by all these mechanisms is that they can eventually lead to a mechanical breakage of CFSs.

Maintenance of fragile site integrity

Instability at FSs is a recognized signature of DNA damage induced by replication stress [2] and it is detected from the earliest premalignant stages [3, 4, 19]. Replication checkpoints are activated in response to the stress induced at CFSs [38]. Central to these checkpoints are the DNA damage response kinases, ATM and ATR, which respectively sense DNA double-strand breaks and RF integrity [38]. Specific targeting of these kinases in cellular models revealed that ATR disruption or hypomorphic mutations lead to chromosomal instability within CFS even under normal replication, a phenomenon that is aggravated after low doses of APH [39]. In addition, dual inhibition of ATM and ATR using caffeine has been found to significantly increase CFS breakage compared with ATR deficiency alone, denoting also a role for ATM and a possible interplay with ATR in CFS protection [40, 41]. Inactivation of several down-stream components of the ATR network like Chk1, HUS1, Claspin, and SMC1 revealed similar effects, although not as efficient as ATR loss (Table 1) [31, 42–44].
Table 1

Factors involved in control of CFSs stability and genome integrity

FactorFunctionReference
ATRMain kinase-activating replication checkpoint[39, 75]
ATMComplementary acting kinase[76]
Chk1Main ATR-downstream kinase involved in activation of the replication checkpoint[31]
Chk2ATM-downstream kinase[76]
HUS1Participates in the Rad9-Rad1-Hus1 (9-1-1) complex and is homologous to the PCNA clamp. The complex phosphorylates ATR substrates upon loading to sites of DNA damage[42]
ClaspinEncodes for an adaptor protein which binds to BRCA1 and Chk1 and facilitates the ATR-dependent phosphorylation of both proteins during DNA replication stress in human cells[43]
SMC1Member of the family of “structural maintenance of chromosomes” proteins participating in chromosome condensation, sister chromatid cohesion, and DNA repair. Prevents the collapse of stalled replication fork in an ATR-dependent manner and is required for S-phase checkpoint activation[77]
BRCA1ATR substrate, implicated in the activation of the G2/M checkpoint, homologous recombination, and DSB repair[78]
FANCD2Component of the Fanconi anemia (FA) pathway, phosphorylated by ATR. Plays a role not only in HR-dependent replication recovery, but also in regulating CFSs stability[79]
Polymerase ηInvolved in DNA synthesis of complex sequences, like repetitive and secondary structure that impede replication performing, the so-called ‘by-pass’ function[80]
Rev3Catalytic subunit of Polζ that is required for maintaining fragile site stability in human cells[81]
Polymerase κInvolved in DNA synthesis of complex sequences, like repetitive and secondary structure that impede replication performing[82]
WRNRecQ helicase regulated in an ATR and ATM-dependent manner, which prevents DSBs formation at perturbed forks after replication stress. Promotes stability of arrested RFs and their efficient restart[83]
BLMRecQ helicase contributing to restarting stalled forks through unwinding DNA structures and/or homologous recombination, to maintenance of pyrimidine pools balance, regulation of fork speed and decatenation of UFBs at CFSs[49, 50]
RECQ1Member of the RecQ family of DNA helicases. Promotes fork recovery and repair[84]
Topoisomerase I and IIAlleviate DNA secondary structures during replication by cleavage and re-ligation. Can facilitate oncogenic rearrangements at induced CFSs[85]
MUS81-EME1Endonuclease involved in resolving HJ-dependent replication intermediates. Required for fork repair and resolving UFBs[41, 50]
PICHPICH (Plk1 interacting checkpoint) is a helicase/translocase involved in resolving UFBs in mitosis[49]
SNM1B/APOLLOA nuclease component of the FA pathway involved in homology-directed repair. Also facilitates DNA localization of FANCD2 and BRCA1[86]
Rad51Component of the HR pathway involved in DSB repair and HJ mediated RF restart[47]
DNA-PKcsNHEJ pathway component[47]
Ligase IVNHEJ pathway component[47]
Factors involved in control of CFSs stability and genome integrity These checkpoints are vital as they ensure that DNA is replicated and chromosomes are prepared for mitosis [38]. Nevertheless, CFSs exhibit an increased vulnerability to RS leading to the activation of repair mechanisms [31]. The frequently observed presence of sister chromatid exchanges (SCEs) at the majority of CFSs breaks after APH treatment suggests that homologous recombination (HR) plays a major role in response to DSBs induced under conditions of RS [45]. The FANCD2 component of the Fanconi anemia (FA) pathway has been shown to play a role not only in HR-dependent replication recovery, but also in regulating CFSs stability (Table 1) (also reviewed in [31]). Similarly, activation of BRCA1 and other DSB repair proteins like RAD51 have also been found to be vital for maintaining CFSs stability (Table 1) [46]. Apart from HR the non-homologous end join pathway is also essential for chromosomal stability at these sites [47]. Specifically, by knocking down Rad51, DNA-PKcs, or Ligase IV, a significantly increased expression of CFSs under RS has been demonstrated. Notably, MDC1 and γ-H2AX foci were formed and co-localized with those of Rad51 and DNA-PKcs, while γ-H2AX and phospho-DNA-PKcs foci localized at expressed FSs on metaphase chromosomes. Other components implicated in resolving replication over specific CFSs region include specialized polymerases (Table 1) [48]. These polymerases, like DNA polymerase eta (Pol η), mainly deal with DNA synthesis of complex sequences, like repetitive and secondary structures that impede replication, by performing the so-called ‘by-pass’ function. Depletion of these specialized polymerases has been shown to lead to persistence of unreplicated CFSs in mitosis. Various helicases/translocases have been proposed to promote fork restart at CFSs in non-redundant ways, like the BLM, WRN, and RECQ1 through Holliday junction-mediated fork remodeling that is independent of DSB formation (Table 1) [41]. Their main purpose seems to be resetting of structural intermediates arising from HR as well as unwinding of DNA secondary structures, in order to facilitate replication fork restart. Alternatively, nucleases, such as the structural endonuclease MUS81-EME1 and the FA pathway nuclease SNM1B/APOLLO, are responsible for DSB-mediated fork restart and/or elimination of permanently collapsed forks by cleavage of replication intermediates and consequent DNA synthesis (Table 1) [41]. Many of the above-described factors have been shown to stabilize CFSs during S-phase replication. Nevertheless, recent observations have shown that under RS, non-fully replicated or interlinked DNA at CFSs may escape S and G2–M checkpoints and enter mitosis (reviewed in [13, 49, 50]). Attempts to segregate these intermediates lead to sister chromatid entanglement followed by non-disjunction, ultimately leading to formation of ultra-fine bridges (UFBs) in anaphase cells. UFBs are defined by FANCD2/FANCI FA proteins binding to their edges, while BLM and PICH (Plk1-interacting checkpoint helicase) attach along the bridge. These persistent replication intermediates have been shown to be processed by the MUS81-EME1 nuclease in early M-phase, possibly with the help of ERCC1, to provide a controlled production of DNA breaks, aiming to allow undisturbed disjunction of sister chromatids. In case of failure, UFBs are formed as mentioned during anaphase. At this stage, BLM helicase and PICH translocase assisted by topoisomerase IIIa (TOPIIIa) and the BLM-associated proteins RMI1/2 function as a second line of defense by decatenating these structures and permitting chromatid segregation [50]. If unresolved UFBs still persist, they will eventually lead to chromosome miss-segregation by uneven distribution of DNA between the daughter cells and micronuclei formation. The transmitted errors at CFSs will be shielded in 53BP1 nuclear bodies in the emerging G1-daughter cells and possibly replicated in the S-phase by high-fidelity polymerases. These results pose a new light on CFSs cleavage, and led to the proposal that apart from being detrimental in initiating genomic instability, it can also serve as a mechanism for controlled production of DNA breaks that rather maintain than compromise genome integrity. Several questions though emerge from this model that has been established with extrinsic factors (chemical inhibitors) that induce RS. How does this model apply and/or differentiate pre-malignant cells that are known to undergo OIRS? Which of the two types of damage, MUS81-EME1 cleavage or the decatenation inability, confers to the cancer-associated genome instability? A tempting but speculative model is that during premalignant stages, MUS81-EME1 cleavage activity is probably aberrantly increased, leading to a high frequency of DSBs, particularly at CFSs. This could be mediated by the active oncogenes that are present at such stages [2] and which in turn increase the activity of CDKs that regulate MUS81-EME1 expression [50]. As long as the checkpoints and their p53 effector are intact, the damage is repaired or the antitumor barriers of apoptosis and senescence eliminate such cells [2]. A similar scenario regarding gross genome damage due to decatenation inability may apply. Notably, it has been recently shown that under moderate RS, CFSs breaks can escape from efficient ATR checkpoint surveillance, leading to mitotic tolerance of such aberrations [44]. Such a pool of cells could undergo selection for loss of checkpoint function(s) and eventually accumulate DNA damage, probably through both MUS81-EME1 processing and chromatid non-disjunctions, provided that they are compatible with survival. Eventually, progression to full malignancy will ensue. This scenario fully concurs with our previous findings showing a prevalence of CFSs breakage along with the presence of UFBs and micronuclei in U2OS cells experiencing OIRS due to sustained expression of the replication licensing factor (RLF) Cdt1 [51]. Importantly, clones of these cells that “escaped” from the antitumor barriers, after prolonged Cdt1 expression, acquired a highly invasive potential. In a paradoxical way, this recently described model of deliberate CFSs controlled breakage to protect genome integrity of cells, may apply to malignant cells in the sense that it allows them to survive at the expense of genome integrity. Further expanding on this model, an emerging question concerns the effect exerted from the CFSs’ instability on the various elements like genes and non-coding RNAs that are located within them.

Functional elements in fragile sites

Several major publications arising from the ENCODE project [10] have underlined the importance of non-coding DNA. Non-coding regions of the genome have been found to participate in biochemical reactions with regulatory potential, such as transcription factor binding, epigenetic modifications, or long-distance interactions. Numerous functions have been attributed to non-coding RNA, including the expanding and best understood family of microRNAs (miRs) that are involved in post-transcriptional regulation of messenger RNA [52]. As a result of this progress, CFSs content may be now understood at a finer scale and the implications of CFS breakage will have to be reexamined carefully.

Fragile sites and cancer-associated genes

While available data point to an overlap between genes and CFSs [5], there is only one report showing that CFSs are denser in protein coding genes, with their distribution among fragile versus non-fragile regions varying among the chromosomes [7]. At the same time, a systematic review for the density of genes present within these genome areas is not available. To address this question, we retrieved a list of 327 genes participating in pathways in cancer from the Kyoto Encyclopedia of Genes and Genomes (KEGG)1 and investigated their association with both cytogenetically and molecularly mapped CFSs. We found that 110 cancer-related genes (33.6 % of all cancer-related genes) are located within CFSs (Table 2). Based on this, the density of cancer-related genes in a cytogenetically defined CFS compared to the rest of the genome is 37.2 % higher (Fig. 2a).
Table 2

Cytogenetic defined common fragile sites (CFSs) association with cancer genes and miRs

FRAChrStartEndBandTypeFrequencyCancer genesmiRNAsReference
FRA1A112,8000,0001p36AphidicolinCommonDVL1, PIK3CD, MTOR, CASP9, CDC42, WNT4, E2F2hsa-mir-6859-1, hsa-mir-1302-2, hsa-mir-6723, hsa-mir-200b, hsa-mir-200a, hsa-mir-429, hsa-mir-6726, hsa-mir-6727, hsa-mir-6808, hsa-mir-4251, hsa-mir-551a, hsa-mir-4417, hsa-mir-4689, hsa-mir-4252, hsa-mir-6728, hsa-mir-34a, hsa-mir-5697, hsa-mir-1273d, hsa-mir-6729, hsa-mir-7846, hsa-mir-4632, hsa-mir-6730, hsa-mir-3675, hsa-mir-3972, hsa-mir-4695, hsa-mir-1290, hsa-mir-6084, hsa-mir-1256, hsa-mir-4418, hsa-mir-6127, hsa-mir-4684, hsa-mir-4253, hsa-mir-3115, hsa-mir-4419a, hsa-mir-378f, hsa-mir-6731, hsa-mir-4425, hsa-mir-3917, hsa-mir-1976[90]
FRA1B15,0700,0016,1300,0001p32AphidicolinCommonJUNhsa-mir-4421, hsa-mir-6500, hsa-mir-761, hsa-mir-1273f, hsa-mir-5095, hsa-mir-1273g, hsa-mir-4781, hsa-mir-4422, hsa-mir-4711[8]
FRA1C168,900,00169,700,0001p31.2AphidicolinCommon
FRA1D184,900,00194,700,0001p22AphidicolinCommonhsa-mir-4423, hsa-mir-7856, hsa-mir-760[91]
FRA1E199,700,001102,200,0001p21.2AphidicolinCommonhsa-mir-553[92]
FRA1F1142,600,001155,000,0001q21AphidicolinCommonARNT, TPM3, CKS1Bhsa-mir-3118-1, hsa-mir-3118-2, hsa-mir-3118-3, hsa-mir-6077-1, hsa-mir-6736, hsa-mir-5087, hsa-mir-6077-2, hsa-mir-6878, hsa-mir-4257, hsa-mir-554, hsa-mir-8083, hsa-mir-6737, hsa-mir-5698, hsa-mir-190b, hsa-mir-4258[91]
FRA1G1172,900,001176,000,0001q25.1AphidicolinCommon
FRA1H1224,100,001236,600,0001q425-AzacytidineCommonWNT9A, WNT3A, EGLN1hsa-mir-320b-2, hsa-mir-4742, hsa-mir-6741, hsa-mir-5008, hsa-mir-3620, hsa-mir-6742, hsa-mir-4666a, hsa-mir-1182, hsa-mir-4427, hsa-mir-4671, hsa-mir-4753, hsa-mir-1537[93]
FRA1I1243,700,001249,250,6211q44AphidicolinCommonhsa-mir-3916, hsa-mir-3124
FRA1J1128,900,001142,600,0001q125-AzacytidineCommon[94]
FRA1K1185,800,001198,700,0001q31AphidicolinCommonTPR, PTGS2hsa-mir-4426, hsa-mir-1278, hsa-mir-4735[95]
FRA1L161,300,00184,900,0001p31AphidicolinCommonJAK1hsa-mir-3116-1, hsa-mir-3116-2, hsa-mir-6068, hsa-mir-4794, hsa-mir-3671, hsa-mir-101-1, hsa-mir-3117, hsa-mir-1262, hsa-mir-186, hsa-mir-7156
FRA1M194,700,00199,700,0001p21.3Folic acidRarehsa-mir-378g, hsa-mir-2682, hsa-mir-137
FRA2A296,800,001102,700,0002q11.2Folic acidRarehsa-mir-3127, hsa-mir-5696
FRA2B2110,200,001114,400,0002q13Folic acidRarePAX8hsa-mir-4267, hsa-mir-4436b-1, hsa-mir-4436b-2, hsa-mir-4435-2, hsa-mir-4771-2, hsa-mir-1302-3[96]
FRA2C216,700,00119,200,0002p24.2AphidicolinCommon[97]
FRA2D252,900,00155,000,0002p16.2AphidicolinCommonhsa-mir-4431, hsa-mir-3682[27]
FRA2E268,600,00175,000,0002p13AphidicolinCommonTGFAhsa-mir-3126, hsa-mir-1285-2[98]
FRA2F2135,100,001136,800,0002q21.3AphidicolinCommonhsa-mir-5590, hsa-mir-128-1[8]
FRA2G2169,700,00118,3000,0002q31AphidicolinCommonITGA6hsa-mir-933, hsa-mir-10b, hsa-mir-7704, hsa-mir-1246, hsa-mir-4444-1, hsa-mir-3128, hsa-mir-6512, hsa-mir-1258, hsa-mir-4437[99, 100]
FRA2H2183,000,001189,400,0002q32.1AphidicolinCommonITGAVhsa-mir-548ae-1, hsa-mir-561[100]
FRA2I2197,400,001209,000,0002q33AphidicolinCommonCASP8, FZD7, FZD5hsa-mir-3130-2, hsa-mir-3130-1, hsa-mir-2355, hsa-mir-7845, hsa-mir-1302-4, hsa-mir-4775[101]
FRA2J2237,300,001243,199,3732q37.3AphidicolinCommonhsa-mir-6811, hsa-mir-4440, hsa-mir-4441, hsa-mir-4269, hsa-mir-2467, hsa-mir-4786, hsa-mir-149, hsa-mir-3133[102]
FRA2K2144,100,001148,700,0002q22.3Folic acidRare[103]
FRA2L283,300,00190,500,0002p11.2Folic acidTCF7L1hsa-mir-6071, hsa-mir-4779, hsa-mir-4771-1, hsa-mir-4435-1, hsa-mir-4780, hsa-mir-4436a[104]
FRA2S2144,100,001154,900,0002q22.3-2q23.3AphidicolinCommonhsa-mir-4773-2, hsa-mir-4773-1[105]
FRA3A323,900,00126,400,0003p24.2AphidicolinCommonRARBhsa-mir-4792, hsa-mir-4442
FRA3B358,600,00163,700,0003p14.2AphidicolinCommon[106]
FRA3C3182,700,001187,900,0003q27AphidicolinCommonDVL3hsa-mir-4448, hsa-mir-1224, hsa-mir-5588, hsa-mir-548aq, hsa-mir-1248[107]
FRA3D3148,900,001160,700,0003q25AphidicolinCommonhsa-mir-5186, hsa-mir-3919, hsa-mir-15b, hsa-mir-16-2[107]
FRA4A46,000,00111,300,0004p16.1AphidicolinCommonhsa-mir-4798, hsa-mir-4274, hsa-mir-95, hsa-mir-548i-2, hsa-mir-3138
FRA4B4527,00,00159,500,0004q12BrdUCommonPDGFRA, KIThsa-mir-4449
FRA4C4139,500,001141,500,0004q31.1AphidicolinCommon[108]
FRA4D411,300,00135,800,0004p15AphidicolinCommonhsa-mir-572, hsa-mir-5091, hsa-mir-218-1, hsa-mir-7978, hsa-mir-573, hsa-mir-4275
FRA4F488,000,00198,800,0004q22AphidicolinCommonhsa-mir-5705[109]
FRA5A528,900,00142,500,0005p13BrdUCommonSKP2hsa-mir-4279, hsa-mir-579, hsa-mir-580, hsa-mir-3650
FRA5B592,300,00198,200,0005q15BrdUCommonhsa-mir-2277, hsa-mir-583
FRA5C5130,600,001136,200,0005q31.1AphidicolinCommonTCF7hsa-mir-6830, hsa-mir-3936, hsa-mir-1289-2, hsa-mir-3661, hsa-mir-4461, hsa-mir-5692c-1[110]
FRA5D592,300,00198,200,0005q15AphidicolinCommonhsa-mir-2277, hsa-mir-583
FRA5E518,400,00128,900,0005p14AphidicolinCommon[95]
FRA5F598,200,001109,600,0005q21AphidicolinCommonhsa-mir-548p
FRA5G5168,500,001180,915,2605q35Folic acidRareFGF18, MAPK9hsa-mir-585, hsa-mir-378e, hsa-mir-3912, hsa-mir-5003, hsa-mir-8056, hsa-mir-4634, hsa-mir-1271, hsa-mir-4281, hsa-mir-1229, hsa-mir-340, hsa-mir-8089, hsa-mir-4638
FRA6A613,400,00115,200,0006p23Folic acidRare[111]
FRA6B64,200,00171,000006p25.1AphidicolinCommonhsa-mir-3691, hsa-mir-7853, hsa-mir-5683
FRA6C625,200,00127,000,0006p22.2AphidicolinCommon[112]
FRA6D670,000,00175,900,0006q13BrdUCommonhsa-mir-30c-2, hsa-mir-30a, hsa-mir-4282
FRA6E616,1000,001164,500,0006q26AphidicolinCommon[113]
FRA6F6105,500,001114,600,0006q21AphidicolinCommonLAMA4, HDAC2hsa-mir-587[114]
FRA6G688,000,00193,100,0006q15AphidicolinCommonhsa-mir-4464, hsa-mir-4643
FRA6H630,400,00146,200,0006p21AphidicolinCommonRXRB, PPARD, CDKN1A, VEGFA, HSP90AB1hsa-mir-877, hsa-mir-4640, hsa-mir-6891, hsa-mir-6832, hsa-mir-4646, hsa-mir-1236, hsa-mir-6721, hsa-mir-6833, hsa-mir-3135b, hsa-mir-219a-1, hsa-mir-6873, hsa-mir-6834, hsa-mir-5004, hsa-mir-3934, hsa-mir-7159, hsa-mir-1275, hsa-mir-6835, hsa-mir-7111, hsa-mir-5690, hsa-mir-3925, hsa-mir-4462, hsa-mir-4641, hsa-mir-6780b, hsa-mir-4647, hsa-mir-4642, hsa-mir-586[115]
FRA7A754,000,00158,000,0007p11.2Folic acidRareEGFRhsa-mir-4283-1, hsa-mir-3147
FRA7B717,300,0007p22AphidicolinCommonPDGFA, RAC1hsa-mir-339, hsa-mir-4655, hsa-mir-6836, hsa-mir-4648, hsa-mir-4656, hsa-mir-589, hsa-mir-6874, hsa-mir-3683[116]
FRA7C735,000,00137,200,0007p14.2AphidicolinCommonhsa-mir-1200
FRA7D743,300,00145,400,0007p13AphidicolinCommonhsa-mir-6837, hsa-mir-6838, hsa-mir-4649, hsa-mir-4657
FRA7E791,100,00192,800,0007q21.2AphidicolinCommonCDK6hsa-mir-1285-1[117]
FRA7F798,000,001107,400,0007q22AphidicolinCommonPIK3CGhsa-mir-3609, hsa-mir-25, hsa-mir-93, hsa-mir-106b, hsa-mir-4658, hsa-mir-6840, hsa-mir-6875, hsa-mir-4653, hsa-mir-4285, hsa-mir-548o, hsa-mir-5090, hsa-mir-4467
FRA7G7114,600,001117,400,0007q31.2AphidicolinCommonMET, WNT2hsa-mir-6132[118]
FRA7H7130,400,001132,600,0007q32.3AphidicolinCommonhsa-mir-29a, hsa-mir-29b-1[119]
FRA7I7147,900,001159,138,6637q36AphidicolinCommonSHHhsa-mir-671, hsa-mir-3907, hsa-mir-153-2, hsa-mir-595, hsa-mir-5707[120]
FRA7J759,900,00177,500,0007q11AphidicolinCommonFZD9hsa-mir-4283-2, hsa-mir-6839, hsa-mir-4650-1, hsa-mir-3914-1, hsa-mir-3914-2, hsa-mir-4650-2, hsa-mir-4284, hsa-mir-590, hsa-mir-4651
FRA7K7107,400,001114,600,0007q31.1AphidicolinCommonLAMB1, LAMB4hsa-mir-3666[121]
FRA8A8101,600,001106,200,0008q22.3Folic acidRareFZD6hsa-mir-7705, hsa-mir-5680, hsa-mir-3151, hsa-mir-548a-3
FRA8B893,300,00199,000,0008q22.1AphidicolinCommonCCNE2hsa-mir-8084, hsa-mir-378d-2, hsa-mir-3150b, hsa-mir-3150a
FRA8C8117,700,001127,300,0008q24.1AphidicolinCommonhsa-mir-3610, hsa-mir-548az, hsa-mir-4663, hsa-mir-548aa-1, hsa-mir-548d-1, hsa-mir-6844, hsa-mir-4662b, hsa-mir-4662a[91]
FRA8D8139,900,001146364,0228q24.3AphidicolinCommonPTK2hsa-mir-151a, hsa-mir-1302-7, hsa-mir-4472-1, hsa-mir-4664, hsa-mir-937, hsa-mir-6845, hsa-mir-661, hsa-mir-6846, hsa-mir-6847, hsa-mir-7112-1, hsa-mir-7112-2, hsa-mir-6848, hsa-mir-939, hsa-mir-1234, hsa-mir-6849, hsa-mir-6893, hsa-mir-6850[66]
FRA8E8117,700,001127,300,0008q24.1Distamycin ARarehsa-mir-3610, hsa-mir-548az, hsa-mir-4663, hsa-mir-548aa-1, hsa-mir-548d-1, hsa-mir-6844, hsa-mir-4662b, hsa-mir-4662a[122]
FRA9A919,900,00133,200,0009p21Folic acidRareCDKN2A, CDKN2Bhsa-mir-4473, hsa-mir-4474, hsa-mir-491, hsa-mir-31, hsa-mir-876, hsa-mir-873[123]
FRA9B9114,900,001117,700,0009q32Folic acidRarehsa-mir-455
FRA9C919,900,00133,200,0009p21BrdUCommonCDKN2A, CDKN2Bhsa-mir-4473, hsa-mir-4474, hsa-mir-491, hsa-mir-31, hsa-mir-876, hsa-mir-873
FRA9D990,400,00191,800,0009q22.1AphidicolinCommonhsa-mir-4289[91]
FRA9E9114,900,001117,700,0009q32AphidicolinCommonhsa-mir-455[124]
FRA9F950,700,00165,900,0009q125-AzacytidineCommon
FRA9G916,600,00118,500,0009p22.2AphidicolinCommon[125]
FRA10A1089,500,00197,000,00010q23.3Folic acidRarePTEN, FAShsa-mir-4679-2, hsa-mir-4679-1, hsa-mir-107[126]
1099,300,001101,900,00010q24.2hsa-mir-1287, hsa-mir-4685, hsa-mir-6507
FRA10AC11089,500,00197,000,00010q23.3Folic acidRarePTEN, FAShsa-mir-4679-2, hsa-mir-4679-1, hsa-mir-107[127]
1099,300,001101,900,00010q24.2hsa-mir-1287, hsa-mir-4685, hsa-mir-6507
FRA10B10111,900,001114,900,00010q25.2BrdURarehsa-mir-4680, hsa-mir-548e, hsa-mir-6715b, hsa-mir-6715a, hsa-mir-4295[128]
FRA10C1052,900,00170,600,00010q21BrdUCommonCCDC6, CTNNA3hsa-mir-605, hsa-mir-548f-1, hsa-mir-3924, hsa-mir-1296, hsa-mir-7151, hsa-mir-1254-1[129]
FRA10D1070,600,00174,900,00010q22.1AphidicolinCommonhsa-mir-7152, hsa-mir-4676[126]
FRA10E10111,900,001114,900,00010q25.2AphidicolinCommonhsa-mir-4680, hsa-mir-548e, hsa-mir-6715b, hsa-mir-6715a, hsa-mir-4295[126]
FRA10F10119,100,001127,500,00010q26.1AphidicolinCommonFGFR2, CTBP2hsa-mir-4681, hsa-mir-4682, hsa-mir-3941, hsa-mir-4296[130]
FRA10G1042,300,00152,900,00010q11.2AphidicolinCommonRET, MAPK8, NCOA4hsa-mir-5100, hsa-mir-3156-1, hsa-mir-4294[129]
FRA11A1168,400,00170,400,00011q13.3Folic acidRareCCND1, FGF19, FGF4, FGF3, FADDhsa-mir-3164, hsa-mir-548k[131]
FRA11B11114,500,001121,200,00011q23.3Folic acidRareCBLhsa-mir-6716, hsa-mir-4492, hsa-mir-3656, hsa-mir-6756[132, 133]
FRA11C1116,200,00121,700,00011p15.1AphidicolinCommonhsa-mir-3159, hsa-mir-4486, hsa-mir-4694[91]
FRA11D1126,100,00127,200,00011p14.2AphidicolinCommon
FRA11E1131,000,00136,400,00011p13AphidicolinCommonhsa-mir-1343, hsa-mir-3973[134]
FRA11F1185,600,00188,300,00011q14.2AphidicolinCommonFZD4hsa-mir-6755, hsa-mir-3166[135]
FRA11G11114,500,001121,200,00011q23.3AphidicolinCommonCBLhsa-mir-6716, hsa-mir-4492, hsa-mir-3656, hsa-mir-6756[133]
FRA11H1163,400,00177,100,00011q13AphidicolinCommonVEGFB, BAD, RELA, GSTP1, CCND1, FGF19, FGF4, FGF3, FADD, WNT11hsa-mir-7155, hsa-mir-1237, hsa-mir-192, hsa-mir-194-2, hsa-mir-6750, hsa-mir-6749, hsa-mir-6879, hsa-mir-6751, hsa-mir-612, hsa-mir-4690, hsa-mir-4489, hsa-mir-3163, hsa-mir-6860, hsa-mir-6752, hsa-mir-7113, hsa-mir-4691, hsa-mir-6753, hsa-mir-3164, hsa-mir-548k, hsa-mir-3664, hsa-mir-6754, hsa-mir-3165, hsa-mir-139, hsa-mir-4692, hsa-mir-548al, hsa-mir-4696, hsa-mir-326[136]
FRA11I1116,200,00121,700,00011p15.1Distamycin ARarehsa-mir-3159, hsa-mir-4486, hsa-mir-4694
FRA12A1246,400,00154,900,00012q13.1Folic acidRareWNT10B, WNT1hsa-mir-4698, hsa-mir-4494, hsa-mir-6505, hsa-mir-1291, hsa-mir-4701, hsa-mir-1293, hsa-mir-6757, hsa-mir-196a-2, hsa-mir-615, hsa-mir-3198-2, hsa-mir-148b[137]
FRA12B1280,300,00192,600,00012q21.3AphidicolinCommonKITLGhsa-mir-617, hsa-mir-618, hsa-mir-4699
FRA12C12114,300,001120,700,00012q24.2BrdURarehsa-mir-620, hsa-mir-4472-2, hsa-mir-1178, hsa-mir-4498[138]
FRA12D12112,300,001114,300,00012q24.13Folic acidRarehsa-mir-3657, hsa-mir-6861, hsa-mir-1302-1, hsa-mir-7106, hsa-mir-6762
FRA12E12109,000,001133,851,89512q24AphidicolinCommonFZD10hsa-mir-4496, hsa-mir-619, hsa-mir-4497, hsa-mir-6760, hsa-mir-6761, hsa-mir-3657, hsa-mir-6861, hsa-mir-1302-1, hsa-mir-7106, hsa-mir-6762, hsa-mir-620, hsa-mir-4472-2, hsa-mir-1178, hsa-mir-4498, hsa-mir-4700, hsa-mir-7107, hsa-mir-4304, hsa-mir-8072, hsa-mir-3908, hsa-mir-6880, hsa-mir-5188, hsa-mir-4419b, hsa-mir-3612, hsa-mir-6763
FRA13A1334,000,00135,500,00013q13.2AphidicolinCommon[139]
FRA13B1355,300,00173,300,00013q21BrdUCommonhsa-mir-5007, hsa-mir-3169, hsa-mir-548x-2, hsa-mir-4704
FRA13C1359,600,00162,300,00013q21.2AphidicolinCommonhsa-mir-3169[140]
FRA13D1395,000,001101,700,00013q32AphidicolinCommonhsa-mir-4501, hsa-mir-3170, hsa-mir-623, hsa-mir-4306
FRA13E1373,300,00179,000,00013q22AphidicolinCommonhsa-mir-3665[115]
FRA14B1458,100,00167,900,00014q23AphidicolinCommonHIF1A, MAXhsa-mir-5586, hsa-mir-548h-1, hsa-mir-7855, hsa-mir-4706, hsa-mir-4708, hsa-mir-625
FRA14C1467,900,00170,200,00014q24.1AphidicolinCommonhsa-mir-5694
FRA15A1559,100,00167,500,00015q22AphidicolinCommonDAPK2, MAP2K1, SMAD3hsa-mir-2116, hsa-mir-8067, hsa-mir-6085, hsa-mir-190a, hsa-mir-422a, hsa-mir-1272, hsa-mir-4511, hsa-mir-4311, hsa-mir-4512[141]
FRA16A1614,800,00116,800,00016p13.11Folic acidRarehsa-mir-3179-1, hsa-mir-3670-1, hsa-mir-3180-1, hsa-mir-6511a-1, hsa-mir-6770-1, hsa-mir-1972-1, hsa-mir-6511b-2, hsa-mir-3180-4, hsa-mir-6506, hsa-mir-484, hsa-mir-3179-2, hsa-mir-3670-2, hsa-mir-3180-2, hsa-mir-6511a-2, hsa-mir-6770-2, hsa-mir-6511a-3[142]
FRA16B1666,700,00170,800,00016q22.1Distamycin ARareCDH1hsa-mir-328, hsa-mir-6773, hsa-mir-1538, hsa-mir-140, hsa-mir-1972-2[143]
FRA16C1666,700,00170,800,00016q22.1AphidicolinCommonCDH1hsa-mir-328, hsa-mir-6773, hsa-mir-1538, hsa-mir-140, hsa-mir-1972-2[144]
FRA16D1679,200,00181,700,00016q23.2AphidicolinCommonhsa-mir-4720, hsa-mir-7854, hsa-mir-6504[143]
FRA16E1624,200,00128,100,00016p12.1Distamycin ARarehsa-mir-1273h, hsa-mir-548w[145]
FRA17A1710,700,00116,000,00017p12Distamycin ARarehsa-mir-744, hsa-mir-1269b, hsa-mir-548h-3, hsa-mir-4731[123]
FRA17B1757,600,00158,300,00017q23.1AphidicolinCommonhsa-mir-21, hsa-mir-4737[112]
FRA18A1832,700,00137,200,00018q12.2AphidicolinCommonhsa-mir-3975, hsa-mir-187, hsa-mir-3929, hsa-mir-4318[146]
FRA18B1853,800,00161,600,00018q21.3AphidicolinCommonBCL2hsa-mir-122, hsa-mir-3591[147]
FRA18C1866,800,00168,700,00018q22.2AphidicolinCommon[148]
FRA19A1932,400,00159,128,98319q135-AzacytidineCommonCEBPA, AKT2, EGLN2, TGFB1, CBLC, FGF21, BAX, FLT3LG, KLK3, BIRC8, PRKCGhsa-mir-6887, hsa-mir-5196, hsa-mir-4530, hsa-mir-6719, hsa-mir-641, hsa-mir-6796, hsa-mir-6797, hsa-mir-4323, hsa-mir-8077, hsa-mir-4531, hsa-mir-8085, hsa-mir-6088, hsa-mir-330, hsa-mir-642a, hsa-mir-642b, hsa-mir-769, hsa-mir-320e, hsa-mir-3190, hsa-mir-3191, hsa-mir-6798, hsa-mir-4324, hsa-mir-150, hsa-mir-5088, hsa-mir-6799, hsa-mir-6800, hsa-mir-4749, hsa-mir-4750, hsa-mir-4751, hsa-mir-8074, hsa-mir-99b, hsa-let-7e, hsa-mir-125a, hsa-mir-6801, hsa-mir-643, hsa-mir-512-1, hsa-mir-512-2, hsa-mir-1323, hsa-mir-498, hsa-mir-520e, hsa-mir-515-1, hsa-mir-519e, hsa-mir-520f, hsa-mir-515-2, hsa-mir-519c, hsa-mir-1283-1, hsa-mir-520a, hsa-mir-526b, hsa-mir-519b, hsa-mir-525, hsa-mir-523, hsa-mir-518f, hsa-mir-520b, hsa-mir-518b, hsa-mir-526a-1, hsa-mir-520c, hsa-mir-518c, hsa-mir-524, hsa-mir-517a, hsa-mir-519d, hsa-mir-521-2, hsa-mir-520d, hsa-mir-517b, hsa-mir-520g, hsa-mir-516b-2, hsa-mir-526a-2, hsa-mir-518e, hsa-mir-518a-1, hsa-mir-518d, hsa-mir-516b-1, hsa-mir-518a-2, hsa-mir-517c, hsa-mir-520h, hsa-mir-521-1, hsa-mir-522, hsa-mir-519a-1, hsa-mir-527, hsa-mir-516a-1, hsa-mir-1283-2, hsa-mir-516a-2, hsa-mir-519a-2, hsa-mir-371a, hsa-mir-371b, hsa-mir-372, hsa-mir-373, hsa-mir-935, hsa-mir-4752, hsa-mir-8061, hsa-mir-7975, hsa-mir-6804, hsa-mir-6802, hsa-mir-6803, hsa-mir-6805, hsa-mir-6806, hsa-mir-4754, hsa-mir-6807
FRA19B19120,000,00019p13Folic acidRareFGF22, APC2, DAPK3, MAP2K2, PIK3R2hsa-mir-1302-10, hsa-mir-4745, hsa-mir-3187, hsa-mir-1909, hsa-mir-1227, hsa-mir-6789, hsa-mir-4321, hsa-mir-7108, hsa-mir-7850, hsa-mir-637, hsa-mir-4746, hsa-mir-7-3, hsa-mir-4747, hsa-mir-6885, hsa-mir-6790, hsa-mir-3940, hsa-mir-6791, hsa-mir-6792, hsa-mir-4999, hsa-mir-5589, hsa-mir-4322, hsa-mir-1181, hsa-mir-1238, hsa-mir-638, hsa-mir-4748, hsa-mir-199a-1, hsa-mir-6793, hsa-mir-6886, hsa-mir-7974, hsa-mir-5684, hsa-mir-6794, hsa-mir-5695, hsa-mir-6515, hsa-mir-24-2, hsa-mir-27a, hsa-mir-23a, hsa-mir-181c, hsa-mir-181d, hsa-mir-1199, hsa-mir-639, hsa-mir-6795, hsa-mir-1470, hsa-mir-3188, hsa-mir-3189, hsa-mir-640
FRA20A2017,900,00121,300,00020p11.23Folic acidRarehsa-mir-3192
FRA20B20920,000112,100,00020p12.2AphidicolinCommonhsa-mir-6870
FRA22A2237,600,00151,304,56622q13Folic acidRareRAC2, PDGFB, RBX1, EP300, WNT7Bhsa-mir-658, hsa-mir-659, hsa-mir-6820, hsa-mir-4534, hsa-mir-4766, hsa-mir-1281, hsa-mir-6889, hsa-mir-33a, hsa-mir-378i, hsa-mir-1249, hsa-mir-4762, hsa-mir-3619, hsa-let-7a-3, hsa-mir-4763, hsa-let-7b, hsa-mir-3201, hsa-mir-4535, hsa-mir-3667, hsa-mir-6821[149]
FRA22B2229,600,00132,200,00022q12.2AphidicolinCommonhsa-mir-3653, hsa-mir-6818, hsa-mir-3200, hsa-mir-3928, hsa-mir-7109
FRAXAX142,100,001147,100,000Xq27.3Folic acidRarehsa-mir-892c, hsa-mir-890, hsa-mir-888, hsa-mir-892a, hsa-mir-892b, hsa-mir-891b, hsa-mir-891a, hsa-mir-513c, hsa-mir-513b, hsa-mir-513a-1, hsa-mir-513a-2, hsa-mir-506, hsa-mir-507, hsa-mir-508, hsa-mir-514b, hsa-mir-509-2, hsa-mir-509-3, hsa-mir-509-1, hsa-mir-510, hsa-mir-514a-1, hsa-mir-514a-2, hsa-mir-514a-3[150]
FRAXBX6,000,0019,500,000Xp22.31AphidicolinCommonhsa-mir-4770, hsa-mir-4767, hsa-mir-651[119]
FRAXCX98,300,001102,600,000Xq22.1AphidicolinCommon[119]
FRAXDX140,300,001142,100,000Xq27.2AphidicolinCommon[151]
FRAXEX147,100,001155,270,560Xq28Folic acidRareIKBKGhsa-mir-2114, hsa-mir-4330, hsa-mir-224, hsa-mir-452, hsa-mir-105-1, hsa-mir-767, hsa-mir-105-2, hsa-mir-3202-1, hsa-mir-3202-2, hsa-mir-718, hsa-mir-6858, hsa-mir-664b, hsa-mir-1184-1, hsa-mir-1184-2, hsa-mir-1184-3[152]
FRAXFX147,100,001155,270,560Xq28Folic acidRareIKBKGhsa-mir-2114, hsa-mir-4330, hsa-mir-224, hsa-mir-452, hsa-mir-105-1, hsa-mir-767, hsa-mir-105-2, hsa-mir-3202-1, hsa-mir-3202-2, hsa-mir-718, hsa-mir-6858, hsa-mir-664b, hsa-mir-1184-1, hsa-mir-1184-2, hsa-mir-1184-3[153]

A total of 125 CFSs were collected from GeneCards V3 [87] and from manual bibliographic search. Data for 1,871 human miRs were downloaded from miRBase 20 [53]. Genomic data of exon genes were downloaded from Ensembl Biomart [88]. A list of 327 genes that participate in Pathways in cancer (KEGG PATHWAY: hsa05200) was retrieved from KEGG [89]. PHP scripts parsed the data to populate a MySQL (http://www.mysql.com) relational database. Other scripts queried the database to produce a list of CFSs with their associated cancer-related genes and microRs. The main results found here are: (i) the total length of the cytogenetic bands of the 125 CFSs is 835.22 Mbps (27 % of the entire haploid genome), while 110 cancer-related genes (33.6 % of all cancer-related genes) and 686 miRs (36.7 % of all miRs) are located within these CFSs, (ii) the total length of the 26 CFSs with precise genomic boundaries is 79.3 Mbps (2.6 % of the entire haploid genome), and (iii) nine cancer-related genes (2.8 % of all cancer-related genes) and 44 miRs (2.4 % of all miRNAs) are located within these molecularly defined CFSs (Table 2). Based on the above, the density of cancer-related genes and miRs in cytogenetically defined CFS compared to the rest of the genome is 37.2 and 56.7 % higher, respectively

Fig. 2

Frequency of cancer-related genes, repetitive elements, miRs, binding elements, and histone marks in CFSs. a CFSs exhibit a higher density of cancer-related genes (obtained from Kyoto Encyclopedia of Genes and Genomes), Alu repetitive elements [19], miRs, and the CTCF binding element relative to non-fragile regions. b CFSs exhibit a differential density of the histone marks (i) Histone 3 lysine 27 acetylation (H3K27ac) and (ii) Histone 3 lysine 4 trimethylation (H3K4me3), relative to non-fragile regions that is cell type origin-dependent (data concerning histone modifications derived from ChIP-seq experiments belonging to the ENCODE project were downloaded from the UCSC server (http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeRegMarkH3k4me3/, http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeRegMarkH3k27ac/). Specifically, we obtained bigWig files for H3k4me3 and H3k27ac modifications in the GM12878, H1-hesc, HSMM, HUVEC, K562, NHEK, and NHLF cell types. Information concerning regions of interest was extracted with the bigWigSummary utility, also available from the UCSC server. Specifically, the average signal was calculated for every chromosome in every cell line by repeatedly invoking: bigWigSummary-type = mean “bigwigfile” chrN start end. Similarly, the average signal was calculated for cytogenetically and molecularly mapped fragile sites. For every defined fragile site, the mean histone modification signal of the corresponding chromosome was subtracted from the mean signal of the fragile region. Signal difference from mean for site i = mean(FS) − mean(chromosomeFS). The chromosome means varied within each cell type (data not shown), as did the histone modifications between cell types) (S histone signal). c Frequency of histone marks per CFSs. Each CFS exhibits a differential density of the histone marks. (i) Histone 3 lysine 4 trimethylation (H3K4me3) and (ii) Histone 3 lysine 27 acetylation (H3K27ac), relative to non-fragile regions averaged over all cell types presented in Fig. 2 (using the data generated for Fig. 2b, we also plotted a boxplot for individual cytogenetically defined CFSs in all 7 cell lines mentioned above with respect to H3K4me3 and H3K27ac. Significant heterogeneity between CFSs can be observed)

Cytogenetic defined common fragile sites (CFSs) association with cancer genes and miRs A total of 125 CFSs were collected from GeneCards V3 [87] and from manual bibliographic search. Data for 1,871 human miRs were downloaded from miRBase 20 [53]. Genomic data of exon genes were downloaded from Ensembl Biomart [88]. A list of 327 genes that participate in Pathways in cancer (KEGG PATHWAY: hsa05200) was retrieved from KEGG [89]. PHP scripts parsed the data to populate a MySQL (http://www.mysql.com) relational database. Other scripts queried the database to produce a list of CFSs with their associated cancer-related genes and microRs. The main results found here are: (i) the total length of the cytogenetic bands of the 125 CFSs is 835.22 Mbps (27 % of the entire haploid genome), while 110 cancer-related genes (33.6 % of all cancer-related genes) and 686 miRs (36.7 % of all miRs) are located within these CFSs, (ii) the total length of the 26 CFSs with precise genomic boundaries is 79.3 Mbps (2.6 % of the entire haploid genome), and (iii) nine cancer-related genes (2.8 % of all cancer-related genes) and 44 miRs (2.4 % of all miRNAs) are located within these molecularly defined CFSs (Table 2). Based on the above, the density of cancer-related genes and miRs in cytogenetically defined CFS compared to the rest of the genome is 37.2 and 56.7 % higher, respectively Frequency of cancer-related genes, repetitive elements, miRs, binding elements, and histone marks in CFSs. a CFSs exhibit a higher density of cancer-related genes (obtained from Kyoto Encyclopedia of Genes and Genomes), Alu repetitive elements [19], miRs, and the CTCF binding element relative to non-fragile regions. b CFSs exhibit a differential density of the histone marks (i) Histone 3 lysine 27 acetylation (H3K27ac) and (ii) Histone 3 lysine 4 trimethylation (H3K4me3), relative to non-fragile regions that is cell type origin-dependent (data concerning histone modifications derived from ChIP-seq experiments belonging to the ENCODE project were downloaded from the UCSC server (http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeRegMarkH3k4me3/, http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeRegMarkH3k27ac/). Specifically, we obtained bigWig files for H3k4me3 and H3k27ac modifications in the GM12878, H1-hesc, HSMM, HUVEC, K562, NHEK, and NHLF cell types. Information concerning regions of interest was extracted with the bigWigSummary utility, also available from the UCSC server. Specifically, the average signal was calculated for every chromosome in every cell line by repeatedly invoking: bigWigSummary-type = mean “bigwigfile” chrN start end. Similarly, the average signal was calculated for cytogenetically and molecularly mapped fragile sites. For every defined fragile site, the mean histone modification signal of the corresponding chromosome was subtracted from the mean signal of the fragile region. Signal difference from mean for site i = mean(FS) − mean(chromosomeFS). The chromosome means varied within each cell type (data not shown), as did the histone modifications between cell types) (S histone signal). c Frequency of histone marks per CFSs. Each CFS exhibits a differential density of the histone marks. (i) Histone 3 lysine 4 trimethylation (H3K4me3) and (ii) Histone 3 lysine 27 acetylation (H3K27ac), relative to non-fragile regions averaged over all cell types presented in Fig. 2 (using the data generated for Fig. 2b, we also plotted a boxplot for individual cytogenetically defined CFSs in all 7 cell lines mentioned above with respect to H3K4me3 and H3K27ac. Significant heterogeneity between CFSs can be observed)

Fragile sites and microRNA genes

According to an early study, miRs are particularly frequent in CFSs [6]. Out of the 186 miRs known at the time, 35 were found in, or very close (<3 Mb) to, CFSs, occurring at a density (number of miRs per length) that was estimated to be approximately 9 % higher than in non-FSs. A newer analysis has found approximately 33.8 % of 715 miRs within CFSs, corresponding to a relative 50 % (26–85 %) higher density in regard to non-FSs, but the relation seems to vary between chromosomes [7] with some, like chromosome 16 and 19, having many more fragile than non-fragile miRAs, and others, like chromosome 14, having a lower incidence of miRNAs in fragile regions. Given that the number of known miRs has more than doubled since then, we have repeated this analysis with the recent version of miRBase (v20, [53]) and have found 686 miRs out of 1,871 (36.7 %) within cytogenetically defined fragile sites (Table 2) (corresponding incidence in molecularly mapped CFSs is shown in Table 3). Thus, the relative density of miR in cytogenetically defined CFSs is 57 % higher than in the rest of the genome (Fig. 2a; Table 2). Specific pertinent examples include tumor suppressors like hsa-mir-34a in FRA1A and oncomirs like hsa-mir-21 in FRA17B. In addition, more than 50 % of microRNAs seem to be clustered in relatively short regions, up to 50 kb, often containing multiple miR isoforms belonging to the same family [54]. When mapped, approximately 28 % of miR clusters overlap with known FSs. Rearrangements within these regions can disrupt multiple miRs in a single hit and produce complex phenotypic changes.
Table 3

A list of molecularly delimited common fragile sites (CFSs) obtained from a manual search of the literature

FRAChrStartEndBAC/STS fromBAC/STS toTypeFrequencyCancer genesmiRNAsReferences
FRA1E197,749,96198,119,925RP4-538N4RP11-128G13AphidicolinCommon[92]
FRA1H1216,110,179226,742,379RP11-22M7RP11-118H45AZACommonTGFB2hsa-mir-215, hsa-mir-194-1, hsa-mir-664a, hsa-mir-320b-2, hsa-mir-4742, hsa-mir-6741[154]
FRA2Ccen218,524,76019,272,430RP11-720L11RP11-78J22AphidicolinCommon[97]
FRA2Ctel214,934,88915,681,539RP11-526G2RP11-32P22AphidicolinCommon[97]
FRA2G2169,498,510170,313,244RP11-285F23RP11-724O16AphidicolinCommon[155]
FRA2H2186,716,111187,251,132RP11-561J1RP11-639N24AphidicolinCommon[156]
FRA3B359,623,63263,846,635RP11-70P20RP11-50F24AphidicolinCommon[157]
FRA4F490,208,19197,312,924RP11-549C16RP11-145G20AphidicolinCommon[109]
FRA6E6160,275,245166,079,958D6S1581D6S1719AphidicolinCommon[158]
FRA6F6111,577,970112,568,919RP5-1112D6RP1-142L7AphidicolinCommon[159]
FRA6H627,804,03837,156,663RP1-193B12RP11-588I14AphidicolinCommonRXRB, PPARD, CDKN1Ahsa-mir-877, hsa-mir-4640, hsa-mir-6891, hsa-mir-6832, hsa-mir-4646, hsa-mir-1236, hsa-mir-6721, hsa-mir-6833, hsa-mir-3135b, hsa-mir-219a-1, hsa-mir-6873, hsa-mir-6834, hsa-mir-5004, hsa-mir-3934, hsa-mir-7159, hsa-mir-1275, hsa-mir-6835, hsa-mir-7111, hsa-mir-5690, hsa-mir-3925[115]
FRA7B7112,392,296TelomereRP11-507C1AphidicolinCommonPDGFA, RAC1hsa-mir-339, hsa-mir-4655, hsa-mir-6836, hsa-mir-4648, hsa-mir-4656, hsa-mir-589, hsa-mir-6874, hsa-mir-3683[116]
FRA7E780,508,75184,935,939D7S1934SHGC-104456AphidicolinCommonHGF[117]
FRA7G7115,894,865116,072,877D7S486D7S522AphidicolinCommon[160]
FRA7H7130,413,791130,857,950D7S786D7S649AphidicolinCommonhsa-mir-29a, hsa-mir-29b-1[161]
FRA7I7144,671,106146,121,417SHGC-153624sWSS2627AphidicolinCommon[120]
FRA7K7110,657,856111,031,412WI-5281SHGC-78648AphidicolinCommon[121]
FRA8C8124,285,269128,421,245RP11-468O2RP11-382A18AphidicolinCommonhsa-mir-548aa-1, hsa-mir-548d-1, hsa-mir-6844, hsa-mir-4662b, hsa-mir-4662a[162]
FRA9G917,135,03817,503,917Within c9orf39/CNTLNAphidicolinCommon[125]
FRA10F10125,391,503128,256,865RP11-391M7RP11-179O22AphidicolinCommonCTBP2hsa-mir-4296, hsa-mir-4484[130]
FRA11E1132,086,45834,028,916RP1-17K7RP13-786C16AphidicolinCommon[134]
FRA11G11113,688,799118,157,704RP11-667M19RP11-832A4AphidicolinCommonZBTB16[133]
FRA13A1335,546,08836,184,897RP11-307O13RP11-270C18AphidicolinCommon[139]
FRA13E1373,285,77476,386,498RP11-342J4RP11-29G8AphidicolinCommon[115]
FRA16D1678,420,35978,731,553AH009490.2FRA16D seqAphidicolinCommon[163]
FRAXBX6,595,1117,548,235DXS1130DXS237AphidicolinCommonhsa-mir-4767[164]

The list of molecularly mapped CFSs was compiled by performing a systematic search of the literature for each one of the known cytogenetically mapped fragile sites (n = 125). Whenever the placement of STS markers or BACs on GRC37/hg19 was unknown, it was verified by megaBLAST (default parameters) of the complete sequence against the human chromosome sequences. Only matches with greater than 90 % coverage were considered for CFS placement. Twenty-six (26) CFSs with a precise mapping were identified (see Table 4), whose coordinates were then mapped to the reference genome version GRC37. Whenever the precise location of BACs was unknown, it was verified by alignment with NCBI Megablast (default settings). The annotation of the reference human genome was obtained from UCSC (URL: hgdownload.cse.ucsc.edu/goldenPath/hg19/) in December 2013. The utility “bigWigSummary” was used to extract mean scores for molecularly mapped fragile regions and randomly selected non-fragile control regions. The BAC/STS limits have been extracted from the cited publication and coordinates mapped on GRC37. Note that FRA2C has been divided into two separate, molecularly defined hotspots

A list of molecularly delimited common fragile sites (CFSs) obtained from a manual search of the literature The list of molecularly mapped CFSs was compiled by performing a systematic search of the literature for each one of the known cytogenetically mapped fragile sites (n = 125). Whenever the placement of STS markers or BACs on GRC37/hg19 was unknown, it was verified by megaBLAST (default parameters) of the complete sequence against the human chromosome sequences. Only matches with greater than 90 % coverage were considered for CFS placement. Twenty-six (26) CFSs with a precise mapping were identified (see Table 4), whose coordinates were then mapped to the reference genome version GRC37. Whenever the precise location of BACs was unknown, it was verified by alignment with NCBI Megablast (default settings). The annotation of the reference human genome was obtained from UCSC (URL: hgdownload.cse.ucsc.edu/goldenPath/hg19/) in December 2013. The utility “bigWigSummary” was used to extract mean scores for molecularly mapped fragile regions and randomly selected non-fragile control regions. The BAC/STS limits have been extracted from the cited publication and coordinates mapped on GRC37. Note that FRA2C has been divided into two separate, molecularly defined hotspots
Table 4

Data from ENCODE [10] with respect to molecularly mapped common fragile sites (CFSs) from Table 3

NameCTCF bs (count)CTCF bs (%)CTCF bs/kbpH3K27ac99-way cons. score
FRA1E970.0020.2611.320.133
FRA1H17,3380.3721.6313.07
FRA2Ctel7000.0150.9382.530.071
FRA2Ccen1,1080.0241.4861.560.111
FRA2G1,5010.0321.6984.220.114
FRA2H1320.0030.2551.440.103
FRA3B4,1560.0890.9821.570.105
FRA4F4,7650.1020.6711.630.085
FRA6H33,7350.7243.6078.140.12
FRA6F2,1040.0452.1232.930.126
FRA6E6,2740.1351.0812.470.062
FRA7B23,3310.51.9073.13
FRA7E4,2530.0910.9611.60.082
FRA7K2650.0060.7092.560.097
FRA7G2650.0061.4892.460.098
FRA7H1,4580.0313.2834.380.089
FRA7I4350.0090.31.980.065
FRA8C8,5990.1842.0793.260.082
FRA9G1680.0040.4551.20.142
FRA10F5,8780.1262.0512.570.071
FRA11E4,4940.0962.3144.050.123
FRA11G9,5450.2052.1362.570.126
FRA13A1930.0040.3021.230.108
FRA13E2,9500.0630.9511.780.104
FRA16D3780.0081.2151.870.118
FRAXB7180.0150.7531.540.056

CTCF binding sites, studied in 89 cell lines, are shown respectively as absolute counts, with respect to the total number of CTCF binding sites in the whole genome and as a frequency per kb. Average H3k27 acetylation scores from ChIP-seq analysis of the K562 cell line have been calculated for each fragile site. Average conservation score between human and 99 vertebrates was obtained from the UCSC browser (http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg19&g=cons100way). We compiled a list of molecularly mapped CFSs by performing a systematic search of the literature for each one of the known cytogenetically mapped CFSs (n = 125). Twenty-six CFSs with a precise mapping were identified (see Table 3), and their coordinates were then mapped to the reference genome version GRC37. Whenever the precise location of BACs was unknown, it was verified by alignment with NCBI Megablast (default settings). The annotation of the reference human genome was obtained from UCSC (URL: hgdownload.cse.ucsc.edu/goldenPath/hg19/) in December 2013. The utility “bigWigSummary” was used to extract mean scores for molecularly mapped fragile regions and randomly selected non-fragile control regions

Fragile sites and regions with regulatory potential

Other DNA elements, such as regions with regulatory potential, may also be contained or overlap with CFSs. As an example, CTCF binding sites from ENCODE ChIP-seq are distributed throughout the genome. CTCF is a critical “weaver” of chromatin structure and function, and can provide an anchor point for nucleosome positioning [55]. Indirectly, CTCF can influence the accessibility of chromatin and plays diverse roles in chromatin insulation, gene regulation, imprinting, intra/interchromosomal interactions, nuclear compartmentalization, and alternative splicing [56, 57]. In some cases, distal fragments bound by CTCF have been found to mediate long-range interactions by loop formation and could modulate transcription at distant sites [58]. We examined the percentage of all potential CTCF binding sites within molecularly mapped CFSs and found that this ranges between 2.76 and 3.20 % in different cell lines (Table 4). Relative to the total length of fragile segments, this corresponds to an 18 % (10–25 %)-fold increase in the number of potential CTCF binding sites (Fig. 2a). Although it is impossible to know whether CTCF binding at these sites exerts a meaningful effect, its presence seems in accordance with the observation that many CFSs are gene-rich. Even more, CFSs rearrangements could influence gene expression further away on the same chromosome. Data from ENCODE [10] with respect to molecularly mapped common fragile sites (CFSs) from Table 3 CTCF binding sites, studied in 89 cell lines, are shown respectively as absolute counts, with respect to the total number of CTCF binding sites in the whole genome and as a frequency per kb. Average H3k27 acetylation scores from ChIP-seq analysis of the K562 cell line have been calculated for each fragile site. Average conservation score between human and 99 vertebrates was obtained from the UCSC browser (http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg19&g=cons100way). We compiled a list of molecularly mapped CFSs by performing a systematic search of the literature for each one of the known cytogenetically mapped CFSs (n = 125). Twenty-six CFSs with a precise mapping were identified (see Table 3), and their coordinates were then mapped to the reference genome version GRC37. Whenever the precise location of BACs was unknown, it was verified by alignment with NCBI Megablast (default settings). The annotation of the reference human genome was obtained from UCSC (URL: hgdownload.cse.ucsc.edu/goldenPath/hg19/) in December 2013. The utility “bigWigSummary” was used to extract mean scores for molecularly mapped fragile regions and randomly selected non-fragile control regions

Fragile sites and histone modifications

Epigenetic modifications, such as histone methylation and acetylation, can also take place within FS. It appears that H3K9/14 hypoacetylation is a global feature of CFSs in a lymphoblastoid cell line [9] and could impede replication progression. Several enzymes modify key histone residues with relatively high specificity and regulate, indirectly, transcription, repair, and replication. Indeed, histone modifications vary significantly between cell types and are well correlated with transcription levels in the ENCODE data (Figure 2 in [10]), especially for H3K79me2, H3K9ac, H3K4me3, and H3K27ac. Histone 3 lysine 27 acetylation (H3K27ac) co-localizes with active enhancers [59] and regions with open chromatin structure and could play a role in protecting against replication–transcription collisions and R-loop formation. Intriguingly, H3K27ac varies between cell types and CFSs (Fig. 2b, c). The average ChIP-seq acetylation signal and the number of signal peaks (data not shown) within cytogenetic CFSs are slightly lower than the corresponding chromosome mean in K562 cells, but higher in HUVEC cells and equivocal for other cell types. Such variability may explain the plasticity of CFSs and their differential expression between individuals, cell types, and culture conditions. Histone 3 lysine 4 trimethylation is also associated with active promoters [60] and correlates well with transcription in the ENCODE data [10]. It appears that H3K4me3 may also contribute to the DNA damage response and repair of DSBs in yeast cells [61], mediating cellular responses to genotoxic stresses, and interacting with the human tumor suppressor ING1, which is required for DNA repair and apoptotic activities [62]. A recent study of ERFSs [9] has shown that replication stall, as identified by anti-replication protein A ChIP, preferentially co-localizes with H3K4me3 (see supplemental figure 1 in [16]). Although cytogenetically defined CFSs as a whole do not show a large deviation from the mean, some sites in particular, like FRA3B and FRA16D, seem to be on average poor in H3K4me3 while others, like FRA2E, FRA3C, and FRA7D seem to be on average rich in H3K4me3 (Fig. 2c). When the cell lines employed were grouped according to their origin (cancerous versus embryonic versus normal epithelial versus normal mesenchymal), a pattern regarding the density of the H3K27ac and H3K4me3 within CFSs relative to non-fragile sites could be discerned (Fig. 2b). Cancer and embryonic cells (K562, GM12878, H1-hESC) displayed lower signals of H3K27ac and H3K4me3 relative to the non-fragile regions, whereas a significantly different distribution was noticed in the other cell line groups (p < 0.001, ANOVA). Despite the small number of cell lines examined, a possible functional link between histone modifications and the other elements (genes, non-coding RNAs, regulatory sequences) positioned within the CFS cannot be excluded (Fig. 1). This potential interplay may be even more complex during carcinogenesis. Oncogenes may distress this functional cross-talk by altering the epigenome of a particular region. As an example, oncogenic Cdc6 was shown to act as “molecular switch” at certain tumor-suppressor loci by regulating CTCF binding. The latter led to suppression of the genes encoded and simultaneous firing of adjacent dormant origins. If such a scenario takes place within CFS that are rich in CTCF sites (Fig. 2a), and depending on the cellular context, the density and timing of firing origins can be altered affecting replication dynamics [56, 57]. Our current understanding of CFSs is traditionally based on a static mapping, often cytogenetic and imprecise, which cannot fully capture the interaction of non-coding DNA, regulatory elements, and histone modifications with vulnerability to RS. Even though the current CFS mapping successfully predicts response to extrinsic stress and OIRS, a more accurate model of fragility will eventually have to integrate experimental data at the nucleotide resolution with other non-coding elements. This is an intriguing area for further study.

Fragile sites in carcinogenesis

Extending the concept of CFSs in carcinogenesis has been a subject of active research since the discovery of an association between cancer breakpoints and FSs [63]. Overall, it appears that CFSs are generally sensitive to innate RS occurring naturally in various tumors and cell lines [64]. Multiple clusters of homozygous deletions, usually small, have been detected over known CFSs in an exhaustive survey of cancer genomes but their expression profile is variable [65]. For example, FRA2F, FRA3B, FRA4F, FRA5H, and FRA16D were most affected while others, like FRA2B and FRA4B, were least affected. Recurrent alterations have been identified in FRA3B and FRA16D in several cancer types, leading to further investigation of the FHIT and WWOX genes, respectively, in mouse models (see K. Huebner and R. Aqeilan chapters in this issue). Fragile sites FRA10C and FRA10G may be involved in the formation of the oncogenic RET/PTC rearrangement in papillary thyroid carcinoma [15]. Specifically, in RET/PTC1, the FRA10G-localized RET is rearranged with the FRA10C-localized tumor suppressor gene CCDC6, while in RET/PTC3 it is rearranged with NCO4 that is located in FRA10G. In a similar manner, the MYC oncogene is flanked by CFSs FRA8C and FRA8D, that may facilitate adjacent integration of HPV18 [66] or MYC amplification [67]. Viral DNA integration in the genome of a host cell can lead to cancer development and CFSs provide preferential hotspots for this [68]. Particularly, HPV16 E6 and E7 oncogenic products have been shown to induce replication stress and DSBs in the host cell. This occurs preferentially at CFSs allowing viral genome integration at these sites [68]. Despite the abundance of CFS breaks in cancer, it would be inappropriate to assume that alterations of genes residing within CFSs always confer a clonal advantage in cancer development [65] without evidence of selection or at least convincing causative models. Clearly, breakage probability (passenger alterations) as a consequence of fragility and clonal selection (driver alterations) in cancer development are two separate phenomena that should not be confused. Nevertheless, the impact of CFS instability in cancer should not be easily dismissed or oversimplified. CFSs breakpoints have been detected in preneoplastic lesions in human and mouse models [4, 19] well before the emergence of the malignant phenotype. Briefly, exposure of xenografted normal human skin to growth factors preferentially induces CFS instability. Similarly, hyperplastic mouse urothelium from HRAS transgenic mice showed numerous copy number alterations in fragile areas. CFS instability is an early manifestation and can be attributed to experimentally controlled, oncogene-induced stress in these studies, in a way that more closely resembles carcinogenesis than APH-induced stress. Therefore, it could be argued that CFS alterations are frequent in cancer, as described above [65], not just because of a higher breakage probability but also because of an earlier involvement, even before the complete deregulation of the cellular machinery. Furthermore, any double-strand break can have dire consequences, such as the initiation of a breakage-fusion-bridge cycle, especially when subtelomeric and peri-centromeric CFSs are disrupted simultaneously [69]. Through this mechanism, CFS breaks can amplify oncogenes, delete tumor suppressors or, most importantly, initiate persistent chromosomal instability. Massive accumulation of localized chromosomal rearrangements in a single time-point, termed chromothripsis (literally: shattering of the chromosome) and chromoanasynthesis, has recently been identified in several cancer types [70]. Indirect evidence suggests that CFSs may have an important role in this process [71] by stalling the RF, favoring RF collapse and, in extreme cases, chromosome pulverization leading to clustering of chromosomal breaks [13]. Indeed, chromosome fragmentation distal to the CFS has been observed under the microscope in some cases [71] and could be a triggering factor for chromothripsis. On the other hand, multi-step, recurrent CFS alterations could be difficult to discriminate from single-step rearrangements, rendering the identification of chromothripsis even more difficult. In that scenario, CFS stability and localization is an important parameter in the bioinformatic algorithms that are applied to define and model such cancer rearrangements. In addition, CFSs can contribute to the clustered shuttering of the chromosomes also during the process of premature chromatin condensation (PCC) [72, 73] in which interphase chromosomes or late replicating chromosome zones like CFS or extranuclear bodies micronuclei are ‘induced’ to condensation by various mitotic factors [74]. This reveals more possibilities for CFSs to act as contributors to GI through chromosome breakage. An interesting scenario suggested that the G2–M mammalian checkpoint can fail to delay mitotic onset as it may not be sensitive enough to detect a few remaining long-replicating forks, thus allowing chromatin condensation of late replicating CFS regions, resulting in multiple DNA breaks [26, 44].

CFSs as “functional” units: a new perception

Common fragile sites have long been considered vulnerable breakage sites in the genome in response to RS from extrinsic factors. Their fragility has also been associated with GI in cancer development. As we have previously shown, CFSs are preferentially affected from the earliest precancerous lesions, in response to OIRS [2, 4]. In the current work, we first performed a review on the heterogeneity and fragility mechanisms affecting these sites. Next, by applying bioinformatic tools and exploiting available information in various databases, like the KEGG, miRbase, and ENCODE, we show a prevalence of various cancer-related genes, miRs, binding elements, and histone modifications in CFSs (Figs. 1, 3). The presence of such a wide spectrum of coding and non-coding elements changes the view on CFSs content and their nature itself. Given that CFSs are altered from the earliest stages in cancer, their impact on cancer development may be more profound than simply participating in the emergence of GI. On one hand, cancer-related genes and miRs may be affected from such early precancerous stages, therefore possibly exerting a strong pressure for malignant progression (Fig. 3). On the other hand, this pressure is also reinforced by alterations and imbalances in the binding elements and histone patterns, respectively, in the CFSs. Furthermore, collectively, all these alterations may further affect in an “avalanche” mode not only the stability of the CFSs, but overall of the genome (Figs. 1, 3). Therefore, as the anti-tumor barriers are gradually overwhelmed, this avalanche effect may function in a positive feedback mode to promote cancer. An important question that emerges is why CFSs are not selected for elimination from the genome, but are rather conserved features in mammals? A tempting but speculative answer is that by locating a set of important coding and non-coding elements in regions that replicate late and/or with delay and thus are prone to instability, they may function as alarm sensors scattered throughout the genome in various chromosomes, to signify detrimental effects from the RS on the cell. As long as the mammalian checkpoints and repair mechanisms are not compromised, cells can monitor and protect their genome and functional integrity through such a dynamic interaction. Nevertheless, this imposes the risk that if the checkpoints and the anti-tumor barriers gradually fail, tumor promotion ensues (Fig. 3). As we were able to examine only a small subset of binding elements and histone modifications from the ENCODE and the miRbase is constantly expanding, in the future more in-depth studies are required to obtain a comprehensive picture of CFSs and on their role in cancer. Overall, CFSs may not be merely structural domains vulnerable only to breakage but highly organized “functional” units that may have deeper biological consequences for the cell when affected.
Fig. 3

Model proposing that CFS apart from contributing to GI exert wider biological effects during cancer development. CFSs are preferentially affected from the earliest precancerous lesions, in response to OIRS, conferring to GI. A wide spectrum of coding and non-coding elements are present within CFSs. Cancer-related genes and miRs may be affected from such early precancerous stages, therefore possibly exerting a strong pressure for malignant progression. This pressure is also reinforced by alterations and imbalances in the binding elements and histone patterns, respectively, in the CFSs. Furthermore, collectively, all of these alterations may further affect in an “avalanche” mode not only the stability of the CFSs, but overall of the genome. As the anti-tumor barriers are gradually overwhelmed, this avalanche effect may function in a positive feedback mode to promote cancer

Model proposing that CFS apart from contributing to GI exert wider biological effects during cancer development. CFSs are preferentially affected from the earliest precancerous lesions, in response to OIRS, conferring to GI. A wide spectrum of coding and non-coding elements are present within CFSs. Cancer-related genes and miRs may be affected from such early precancerous stages, therefore possibly exerting a strong pressure for malignant progression. This pressure is also reinforced by alterations and imbalances in the binding elements and histone patterns, respectively, in the CFSs. Furthermore, collectively, all of these alterations may further affect in an “avalanche” mode not only the stability of the CFSs, but overall of the genome. As the anti-tumor barriers are gradually overwhelmed, this avalanche effect may function in a positive feedback mode to promote cancer
  164 in total

1.  SMAD5 gene expression, rearrangements, copy number, and amplification at fragile site FRA5C in human hepatocellular carcinoma.

Authors:  Drazen B Zimonjic; Marian E Durkin; Catherine L Keck-Waggoner; Sang-Won Park; Snorri S Thorgeirsson; Nicholas C Popescu
Journal:  Neoplasia       Date:  2003 Sep-Oct       Impact factor: 5.715

2.  Collisions between replication and transcription complexes cause common fragile site instability at the longest human genes.

Authors:  Anne Helmrich; Monica Ballarino; Laszlo Tora
Journal:  Mol Cell       Date:  2011-12-23       Impact factor: 17.970

3.  Common fragile site FRA11G and rare fragile site FRA11B at 11q23.3 encompass distinct genomic regions.

Authors:  Anne Fechter; Isabel Buettel; Elisabeth Kuehnel; Larissa Savelyeva; Manfred Schwab
Journal:  Genes Chromosomes Cancer       Date:  2007-01       Impact factor: 5.006

4.  Molecular characterization of the human common fragile site FRA1H.

Authors:  Angela Curatolo; Zaira M Limongi; Franca Pelliccia; Angela Rocchi
Journal:  Genes Chromosomes Cancer       Date:  2007-05       Impact factor: 5.006

5.  Fragile sites are preferential targets for integrations of MLV vectors in gene therapy.

Authors:  A C Bester; M Schwartz; M Schmidt; A Garrigue; S Hacein-Bey-Abina; M Cavazzana-Calvo; N Ben-Porat; C Von Kalle; A Fischer; B Kerem
Journal:  Gene Ther       Date:  2006-03-02       Impact factor: 5.250

6.  FRA7G extends over a broad region: coincidence of human endogenous retroviral sequences (HERV-H) and small polydispersed circular DNAs (spcDNA) and fragile sites.

Authors:  H Huang; J Qian; J Proffit; K Wilber; R Jenkins; D I Smith
Journal:  Oncogene       Date:  1998-05-07       Impact factor: 9.867

7.  Fragile sites at 16q22 are not at the breakpoint of the chromosomal rearrangement in AMMoL.

Authors:  R N Simmers; G R Sutherland; A West; R I Richards
Journal:  Science       Date:  1987-04-03       Impact factor: 47.728

8.  Interplay between ATM and ATR in the regulation of common fragile site stability.

Authors:  E Ozeri-Galai; M Schwartz; A Rahat; B Kerem
Journal:  Oncogene       Date:  2007-10-15       Impact factor: 9.867

9.  Oncogene-induced replication stress preferentially targets common fragile sites in preneoplastic lesions. A genome-wide study.

Authors:  P K Tsantoulis; A Kotsinas; P P Sfikakis; K Evangelou; M Sideridou; B Levy; L Mo; C Kittas; X-R Wu; A G Papavassiliou; V G Gorgoulis
Journal:  Oncogene       Date:  2007-12-17       Impact factor: 9.867

Review 10.  Common fragile sites: genomic hotspots of DNA damage and carcinogenesis.

Authors:  Ke Ma; Li Qiu; Kristin Mrasek; Jun Zhang; Thomas Liehr; Luciana Gonçalves Quintana; Zheng Li
Journal:  Int J Mol Sci       Date:  2012-09-20       Impact factor: 6.208

View more
  28 in total

1.  Enhanced Expression of FRA16B using AT-Rich DNA Binding Chemicals in a Woman with Secondary Amenorrhoea.

Authors:  Gunasekaran Bhavani; S Sivaprakash; Chandra R Samuel; Sathiyavedu Thyagarajan Santhiya
Journal:  J Clin Diagn Res       Date:  2017-06-01

Review 2.  Fragile sites in cancer: more than meets the eye.

Authors:  Thomas W Glover; Thomas E Wilson; Martin F Arlt
Journal:  Nat Rev Cancer       Date:  2017-07-25       Impact factor: 60.716

Review 3.  Recent advances in the study of HPV-associated carcinogenesis.

Authors:  Liyan Jin; Zhi-Xiang Xu
Journal:  Virol Sin       Date:  2015-04-20       Impact factor: 4.327

Review 4.  Stress-induced DNA damage biomarkers: applications and limitations.

Authors:  Zacharenia Nikitaki; Christine E Hellweg; Alexandros G Georgakilas; Jean-Luc Ravanat
Journal:  Front Chem       Date:  2015-06-02       Impact factor: 5.221

5.  WWOX guards genome stability by activating ATM.

Authors:  Idit Hazan; Mohammad Abu-Odeh; Thomas G Hofmann; Rami I Aqeilan
Journal:  Mol Cell Oncol       Date:  2015-02-23

6.  Persistence of Breakage in Specific Chromosome Bands 6 Years after Acute Exposure to Oil.

Authors:  Alexandra Francés; Kristin Hildur; Joan Albert Barberà; Gema Rodríguez-Trigo; Jan-Paul Zock; Jesús Giraldo; Gemma Monyarch; Emma Rodriguez-Rodriguez; Fernanda de Castro Reis; Ana Souto; Federico P Gómez; Francisco Pozo-Rodríguez; Cristina Templado; Carme Fuster
Journal:  PLoS One       Date:  2016-08-01       Impact factor: 3.240

7.  Current questions and controversies in chromosome fragile site research: does WWOX, the gene product of common fragile site FRA16D, have a passive or active role in cancer?

Authors:  I Hazan; R I Aqeilan
Journal:  Cell Death Discov       Date:  2015-10-19

Review 8.  Tumor Suppressor Genes within Common Fragile Sites Are Active Players in the DNA Damage Response.

Authors:  Idit Hazan; Thomas G Hofmann; Rami I Aqeilan
Journal:  PLoS Genet       Date:  2016-12-15       Impact factor: 5.917

9.  Immune-Related Nine-MicroRNA Signature for Predicting the Prognosis of Gastric Cancer.

Authors:  Jingxuan Xu; Jian Wen; Shuangquan Li; Xian Shen; Tao You; Yingpeng Huang; Chongyong Xu; Yaping Zhao
Journal:  Front Genet       Date:  2021-07-05       Impact factor: 4.599

10.  WWOX modulates the ATR-mediated DNA damage checkpoint response.

Authors:  Mohammad Abu-Odeh; Nyla A Hereema; Rami I Aqeilan
Journal:  Oncotarget       Date:  2016-01-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.