| Literature DB >> 32204553 |
Klizia Maccaroni1, Elisa Balzano1, Federica Mirimao1, Simona Giunta2, Franca Pelliccia1.
Abstract
Common fragile sites (CFSs) are particularly vulnerable regions of the genome that become visible as breaks, gaps, or constrictions on metaphase chromosomes when cells are under replicative stress. Impairment in DNA replication, late replication timing, enrichment of A/T nucleotides that tend to form secondary structures, the paucity of active or inducible replication origins, the generation of R-loops, and the collision between replication and transcription machineries on particularly long genes are some of the reported characteristics of CFSs that may contribute to their tissue-specific fragility. Here, we validated the induction of two CFSs previously found in the human fetal lung fibroblast line, Medical Research Council cell strain 5 (MRC-5), in another cell line derived from the same fetal tissue, Institute for Medical Research-90 cells (IMR-90). After induction of CFSs through aphidicolin, we confirmed the expression of the CFS 1p31.1 on chromosome 1 and CFS 3q13.3 on chromosome 3 in both fetal lines. Interestingly, these sites were found to not be fragile in lymphocytes, suggesting a role for epigenetic or transcriptional programs for this tissue specificity. Both these sites contained late-replicating genes NEGR1 (neuronal growth regulator 1) at 1p31.1 and LSAMP (limbic system-associated membrane protein) at 3q13.3, which are much longer, 0.880 and 1.4 Mb, respectively, than the average gene length. Given the established connection between long genes and CFS, we compiled information from the literature on all previously identified CFSs expressed in fibroblasts and lymphocytes in response to aphidicolin, including the size of the genes contained in each fragile region. Our comprehensive analysis confirmed that the genes found within CFSs are longer than the average human gene; interestingly, the two longest genes in the human genome are found within CFSs: Contactin Associated Protein 2 gene (CNTNAP2) in a lymphocytes' CFS, and Duchenne muscular dystrophy gene (DMD) in a CFS expressed in both lymphocytes and fibroblasts. This indicates that the presence of very long genes is a unifying feature of all CFSs. We also obtained replication profiles of the 1p31.1 and 3q13.3 sites under both perturbed and unperturbed conditions using a combination of fluorescent in situ hybridization (FISH) and immunofluorescence against bromodeoxyuridine (BrdU) on interphase nuclei. Our analysis of the replication dynamics of these CFSs showed that, compared to lymphocytes where these regions are non-fragile, fibroblasts display incomplete replication of the fragile alleles, even in the absence of exogenous replication stress. Our data point to the existence of intrinsic features, in addition to the presence of long genes, which affect DNA replication of the CFSs in fibroblasts, thus promoting chromosomal instability in a tissue-specific manner.Entities:
Keywords: DNA replication timing; chromosomal instability; common fragile sites; fluorescent in situ hybridization; gene length; human fetal fibroblasts; long genes; replicative stress; tissue specificity
Mesh:
Substances:
Year: 2020 PMID: 32204553 PMCID: PMC7140878 DOI: 10.3390/genes11030326
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Long genes are found in every CFS in lymphocytes and fibroblast cell lines. Genomic locations of all CFSs experimentally characterized in fibroblasts were defined and all genes mapping to the CFS were identified and catalogued, including CFS name/identifier, mean frequency of expression in percentage, total length of the fragile region, total length of the band as seen in cytogenetic analyses, names of genes present within the CFS, and their size (in Mb). The reference for the molecular and cytogenetic characterization of the specific CFS was also included.
| CFS (APH) | Mean Frequency (in %) | CFS Length | CFS Localization in Chromosome Band(s) | Gene(s) | Length | Human Average Gene Length | References |
|---|---|---|---|---|---|---|---|
| 0.054 Mb | [ | ||||||
| FRA1B | ≈5%–9% | ≈8 Mb | 1p32.2 | 1. | 1. 1,256 Mb | [ | |
|
|
|
|
|
|
| [ | |
| FRA1E | ≈3%–6% | ≈3.0 Mbs | 1p21.2-p21.3 | 1. | 1. 0,843 Mbs | [ | |
| FRA1K | ≈2%–3% | ≈4.0 Mb | 1q25.3-q31.1 | 1. | 1. 0,457 Mb | [ | |
| FRA1H | ≈4%–7% | ≈11 Mb | 1q41-42.1 | 1. | 1. 0,801 Mb | [ | |
| FRA2q14.1 | 3.2% | ≈2.0 Mb | 2q14.1 | 1,4 Mb | [ | ||
|
|
|
|
|
|
| [ | |
| FRA2F | ≈3%–4% | ≈3.0 Mb | 2q22.1-2q22.2 | 1. | 1. 1,9 Mb | [ | |
|
|
|
|
|
|
| [ | |
| FRA2S | ≈4%–6% | > 4.0 Mb | 2q22.3–q23.3 | 1. | 1. 0,496 Mb | [ | |
| FRA2G | ≈10%–15% | > 2.4 Mb | 2q24.3–q31 | 1. | 1. 0,578 Mb | [ | |
| FRA2H | ≈10%–12% | > 7.0 Mb | 2q32.1-q32.2 | 1. | 1. 0,387 Mb | [ | |
| FRA3B | ≈18%–30% | ≈4.3 Mb | 3p14.2 | 1. | 1. 1,5 Mb | [ | |
|
|
|
|
|
|
| [ | |
|
|
|
|
|
|
| [ | |
|
|
|
|
|
|
| [ | |
|
|
|
|
|
|
| [ | |
| FRA4F | ≈3%–4% | ≈7.0 Mb | 4q22 (4q22.3) | 1. | 1. 0,385 Mb | [ | |
| FRA6H | ≈2%–11% | ≈5.0 Mb | 6p21.1-p21.2 | 1. | 1. 0,098 Mb | [ | |
| FRA6F | ≈1%–2% | 1.2 Mb | 6q21 | 1. | 1. 0,185 Mb | [ | |
| FRA6E | ≈12%–24% | ≈3.6 Mb | 6q26-q27 | 1. | 1. 0,142 Mb | [ | |
| FRA7B | ≈7%–12% | ≈12.2 Mb | 7p21.3-p22.3 | 1. | 1. 0,471 Mb | [ | |
|
|
|
|
|
|
| [ | |
| FRA7E | ≈3%–6% | ≈4.5 Mb | 7q21.11 | 1. | 1. 0,180 Mb | [ | |
| FRA7K | ≈4%–8% | ≈0.800 Mb | 7q31.1 | 0,899 Mb | [ | ||
|
|
|
|
|
|
| [ | |
| FRA7G | ≈2%–3% | ≈5.0 Mb | 7q31.2 | 1. | 1. 0,480 Mb | [ | |
| FRA7H | ≈3%–6% | > 2.0 Mb | 7q32.3 | 1. | 1. 0,163 Mb | [ | |
| FRA7I | ≈2%–4% | > 2,0 Mb | 7q35-q36.1 | 1. | 1. 0,384 Mb | [ | |
| FRA8C | ≈4%–7% | ≈4 Mb | 8q24.13-q21 | 1. | 1. 0,863 Mb (Ensembl) | [ | |
| FRA9G | ≈1%–3% | ≈0.400 Mb | 9p22.2 | 0,375 Mb | [ | ||
| FRA9E | ≈2%–5% | ≈10 Mb | 9q32.1-q33.1 | 1. | 1. 0,532 Mb | [ | |
| FRA10D | ≈4%–6% | ≈1.8 Mb | 10q21.3-q22.1 | 1,784 Mb | [ | ||
| FRA11E | ≈3%–5% | > 3 Mb | 11p13 | 1. | 1. 0,506 Mb | [ | |
| FRA11H | ≈2%–3% | ≈8 Mb | 11q13.2-q13.4 | 0,757 Mb | [ | ||
| FRA11F | ≈4%–5% | ≈7.5 Mb | 11q14.1-q43 | 1. | 1. 2,173 Mb | [ | |
| FRA11G | ≈2%–3% | ≈4.5 Mb | 11q23.3 | 1. | 1. 0,317 Mb | [ | |
|
|
|
|
|
|
| [ | |
| FRA13A | ≈4%–8% | > 1.0 Mb | 13q13.2-q13.3 | 1. 0,730 Mb | [ | ||
| FRA13E | ≈2%–5% | > 3.3 Mb | 13q21-q22 | 1. | 1. 0,234 Mb | [ | |
|
|
|
|
|
|
| [ | |
| FRA15A | ≈2%–3% | ≈1.0 Mb | 15q22.2 | 0,741 Mb | [ | ||
| FRA16D | ≈15%–25% | >1.0 Mb | 16q23.2 | 1. | 1. 1,113 Mb | [ | |
|
|
|
|
|
|
| [ | |
| FRA18C | ≈3%–4% | ≈2.0 Mb | 18q22.1-q22.2 | 1. | 1. 0,499 Mb | [ | |
| FRA22B | ≈3%–5% | ≈1.8 Mb | 22q12.2-q12-3 | 1. | 1. 0.551 Mb | [ | |
| FRAXB | ≈8%–14% | > 1.0 Mb | Xp22.3 | 1. | 1. 0,379 Mb (NCBI) | [ | |
| FRAXp21.2-p21.1 | ≈2%–3% | ≈5 Mb | Xp21.2-p21.1 | 1. | 1. 1,369 Mb | [ | |
|
|
|
|
|
|
| [ | |
|
| Annotation release: 109 release date: 2019-06-14 |
| |||||
Fibroblast CFSs: blue; lymphocyte CFSs: black.
Figure 1Cytogenetic localization and expression of common fragile sites in fibroblast cell lines. (A) IMR-90 metaphase stained with Giemsa. The white arrow shows the break on chromosomes 3 (3q13.3). (B) The same sample stained with CMA3 for more precise localization by banding. (C) The histogram showing the percentage of breaks expressed in both MRC-5 and IMR-90 cell lines as an average of three experiments; the breaks with frequency lower than 3% were not considered, given their uncommon fragility.
Figure 2Genomic annotation of 1p31.1 and 3q13.3 fragile regions. (A) On the left, break on the short arm of chromosome 1 (1p31.1) was identified with Giemsa staining; in the middle, the same chromosome is visualized with 4′,6′-diamidino-2-phenylindole hydrochloride (DAPI). Distal and proximal probes flanking the common fragile sites (CFSs) are localized, in the same chromosome, through subsequent fluorescent in situ hybridization (FISH) staining (panel on the right). (B) On the left, identification of a fragile region on chromosome 3 (3q13.3) using Giemsa staining; in the middle, the same region is visualized with DAPI staining and by FISH of distal and proximal probes flanking the CFS (panel of the right). The arrows show the fragile site location.
Figure 3Genes contained within CFSs were found to be longer than the average human gene. Genomic locations of all CFSs experimentally characterized in fibroblasts were defined and all genes mapping to the CFS regions were identified. The gene length is plotted. The plotted average human gene length (first bar) was derived by dividing the sum of the length of all annotated human genes divided by the total gene number. The red line indicates a previously reported > 300 kb cut off for long genes associated with CFSs [13].
Figure 4Bromodeoxyuridine (BrdU) replication labelling on interphase nuclei from early, mid, and late S-phases. Example of nuclei stained with DAPI, FISH (red), and BrdU as indicated, and merge (right) of early (A), mid (B), and late S-phase stages (C), as plotted in Figure 5 and Figure 6.
Figure 5Replication timing analysis of 1p31.1 fragile region on fibroblasts and lymphocytes. The histograms show the percentage of replicated alleles in MRC-5 and IMR-90 cell lines under normal and stressful condition (addition of aphidicolin (APH)). The replication timing of lymphocytes was used as a control. Percentage of replicated distal (A) and proximal (B) probes are shown. Mean of three individual experiments is plotted, with error bars representing the standard error of the mean (SEM). p-values were calculated on the basis of differences between each condition in fibroblasts with the corresponding treatment in lymphocytes. ns: non-significant; * for p ≤ 0.05, ** for p ≤ 0.01, and *** for p ≤ 0.001.
Figure 6Replication timing analysis of 3q13.3 fragile region on both fibroblast cell lines. The histograms show the replication timing in MRC-5 and IMR-90 cell lines in normal and stressful conditions. The replication timing of lymphocytes was used as control for both fibroblast cell lines. Percentage of replicated distal (A) and proximal (B) probes are shown. The mean of three individual experiments is plotted, with error bars representing the SEM. p-values were calculated on the basis of differences between each condition in fibroblasts with the corresponding treatment in lymphocytes. ns: non-significant; * for p ≤ 0.05, ** for p ≤ 0.01, and *** for p ≤ 0.001.