| Literature DB >> 29095815 |
Zirui Dong1,2,3, Huilin Wang1,3,4, Haixiao Chen2,5, Hui Jiang2,5, Jianying Yuan2,5, Zhenjun Yang2,5, Wen-Jing Wang2,5, Fengping Xu2,5,6, Xiaosen Guo2,5, Ye Cao1,3, Zhenzhen Zhu2,5, Chunyu Geng2,5, Wan Chee Cheung1, Yvonne K Kwok1,3, Huanming Yang2,5, Tak Yeung Leung1,3,7, Cynthia C Morton8,9,10,11,12, Sau Wai Cheung13,14, Kwong Wai Choy15,16,17.
Abstract
PURPOSE: Recent studies demonstrate that whole-genome sequencing enables detection of cryptic rearrangements in apparently balanced chromosomal rearrangements (also known as balanced chromosomal abnormalities, BCAs) previously identified by conventional cytogenetic methods. We aimed to assess our analytical tool for detecting BCAs in the 1000 Genomes Project without knowing which bands were affected.Entities:
Mesh:
Year: 2017 PMID: 29095815 PMCID: PMC5932280 DOI: 10.1038/gim.2017.170
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.822
Figure 1Spectrum of inter- and intra-chromosomal balanced rearrangements and cytogenetic validations
Distributions of the average insert sizes and read-pair amounts of 2,504 samples from the 1000 Genomes Project are shown in (A) and (B), respectively. Insert size and read-pair amounts were calculated based on non-chimeric and uniquely mapped read-pairs. (C) Spectrum of BCAs. Balanced translocations are indicated with red lines and the corresponding sample IDs are shown in red font in each affected chromosome in the outmost circle. Inversions are indicated in blue lines and sample IDs are shown in blue font. Chromosomal nucleotide positions and bands are shown according to the UCSC Genome Viewer Table Browser. In figures (D), (E) and (F), validation of balanced translocations and inversion (G) by G-banded chromosome analysis are shown. Ideograms of the balanced rearrangements are shown on the left, while the karyogram images are to the right with the corresponding ideogram of the derivative chromosomes for reference. Breakpoint regions are indicated with red arrows. Sample name and the International System for Human Cytogenomic Nomenclature (ISCN) description are shown below each.
Balanced translocations detected in the 1000 Genomes Project
| Sample ID | Karyotype | Continental | Population | der | Breakpoint A | Breakpoint B | Micro- | chr | Deletion involving breakpoints | Gene | Genes in TAD | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| chr | Coordinate | chr | Coordinate | Size | Start | End | |||||||||
| HG02260 | 46,XX, t(9;14)(q34.2;q31.1) | AMR | PEL | 9 | 9 | 137230907 | 14 | 79839174 | – | 9 | 14 | 137230908 | 137230922 | ||
| 14 | 14 | 79839173 | 9 | 137230923 | – | 14 | 0 | NA | NA | ||||||
| HG03729 | 46,XY, t(3;17)(q24;p13.3) | SAN | ITU | 3 | 3 | 143817430 | 17 | 2910366 | – | 3 | 5,219 | 143817431 | 143822650 | ||
| 17 | 3 | 143822651 | 17 | 2914751 | TT | 17 | 4,383 | 2910367 | 2914750 | ||||||
| NA18612 | 46,XY, t(16;17)(q23.1;q24.2) | ASN | CHB | 16 | 16 | 75336134 | 17 | 64953079 | C | 16 | 3 | 75336135 | 75336137 | ||
| 17 | 17 | 64953078 | 16 | 75336138 | A | 17 | 0 | NA | NA | ||||||
| NA20764 | 46,XX, t(2;19)(p11.2;p13.3) | EUR | TSI | 2 | 2 | 86491099 | 19 | 424310 | – | 2 | 1 | 86491100 | 86491100 | ||
| 19 | 19 | 424308 | 2 | 86491101 | TG | 19 | 1 | 424309 | 424309 | ||||||
Continental group: AMR, SAN, ASN and EUR refer to American, East Asian, South Asian and European, respectively
Population: PEL, ITU, CHB and TSI refer to Peruvian in Lima (Peru), Indian Telugu in the UK, Han Chinese in Beijing (China) and Toscani in Italy, respectively.
der: derivative chromosome
TAD: topological associated domain; domain information from the human IMR90 fibroblast cell line (hg19)
Inversions detected in the 1000 Genomes Project
| Sample ID | Karyotype | Continental group | Population | Coordinate | Microhomology | Deletion (bp) | Gene disruption | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Break A | Break B | Break A | Break B | Break A | Break B | |||||
| NA20759 | 46,XY,inv(2) (p14p12) | EUR | TSI | 67974196 | 81338340 | G | 1 | 0 | – | – |
| 67974194 | 81338341 | – | ||||||||
| HG04152 | 46,XY,inv(1) (q32.1q32.1) | SAN | BEB | 199531693 | 203428169 | – | 65 | 0 | – | – |
| 199531758 | 203428141 | – | ||||||||
| NA18959 | 46,XY,inv(3) (q26.1q26.1) | ASN | JPT | 166243083 | 166459774 | AT | 4 | 0 | – | – |
| 166243087 | 166459775 | G | ||||||||
| NA21133 | 46,XY,inv(X) (p22.2p22.2) | SAN | GIH | 15629309 | 15687230 | T | 0 | 0 | – | – |
| 15629310 | 15687231 | – | ||||||||
Continental group: EUR, SAN and ASN refer to European, South Asian and East Asian, respectively
Population: TSI, BEB, JPT and GIH refer to Toscani in Italy, Bengali in Bangladesh, Japanese in Tokyo (Japan) and Gujarati Indian in Houston (TX, USA) respectively.
Breakpoint = breakpoint
Figure 2A subtle translocation t(16;17)(q23.1;q24.2) (NA18612) and the aberrant splicing of intron 6 of CFDP1 (NM_006324)
(A) Validation from chromosome analysis. Ideograms of the derivative balanced translocation chromosomes are shown with the corresponding G-banded chromosome pairs. Breakpoint regions are indicated with red arrows. (B) PCR validation of the junction of DNA sequences from the two derivative chromosomes indicated by red arrows while absent in the negative control. (C) Validation from FISH. BAC probes are shown with the targeted bands (16q24.3 in SpectrumOrange, 17p13.3 in SpectrumRed and 17q25.1 in SpectrumGreen, respectively). Derivative chromosomes and normal chromosomes are designated with arrows. (D) In the der(17), the genomic location of anti-sense gene CFDP1 (NM_006324) is shown with the breakpoint mapping in intron 6 (red dotted line). RNA-seq read-pairs align to the region (expressed as coverage) proximal to the breakpoint in seq[GRCh37/hg19] 16q23.1(75,336,134_75,336,138). It includes the intergenic region in 17q24.2 and the partial intron 6 of CFDP1 (NM_006324) in 16q23.1 (two grey dotted lines), indicating the aberrant splicing of intron 6. (E) Transcript coverage was plotted with the paired-end aligned reads (RNA-seq). The coverage of each coordinate is divided by the average coverage in this transcript, and subsequently normalized coverage with the average coverage from the other three EBV-B cell lines from the 1000 Genomes Project. Black arrow indicates low expression in exon 7 that is the absence of exon 7 in the disrupted transcript.
Figure 3Gene disruption, cryptic deletions and potential disruption of interaction between promoter and enhancer by the breakpoints of balanced translocations
Figures (A), (B) and (C) NRXN3 disruption in 46,XX,t(9;14)(q34.2;q31.1) (HG02260). (A) and (B) Genomic locations of NRXN3 and RXRA are shown with breakpoints indicated by red dotted lines. (C)NRXN3 and RXRA expression for the four cases from the 1000 Genomes Project and for 13 reported EBV-B normal control cell lines (the GTEx project). Gene expression for NRXN3 and RXRA in HG02260 are indicated with red arrows. Figures (D) and (E) cryptic deletions involved at the breakpoints in translocation 46,XY,t(3;17)(q24;p13.3) (HG03729). Two cryptic deletions of seq[GRCh37/hg19] 3q24(143,817,430_143,822,651)×1 and seq[GRCh37/hg19] 17p13.3(2,910,366_2,914,751)×1 were detected by read-depth difference algorithm and were further confirmed by quantitative PCR. The deleted regions are shown in a yellow background with a red arrow while the normal copy-ratio (diploid) is shown in a blue background with a blue arrow. Two independent pairs of primers (Supplementary Table 2) were used to perform qPCR in quintuplicate for validation of each deletion. The bars in cyan show the relative quantification of HG03729, while the bars in blue indicate the negative control. Figures (F) and (G) potential disruption of interaction between promoter and enhancer from rearrangement in 46,XY,t(16;17)(q23.1;q24.2) (NA18612) in H1-hESC. (F) Genes and the ChIP-seq data from the ENCODE Project are shown in terms of the genomic location. Each cell line with the ChIP-seq data (i.e., H3K4Me3 and H3K27Ac)[33] is labeled with a red arrow. Breakpoint in seq[GRCh37/hg19] 17q24.2(64,953,078_64,953,079) is shown by a green vertical line, while the candidate promoters and enhancers are indicated with orange and blue arrows, respectively. The region of potential enhancer in H1-hESC is highlighted in DNase I Hypersensitivity Clusters[34] in a blue rectangle (DHS region). The figure below is zoomed in on the potential enhancer region in H1-hESC. Enrichment of H3K4Me1 and absence of H3K4Me3 support a potential active enhancer in this region[33,39], while enrichment of DNA-binding sequence motifs also indicates the candidate region of the interaction for regulatory elements[33]. (G) Gene expression level (Read Per Kilobase Million) of the four cases and 13 EBV-B normal control samples (GTEx project)[24].