| Literature DB >> 32010185 |
Liang Hu1,2,3,4, Fan Liang5, Dehua Cheng1,4, Zhiyuan Zhang5, Guoliang Yu5, Jianjun Zha5, Yang Wang5, Qi Xia5, Daoli Yuan5, Yueqiu Tan1,2,3,4, Depeng Wang5, Yu Liang5, Ge Lin1,2,3,4.
Abstract
Genomic structural variants, including translocations, inversions, insertions, deletions, and duplications, are challenging to be reliably detected by traditional genomic technologies. In particular, balanced translocations and inversions can neither be identified by microarrays since they do not alter chromosome copy numbers, nor by short-read sequencing because of the unmappability of short reads against repetitive genomic regions. The precise localization of breakpoints is vital for exploring genetic causes in patients with balanced translocations or inversions. Long-read sequencing techniques may detect these structural variants in a more direct, efficient, and accurate manner. Here, we performed whole-genome, long-read sequencing using the Oxford Nanopore GridION sequencer to detect breakpoints in six balanced chromosome translocation carriers and one inversion carrier. The results showed that all the breakpoints were consistent with the karyotype results with only ~10× coverage. Polymerase chain reaction (PCR) and Sanger sequencing confirmed 8 out of 14 breakpoints; however, other breakpoint loci were slightly missed since they were either in highly repetitive regions or pericentromeric regions. Some of the breakpoints interrupted normal gene structure, and in other cases, micro-deletions/insertions were found just next to the breakpoints. We also detected haplotypes around the breakpoint regions. Our results suggest that long-read, whole-genome sequencing is an ideal strategy for precisely localizing translocation breakpoints and providing haplotype information, which is essential for medical genetics and preimplantation genetic testing.Entities:
Keywords: Oxford Nanopore Technologies; balanced translocation; long-read sequencing; preimplantation genetic testing; structural variants
Year: 2020 PMID: 32010185 PMCID: PMC6972507 DOI: 10.3389/fgene.2019.01313
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The list of subjects analyzed in the current study and the details on the inferred breakpoints.
| Sample | Karyotype | Depth (X) | No. of mapped sequencing reads | No. of mapped sequencing bases | Coverage rate (%) | No. of spanning breakpoints reads | Breakpoint position (GRCh37) | Disrupted gene (breakpoint) |
|---|---|---|---|---|---|---|---|---|
| DM17A2236 | 46,XY,t(6;8)(q25;q22) | 11.32 | 2,262,314 | 32,111,789,470 | 91.85 | 11 | 6:167281717 8:113696089 | Intergenic region |
| DM17A2237 | 46,XX,t(18;21)(q11;q11) | 10.31 | 2,316,017 | 29,746,593,714 | 93.44 | 11 | 18:28685658 21:29073597 |
|
| DM17A2246 | 46,XX,t(8;22)(q24;q11) | 9.87 | 1,931,784 | 28,742,307,402 | 94.34 | 6 | 8:125495366 22:20326956~20327048 |
|
| DM17A2247 | 46,XY,t(11;22)(q23;q11) | 9.98 | 2,024,838 | 29,361,507,192 | 95.30 | 5 | 11:116683166 22:20326993 | Intergenic region Intergenic region |
| DM17A2248 | 46,XX,inv(11)(q11q21) | 10.94 | 2,498,061 | 32,758,847,457 | 96.96 | 10 | 11:58265643 11:100448937 | Intergenic region Intergenic region |
| DM17A2249 | 46,XY,t(2;18)(p13;q23) | 10.26 | 1,790,385 | 29,628,601,968 | 93.52 | 11 | 2:80320441 18:66637011 |
|
| DM17A2250 | 46,XX,t(3;9)(p13;p13) | 13.54 | 3,150,533 | 39,494,541,253 | 94.43 | 7 | 3:90504855 9:44216447 | Centromere region Intergenic region |
Figure 1Quality-control analysis of the long-read sequencing data obtained using the Oxford Nanopore platform. (A) The median identity of the sequencing data against the reference genome was approximately 85% for all samples. (B) For all samples, the mean lengths were 12.3–16.3 kilobases, and the read N50 values were 15.3–20.5 kilobases. (C) The overall strategy for breakpoint analysis.
Figure 2Balanced translocation by sequencing and karyotyping for subject DM17A2237. (A) Read mapping of the breakpoints for the balanced translocations. DNA fragments were compared to the reference human genome (GRCh37/hg19), and the breakpoints were shown in IGV. Twenty reads adjacent to the breakpoint were obtained. (B) Karyotype of carrier DM17A2237. Karyotype analysis was determined from G-banding analysis, following a standard protocol. The karyotype result revealed an approximate region where the breakpoint occurred. (C) PCR analysis and Sanger sequencing for validating the breakpoints. An ethidium bromide-stained agarose gel was showing the presence of two new bands created by the rearrangement of chromosomal segments at breakpoints (BP1 and BP2). BP, breakpoint; C, control; M, marker. Primer information is available in .
Figure 3Long-read sequencing enabled haplotype detection around the translocation breakpoints in sample DM17A2237. Using the breakpoints as anchoring markers, we obtained 2-megabase sequences on either side of the breakpoints. Through SNP calling and the MarginPhase tool, we phased the haplotypes around the breakpoints in chr18 (A) and chr21 (B). Reads around breakpoints were shown in IGV (bottom panel) and regions in red box were enlarged (top panel). Capital letters represent accurate sequencing information, whereas lowercase letters represent fuzzy base information.