| Literature DB >> 31506324 |
Yi Dai1, Pidong Li2, Zhiqiang Wang3, Fan Liang2, Fan Yang2, Li Fang4,5, Yu Huang6, Shangzhi Huang7, Jiapeng Zhou2, Depeng Wang2, Liying Cui1,8, Kai Wang9,5.
Abstract
PURPOSE: Facioscapulohumeral muscular dystrophy (FSHD) is a common adult muscular dystrophy. Over 95% of FSHD cases are associated with contraction of the D4Z4 tandem repeat (~3.3 kb per unit) at 4q35 with a specific genomic configuration (haplotype) called 4qA. Molecular diagnosis of FSHD typically requires pulsed-field gel electrophoresis with Southern blotting. We aim to develop novel genomic and computational methods for characterising D4Z4 repeat numbers in FSHD.Entities:
Keywords: D4Z4; FSHD; facioscapulohumeral muscular dystrophy; macrosatellite; single-molecule optical mapping
Year: 2019 PMID: 31506324 PMCID: PMC7029236 DOI: 10.1136/jmedgenet-2019-106078
Source DB: PubMed Journal: J Med Genet ISSN: 0022-2593 Impact factor: 6.318
A list of patients and control subjects assayed by the Bionano Saphyr platform in the current study. More detailed description of results can be found in online supplementary table 1
| ID | Sex | Age (years) | Onset (years) | Family history | CK (U/L) | EMG | Southern blot (4q35) | Optical mapping (Nb.BssSI enzyme, 4q35) | Optical mapping (Nt.BspQI enzyme, 4q35) | |||||||
| Length (kb) | Units | Allele | Units | Allele | Read count | Units | Allele | Read count | ||||||||
| Patient cohort 1 | P01 | F | 37 | 28 | + | 104 | Mild myopathic change | ~20 | ~4 | 4qA | 4 | 4qA | 6 | 4.3±0.3 | 4qA | 38 |
| ~38 | >10 | 4qB | 22 | 4qB | 5 | 22.4±0.2 | 4qB | 23 | ||||||||
| P02 | F | 18 | 12 | + | 379 | Myopathic change | ~18 | ~3 | 4qA | 3 | 4qA | 25 | – | – | – | |
| ~98 | ~28 | 4qA | 28 | 4qA | 10 | |||||||||||
| P03 | F | 43 | 30 | + | 110 | Mild myopathic change | ~18 | ~3 | 4qA | 3 | 4qA | 27 | – | – | – | |
| ~38 | >10 | 4qA | 11 | 4qA | 34 | |||||||||||
| P04 | M | 27 | 11 | – | 1406 | Myopathic change | ~16 | ~3 | 4qA | 3 | 4qA | 18 | – | – | – | |
| ~61 | ~17 | 4qB | 19 | 4qB | 42 | |||||||||||
| ~76 | ~21 | 4qA | 23 | 4qA | 30 | |||||||||||
| P05 | F | 41 | 31 | + | 95 | Normal | ~12 | ~2 | 4qA | 2 | 4qA | 12 | – | – | – | |
| ~38 | ~10 | 4qA | 11 | 4qA | 18 | |||||||||||
| >38 | >10 | 4qA | 17 | 4qA | 60 | |||||||||||
| Patient cohort 2 | P06 | F | 14 | 6 | – | 406 | Myopathic change | – | – | – | 2 | 4qA | 3 | 2.2±0.3 | 4qA | 32 |
| 15 | 4qB | 4 | 15.0±0.2 | 4qB | 36 | |||||||||||
| 27 | 4qA | 3 | 27.2±0.4 | 4qA | 5 | |||||||||||
| P07 | M | 23 | 18 | – | 871 | Myopathic change | – | – | – | 4 | 4qA | 6 | – | – | – | |
| – | – | – | 20 | 4qA | 11 | |||||||||||
| P08 | M | 18 | 13 | – | 538 | Myopathic change | ~21.5 | ~5 | 4qA | 5 | 4qA | 6 | – | – | – | |
| >63.5 | >17 | 4qA | 25 | 4qA | 9 | |||||||||||
| P09 | F | 33 | 19 | – | 269 | Mild myopathic change | ~21.5 | ~5 | 4qA | 5 | 4qA | 33 | – | – | – | |
| >63.5 | >17 | 4qA | 18 | 4qA | 26 | |||||||||||
| P10 | F | 39 | 28 | – | 176 | Myopathic change | ~15 | ~3 | 4qA | 3 | 4qA | 17 | – | – | – | |
| >63.5 | >17 | 4qA | 28 | 4qA | 8 | |||||||||||
| P11 | M | 20 | 14 | + | 379 | Myopathic change | ~15 | ~3 | 4qA | 3 | 4qA | 16 | – | – | – | |
| >63.5 | >17 | 4qA | 32 | 4qA | 12 | |||||||||||
| P12 | F | 15 | 10 | – | 514 | No data | ~12 | ~2 | 4qA | 2 | 4qA | 10 | – | – | – | |
| ~48.5 | 15 | 4qA | 16 | 4qA | 18 | |||||||||||
| P13 | F | 53 | 32 | + | 341 | Myopathic change | ~18.5 | ~4 | 4qA | 4 | 4qA | 18 | – | – | – | |
| >63.5 | >17 | 4qB | 18 | 4qB | 18 | |||||||||||
| Control | C01 | F | – | – | – | – | – | – | – | – | 19 | 4qB | 8 | 18.8±0.3 | 4qB | 22 |
| 47 | 4qA | 10 | 46.5±0.5 | 4qA | 13 | |||||||||||
| C02 | M | – | – | – | – | – | – | – | – | 18 | 4qA | 8 | 18.6±0.3 | 4qA | 7 | |
| 20 | 4qA | 9 | 20.5±0.1 | 4qA | 8 | |||||||||||
| C03 | M | – | – | – | – | – | – | – | – | 13 | 4qB | 4 | 12.7±0.5 | 4qB | 20 | |
| 22 | 4qB | 4 | 22.2±0.4 | 4qB | 22 | |||||||||||
More detailed description of results can be found in online supplementary table 1.
CK, creatine kinase; EMG, electromyography; F, female; M, male.
Figure 2Illustration of the region with label dissimilarity between 10q26, 4q35, KQ983257.1 and KQ983258.1 (blue box) adjacent to the region of similarity (green box), based on in silico analysis on GRCh37, GRCh38, KQ983257.1 and KQ983258.1. Red vertical bars represent labels of enzyme recognition sites. By using fragments that spans the region of dissimilarity, we can confidently separate fragments originating from 10q26, 4q35 or those that are undetermined (uninformative). The panel A and B represent labels generated by the Nb.BssSI and Nt.BspQI enzymes, respectively. Although the reference genome GRCh38 contains two labels (repeat unit #2 and #5) within the D4Z4 repeat region for the Nt.BspQI enzyme (red box), we rarely observe them in real data, possibly due to the inclusion of a very rare allele in the GRCh38 or due to errors in genome assembly.
Figure 1An overview of the genomic architecture of segmental duplications at the chromosome 4q35 region and the 10q26 region. In the GRCh38 reference genome, 4q35 incorrectly shows two D4Z4 arrays (8 units 4qB type and 1.5 unit 4qA type) with a 50 kb gap between AC225782.3 (4qB) and AC215524.3 (4qA) (panel A). 10q26 incorrectly shows two D4Z4 arrays (each with 7 D4Z4 repeat units) with a 50 kb gap between AL845259.22 (10qA) and AL7323751.8 (10qA) (panel B). Two new patch scaffold sequences were recently added in CRCh38 patch 7 with 4qA configuration (KQ983257.1) without gap and with 4qA-L configuration (KQ983258.1) without gap, and we additionally illustrated them (panel C and D). The segmental duplication (green boxes) in 4q35 has high sequence identity with the corresponding region in 10q26, KQ983257.1 and KQ983258.1, while the distal D4Z4 array separated by the gap in 10q26 is marked with an orange box. The incorrect assembly gap is not present in 4q35 (panel E) or 10q26 (panel F) in the GRCh37 genome assembly.
Figure 3Molecular diagnosis of facioscapulohumeral muscular dystrophy by Southern blot on samples from cohort 1. The results for cohort 2 is available in online supplementary figure 1. E/H and p13E-11: double digested with EcoRI/HindIII and then labelled with probe p13E-11, and all the 4q and 10q segments are illustrated. E/B and p13E-11: double digested with EcoRI/BlnI and then labelled with probe p13E-11, and the 10q segments are digested so only 4q segments are illustrated. H and 4qA: digested with HindIII and then labelled with probe 4qA, and the 4qA alleles are illustrated. H and 4qB: digested with HindIII and then labelled with probe 4qB, and the 4qB alleles are illustrated. The asterisk ‘*’ denotes pathogenic allele with <10 repeat units and with a 4qA configuration. The plus sign ‘+’ denotes somatic mosaic allele.
Figure 4Determination of somatic mosaicism by Nb.BssSI (panel A/B/C) and Nt.BspQI (panel D/E/F) enzymes on a patient with 2, 15 and 27 repeats (ID: P06). For both enzymes, our computational pipeline accurately identified the presence of somatic contraction, and determined that the contraction occurs on the parental allele carrying the 4qA configuration. For repeat quantification using the Nt.BspQI enzyme, mean±SD is annotated in the figure. Vertical bars represent labels of enzyme recognition sites.
Analysis of Bionano Irys genome mapping data sets of three families
| ID | Ethnicity | Relationship | Irys optical mapping (Nb.BssSI enzyme) | Irys optical mapping (Nt.BspQI enzyme) | |||||
| Units | Allele | Read count | Units | Allele | Read count | ||||
| Family 1 | HG00512 | Southern Han Chinese | Father | 19 | 4qA | 3 | 19.8±0.5 | 4qB | 7 |
| – | – | 40.4±0.3 | 4qA | 3 | |||||
| HG00513 | Southern Han Chinese | Mother | 11 | 4qA | 6 | 10.4±0.2 | 4qA | 8 | |
| – | – | 33.1±0.3 | 4qA | 11 | |||||
| HG00514 | Southern Han Chinese | Daughter | 11 | 4qA | 6 | 10.6±0.2 | 4qA | 9 | |
| 19 | 4qA | 5 | 19.6±0.3 | 4qA | 10 | ||||
| Family 2 | HG00731 | Puerto Rican | Father | 20 | 4qB | 5 | 19.9±0.5 | 4qB | 7 |
| 40 | 4qA | 3 | 40.5±0.3 | 4qA | 3 | ||||
| HG00732 | Puerto Rican | Mother | 17 | 4qB | 6 | 17.2±0.1 | 4qB | 33 | |
| 32 | 4qB | 4 | 32.1±0.3 | 4qB | 18 | ||||
| HG00733 | Puerto Rican | Daughter | 17 | 4qB | 7 | 17.3±0.3 | 4qB | 12 | |
| 40 | 4qA | 2 | 40.1±0.4 | 4qA | 7 | ||||
| Family 3 | GM19238 | Yorùbá | Mother | 25 | 4qA | 2 | 25.6±0.3 | 4qA | 10 |
| – | – | – | 49.1±0.7 | 4qA | 13 | ||||
| GM19239 | Yorùbá | Father | 9 | 4qB | 5 | 9.2±0.1 | 4qB | 14 | |
| 36 | 4qA | 2 | 36.6±0.2 | 4qA | 13 | ||||
| GM19240 | Yorùbá | Daughter | 25 | 4qA | 3 | 25.9±0.3 | 4qA | 5 | |
| 36 | 4qA | 4 | 36.2±0.4 | 4qA | 7 | ||||