| Literature DB >> 31912665 |
Guy Helman1,2, Bryan R Lajoie3, Joanna Crawford2, Asako Takanohashi4, Marzena Walkiewicz1, Egor Dolzhenko3, Andrew M Gross3, Vladimir G Gainullin3, Stephen J Bent5, Emma M Jenkinson6, Sacha Ferdinandusse7, Hans R Waterham7, Imen Dorboz8, Enrico Bertini9,10, Noriko Miyake11, Nicole I Wolf12, Truus E M Abbink12, Susan M Kirwin13, Christina M Tan13, Grace M Hobson13, Long Guo14, Shiro Ikegawa14, Amy Pizzino4, Johanna L Schmidt4, Genevieve Bernard15,16,17, Raphael Schiffmann18, Marjo S van der Knaap12,19, Cas Simons1,2, Ryan J Taft3, Adeline Vanderver4,20.
Abstract
Genetic white matter disorders have heterogeneous etiologies and overlapping clinical presentations. We performed a study of the diagnostic efficacy of genome sequencing in 41 unsolved cases with prior exome sequencing, resolving an additional 14 from an historical cohort (n = 191). Reanalysis in the context of novel disease-associated genes and improved variant curation and annotation resolved 64% of cases. The remaining diagnoses were directly attributable to genome sequencing, including cases with small and large copy number variants (CNVs) and variants in deep intronic and technically difficult regions. Genome sequencing, in combination with other methodologies, achieved a diagnostic yield of 85% in this retrospective cohort.Entities:
Mesh:
Year: 2020 PMID: 31912665 PMCID: PMC6952322 DOI: 10.1002/acn3.50957
Source DB: PubMed Journal: Ann Clin Transl Neurol ISSN: 2328-9503 Impact factor: 4.511
Figure 1Myelin Disorders Bioregistry Project Unsolved Cohort and Challenging Variants in Known Leukodystrophy Genes. (A) The initial “unsolved” cohort consisted of 191 families. 101 were diagnosed by standard of care (SoC) methodologies, 71 of the remaining families underwent exome sequencing, while 19 were excluded due to lack of DNA or inability to access the family trio for NGS testing. The genome sequencing cohort consisted of 41 families, 14 of which were solved. (B) Variants in GJC2 (OMIM:608803) were identified in LD_0617 and are causative of Pelizaeus‐Merzbacher‐like disease (OMIM:608804). The 13bp paternally inherited duplication, c.916_928dupGCCTCCGCCCCCG, p.Ala310fs in GJC2, was previously undetected by exome sequencing due to a lack of coverage in this region.1 (C) The duplication lies in the middle of a 110 nucleotide block of a ~ 95% GC‐rich region and was identified in the affected individual and father on genome sequencing, where it is indicated by a green rectangle. (D) Sanger chromatogram confirming the presence of the c.916_928dupGCCTCCGCCCCCG variant in the LD_0617 proband. The duplicated sequence is indicated in red with the second copy in LD_0617 bold and underlined.
Clinically significant single‐nucleotide, indel, and copy number variants.
| Family ID | Gender | Current Age | Age at Initial MRI | Gene | Zygosity | Variant (GRCh37) & Transcript ID | cDNA | Protein | rsID | gnomAD AF | Max AF |
SIFT/ Mutation Taster | Variant Classification | ACMG Criteria |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Novel Gene ( | ||||||||||||||
| LD_0246 | M | 8 Y | 11 M |
|
Het.,
|
17‐73774722‐G‐C ENST00000254810 | c.365C> G | Pro122Arg | N/A | 0.0000 | 0.0000 | N/A/ DC | Pathogenic | PS2, PM2, PP3 |
| LD_0500 | M | 6 Y | 23 M |
|
Hemi.,
|
X‐129274569‐G‐A ENST00000287295 | c.720C> T | Asp240Asp | N/A | 0.0000 | 0.0000 | N/A/ N/A | Pathogenic | PS2, PS3, PM2 |
| LD_0527 | M | 8 Y | UNK |
| Hom. |
10‐134598648‐C‐TA ENST00000368592 | c.606delGinsTA | Lys202fs | N/A | 0.0000 | 0.0000 | N/A/ N/A | Likely pathogenic | PVS1, PM2 |
| LD_0807 | F | 17 Y | 12 Y |
| Het. (m) |
17‐8076848‐A‐C ENST00000363593 | n.58 T> G | N/A | N/A | 0.0000 | 0.0000 | N/A/ N/A | VUS | PM2, PM3, PP4, BP6 |
| Het. (p) |
17‐8076762‐G‐A ENST00000363593 | n. *9C> T | N/A | N/A | 0.001907 | 0.001907 | N/A/ N/A | VUS | PM2, PM3, PP4, BP6 | |||||
| Intronic ( | ||||||||||||||
| LD_0393 | F (3) & M (1) | N/A | N/A |
| Hom. |
5‐149440309‐CT‐ ENST00000286301 | c.1969 + 115_1969+116delAG | Pro658Serfs*24 | N/A | 0.0000 | 0.0000 | N/A/ N/A | Likely pathogenic | PS3, PM2, PP1 |
| MT‐DNA ( | ||||||||||||||
| LD_0725 | F | 21 Y | 1 Y |
| Het. (m) |
MT‐3243‐A‐G ENST00000386347 | n.14A> G | N/A | rs199474657 | N/A | N/A | N/A/ N/A | Pathogenic | PS1, PS3, PP3, PP4 |
| Challenging Variants ( | ||||||||||||||
| LD_0315 | M | 7 Y | 19 M |
| Het. (p) |
1‐173797450‐T‐CC ENST00000361951 | c.228‐21delTinsCC | N/A | rs528772984 | 0.000021 | 0.002 | N/A/ N/A | Likely pathogenic | PS3, PM1 |
| Het. (m) |
1‐173797537‐G‐T ENST00000361951 | c.294G>T | Glu98Asp | N/A | 0.0000 | 0.0000 | T/ DC | Likely pathogenic | PS3, PM2, PP2, PP3 | |||||
| LD_0617 | F | 7 Y | 2 Y |
| Het. (p) |
1‐228346388‐C‐GCCTCCGCCCCCG ENST00000366714 | c.916_928dupGCCTCCGCCCCCG | Ala310fs | N/A | 0.0000 | 0.0000 | N/A/ N/A | Pathogenic | PVS1, PM2, PP4 |
| Het. (m) |
1‐ 228345662‐A‐G ENST00000366714 | c.203A> G | Tyr68Cys | rs1031720654 | 0.0000 | 0.0000 | D/ DC | Likely pathogenic | PM2, PM3, PP3, PP4 | |||||
| Improved Pipeline/ Separate study novel gene ( | ||||||||||||||
| LD_0162 | M & M | Deceased | 3 Y |
| Hom. |
11‐86017416‐G‐C ENST0000027843 | c.160G> C | Val54Leu | rs202003795 | 0.0001299 | 0.0002 | D/ DC | Likely pathogenic | PS1, PM1, PP1, PP4, PP5 |
| LD_0579 | M | 5 Y | 5 M |
| Hemi. (m) |
X‐153135278‐A‐T ENST00000264161 | c.1103T> A | Ile368Asn | N/A | 0.0000 | 0.0000 | D/ DC | Likely pathogenic | PM1, PM2, PP2, PP3 |
| LD_0808 | M | 6 Y | 18 M |
| Het. (p) |
4‐123855300‐G‐A ENST00000274008 | c.554G> A | Gly185Glu | rs753587518 | 0.0000253 | 0.0002 | D/ DC | Likely pathogenic | PM2,PM3, PP2, PP3, PP4 |
| Het. (m) |
4‐123949348‐G‐C ENST00000274008 | c.1877G> C | Trp626Ser | rs375343753 | 0.0000122 | 0.0000 | D/ DC | Likely pathogenic | PM2, PM5, PP3, PP4, PP5 | |||||
| Clinically Resolved | ||||||||||||||
| LD_0587 | M | 9 Y | 9 M |
| Hemi. (m) |
X‐53226186‐C‐A ENST00000375401 | c.2663G> T | Arg888Leu | rs375850872 | 0.0000144 | 0.0001 | D/ P | VUS | PM2, PP3 |
Key: M – Month; Y – Years; UNK – Unknown; MT– mitochondrial; Het. – heterozygous; Hemi. – Hemizygous; Hom. ‐ Homozygous; N/A – Not available; AF – allele frequency.
Total allele frequency in Genome Aggregation Database (gnomAD) dataset.
Maximum reported allele frequency for any single subpopulation represented in ExAC, 1000 Genomes, or Exome Variant Server datasets.
DC – Disease‐Causing; DA – Disease‐Causing Automatic; D – Damaging; T – Tolerated
ACMG criteria upon which classification is based
Variant shown in Guo et al. 2019 to result in inclusion of a novel 99nt pseudo‐exon, with an in‐frame stop codon p.(P658Sfs*24) resulting in nonsense‐mediated decay.
Variant shown to result in DARS2 Exon 3 skipping. See Appendix S1.