| Literature DB >> 33739559 |
Laura Blasco-Pérez1,2, Ida Paramonov1,2, Jordi Leno1,2, Sara Bernal3,4, Laura Alias3,4, Pablo Fuentes-Prior5, Ivon Cuscó1,2,4, Eduardo F Tizzano1,2.
Abstract
Spinal muscular atrophy (SMA) is caused by bi-allelic loss or pathogenic variants in the SMN1 gene. SMN2, the highly homologous copy of SMN1, is considered the major phenotypic modifier of the disease. Determination of SMN2 copy number is essential to establish robust genotype-phenotype correlations and predict disease evolution, to stratify patients for clinical trials, as well as to define those eligible for treatment. Discordant genotype-phenotype correlations are not uncommon in SMA, some of which are due to intragenic SMN2 variants that may influence the amount of complete SMN transcripts and, therefore, of full-length SMN protein. Detection of these variants is crucial to predict SMA phenotypes in the present scenario of therapeutic advances and with the perspective of SMA neonatal screening and early diagnosis to start treatments. Here, we present a novel, affordable, and versatile method for complete sequencing of the SMN2 gene based on long-range polymerase chain reaction and next-generation sequencing. The method was validated by analyzing samples from 53 SMA patients who lack SMN1, allowing to characterize paralogous, rare variants, and single-nucleotide polymorphisms of SMN2 as well as SMN2-SMN1 hybrid genes. The method identifies partial deletions and can be adapted to determine rare pathogenic variants in patients with at least one SMN1 copy.Entities:
Keywords: SMN2 copies; next-generation sequencing; paralogous variants; phenotype-genotype correlations; spinal muscular atrophy
Mesh:
Substances:
Year: 2021 PMID: 33739559 PMCID: PMC8252042 DOI: 10.1002/humu.24200
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
The 22 candidate positions for paralogous sequence variants (PSVs) between SMN1 and SMN2 are shown
|
|
| Gene location | Ref | Ref | Categorization | Conversion nomenclature | Source of information |
|---|---|---|---|---|---|---|---|
| 70231509 | 69356085 | Intron 1 | G | A | SNP | c.82‐3157A>G | BLAT |
| 70240028 | 69364605 | Intron 4 | G | A | SNP | c.628‐457A>G | BLAT |
| 70242435 | 69367010 | Intron 6 | T | C | SNP | c.834+432C>T | BLAT |
| 70242978 | 69367553 | Intron 6 | A | G | SNV | c.834+975G>A | BLAT |
| 70244142 | 69368717 | Intron 6 | A | G | SNP | c.834+2139G>A | Monani et al. /BLAT |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 70247401 | 69371981 | Intron 6 | C | A | SNV | c.835‐367A>C | Monani et al. /BLAT |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Note: These positions were obtained from a previous bibliographic compilation (Monani et al., 1999) and were complemented with a BLAT between the two genes, as deposited in the reference genome Hg19. Repetitive regions (polyA, polyT, and polyGT) were discarded. The candidate positions were genotyped in the patients studied in this study (n = 53) and in samples with at least one SMN1 (n = 3) to check for consistency. From the 22 candidates, six were discarded as PSVs, four of them are listed as SNPs instead, and the remaining two as rare SNV. Therefore, a total of 16 nucleotides (in bold in the Table) differentiate SMN1 and SMN2 genes, 10 of which had been previously described and validated, 5 had been described but not validated, and 1 is considered here as a PSV for the first time.
Abbreviations: BLAT, BLAST‐like alignment tool; SNP, single‐nucleotide polymorphism; SNV, single‐nucleotide variant.
These positions show high variability between samples and were therefore classified as SNPs (including position c.835‐367C/A previously validated by Monani et al., 1999).
In these positions, the same nucleotide has always been found in both SMN1 and SMN2. Thus, G>A and A>C exchanges appears to be very rare SNV found in the reference genome.
These 11 positions were previously classified as bona fide PSVs, after being tested in a control population of 15 individuals (Monani et al., 1999).
Figure 1Structure of the SMN2–SMN1 hybrid detected in patients SMA4 and SMA39. PSVs were genotyped in all patients. In SMA4 and SMA39, six SMN1 PSVs located in intron 6 (Chr5: 69370451‐69370895) were detected, which indicates the presence of hybrid genes. The AB ratio indicates that in SMA04, two of the three copies are SMN2–SMN1 hybrids (SMN1 PSVs in 66%) while in SMA39, only one of the three copies is an SMN2–SMN1 hybrid (SMN1 PSVs in 33%). PSV, paralogous sequence variant
Figure 2Utility of AB ratios to calculate the number of the copies in which variants are present. Patient SMA52 has two SMN2 copies (determined by MLPA) and the variant NM_017411.4:c.859G>C (p.(Gly287Arg)) was detected with a frequency of 56%, in agreement with the AB ratio expected for the variant in one over two alleles. Patient SMA21 has three SMN2 copies (determined by MLPA), and the variant c.835‐44A>G (NC_000005.9: g.69372304A>G) was detected with a frequency of 36%, in agreement with the AB ratio expected for the variant in one over three alleles. MLPA, multiplex ligation‐dependent probe amplification
Figure 3Detection of the pathogenic variant NM_017411.4:c.815A>G in samples SMA55 and SMA55F. Patient SMA55 has one SMN1 copy (with the variant c.815A>G) and one SMN2 copy, while his father (SMA55F) has two SMN1 (one copy with the variant c.815A>G) and two SMN2 copies (determined by MLPA and Sanger). The pathogenic variant c.815A>G was detected in SMA55 and SMA55F through NGS with a frequency of 52% and 25%, respectively. The SMN1‐specific PCR performed confirms that the pathogenic variant c.815A>G is present in SMN1 since we observed the variant in hemizygous status in SMA55 and in heterozygous status in SMA55F. MLPA, multiplex ligation‐dependent probe amplification; NGS, next‐generation sequencing; PCR, polymerase chain reaction
Figure 4Description of the utility of AB ratios to determine the presence of two partial SMN genes (SMN1/2Δ7/8). The patient (SMA55) has one SMN1 (with a pathogenic variant in exon 6*), one SMN2, and two partial SMNs from promoter to exon 6 (determined by MLPA). In the 5′ region (promoter‐ex6), the patient has a total of four SMN copies; consequently, we detected different SNPs with an allelic frequency of 25%, 50%, 75%, and 100%. By contrast, in the 3′ region (in6‐ex8), the patient has two SMN copies, and only variants with an allelic frequency of 50% are detected. Note that PSVs are located in the 3′ region; therefore, it is not possible to determine whether these partial genes are derived from SMN1 or SMN2. MLPA, multiplex ligation‐dependent probe amplification; PSV, paralogous sequence variant; SNP, single‐nucleotide polymorphisms