| Literature DB >> 32066871 |
Xiao Chen1, Alba Sanchis-Juan2,3, Courtney E French4, Andrew J Connell5, Isabelle Delon6, Zoya Kingsbury7, Aditi Chawla1, Aaron L Halpern1, Ryan J Taft1, David R Bentley7, Matthew E R Butchbach5,8,9,10, F Lucy Raymond3,4, Michael A Eberle11.
Abstract
PURPOSE: Spinal muscular atrophy (SMA), caused by loss of the SMN1 gene, is a leading cause of early childhood death. Due to the near identical sequences of SMN1 and SMN2, analysis of this region is challenging. Population-wide SMA screening to quantify the SMN1 copy number (CN) is recommended by the American College of Medical Genetics and Genomics.Entities:
Keywords: bioinformatics; carrier screening; copy-number analysis; genome sequencing (GS); spinal muscular atrophy (SMA)
Mesh:
Substances:
Year: 2020 PMID: 32066871 PMCID: PMC7200598 DOI: 10.1038/s41436-020-0754-0
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.822
Fig. 1Common copy-number variants (CNVs) affecting the SMN1/2 loci.
a Depth profiles across the SMN1/SMN2 regions. Samples with a total SMN1 + SMN2 copy number of 2, 3, 4, and 5 (derived from average read depth in the two genes) are shown as green, blue, black, and orange dots, respectively. Depth from 50 samples are summed up for each CN category. Each dot represents normalized depth values in a 100-bp window. Read counts are calculated in each 100-bp window, summing up reads from both SMN1 and SMN2, and normalized to the depth of wild-type samples (CN = 4). The SMN exons are represented as purple boxes. The two x-axes show coordinates (hg19) in SMN1 (bottom) and SMN2 (upper). b Depth profiles aggregated from 50 samples carrying a deletion of exons 7 and 8 are shown as red dots. Read depths are calculated in the same way as in (a).
Fig. 2Scatterplot of total SMN (SMN1, SMN2, and SMN2∆7–8) copy number (CN) (x-axis, called by read depth in exons 1–6) and intact SMN copy number (y-axis, called by read depth in exons 7–8).
Fig. 3Distribution of SMN1/SMN2/SMN2∆7–8 copy numbers in the population.
a Percentage of samples showing copy number (CN) call agreement with c.840C>T across 16 SMN1–SMN2 base difference sites in African and non-African populations. Coordinates of these 16 sites are given in Table S1 and site 13* is the c.840C>T splice variant site. The black horizontal line denotes 85% concordance. b Histogram of the distribution of SMN1, SMN2, and SMN2∆7–8 copy numbers across five populations in 1kGP and the National Institute for Health Research (NIHR) BioResource cohort. c SMN1 CN vs. total SMN2 CN (intact SMN2 + SMN2∆7–8). d Two trios with an SMA proband detected by the caller and orthogonally confirmed in the Next Generation Children project cohort. CNs of SMN1, SMN2, and SMN2∆7–8 are phased and labeled for each member of the trios.
Validation against samples with known SMN1/SMN2 copy numbers (CNs).
| CN by orthogonal method | Total | Concordant | Discordant | Agreement | |
|---|---|---|---|---|---|
| 0 | 64 | 64 | 0 | 100.0% | |
| 1 | 45 | 44 | 1 | 97.8% | |
| 2 | 897 | 897 | 0 | 100.0% | |
| 3 | 174 | 174 | 0 | 100.0% | |
| 4 | 43 | 43 | 0 | 100.0% | |
| 6 | 1 | 0 | 1 | 0.0% | |
| Total | 1224 | 1222 | 2 | 99.8% | |
| 0 | 117 | 117 | 0 | 100.0% | |
| 1 | 466 | 465 | 1 | 99.8% | |
| 2 | 541 | 539 | 2 | 99.6% | |
| 3 | 60 | 60 | 0 | 100.0% | |
| 4 | 9 | 8 | 1 | 88.9% | |
| Total | 1193 | 1189 | 4 | 99.7% | |
| 0 | 1089 | 1089 | 0 | 100.0% | |
| 1 | 80 | 80 | 0 | 100.0% | |
| 2 | 4 | 4 | 0 | 100.0% | |
| Total | 1173 | 1173 | 0 | 100.0% |
SMN1 copy number (CN) and c.*3+80T>G frequency by population.
| Ethnicity | Total | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Count | c.*3+80T>G+ | Count | c.*3+80T>G+ | Count | c.*3+80T>G+ | Count | c.*3+80T>G+ | ||
| African | 902 | 4 | 0 (0.0%) | 404 | 134 (33.2%) | 373 | 315 (84.5%) | 121 | 112 (92.6%) |
| European | 9648 | 212 | 0 (0.0%) | 8899 | 4 (0.04%) | 524 | 22 (4.2%) | 13 | 2 (15.38%) |
| South Asian | 1199 | 20 | 0 (0.0%) | 965 | 1 (0.1%) | 195 | 5 (2.56%) | 19 | 1 (5.26%) |
| East Asian | 593 | 8 | 0 (0.0%) | 552 | 1 (0.18%) | 33 | 1 (3.03%) | 0 | 0 (NA) |
| Admixed American | 341 | 7 | 0 (0.0%) | 296 | 7 (2.36%) | 36 | 9 (25.0%) | 2 | 1 (50.0%) |