| Literature DB >> 34440387 |
Wenjing Lai1, Xin Feng1, Ming Yue1, Prudence W H Cheung2, Vanessa N T Choi1, You-Qiang Song1, Keith D K Luk2, Jason Pui Yin Cheung2, Bo Gao1.
Abstract
Congenital scoliosis (CS) is a lateral curvature of the spine resulting from congenital vertebral malformations (CVMs) and affects 0.5-1/1000 live births. The copy number variant (CNV) at chromosome 16p11.2 has been implicated in CVMs and recent studies identified a compound heterozygosity of 16p11.2 microdeletion and TBX6 variant/haplotype causing CS in multiple cohorts, which explains about 5-10% of the affected cases. Here, we studied the genetic etiology of CS by analyzing CNVs in a cohort of 67 patients with congenital hemivertebrae and 125 family controls. We employed both candidate gene and family-based approaches to filter CNVs called from whole exome sequencing data. This identified 12 CNVs in four scoliosis-associated genes (TBX6, NOTCH2, DSCAM, and SNTG1) as well as eight recessive and 64 novel rare CNVs in 15 additional genes. Some candidates, such as DHX40, NBPF20, RASA2, and MYSM1, have been found to be associated with syndromes with scoliosis or implicated in bone/spine development. In particular, the MYSM1 mutant mouse showed spinal deformities. Our findings suggest that, in addition to the 16p11.2 microdeletion, other CNVs are potentially important in predisposing to CS.Entities:
Keywords: CNV; congenital scoliosis; congenital vertebral malformation; copy number variant
Mesh:
Year: 2021 PMID: 34440387 PMCID: PMC8391542 DOI: 10.3390/genes12081213
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1The workflow of CNV analysis. This strategy detected CNVs in several candidate genes and identified recessive and novel rare CNVs enriched in patients with CS.
CNVs found with candidate gene approach (N.D., not determined; N.A., not applied).
| Gene | Patient | Type | Chr | Start | End | Size (bp) | Bayes Factor | Reads Ratio (Observed/ Expected) | Exons Annotation (hg19) (Gene_exon) | Inheritance Pattern | Highest Frequency in DGV (Sample Size >100) | gnomAD_ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| CS033 | deletion | 1 | 120,611,949 | 120,612,020 | 71 | 4.62 | 0.657 | NOTCH2_1 | N.D. | 0.0037 | 0.00037 |
| CS043 | deletion | 1 | 120,539,621 | 120,612,020 | 72,399 | 8.46 | 0.722 | NOTCH2_1-4 | De novo | 0.0037 | 0.00037 | |
|
| CS018 | deletion | 21 | 41,452,080 | 41,452,267 | 187 | 4.65 | 0.429 | DSCAM_25 | De novo | 0.00049 | N.A. |
| CS036 | deletion | 21 | 41,452,080 | 41,452,267 | 187 | 6.02 | 0.415 | DSCAM_25 | De novo | 0.00049 | N.A. | |
| CS050 | deletion | 21 | 41,452,080 | 41,452,267 | 187 | 5.8 | 0.468 | DSCAM_25 | Paternal | 0.00049 | N.A. | |
| CS053 | deletion | 21 | 41,452,080 | 41,452,267 | 187 | 5.45 | 0.494 | DSCAM_25 | N.D. | 0.00049 | N.A. | |
| CS064 | deletion | 21 | 41,452,080 | 41,452,267 | 187 | 6.1 | 0.463 | DSCAM_25 | N.D. | 0.00049 | N.A. | |
|
| CS048 | deletion | 8 | 51,503,440 | 51,571,223 | 67,783 | 6.37 | 0.43 | SNTG1_13-15 | De novo | 0.0002 | 0.000046 |
|
| CS059 | deletion | 16 | 29,674,601 | 30,199,897 | 525,296 | 644 | 0.555 | SPN_2,AC009133.19_2-3,QPRT_1-4,C16orf54_2,ZG16_2-4,KIF22_1-13,MAZ_1-5,PRRT2_2-3,PAGR1_1-3,CTD-2574D22.6_1-2,MVP_2-15,CDIPT_6-2,SEZ6L2_16-1,ASPHD1_1-3,KCTD13_6-1,TMEM219_1-4,TAOK2_2-16,HIRIP3_7-1,INO80E_1-7,DOC2A_11-2,C16orf92_2-3,FAM57B_5-1,ALDOA_8-16,PPP4C_2-9, | N.D. | 0.0005 | 0.0001462 |
| CS071 | deletion | 16 | 29,495,011 | 30,218,221 | 723,210 | 754 | 0.572 | NPIPL3_3-1,SPN_2,AC009133.19_2-3,QPRT_1-4,C16orf54_2,ZG16_3-4,KIF22_2-12,MAZ_1-5,PRRT2_2-3,PAGR1_1-3,CTD-2574D22.6_1-2,MVP_2-15,CDIPT_6-2,SEZ6L2_16-1,ASPHD1_1-3,KCTD13_6-1,TMEM219_1-4,TAOK2_2-16,HIRIP3_7-2,INO80E_1-7,DOC2A_11-2,C16orf92_2-3,FAM57B_5-1,ALDOA_8-16,PPP4C_2-9, | De novo | 0.0005 | 0.0001462 | |
| CS078 | deletion | 16 | 29,498,516 | 30,199,897 | 701,381 | 690 | 0.578 | NPIPL3_1,SPN_2,AC009133.19_2-3,QPRT_1-4,C16orf54_2,ZG16_3-4,KIF22_2-12,MAZ_1-5,PRRT2_2-3,PAGR1_1-3,CTD-2574D22.6_1-2, MVP_2-15,CDIPT_6-2,SEZ6L2_16-1,ASPHD1_1-3,KCTD13_6-1,TMEM219_1-4,TAOK2_2-16,HIRIP3_7-2,INO80E_1-7,DOC2A_11-2,C16orf92_2-3,FAM57B_5-1,ALDOA_8-16,PPP4C_2-9, | N.D. | 0.0005 | 0.0001462 | |
| CS081 | deletion | 16 | 29,498,516 | 30,199,897 | 701,381 | 645 | 0.558 | NPIPL3_1,SPN_2,AC009133.19_2-3,QPRT_1-4,C16orf54_2,ZG16_3-4,KIF22_2-12,MAZ_1-5,PRRT2_2-3,PAGR1_1-3,CTD-2574D22.6_1-2, MVP_2-15,CDIPT_6-2,SEZ6L2_16-1,ASPHD1_1-3,KCTD13_6-1,TMEM219_1-4,TAOK2_2-16,HIRIP3_7-2,INO80E_1-7,DOC2A_11-2,C16orf92_2-3,FAM57B_5-1,ALDOA_8-16,PPP4C_2-9, | N.D. | 0.0005 | 0.0001462 |
Recessive CNVs unique in patients with CS (N.D., not determined; N.A., not applied).
| Gene | Patient | Type | Chr | Start | End | Size (bp) | Bayes Factor | Reads Ratio | Exons Annotation (hg19) (Gene_exon) | Inheritance Pattern | Highest Frequency in DGV (Sample Size > 100) | gnomAD_ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| CS047 | deletion | 1 | 148,261,458 | 148,262,366 | 908 | 5.27 | 0.04 | NBPF20_98-99 | De novo | 0 | 0 |
|
| CS048 | deletion | 9 | 35,061 | 35,519 | 458 | 6.51 | 0 | FAM138C_1-2 | * De novo | 0.0074 | N.A. |
|
| CS004 | deletion | 17 | 57,656,834 | 57,657,240 | 406 | 5.38 | 0 | DHX40_9-10 | N.D. | 0.0000922 | 0.0025 |
| CS035 | 4.91 | N.D. | ||||||||||
| CS043 | 6 | * De novo | ||||||||||
| CS050 | 7.23 | * De novo | ||||||||||
| CS053 | 7.02 | N.D. | ||||||||||
| CS057 | 6.29 | De novo |
* The CNV is also not present in the healthy siblings.
Novel CNVs enriched in patients with CS (N.A., not applied).
| Gene | Type | Chr | Size (bp) | Count in 67 Patients | Count in 125 Controls | Highest Frequency in DGV (Sample Size > 100) | gnomad_ | gnomad_East Asia_Structural_Variants Frequency (Heterozygous Loss) |
|---|---|---|---|---|---|---|---|---|
|
| deletion | 1 | 30,080–383,938 | 4 | 0 | 0.009556907 | 0.0000461 | 0.000 |
|
| deletion | 2 | 544–11,914 | 4 | 0 | 0.002257336 | 0.0000461 | 0.000 |
|
| deletion | 3 | 279–55,232 | 4 | 0 | 0.000798722 | 0.0000462 | 0.000 |
|
| deletion | 16 | 6724–71,803 | 4 | 0 | N.A.b | 0 | 0.000 |
|
| deletion | 16 | 525,296–723,210 | 4 | 0 | 0.0005 | 0.0001462 | 0.000 |
|
| deletion | 17 | 107–2354 | 10 a | 1 | 0.0000922 | 0.0025 | 0.008152 |
|
| deletion | 2 | 11,694–24,032 | 5 | 1 | 0.001303781 | 0.0000479 | 0.000 |
|
| deletion | 1 | 190–10,053 | 4 | 1 | 0.0000922 | 0.0000461 | 0.0004139 |
|
| deletion | 3 | 2892–100,149 | 4 | 1 | 0.00086881 | 0 | 0.000 |
|
| deletion | 8 | 203–12,325 | 4 | 1 | 0.001145475 | 0 | 0.000 |
|
| deletion | 15 | 52,610–323,156 | 4 | 1 | 0.00518807 | 0.0000922 | 0.000 |
|
| deletion | 16 | 7697–99,254 | 4 | 1 | 0.001198083 | 0.0000481 | 0.000 |
|
| deletion | 20 | 730–33,608 | 4 | 1 | 0.00064226 | 0 | 0.000 |
|
| duplication | 17 | 560–13,580 | 4 | 1 | 0.0009219 | 0 | 0.000 |
a DHX40 has 6 homozygous (listed in Table 2) and 4 heterozygous CNVs. b No CNV with sample size more than 100 is found within the NAE1 locus. Note: Detailed information of these novel CNVs is shown in Table S1.