Literature DB >> 25504735

Exome sequencing identifies a rare HSPG2 variant associated with familial idiopathic scoliosis.

Erin E Baschal1, Cambria I Wethey1, Kandice Swindle1, Robin M Baschal2, Katherine Gowan3, Nelson L S Tang4, David M Alvarado5, Gabe E Haller5, Matthew B Dobbs6, Matthew R G Taylor7, Christina A Gurnett8, Kenneth L Jones3, Nancy H Miller9.   

Abstract

Idiopathic scoliosis occurs in 3% of individuals and has an unknown etiology. The objective of this study was to identify rare variants that contribute to the etiology of idiopathic scoliosis by using exome sequencing in a multigenerational family with idiopathic scoliosis. Exome sequencing was completed for three members of this multigenerational family with idiopathic scoliosis, resulting in the identification of a variant in the HSPG2 gene as a potential contributor to the phenotype. The HSPG2 gene was sequenced in a separate cohort of 100 unrelated individuals affected with idiopathic scoliosis and also was examined in an independent idiopathic scoliosis population. The exome sequencing and subsequent bioinformatics filtering resulted in 16 potentially damaging and rare coding variants. One of these variants, p.Asn786Ser, is located in the HSPG2 gene. The variant p.Asn786Ser also is overrepresented in a larger cohort of idiopathic scoliosis cases compared with a control population (P = 0.024). Furthermore, we identified additional rare HSPG2 variants that are predicted to be damaging in two independent cohorts of individuals with idiopathic scoliosis. The HSPG2 gene encodes for a ubiquitous multifunctional protein within the extracellular matrix in which loss of function mutation are known to result in a musculoskeletal phenotype in both mouse and humans. Based on these results, we conclude that rare variants in the HSPG2 gene potentially contribute to the idiopathic scoliosis phenotype in a subset of patients with idiopathic scoliosis. Further studies must be completed to confirm the effect of the HSPG2 gene on the idiopathic scoliosis phenotype.
Copyright © 2015 Baschal et al.

Entities:  

Keywords:  HSPG2; exome sequencing; idiopathic scoliosis; perlecan

Mesh:

Substances:

Year:  2014        PMID: 25504735      PMCID: PMC4321025          DOI: 10.1534/g3.114.015669

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Idiopathic scoliosis (IS) [MIM: %181800] is a common inherited disorder of the immature skeleton, affecting 3% of the pediatric population, with girls characteristically more severely affected than boys (Asher and Burton 2006). Clinically, IS is defined as a structural lateral spinal curvature of ≥10° with a rotatory component, documented by radiographic analysis and occurring in otherwise-normal children. Current treatment modalities are limited to observation, physical therapy, bracing, and surgery. Significant curves in adolescence can lead to severe curve progression in adulthood. In an effort to prevent severe progression and deformity as an adult, these adolescents may elect to undergo an operative spinal instrumentation and fusion, a costly procedure with life-long implications. These methods focus on the physical symptoms and cosmetic deformity but fail to address the need for preventative options. The familial nature of IS is well established, but the mode of inheritance is not clear (Cowell ; Riseborough and Wynne-Davies 1973; Bonaiti ; Czeizel ; Tang ). It is unlikely that IS families are homogeneous with respect to their genetic architecture but represent a spectrum of inheritance patterns ranging from simple Mendelian to complex polygenic. Concordance rates have been reported between 0.73 and 0.92 for monozygotic twins and between 0.36 and 0.63 for dizygotic twins (Kesling and Reinker 1997; Inoue ; Andersen ). Despite clear evidence for the importance of genetics in the susceptibility to IS, limited progress has been made in the identification of genes that are causative or may influence disease susceptibility, variation, or progression. The current literature includes multiple studies that have used linkage and association analyses. Genome-wide linkage analyses have effectively localized candidate regions; however, progress has been burdened by heterogeneity both within and between study populations and by insufficient statistical power (Wise ; Chan ; Salehi ; Justice ; Miller , 2006; Alden ; Ocaka ; Gurnett ; Raggio ; Clough ; Marosy ; Edery ). Association studies, including genome-wide association studies (GWAS), have implicated select genes, including CHD7, LBX1, and GPR126 (Gao ; Sharma ; Takahashi ; Kou ). A recent study has completed exome sequencing for an IS cohort and identified rare variants in the fibrillin genes, FBN1 and FBN2, which were associated with curve severity (Buchan ). In the current study, we completed exome sequencing for a multigenerational family with familial IS, with the goal of identifying rare genetic variants that contribute to the IS phenotype.

Material and Methods

Subjects

Denver IS population:

For the Denver IS population, written informed consent was obtained from study subjects who were enrolled in accordance with protocols approved by the Johns Hopkins School of Medicine Institutional Review Board and the University of Colorado Denver Institutional Review Board (Colorado Multiple Institutional Review Board, Study Approval #06-1161 and 07-0417). A diagnosis of IS required clinical/family history and physical examination consistent with a spinal curvature in the coronal plane, a standing anteroposterior spinal radiograph showing ≥10° curvature by the Cobb method with pedicle rotation, and no congenital deformity or other existing genetic disorder (Shands and Eisberg 1955; Kane 1977; Armstrong ). Individuals were classified as having familial IS if they had at least one additional family member affected with IS (Justice ; Miller ). The majority of individuals in our study are likely to exhibit an autosomal dominant mode of inheritance (Miller , 2012). We collected blood samples from all participants and extracted genomic DNA according to standard phenol-chloroform purification protocols (Sambrook ; Moore and Dowhan 2002) or using the QIAGEN Gentra Puregene Blood Kit.

St. Louis IS population:

As described by Buchan , IS patients in the St. Louis dataset were recruited from St. Louis Children’s Hospital and St. Louis Shriners Hospital for Children. All patients had scoliosis of unknown etiology with spinal curves measuring ≥10°. Patients with developmental delay, multiple congenital anomalies, or known underlying medical disorders (e.g., Ehlers-Danlos syndrome, Marfan syndrome) were excluded. We selected 140 unrelated IS cases of European ancestry with severe deformity (spinal curves measuring ≥40°) for the exome sequencing screen. DNA was collected from affected probands after we obtained informed consent.

Control populations:

Comparisons with control populations used either the 1000 Genomes population (European ancestry, N = 379) (Abecasis ) or the National Heart, Lung, and Blood Institute (NHLBI) Exome Sequencing Project [(ESP) European ancestry subset (EA-ESP), version ESP6500SI-V2, N = 4300].

Exome sequencing

Denver exome sequencing:

A multigenerational family with European ancestry was selected for exome sequencing, with DNA samples available on five affected and three unaffected individuals (two of the unaffected individuals are marry-in relatives). This family was selected based on the number of affected family members with severe curves and the relative pedigree-distance between affected family members. Exome capture was performed on 1 µg of genomic DNA from three affected individuals (IV:1, IV:6, and II:4) in this family using the Illumina TruSeq Exome Capture kit. Samples were sequenced with a 2x100bp run at the Illumina HiSeq 2000 at the University of Colorado Denver Genomics and Microarray Core Facility.

St. Louis exome sequencing:

For the St. Louis IS dataset, exon enrichment was performed using the SureSelect Human All Exon 38Mb and 50Mb kits (Agilent Technologies, Santa Clara, CA) or the TruSeq Exome Enrichment kit (Illumina, San Diego, CA) at the Washington University Genome Technology Access Center, as described previously (Buchan ).

Bioinformatics filtering

Exome sequencing reads were mapped to the reference human genome sequence (hg19) with large-scale alignment software (GSNAP) (Wu and Nacu 2010). Sequence calls for variants (single nucleotide variants (SNVs), insertions/deletions) were performed using the Broad Institute’s Genome Analysis Toolkit (GATK) (McKenna ; Depristo ). The resulting variants were filtered in a systematic way. The program ANNOVAR was used to filter the variants by cross referencing various genetic variation databases (e.g., 1000 genomes database, NHLBI Exome Sequencing Project 6500 dataset, etc), allowing us to extract information about variant frequencies (if previously reported) and location within genes (Wang ). Variants were restricted to those that 1) had no more than a 1% frequency in the phase 1 1000 genomes data (Abecasis ), and 2) altered the coding sequence (nonsense, splice-site, missense and insertion/deletion). The nonsense and missense variants were subsequently cross-referenced to the dbNSFP database to determine whether the resulting changes to the protein are predicted to be damaging (see Supporting Information, File S1) (Liu ). In addition, we required at least 5X sequence coverage at any given position to call a genotype. For the multigenerational family, we required that all three individuals share the variant if they had a genotype at that position. For the St. Louis dataset, next-generation sequencing reads were aligned to hg19 human reference sequence and variants were called and annotated, as described by Buchan .

Sanger sequencing

The variants that remained after the filtering process were confirmed by Sanger sequencing. The variants identified through exome sequencing were tested in the original three affected individuals who underwent exome sequencing and five additional relatives (two affected and three unaffected). As discussed in the Results section, one variant (chromosome 6, 168376925, in HGC6.3) had a high frequency in the EA-ESP population and therefore was not Sanger sequenced. Primers were designed using the program Primer3 (Koressaar and Remm 2007; Untergasser ). Primer sequences can be found in Table S1. Sanger sequencing was completed at either the University of Colorado Denver Barbara Davis Center for Childhood Diabetes or at Beckman Coulter Genomics. Details of PCR conditions and Sanger sequencing protocols are located in File S1. Sanger sequences were analyzed using CodonCode software (CodonCode Corporation, Centerville, MA), and figures were generated using Sequencher (GeneCodes Corporation, Ann Arbor, MI).

HSPG2 sequencing

Exons from the HSPG2 gene (97 exons, 14369 base pair mRNA, 4391 amino acids) were sequenced in 100 unrelated individuals with IS. All individuals had a curvature of >10° with a range of 18−119°), and were predominantly of European ancestry. To decrease sequencing costs, sequencing was completed using 25 pools of four individuals each. An Illumina MiSeq 2x250bp run was performed at the University of Colorado Denver DNA Diagnostic Laboratory. The HSPG2 libraries were enriched using long-range polymerase chain reaction (PCR), resulting in approximately 2.5 to 10 kb fragments with a Takara long range PCR kit (Clontech). PCR conditions followed manufacturer’s recommendations. Primer sequences can be found in Table S1. For high GC regions, betaine was used in the PCR. A GENios FL (Tecan) microplate reader using SYBR green was used to quantify the libraries. The library was generated using a Nextera XT DNA sample kit (FC-121-1031; Illumina) and Nextera Index kit (FC-121-1011; Illumina). Bioinformatics filtering was completed using the same protocol as described earlier. The filtering process resulted in 14 potentially damaging and rare HSPG2 variants in these 100 individuals. The 14 variants were Sanger sequenced in each of the four individuals within the pool to confirm the variant and identify the individual within the pool that carried the variant. The Sanger sequencing protocol was the same as described for the multigenerational family. We were unable to confirm an HSPG2 variant at position 22263709 (p.Met1Thr) due to problems with the PCR-amplification process. This variant was removed from all further analyses, resulting in a total of 13 rare variants in the HSPG2 gene. The median sequencing coverage across the HSPG2 gene was 172X, with a range of 45X to 349X. Exons 27−42 had lower coverage in multiple pools, where the median coverage was 68X across these 16 exons. Due to these concerns with sequence quality, we chose to Sanger sequence all 100 IS individuals for all 13 variants to confirm the variant frequency in our population.

Statistical comparisons

The Fisher’s exact test (two-sided) was used to calculate the P-values for comparisons between the IS population(s) and the 1000 Genomes European population or the EA-ESP, with α = 0.05. Mutalyzer 2.0.beta-31 was used to generate and check all nucleotide and protein positions (Wildeman ), and positions were verified using MutationTaster and the public version of the Human Gene Mutation Database (HGMD) (Schwarz ; Stenson ).

Results

Exome sequencing identifies an HSPG2 variant in an IS family

We studied a multigenerational family affected with familial IS and of European ancestry (Figure 1A). Affected individuals were diagnosed with familial IS based on radiographic findings of a spinal curvature of ≥10°. The proband, individual III:2, was diagnosed at 13 years of age with a right thoracic curve and a left lumbar curve of 60° and 40°, respectively, and underwent posterior spinal instrumentation and fusion. The transmission pattern in this family suggested a Mendelian mode of inheritance, for which exome sequencing is ideally suited. Exome sequencing, followed by variant detection and filtering, was completed for three individuals in this family. Sequencing coverage was 64× overall, with IV:1, IV:6, II:4 at 97X, 82X, and 22X respectively. Filtering resulted in a list of 16 variants that were at a frequency of <1% in the 1000 Genomes data, were predicted to be damaging (according to the criteria defined in the Materials and Methods section and File S1), and were insertions/deletions, nonsense, missense, or splice-site changes. In addition, to be considered for further study, variants were required to be present in all three affected individuals in the family.
Figure 1

(A) Pedigree for the multigenerational family with IS. Individuals IV:1, IV:6, and II:4 underwent exome sequencing. Sanger sequencing was completed for those three individuals in addition to the affected individuals III:2 and III:6 and unaffected individuals III:1, III:7, and IV:2. Lumbar and thoracic curves are abbreviated with L and T, respectively. Double curves are represented by a slash (right thoracic/left lumbar), and triple curves are represented with three slashes (left high thoracic/right thoracic/left lumbar). CT or TT is the individual genotype at HSPG2 position p.Asn786Ser. A “?” represents individuals that have not been examined, so their scoliosis status is unknown. (B) Chromatograms for Variant p.Asn786Ser. Chromatograms for p.Asn786Ser are shown for two individuals from the multigenerational family (III:6, affected, and III:7, unaffected). The variant p.Asn786Ser, in the HSPG2 gene, is heterozygous in the affected individuals within the family, but not in the unaffected individuals (with the exception of IV:2, as discussed in the section Results).

(A) Pedigree for the multigenerational family with IS. Individuals IV:1, IV:6, and II:4 underwent exome sequencing. Sanger sequencing was completed for those three individuals in addition to the affected individuals III:2 and III:6 and unaffected individuals III:1, III:7, and IV:2. Lumbar and thoracic curves are abbreviated with L and T, respectively. Double curves are represented by a slash (right thoracic/left lumbar), and triple curves are represented with three slashes (left high thoracic/right thoracic/left lumbar). CT or TT is the individual genotype at HSPG2 position p.Asn786Ser. A “?” represents individuals that have not been examined, so their scoliosis status is unknown. (B) Chromatograms for Variant p.Asn786Ser. Chromatograms for p.Asn786Ser are shown for two individuals from the multigenerational family (III:6, affected, and III:7, unaffected). The variant p.Asn786Ser, in the HSPG2 gene, is heterozygous in the affected individuals within the family, but not in the unaffected individuals (with the exception of IV:2, as discussed in the section Results). After the filtering process, 16 potentially damaging and rare variants were identified in this family (Table 1 and Table S2). One variant was removed due to an excessive frequency in the EA-ESP dataset (in HGC6.3, 23% frequency in EA-ESP). For each of the remaining 15 variants, Sanger sequencing was completed for the individuals who originally underwent exome sequencing and for additional relatives with DNA samples [two affected and three unaffected (two of the unaffected individuals were marry-in relatives)]. Sanger sequencing was unable to replicate the original result for 1 of the 15 variants (OPRD1) in the original three exome sequenced individuals (Table S3).
Table 1

Rare variants (MAF 0.01) identified in our multigenerational family with IS

ChrPositionGeners #RefVarHGVS Protein PositionVariant Type1000 Genomes FrequencyEA-ESP Frequency
Chr122205601HSPG2rs143736974TCp.N786SMissense0.00180.009
Chr123233346EPHB2rs28936395GAp.D679NMissense0.00230.0071
Chr128060600FAM76Ars147811051GAp.R121HMissense0.00081
Chr129189481OPRD1crs370180812GTp.A269SMissense0.00017
Chr145524416ZSWIM5rs149382804GAp.R373WMissense0.000460.0014
Chr151873967EPS15rs61753384GAp.S438LMissense0.00640.022
Chr477662160SHROOM3rs368255851ATp.D945VMissense0.00012
Chr643483718YIPF3GCATGp.D65delNonframeshift deletion0.0095
Chr6168376925HGC6.3drs35001101GGTp.T137Hfs*7Frameshift insertion0.24
Chr7139281560HIPK2TTCTGCCGp.Q873_T874insRQNonframeshift insertion
Chr7149473614SSPOrs117048984AGp.Y77CMissense0.04
Chr7150934827CHPF2rs144589067GAp.R460QMissense0.00180.0039
Chr8132051970ADCY8rs75246765CAp.A204SMissense0.0060.016
Chr8133883671TGrs114322847CTp.P118LMissense0.00870.022
Chr10112570222RBM20TTGAGN/ASplicing0.0027
ChrX132092485HS6ST2rs181526961GAp.S49LMissense0.0060.0097

MAF, minor allele frequency; IS, idiopathic scoliosis; Chr, chromosome; Ref, reference allele; Var, variant allele; EA-ESP, European American Exome Sequencing Project.

The original MAF filter used the 1000 genomes frequency.

Additional details for each variant are provided in Table S2.

Variant in the OPRD1 gene was not present in all of the affected individuals in the family and was removed from further analyses.

Variant in the HGC6.3 gene was removed from further analyses due to the high frequency in the EA-ESP population.

MAF, minor allele frequency; IS, idiopathic scoliosis; Chr, chromosome; Ref, reference allele; Var, variant allele; EA-ESP, European American Exome Sequencing Project. The original MAF filter used the 1000 genomes frequency. Additional details for each variant are provided in Table S2. Variant in the OPRD1 gene was not present in all of the affected individuals in the family and was removed from further analyses. Variant in the HGC6.3 gene was removed from further analyses due to the high frequency in the EA-ESP population. After filtering and Sanger sequencing, there were 14 variants of interest (Table 1). Therefore, we surveyed the literature for mouse and human phenotypic associations. Previously reported mouse phenotypes for loss-of-function mutations in 13 of the 14 genes produced either phenotypes not clearly associated with musculoskeletal disease or had no evident phenotype. Those 13 genes also have no reported association with a human musculoskeletal phenotype. The single remaining variant is NM_005529.5:p.Asn786Ser, a heterozygous missense variant in the HSPG2 gene (MIM *142461) on chromosome 1p36. All of the affected individuals in this family are heterozygous for this variant (CT genotype) (Figure 1B). Of the three unaffected individuals, two are homozygous for the reference allele (TT genotype) and one is heterozygous (IV:2, CT genotype). The presence of the CT genotype in an unaffected male may be due to incomplete penetrance of the IS phenotype. HSPG2 encodes for heparan sulfate proteoglycan 2, or perlecan. Perlecan is a key component of basement membranes and is strongly associated with musculoskeletal development, both in mouse and in human (Costell ; Nicole ; Arikawa-Hirasawa ; Stum ; Rodgers ), making it an excellent candidate gene for IS.

Sequencing of the HSPG2 gene in 100 individuals with IS

Seeking to further our knowledge of the importance of the HSPG2 gene in IS, we designed an HSPG2 sequencing project. Exons from the HSPG2 gene (97 exons, 14,369 base pair mRNA, 4391 amino acids) were sequenced in 100 unrelated individuals with IS (Denver dataset, curvature range 18−119°) with a custom targeted gene approach on an Illumina MiSeq platform. After variant filtering using the bioinformatic pipeline described previously, we found 14 potentially damaging and rare HSPG2 variants. Again, variants were defined as potentially damaging according to our criteria defined in the Materials and Methods section and File S1, and rare if the frequency was <1% in the 1000 Genomes database. We were unable to confirm one variant with Sanger sequencing, and it was consequently removed from further analyses (see the section Materials and Methods). Each variant had a frequency of <1% in the 1000 Genomes database, and 3 of the 13 identified variants were unique (not previously identified in the 1000 genomes data or in the NHLBI ESP; Table 2 and Table S4). All 13 variants were tested by Sanger sequencing in all 100 individuals to assess variant frequencies in our IS population. Including one individual from our multigenerational family (IV:1), we had a total of 101 individuals with 20 occurrences of HSPG2 variants. One individual had two HSPG2 variants.
Table 2

HSPG2 rare variants (MAF <0.01) Identified in individuals with IS

Positionrs #RefVarHGVS PositioncVariant Type1000 Genomes FrequencyEA-ESP FrequencyDenver (N = 101)St. Louis (N = 140)IS Allele Frequency
22149967rs145687082CTp.V4340MMissense0.001410.0021
22150639rs141280063CTp.E4292KMissense0.0004710.0021
22150854CTp.G4254SMissense10.0021
22154901rs146179360GAp.R4086WMissense0.001210.0021
22155345rs140139732GAp.R4074CMissense0.0014110.0041
22155455rs143015575CAp.R4037LMissense0.0008210.0021
22155960CTp.G3970RMissense10.0021
22159039rs78280998CTp.R3719QMissense10.0021
22161394rs143543800CTp.V3500MMissense0.00180.004110.0021
22168855rs114851469GAp.R2977WMissense0.00320.01250.010
22169325rs41266007CTp.G2950RMissense0.00780.02340.015
22182115rs2229474GAp.R1919CMissense0.00410.0087120.0062
22182333rs140954748GAp.A1883VMissense0.001210.0021
22186669CTN/ASplicing10.0021
22188289rs142433309GAp.T1639MMissense0.00090.00362 (1 has AA)10.0083
22188328rs41311989CTp.R1626HMissense0.0060.0011110.0041
22202483rs62642528GAp.P1019LMissense0.014210.0062
22205601rs143736974TCp.N786SMissense0.00180.009460.021
22206959AGp.Y698HMissense10.0021
22206977rs143669458CTp.V692MMissense0.000920.003310.0021
22211941rs75467696GTp.T361NMissense0.00230.006610.0021

MAF, minor allele frequency; IS, idiopathic scoliosis; Ref, reference allele; Var, variant allele EA-ESP, European American Exome Sequencing Project.

The original MAF filter used the 1000 genomes frequency.

Additional details for each variant are provided in Table S4.

HSPG2 HGVS positions use NM_005529.

MAF, minor allele frequency; IS, idiopathic scoliosis; Ref, reference allele; Var, variant allele EA-ESP, European American Exome Sequencing Project. The original MAF filter used the 1000 genomes frequency. Additional details for each variant are provided in Table S4. HSPG2 HGVS positions use NM_005529. One of the 13 variants was the same variant previously identified from the exome sequencing of our multigenerational IS family (p.Asn786Ser). This specific variant was heterozygous in 3 of the 100 additional sequenced individuals. In the 1000 Genomes data, this variant was only found in four individuals, all male (three from the European subset and one from the Puerto Rican subset). This difference is significant, with the CT genotype present in 4 of 101 IS individuals compared with 3 of 379 1000 Genomes European individuals [P = 0.039, odds ratio (OR) 5.2, 95% confidence interval (95% CI) 0.96−30]. However, the difference is not significant when we compare with the EA-ESP dataset (77/4300 individuals, P = 0.114, OR 2.3, 95% CI 0.69−6.6). This is potentially related to the small number of individuals sequenced in our study (N = 101 individuals) compared with the EA-ESP database (N = 4300). In summary, in the Denver IS dataset, we identified 13 potentially damaging and rare variants in the HSPG2 gene, with 20 occurrences in 101 individuals with IS. This finding provides further support that HSPG2 may have an important role in IS etiology.

Further investigation of the HSPG2 finding in an independent IS population

Next, we investigated HSPG2 damaging variants in the exome sequencing data from an independent group of 140 IS individuals (St. Louis dataset). We identified 7 of the original 13 Denver HSPG2 variants in this St. Louis dataset, including p.Asn786Ser. The CT genotype for this variant is present in six individuals in the St. Louis dataset (compared with 77/4300 EA-ESP, P = 0.046, OR = 2.5, 95% CI 0.95-6.0). Furthermore, an additional 11 potentially damaging variants in HSPG2 were discovered that were not previously identified in the Denver IS group, for a total of 18 HSPG2 variants in the St. Louis dataset (Table 2 and Table S4). We tested all 18 variants by Sanger sequencing and found that three low-quality variants did not confirm, resulting in 15 verified HSPG2 variants from the St. Louis dataset. One individual had two HSPG2 variants. Collectively, in the combined Denver-St. Louis IS population, the data for the p.Asn786Ser variant are compelling. There are 10 IS individuals with a CT genotype of 241 individuals total. We compared this with 77 of 4300 in the EA-ESP population and found that the CT genotype is significantly enriched in the combined IS population (P = 0.024, OR 2.4, 95% CI 1.1−4.8). Overall, 21 potentially damaging and rare variants in HSPG2 were identified in the combined Denver-St. Louis IS population, with 48 occurrences in 241 individuals. The variants are distributed across the HSPG2 gene (Figure 2).
Figure 2

Rare HSPG2 variants identified in IS patients. Structure of the HSPG2 protein showing the amino acid positions of the rare coding variants identified in the combined IS cohort of 241 individuals. Amino acid positions are based on transcript NM_005529.5. Variant p.Asn786Ser is the original variant identified in this study (combined Denver−St. Louis dataset, P = 0.024, odds ratio 2.4). Also note that variant 22186669 (c.5014+1G > A) is not shown on the diagram because it is a splicing variant and does not have an amino acid position. IS, idiopathic scoliosis.

Rare HSPG2 variants identified in IS patients. Structure of the HSPG2 protein showing the amino acid positions of the rare coding variants identified in the combined IS cohort of 241 individuals. Amino acid positions are based on transcript NM_005529.5. Variant p.Asn786Ser is the original variant identified in this study (combined Denver−St. Louis dataset, P = 0.024, odds ratio 2.4). Also note that variant 22186669 (c.5014+1G > A) is not shown on the diagram because it is a splicing variant and does not have an amino acid position. IS, idiopathic scoliosis.

Discussion

We performed exome sequencing for a multigenerational family with familial IS and identified a potentially damaging and rare variant in the HSPG2 gene. Our previous linkage results were reviewed for potential correlation with the location of newly identified variants (Miller , 2006; Alden ; Clough ; Marosy ). When this particular family was analyzed individually, marginal linkage signals were noted at nine loci (Miller, unpublished data, 2014). One of these loci, located on chromosome 1 (19−48.3 Mb), contains the HSPG2 gene. These ancillary linkage data provide additional evidence that a variant within HSPG2 potentially contributes to the IS phenotype in this family. Upon further sequencing of the HSPG2 gene in an additional 100 individuals with IS, we identified 13 potentially damaging and rare variants in HSPG2, with 20 occurrences in 101 individuals. Finally, using exome sequencing data from a second IS population, we found 15 variants in HSPG2. Combined, we found 21 rare variants with 48 occurrences in 241 individuals with IS. Specifically, p.Asn786Ser was present in 10 individuals in our dataset and is enriched in these two IS populations when compared with the EA-ESP dataset (P = 0.024, OR 2.4). HSPG2 encodes for heparan sulfate proteoglycan 2, or perlecan. Perlecan is a modular proteoglycan consisting of five structural core domains to which long chains of glycosaminoglycans (GAGs: heparan sulfate or chondroitin sulfate) are attached (Figure 2) (Murdoch ). Domain I, the N-terminal domain, is unique to perlecan and has three attachment sites for heparan sulfate/chondroitin sulfate chains. Domain II is homologous to low-density lipoprotein (LDL) receptor sites. Domain III consists of multiple modules homologous to laminin alpha chains and laminin epidermal growth factor-like (EGF-like) repeats and is divided into three subdomains (III-1, III-2, III-3) (Schulze ). Fibroblast growth factor-7 (FGF-7), platelet-derived growth factor (PDGF), and von Willebrand A-domain related protein (WARP) are known to bind subdomains III-1, III-2, and III-2, respectively (Gohring ; Mongiat ; Allen ). The largest domain, domain IV, contains multiple immunoglobulin G−like (IgG-like) repeats similar to neural cell adhesion molecules. Domain V, at the C-terminus of the protein, has homology to the globular domain of laminin and contains a GAG attachment and EGF-like repeats. This domain interacts with cell-surface integrins (α2β1), forming additional complexes linking the extracellular matrix (ECM) with the cell, and is responsible for perlecan self-assembly (Bix ). Perlecan stabilizes the ECM and is localized to basement membranes, vascular structures, cartilage, and osteogenic tissues (Costell ). This protein is abundantly expressed by skeletal myofibers and is incorporated into the basal lamina of both the synaptic and extrasynaptic regions within the neuromuscular junction (NMJ) (Singhal and Martin 2011). Perlecan binds multiple proteins and signaling molecules {including FBN1 [previously implicated in IS (Buchan )], FGF-7, latent transforming growth factor-beta binding protein-2 (LTBP-2)}, indicating it has a role as an ECM signaling scaffold with an ability to participate in cellular proliferation, differentiation, and migration (Arikawa-Hirasawa ; Farach-Carson and Carson 2007; Hayes ; Hayes ). Perlecan is strongly associated with musculoskeletal development because it is essential for endothelial growth, regeneration, and both avascular cartilage and skeletal development, making HSPG2 an excellent gene candidate for IS. Classically, pathogenic mutations in the FBN1 gene result in Marfan syndrome, a multisystem disorder that includes the musculoskeletal abnormality of scoliosis. Early investigations noted FBN1 abnormalities in the paraspinal tissues and fibroblasts of select individuals with severe IS (Miller ). Most recently, Buchan identified rare variants in the FBN1 gene as risk factors for severe spinal deformity in individuals with IS (Buchan ). Collectively, these findings and the known colocalization of perlecan and FBN1 within paraspinal tissues make HSPG2 an excellent candidate gene for IS pathogenesis. Mutations within the HSPG2 gene have been associated with two human diseases transmitted in an autosomal recessive pattern of inheritance, Schwartz-Jampel syndrome, type 1 (SJS1) [MIM: #255800] and dyssegmental dysplasia Silverman-Handmaker type (DDSH) [MIM: #224410]. Individuals with SJS1 are characterized by a skeletal dysplasia, including kyphoscoliosis, joint contractures, and myotonia resulting in a fixed facial expression and prolonged muscle contractions (Nicole ; Stum ). SJS1 mutations occur throughout the HSPG2 gene and are not concentrated in a specific exon or domain (Table S5) (Nicole ; Arikawa-Hirasawa ; Stum ; Rodgers ). In SJS1, the mutated proteins have varying degrees of functionality. Animal models have shown site-specific mutations of the gene, which result in down-regulation of perlecan at the transcriptional level (Rodgers ). In contrast, DDSH is a more severe phenotype than SJS1 and is considered lethal (Arikawa-Hirasawa ). The causative mutational events for DDSH are functional null mutations, resulting in a truncated perlecan protein core that is not secreted (Table S5) (Arikawa-Hirasawa ). This is similar to the Hspg2 homozygous knockout mouse, which exhibits significant kyphoscoliosis and skeletal defects (Costell ). Thus, homozygous HSPG2 mutations in humans and mice display a more severe phenotype with an earlier onset compared with the adolescent-onset FIS phenotype in individuals with heterozygous HSPG2 variants. This finding suggests that haploinsufficiency may result in a later onset and progressive disease, as observed in FIS. Similar to SJS1 mutational events, the variants we identified in IS individuals are spread out across the HSPG2 gene. Our original variant (p.Asn786Ser) is located in domain III-1, which is known to bind FGF-7 (Figure 2). One IS individual has a damaging variant (22202483, p.Pro1019Leu) at a base position also identified in SJS1 individuals (Stum ). However, this variant has an allele frequency of 1.3% in the EA-ESP and thus is unlikely to be solely causative for SJS1, based on a predicted homozygote frequency of 0.018%. We completed a test of variant burden across the entire HSPG2 gene, comparing the occurrence of potentially damaging and rare variants in individuals with IS to the EA-ESP control population. We used a Z test for population proportions to compare the number of variant occurrences in the two populations. Using the overall combined dataset, we found there were 48 occurrences in 241 individuals with IS compared with 720 occurrences in 4300 individuals in the EA-ESP dataset (P = 0.20). If we remove the variants in perlecan domain IV from both datasets, because domain IV is highly repetitive and the functions are more likely to be redundant within the gene, there are 31 occurrences in 241 individuals with IS compared with 385 occurrences in 4300 individuals in the EA-ESP dataset (P = 0.040). This marginally significant result increases our confidence that variants in HSPG2 contribute to the IS phenotype, but sequencing in additional individuals with IS will be needed to draw strong conclusions. IS, defined as a lateral spinal curvature of at least 10°, is present in 3% of the general population and can lie undetected within many individuals. Thus, a variant that contributes to the IS phenotype could be expected to be present in a control population at a reasonably high frequency. In addition, IS is characterized by incomplete penetrance, so not all individuals who carry a variant are expected to have clinical disease, as seen with individual IV:2, who is unaffected but is heterozygous at the variant of interest (p.Asn786Ser). Finally, we do not expect that one gene is causative for IS in all individuals. We believe that HSPG2 may contribute to the IS phenotype in a subset of individuals with IS but that other genes may be required to modify the disease and result in clinical/identifiable disease (i.e., another variant in a different gene may also be required to have a larger/progressive curvature). Future studies should include both sequencing the HSPG2 gene in larger IS cohorts and performing functional studies of the identified variants to provide confirmation that the variants contribute to the IS phenotype. In summary, IS is a complex disease. Genetic findings have been hampered by incomplete penetrance, subclinical disease in control populations, unknown modes of inheritance, and genetic heterogeneity. Additionally, IS may be a polygenic disease, even within families. By demonstrating the presence of the p.Asn786Ser variant in all affected individuals in a multigenerational family, with incomplete penetrance in one unaffected individual, and enrichment of this variant in two independent cohorts of IS patients, we now have evidence that rare variants in the HSPG2 gene potentially contribute to IS susceptibility.
  62 in total

1.  A genome-wide association study identifies common variants near LBX1 associated with adolescent idiopathic scoliosis.

Authors:  Yohei Takahashi; Ikuyo Kou; Atsushi Takahashi; Todd A Johnson; Katsuki Kono; Noriaki Kawakami; Koki Uno; Manabu Ito; Shohei Minami; Haruhisa Yanagida; Hiroshi Taneichi; Taichi Tsuji; Teppei Suzuki; Hideki Sudo; Toshiaki Kotani; Kota Watanabe; Kazuhiro Chiba; Naoya Hosono; Naoyuki Kamatani; Tatsuhiko Tsunoda; Yoshiaki Toyama; Michiaki Kubo; Morio Matsumoto; Shiro Ikegawa
Journal:  Nat Genet       Date:  2011-10-23       Impact factor: 38.330

2.  Mapping of the binding of platelet-derived growth factor to distinct domains of the basement membrane proteins BM-40 and perlecan and distinction from the BM-40 collagen-binding epitope.

Authors:  W Göhring; T Sasaki; C H Heldin; R Timpl
Journal:  Eur J Biochem       Date:  1998-07-01

Review 3.  Role of extracellular matrix proteins and their receptors in the development of the vertebrate neuromuscular junction.

Authors:  Neha Singhal; Paul T Martin
Journal:  Dev Neurobiol       Date:  2011-11       Impact factor: 3.964

4.  Assignment of a locus for autosomal dominant idiopathic scoliosis (IS) to human chromosome 17p11.

Authors:  Leila Baghernajad Salehi; Massimo Mangino; Salvatore De Serio; Domenico De Cicco; Francesca Capon; Sabrina Semprini; Antonio Pizzuti; Giuseppe Novelli; Bruno Dallapiccola
Journal:  Hum Genet       Date:  2002-08-21       Impact factor: 4.132

5.  Genetic analysis of structural elastic fiber and collagen genes in familial adolescent idiopathic scoliosis.

Authors:  N H Miller; B Mims; A Child; D M Milewicz; P Sponseller; S H Blanton
Journal:  J Orthop Res       Date:  1996-11       Impact factor: 3.494

6.  Males with familial idiopathic scoliosis: a distinct phenotypic subgroup.

Authors:  Mark Clough; Cristina M Justice; Beth Marosy; Nancy H Miller
Journal:  Spine (Phila Pa 1976)       Date:  2010-01-15       Impact factor: 3.468

7.  Adolescent idiopathic scoliosis in twins: a population-based survey.

Authors:  Mikkel O Andersen; Karsten Thomsen; Kirsten O Kyvik
Journal:  Spine (Phila Pa 1976)       Date:  2007-04-15       Impact factor: 3.468

8.  Perlecan maintains the integrity of cartilage and some basement membranes.

Authors:  M Costell; E Gustafsson; A Aszódi; M Mörgelin; W Bloch; E Hunziker; K Addicks; R Timpl; R Fässler
Journal:  J Cell Biol       Date:  1999-11-29       Impact factor: 10.539

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  Endorepellin causes endothelial cell disassembly of actin cytoskeleton and focal adhesions through alpha2beta1 integrin.

Authors:  Gregory Bix; Jian Fu; Eva M Gonzalez; Laura Macro; Amy Barker; Shelly Campbell; Mary M Zutter; Samuel A Santoro; Jiyeun K Kim; Magnus Höök; Charles C Reed; Renato V Iozzo
Journal:  J Cell Biol       Date:  2004-07-05       Impact factor: 10.539

View more
  22 in total

Review 1.  Development of a straight vertebrate body axis.

Authors:  Michel Bagnat; Ryan S Gray
Journal:  Development       Date:  2020-10-06       Impact factor: 6.868

Review 2.  The cartilage matrisome in adolescent idiopathic scoliosis.

Authors:  Carol A Wise; Diane Sepich; Aki Ushiki; Anas M Khanshour; Yared H Kidane; Nadja Makki; Christina A Gurnett; Ryan S Gray; Jonathan J Rios; Nadav Ahituv; Lila Solnica-Krezel
Journal:  Bone Res       Date:  2020-03-09       Impact factor: 13.567

3.  The axonemal dynein heavy chain 10 gene is essential for monocilia motility and spine alignment in zebrafish.

Authors:  Yunjia Wang; Benjamin R Troutwine; Hongqi Zhang; Ryan S Gray
Journal:  Dev Biol       Date:  2021-12-13       Impact factor: 3.582

4.  Identification of FAT3 as a new candidate gene for adolescent idiopathic scoliosis.

Authors:  Dina Nada; Cédric Julien; Simon Papillon-Cavanagh; Jacek Majewski; Mohamed Elbakry; Wesam Elremaly; Mark E Samuels; Alain Moreau
Journal:  Sci Rep       Date:  2022-07-19       Impact factor: 4.996

5.  Identification of rare paired box 3 variant in strabismus by whole exome sequencing.

Authors:  Hui-Min Gong; Jing Wang; Jing Xu; Zhan-Yu Zhou; Jing-Wen Li; Shu-Fang Chen
Journal:  Int J Ophthalmol       Date:  2017-08-18       Impact factor: 1.779

Review 6.  Genetics and pathogenesis of idiopathic scoliosis.

Authors:  A Grauers; E Einarsdottir; P Gerdhem
Journal:  Scoliosis Spinal Disord       Date:  2016-11-28

7.  Lack of association between AKAP2 and the susceptibility of adolescent idiopathic scoliosis in the Chinese population.

Authors:  Leilei Xu; Chao Xia; Weiguo Zhu; Zhenhua Feng; Xiaodong Qin; Weixiang Sun; Yong Qiu; Zezhang Zhu
Journal:  BMC Musculoskelet Disord       Date:  2017-08-24       Impact factor: 2.362

8.  Mutations in KIF7 implicated in idiopathic scoliosis in humans and axial curvatures in zebrafish.

Authors:  Elizabeth A Terhune; Melissa T Cuevas; Anna M Monley; Cambria I Wethey; Xiaomi Chen; Maria V Cattell; Melisa N Bayrak; Morgan R Bland; Brittan Sutphin; George Devon Trahan; Matthew R G Taylor; Lee A Niswander; Kenneth L Jones; Erin E Baschal; Lilian Antunes; Matthew Dobbs; Christina Gurnett; Bruce Appel; Ryan Gray; Nancy Hadley Miller
Journal:  Hum Mutat       Date:  2021-02-07       Impact factor: 4.878

Review 9.  Etiological Theories of Adolescent Idiopathic Scoliosis: Past and Present.

Authors:  Maja Fadzan; Josette Bettany-Saltikov
Journal:  Open Orthop J       Date:  2017-12-29

10.  CELSR2 is a candidate susceptibility gene in idiopathic scoliosis.

Authors:  Elisabet Einarsdottir; Anna Grauers; Jingwen Wang; Hong Jiao; Stefan A Escher; Aina Danielsson; Ane Simony; Mikkel Andersen; Steen Bach Christensen; Kristina Åkesson; Ikuyo Kou; Anas M Khanshour; Acke Ohlin; Carol Wise; Shiro Ikegawa; Juha Kere; Paul Gerdhem
Journal:  PLoS One       Date:  2017-12-14       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.