| Literature DB >> 29930198 |
Erin E Baschal1, Elizabeth A Terhune2, Cambria I Wethey2, Robin M Baschal2,3, Kandice D Robinson2, Melissa T Cuevas2, Shreyash Pradhan2, Brittan S Sutphin2, Matthew R G Taylor4, Katherine Gowan5, Chad G Pearson6, Lee A Niswander5,7, Kenneth L Jones5, Nancy H Miller2,3.
Abstract
Idiopathic scoliosis (IS) is a structural lateral spinal curvature of ≥10° that affects up to 3% of otherwise healthy children and can lead to life-long problems in severe cases. It is well-established that IS is a genetic disorder. Previous studies have identified genes that may contribute to the IS phenotype, but the overall genetic etiology of IS is not well understood. We used exome sequencing to study five multigenerational families with IS. Bioinformatic analyses identified unique and low frequency variants (minor allele frequency ≤5%) that were present in all sequenced members of the family. Across the five families, we identified a total of 270 variants with predicted functional consequences in 246 genes, and found that eight genes were shared by two families. We performed GO term enrichment analyses, with the hypothesis that certain functional annotations or pathways would be enriched in the 246 genes identified in our IS families. Using three complementary programs to complete these analyses, we identified enriched categories that include stereocilia and other actin-based cellular projections, cilia and other microtubule-based cellular projections, and the extracellular matrix (ECM). Our results suggest that there are multiple paths to IS and provide a foundation for future studies of IS pathogenesis.Entities:
Keywords: actin cytoskeleton; cilia; exome sequencing; extracellular matrix; idiopathic scoliosis; microtubules
Mesh:
Year: 2018 PMID: 29930198 PMCID: PMC6071588 DOI: 10.1534/g3.118.200290
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Clinical information for subjects who underwent exome sequencing. Five families underwent exome sequencing, with three to four individuals sequenced in each family. Gender, curve measurement (Cobb angle), relationship to proband, degree relationship to proband, and sequencing coverage across the exome are presented for each individual. Double curves are represented by a slash. Pedigrees are provided in Supplemental File 1
| Family | Individual | Gender | Curve (Degrees) | Relationship to Proband | Degree Relationship to Proband | Coverage |
|---|---|---|---|---|---|---|
| A | III-4 | F | 52/81 | proband | proband | 91X |
| A | III-5 | F | 76/65 | sister | 1st degree | 51X |
| A | III-1 | M | 25 | half cousin (paternal) | 4th degree | 51X |
| A | II-4 | F | unknown | aunt (paternal) | 2nd degree | 68X |
| B | IV-3 | F | 34/28 | proband | proband | 36X |
| B | IV-5 | M | 56 | cousin (maternal) | 3rd degree | 77X |
| B | II-2 | F | 0 | grandmother (maternal) | 2nd degree | 63X |
| C | IV-1 | F | 36/20 | proband | proband | 49X |
| C | III-6 | F | 45 | first cousin once removed (maternal) | 4th degree | 52X |
| C | IV-2 | M | 35/70 | second cousin (maternal) | 5th degree | 38X |
| D | III-1 | F | 32/47 | proband | proband | 52X |
| D | III-5 | F | 78/42 | cousin (paternal) | 3rd degree | 44X |
| D | II-5 | F | 12/8 | aunt (paternal) | 2nd degree | 49X |
| E | IV-1 | F | 50/60 | proband | proband | 55X |
| E | III-3 | F | 50 | aunt (paternal) | 2nd degree | 40X |
| E | II-3 | F | 22/35 | great aunt (paternal) | 3rd degree | 38X |
Functional annotation clusters identified by DAVID. Functional annotation clustering analysis was completed as described in the Methods. In brief, the GOTERM_CC_All annotation category was used, with an input of the 246 genes identified in our dataset. Five clusters were identified, and higher enrichment scores represent clusters that are more enriched. Detailed results, including underlying genes for each GO term, are presented in Supplemental File 4
| Cluster | Enrichment Score | GO Terms |
|---|---|---|
| Cluster 1 | 2.47084939 | stereocilium, stereocilium bundle, actin-based cell projection, cluster of actin-based cell projections, brush border, nonmotile primary cilium, primary cilium, microvillus |
| Cluster 2 | 0.549760389 | extracellular matrix component, complex of collagen trimers, proteinaceous extracellular matrix |
| Cluster 3 | 1.470776063 | cell projection, cilium, cell projection part, ciliary part |
| Cluster 4 | 1.268556732 | intrinsic component of plasma membrane, integral component of plasma membrane, plasma membrane part |
| Cluster 5 | 1.058855585 | synapse, neuron part |
Figure 1BiNGO visualization of Cellular Component GO terms. BiNGO was used to determine and visualize GO terms that are significantly overrepresented in our dataset, as described in detail in the Methods. Blue represents more significant GO terms, teal is P = 0.05, and white are non-significant first degree relatives. The size of the node is proportional to the number of genes in the dataset that are annotated to that GO term. The data used to generate this figure are presented in Supplemental File 6.
BiNGO GO term annotations and gene lists. BiNGO was used to determine and visualize GO terms that are significantly overrepresented in our dataset, as described in detail in the Methods and Results. Nine overall groups of clustered GO terms were manually annotated. This table lists the number of significant GO terms included in the overall group, the number of genes annotated to those significant GO terms, and the corresponding gene names. In addition, three significant GO terms were not clustered with the nine overall groups: non-membrane-bounded organelle, intracellular non-membrane-bounded organelle, and organelle part. The non-redundant list of genes annotated to those three GO terms are included, as are the genes from our five families that were not annotated by BiNGO
| Overall Group | # Significant GO Terms | # Genes | Genes Annotated to the Overall Group |
|---|---|---|---|
| Muscle | 2 | 1 | |
| ECM/ Collagen | 6 | 8 | |
| Microtubule Cytoskeleton | 6 | 12 | |
| Cilium | 4 | 5 | |
| Sterocilium | 3 | 4 | |
| Neuron/ Synapse | 7 | 15 | |
| Plasma Membrane | 4 | 31 | |
| Mitochondrial Membrane | 1 | 1 | |
| Nucleus | 2 | 1 | |
| Three Significant GO Terms That Are Not in the Nine Overall Groups | 3 | 47 | |
| Not Annotated by BiNGO | N/A | 49 |
Location of proteins in the cilium. Literature searches were used to identify the location within the cilium of the cilia-related genes/proteins identified in our study. Genes were classified as encoding cilia proteins (“yes”) if the gene was listed in at least one cilia gene database, or had published evidence localizing the protein product to the cilium (see Methods). Genes were classified as potential cilia genes (“potential”) if published evidence suggested the protein interacted with the cilium, but was not confirmed to be related to cilia function. Cilia annotations are presented for all variants in Supplemental File 2 and for cilia-localized variants in Supplemental File 5
| Location | Number of Variants (“Yes”) | Number of Variants (“Potential”) |
|---|---|---|
| Axoneme | 7 | 0 |
| Basal body/centriole | 2 | 0 |
| Base of cilia | 6 | 0 |
| Other/unknown | 2 | 7 |
| Total | 17 | 7 |