Literature DB >> 32703248

Evaluation of genetic susceptibility of common variants in SOX9 in patients with congenital talipes equinovarus in the Han Chinese population.

Jian Li1, Zhi Wang2, Dongxu Feng3, Wei Wang1, Weilou Feng4.   

Abstract

BACKGROUND: Congenital talipes equinovarus (CTEV) is a common birth defect that causes severe deformities of one or both feet. Genetics have been proven to play a key role in the risk of CTEV. Our study aimed to evaluate the genetic susceptibility of common variants in the SOX9 gene to CTEV in a Han Chinese population.
METHODS: In this study, we recruited 2,205 study participants, including 692 CTEV patients and 1513 healthy controls. A total of seven selected single-nucleotide polymorphisms (SNPs) within the SOX9 gene were genotyped, and environmental variables, including maternal smoking and alcoholic drinking habits, were assessed. In addition, bioinformatics analyses were performed to explore the potential biological functions of the associated SNPs.
RESULTS: The SNP rs73354570 was identified to be significantly associated with the risk of CTEV (OR = 1.53, P = 2.11 × 10-5), and the C allele was associated with an increased risk of CTEV. A dose-dependent pattern could be observed in genotypic analyses. The OR for individuals with AC genotypes was 1.37 (95% CI 1.09-1.71), and the OR for individuals with CC homozygotes was 1.47 (95% CI 1.18-1.82). Further analyses identified that rs73354570 is located within a region of multiple binding proteins, including CEBPB and POLR2A, which suggested that this SNP was also part of genetic motifs that are found within several cell types.
CONCLUSION: Our results provide evidence supporting the important role of the SOX9 gene in the contribution to the risk of CTEV.

Entities:  

Keywords:  Case-control study; Congenital talipes equinovarus; Genetic association; SOX9 gene; Single-nucleotide polymorphisms

Mesh:

Substances:

Year:  2020        PMID: 32703248      PMCID: PMC7376870          DOI: 10.1186/s13018-020-01802-7

Source DB:  PubMed          Journal:  J Orthop Surg Res        ISSN: 1749-799X            Impact factor:   2.359


Background

Congenital talipes equinovarus (CTEV), or clubfoot, is a common birth defect that occurs in 1 in every 1000 live births [1]. As a severe birth defect, patients with this disorder have difficulty walking because of severe deformities of one or both feet [1]. The etiology of CTEV is still unclear, but genetics have been proven to play a key role in the risk of CTEV [2-5]. In a family-based study, Gurnett et al. showed that clubfoot segregates in an autosomal dominant fashion with incomplete penetrance [6]. Previous genetic studies have identified several genes that are related to clubfoot, including PITX1, TBX4, and MYH3. The only previous genome-wide association study (GWAS) of clubfoot identified a significant single-nucleotide polymorphism (SNP) between NCOR2 and ZNF664, and there was evidence suggesting that FOXN3, SORCS1, and MMP7/TMEM123 were also associated with clubfoot [7]. SOX9 encodes a transcription factor that can recognize and bind to the sequence of CCTTGAG. Early studies have shown that the products of SOX9 can bind to COL9A1 and regulate its transcription [8]. COL9A1 encodes an α chain of type IX collagen. Early studies have shown that mice with a knockout of the Col9a1 gene do not produce any collagen IX polypeptides, which indicates that COL9A1 is essential for the synthesis of type IX collagen. Recent studies have shown that variations in collagen IX-related genes can cause alterations in bone marrow hyperplasia and might therefore be related to articular cartilage-related diseases [8, 9]. Zhou et al. investigated the association between common SNPs of COL9A1 and clubfoot in a small sample of individuals with Chinese Han ancestry, and a potentially related SNP was identified [10]. Based on these multiple lines of evidence connecting SOX9 and COL9A1, we hypothesize that SOX9 might be involved in the regulation of articular cartilage development. In a recent study, Wang et al. detected significant differences in the mRNA and protein expression levels of SOX9 between idiopathic CTEV muscle samples and controls [11]. Nevertheless, no evidence has been identified to connect the genetic polymorphisms of SOX9 and the risk of CTEV in the human population. In this study, we aimed to investigate the potential genetic association between genetic polymorphisms of SOX9 and the risk of CTEV based on thousands of study participants with Chinese Han ancestry. In addition to single marker-based association analyses, we also examined the effects of some environmental factors, including maternal smoking and alcoholic drinking habits. Gene-by-environment interactions were also explored.

Methods

Study participants

We recruited 692 CTEV patients and 1513 unrelated healthy controls from Honghui Hospital of Xi’an Jiaotong University and Xi’an Children Hospital from January 2012 to April 2017. All participants included in the study were randomly chosen genetically unrelated Han Chinese individuals. All patient diagnoses were confirmed by X-ray examinations. All controls without foot deformities were matched with patients by age and gender. The demographic and clinical characteristics of all participants were obtained from questionnaires and medical records. This study was performed in accordance with the ethical guidelines of the Helsinki Declaration of 1975 (revised in 2008) and was approved through the Ethics Committee of Honghui Hospital of Xi’an Jiaotong University. Written informed consent was obtained from all participants.

SNP selection and genotyping

Tagged SNPs located within gene regions with minor allele frequency (MAF) > 0.01 in SOX9 in 1000 Chinese Han genomes were chosen for genotyping. Algorithm Tagger integrated in Haploview [12] was used for SNP tagging, and the r2 criterion used for tagging was 0.8 for both gene regions. The r2 criterion was used for tagging. A total of 7 candidate SNPs were selected for genotyping in this study. Genomic DNA was isolated from peripheral blood leukocytes according to the manufacturer’s protocol (Genomic DNA kit, Axygen Scientific, Inc., CA, USA). SNP genotyping was performed using the high-throughput Sequenom MassARRAY platform with iPLEX GOLD chemistry (Sequenom, San Diego, CA, USA) based on the manufacturer’s protocols. The results were processed using Sequenom Typer 4.0 software, and genotype data were generated from the samples [13]. The case and control sample results were blinded for quality control during genotyping processes [14], and 5% of samples were randomly processed with a concordance of 100%.

Statistical methods

Genetic association analyses were performed at both genotypic and allelic levels using Plink v1.9 [15]. A linkage disequilibrium (LD) plot was made using Haploview [12]. In addition to ordinary genetic association analyses, we conducted gene-by-environment interaction analyses by logistic models. For gene-by-environment analyses, two environmental factors, i.e., maternal smoking and maternal alcoholic drinking, were included. The smoking and drinking status were recorded as their habit in general but not status during their pregnancy. Each of these seven selected SNPs was examined by pairing with each of the two environmental factors. Both maternal smoking and alcoholic drinking were coded as 0, 1, and 2, indicating never, occasionally, and often, respectively. Age and gender were included in the logistic models described above. Bonferroni corrections were applied to address multiple comparisons. In addition to statistical methods, we conducted bioinformatics analyses to explore the potential biological functions of our targeted SNPs. RegulomeDB, which annotates SNPs using ENCODE data, was used to investigate the potential regulatory roles of significant SNPs [16]. We also examined the potential eQTL patterns of those significant SNPs using GTEx [17]. Moreover, we investigated the gene-gene network of SOX9 using the STRING database, which is a database of known and predicted protein-protein interactions.

Results

Demographic and clinical characteristics of our study participants

Of the patients included in this study, 8.1% had a family history of CTEV compared with 1.4% of the controls. There was a significant difference in this variable between the study groups (P < 0.001, Table 1). However, no significant differences were found in age, gender, maternal smoking, or maternal drinking between groups (Table 1).
Table 1

Demographic characteristics of CTEV patients and healthy controls

CharacteristicsSubjects (N = 2205)P value
PatientsControls
Number6921513
Age (months), mean ± SD46.72 ± 32.3046.91 ± 33.340.99
Gender (male/female)445/247970/5430.96
Family history (yes/no)56/63621/1492< 0.001
Maternal smoking (yes/occasionally/never)44/91/55781/170/12620.25
Maternal drinking (yes/occasionally/never)55/208/429117/436/9600.80
Talipes equinovarus (single/both)411/281––

SD standard deviation, NA not applicable. Chi-square tests were performed for qualitative variables, and t tests were conducted for quantitative variables

Demographic characteristics of CTEV patients and healthy controls SD standard deviation, NA not applicable. Chi-square tests were performed for qualitative variables, and t tests were conducted for quantitative variables

Genetic associations signal

In our study, all seven SNPs selected were in Hardy-Weinberg equilibrium. Basic information, including MAF and the results of Hardy-Weinberg equilibrium tests for these 7 SNPs, is included in Table 2. Weak LD could be identified among these 7 SNPs (Fig. 1). Rs73354570 of SOX9 was the only SNP identified to be significantly associated (odds ratio = 1.53, P = 2.11 × 10−5) with the disease status of CTEV (Table 3). The C allele was associated with an increased risk of CTEV. A dose-dependent pattern could be observed in genotypic analyses. The ORs for individuals with AC genotypes and CC homozygotes were 1.37 (with 95% confidence interval 1.09–1.71) and 1.47 (with 95% confidence interval 1.18–1.82), respectively. The only interaction signal that achieved nominal significance was identified between SNP rs73354570 and maternal alcoholic drinking habit (P = 0.02). However, this signal was no longer significant if we correct for multiple comparisons (threshold of P value is 0.007). The full results are summarized in Table 4.
Table 2

Genetic information of the selected SNPs

CHRSNPPOSALLELEFUNCLOCIMAFHWE
17rs14422194172121913C/TNear-gene-5SOX90.030.62
17rs3512655072122558A/GIntronSOX90.490.92
17rs222998972122794C/TCoding-synonSOX90.030.41
17rs14581371372124722A/GUntranslated-3SOX90.021.00
17rs104267372125198A/GUntranslated-3SOX90.230.94
17rs7499934172125313C/TUntranslated-3SOX90.131.00
17rs7335457072125316A/CUntranslated-3SOX90.110.23

CHR: chromosome; POS: position; FUNC: function; MAF: minor allele frequency;HWE: P values of Hardy-Weinberg Equilibrium tests performed in controls.

Fig. 1

Linkage disequilibrium plot made by the genotyped SNPs. Values of r2 were showed in each cell

Table 3

Genetic association between genetic polymorphisms of SOX9 and CTEV

CHRSNPPOSGenotypic analysesAllelic Analyses
GenotypesCasesControlsOR[95% CI]*χ2P valueAllelesCasesControlsOR[95% CI]χ2P value
TT00
17rs14422194172121913CT40791.13[0.75–1.65]T40791.11[0.75–1.63]
CC6521434ref0.61C13442947ref0.280.60
AA1653561.03[0.80–1.33]
17rs3512655072122558AG3467531.03[0.83–1.27]A67614651.02[0.90–1.16]
GG181404ref0.080.96G7081561ref0.070.79
TT00
17rs222998972122794CT431000.94[0.65–1.35]T431000.94[0.65–1.35]
CC6491413ref0.78C13412926ref0.120.73
AA00
17rs14581371372124722AG26451.27[0.78–2.08]A26451.27[0.78–2.06]
GG6661468ref0.36G13582981ref0.920.34
AA40811.07[0.71–1.60]
17rs104267372125198AG2485361.03[0.85–1.24]A3286981.04[0.89–1.20]
GG404896ref0.240.89G10562328ref0.210.64
CC9240.82[0.38–1.77]
17rs7499934172125313CT1553391.00[0.80–1.24]C1733870.97[0.80–1.18]
TT5281150ref0.260.88T12112639ref0.070.79
CC1791.47[1.18–1.82]
17rs7335457072125316AC1552661.37[1.09–1.71]C1892841.53[1.26–1.86]
AA5281238ref22.391.38 × 105A11952742ref18.092.11 × 10-5

CHR chromosome, POS position

*Fisher exact tests were performed when sparse cells were presented. Threshold of P values was 0.05/7 ≈ 0.007. Significant results were highlighted in bold

Table 4

Results of gene by environmental interactions between selected SNPs and smoking and alcoholic drinking

CHRSNPPOSAlleleSmokingDrinking
Z.interP.interZ.interP.inter
17rs104267372125198A/G0.490.63− 1.060.29
17rs14422194172121913C/T− 0.340.74− 1.850.06
17rs14581371372124722A/G− 0.510.610.490.62
17rs222998972122794C/T1.220.22− 1.420.16
17rs3512655072122558A/G0.800.421.800.07
17rs7335457072125316A/C− 0.630.532.340.02
17rs7499934172125313C/T− 0.810.421.890.06

CHR chromosome, POS position, Z.inter Z-statistic of the interaction term, P.inter P values of the interaction term

Genetic information of the selected SNPs CHR: chromosome; POS: position; FUNC: function; MAF: minor allele frequency;HWE: P values of Hardy-Weinberg Equilibrium tests performed in controls. Linkage disequilibrium plot made by the genotyped SNPs. Values of r2 were showed in each cell Genetic association between genetic polymorphisms of SOX9 and CTEV CHR chromosome, POS position *Fisher exact tests were performed when sparse cells were presented. Threshold of P values was 0.05/7 ≈ 0.007. Significant results were highlighted in bold Results of gene by environmental interactions between selected SNPs and smoking and alcoholic drinking CHR chromosome, POS position, Z.inter Z-statistic of the interaction term, P.inter P values of the interaction term

Bioinformatics analyses

We explored the potential functional significance of SNP rs73354570 in RegulomeDB and GTEx. The RegulomeDB score for rs73354570 was 2b. The scoring system of RegulomeDB ranges from 1 to 6, and a lower score often indicates that the SNP has a more significant role in biological functions. A further examination of this SNP identified that rs73354570 is located within a region of multiple binding proteins, including CEBPB and POLR2A. This SNP was also part of genetic motifs found within several cell types. We did not identify significant eQTL signals from GTEx for SNP rs73354570 on SOX9 (Supplemental Table S1). The gene-gene network of SOX9 is shown in Fig. 2. According to the STRING database, the protein product of SOX9 interacts experimentally with several proteins encoded by the PRKG2, COL2A1, RUNX2, AMH, NR5A1, FOXL2, and CTNNB1 genes. In addition, SOX9 was also predicted to be connected with other proteins encoded by genes, including COL10A1, ACAN, and SHH.
Fig. 2

Gene-gene interaction network constructed based on protein-protein interaction (PPI) data. Blue and pink lines mean experimentally determined interactions, and the rest color lines mean the predicted interactions

Gene-gene interaction network constructed based on protein-protein interaction (PPI) data. Blue and pink lines mean experimentally determined interactions, and the rest color lines mean the predicted interactions

Discussion

In this study, we identified an SNP, i.e., rs73354570, that was significantly associated with the disease status of CTEV in a large Chinese Han population-based sample. To the best of our knowledge, our study is the first to report a significant association between SOX9 and CTEV. SOX9 was not reported to be significant in the first and only previous GWAS that focused on clubfoot, which was performed by Zhang et al. [7]. A GWAS with a stringent P value threshold (5 × 10−8) might miss some true positive hits. Our candidate gene-based study design could avoid this limitation by testing fewer markers of preselected genes. To date, the genes SOX9 and SOX9-AS1 (gene encoding anti-sense RNA1 of SOX9) have been identified to be associated with several traits and disorders, including angiotensin-converting enzyme inhibitor intolerance [18], height [19], liver enzyme levels (gamma-glutamyl transferase) [20], lung function [21], nose morphology [22], and thyroid hormone levels [23]. However, no GWAS has established the connection between SOX9 and CTEV-related disorders. Our bioinformatics analyses have identified that SNP rs73354570 might have significant biological functions in the gene expression of SOX9. This SNP would likely affect the protein binding of its surrounding areas and thus might affect the gene expression of SOX9. However, this information extracted from RegulomeDB could not be replicated by GTEx eQTL data. No significant differences could be found in the gene expression of SOX9 in different genotype groups of SNP rs73354570 in multiple human tissues. Thus, the eQTL data extracted from publicly available databases should be interpreted carefully. Gene expression levels could be very different in target tissues of CTEV patients compared to samples collected in GTEx. Therefore, we believe that it is still too early to reach conclusions on the functional significance of the identified SNPs, although there is a good chance that rs73354570 is a surrogate of another underlying marker or a set of markers. No significant signals have been reported in candidate-based association studies. Wang et al. reported that SOX9 overexpression plays a potential role in idiopathic congenital talipes equinovarus [11]. In the present study, we identified the C allele of rs73354570 to be significantly associated with an increased risk of CTEV. In the future, an eQTL study for rs73354570 on the gene expression of SOX9 based on the target tissue of CTEV patients should be conducted to evaluate the allelic effects on gene expression. With the rapid development of sequencing technology, numerous susceptibility loci contributing to complex diseases have been reported, such as schizophrenia [24-26]. Considering that analyses of only some SNPs are not sufficient to draw conclusions [27-31], we conducted gene-by-environment interaction analyses to investigate potential interactions between our selected SNPs and two environmental factors along with genetic association analyses. However, no significant interaction signals were identified. Nevertheless, we believe that it is not necessary to overinterpret this negative result. Since smoking and alcoholic drinking have been proven to be significantly associated with multiple birth defects [32, 33], it is probable that multiple environmental factors could combine with genetic factors to play a role in the process of CTEV onset. Notably, the most significant interaction signal was obtained from the significant hit of association analysis (rs73354570). Although this signal failed to achieve genome-wide significance, it is still worth investigating this interaction signal further in the future because it might partly explain its single marker-based association signal. A potential limitation of the present study is that smoking and drinking status were recorded as habits in general but not status during pregnancy. Additional studies with appropriately measured environmental factors should be conducted to further investigate the combined effects of both environmental factors and genetic factors on CTEV. Our study has several limitations. The most important factor is the lack of replication. Future studies are needed to replicate our findings regarding SOX9 in Chinese Han and other populations. Moreover, we did not perform any procedures to adjust for population stratification in the study. However, we tried our best to at least partially control this potential confounding factor because geographic location is a good indicator for genetic matching in the Han Chinese population [34, 35]. Another limitation is that we only included SNPs located within gene regions. However, several important regulatory regions are located at up- or downstream regions that are not within gene regions, and a significant portion of the GWAS panel markers cannot be mapped to any gene regions (but are several kb away from the targeted genes). It is very difficult to claim that seven preselected SNPs could represent most of the genetic information of SOX9. In addition, as a candidate gene-based study focusing on the effects of common SNPs, we did not examine any rare or low-frequency variants. However, a recent study showed that low-frequency and rare variants might play an important role in the onset and development of CTEV [36]. Based on Chinese family samples with CTEV, Zhang et al. identified two pathogenic variations from mediator complex subunit 13L (MED13L) and transforming growth factor-β receptor 2 (TGFBR2). Therefore, a sequencing-based study might provide more information about the genetic etiology of CTEV. In this study, we identified potential links between genetic polymorphisms of SOX9 and the risk of CTEV. Our study could improve our understanding of the genetic architecture of CTEV and provide a basis for novel intervention plans. Replication studies involving Chinese Han and other populations are still needed in the future. Additional file 1: Table S1. eQTL signals for SNP rs73354570 on gene SOX9.
  36 in total

Review 1.  Genetics and epidemiology of idiopathic congenital talipes equinovarus.

Authors:  Simon Barker; David Chesney; Zosia Miedzybrodzka; Nicola Maffulli
Journal:  J Pediatr Orthop       Date:  2003 Mar-Apr       Impact factor: 2.324

2.  Haploview: analysis and visualization of LD and haplotype maps.

Authors:  J C Barrett; B Fry; J Maller; M J Daly
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

3.  Leveraging Polygenic Functional Enrichment to Improve GWAS Power.

Authors:  Gleb Kichaev; Gaurav Bhatia; Po-Ru Loh; Steven Gazal; Kathryn Burch; Malika K Freund; Armin Schoech; Bogdan Pasaniuc; Alkes L Price
Journal:  Am J Hum Genet       Date:  2018-12-27       Impact factor: 11.025

4.  Evaluation of the relationships of the WBP1L gene with schizophrenia and the general psychopathology scale based on a case-control study.

Authors:  Fanglin Guan; Tong Ni; Wei Han; Huali Lin; Bo Zhang; Gang Chen; Li Zhu; Dan Liu; Tianxiao Zhang
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2019-12-16       Impact factor: 3.568

5.  Meta-analysis of genome-wide association studies on the intolerance of angiotensin-converting enzyme inhibitors.

Authors:  Seyed H Mahmoudpour; Abirami Veluchamy; Moneeza K Siddiqui; Folkert W Asselbergs; Patrick C Souverein; Catherine E de Keyser; Albert Hofman; Chim C Lang; Alexander S F Doney; Bruno H Stricker; Anthonius de Boer; Anke H Maitland-van der Zee; Colin N A Palmer
Journal:  Pharmacogenet Genomics       Date:  2017-03       Impact factor: 2.089

6.  Two-stage additional evidence support association of common variants in the HDAC3 with the increasing risk of schizophrenia susceptibility.

Authors:  Xiaodi Jia; Tianxiao Zhang; Lu Li; Dongke Fu; Huali Lin; Gang Chen; Xinshe Liu; Fanglin Guan
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2016-08-30       Impact factor: 3.568

Review 7.  Update on medicines for smoking cessation.

Authors:  Mike McDonough
Journal:  Aust Prescr       Date:  2015-08-03

8.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

9.  Whole-Genome Sequencing Coupled to Imputation Discovers Genetic Signals for Anthropometric Traits.

Authors:  Ioanna Tachmazidou; Dániel Süveges; Josine L Min; Graham R S Ritchie; Julia Steinberg; Klaudia Walter; Valentina Iotchkova; Jeremy Schwartzentruber; Jie Huang; Yasin Memari; Shane McCarthy; Andrew A Crawford; Cristina Bombieri; Massimiliano Cocca; Aliki-Eleni Farmaki; Tom R Gaunt; Pekka Jousilahti; Marjolein N Kooijman; Benjamin Lehne; Giovanni Malerba; Satu Männistö; Angela Matchan; Carolina Medina-Gomez; Sarah J Metrustry; Abhishek Nag; Ioanna Ntalla; Lavinia Paternoster; Nigel W Rayner; Cinzia Sala; William R Scott; Hashem A Shihab; Lorraine Southam; Beate St Pourcain; Michela Traglia; Katerina Trajanoska; Gialuigi Zaza; Weihua Zhang; María S Artigas; Narinder Bansal; Marianne Benn; Zhongsheng Chen; Petr Danecek; Wei-Yu Lin; Adam Locke; Jian'an Luan; Alisa K Manning; Antonella Mulas; Carlo Sidore; Anne Tybjaerg-Hansen; Anette Varbo; Magdalena Zoledziewska; Chris Finan; Konstantinos Hatzikotoulas; Audrey E Hendricks; John P Kemp; Alireza Moayyeri; Kalliope Panoutsopoulou; Michal Szpak; Scott G Wilson; Michael Boehnke; Francesco Cucca; Emanuele Di Angelantonio; Claudia Langenberg; Cecilia Lindgren; Mark I McCarthy; Andrew P Morris; Børge G Nordestgaard; Robert A Scott; Martin D Tobin; Nicholas J Wareham; Paul Burton; John C Chambers; George Davey Smith; George Dedoussis; Janine F Felix; Oscar H Franco; Giovanni Gambaro; Paolo Gasparini; Christopher J Hammond; Albert Hofman; Vincent W V Jaddoe; Marcus Kleber; Jaspal S Kooner; Markus Perola; Caroline Relton; Susan M Ring; Fernando Rivadeneira; Veikko Salomaa; Timothy D Spector; Oliver Stegle; Daniela Toniolo; André G Uitterlinden; Inês Barroso; Celia M T Greenwood; John R B Perry; Brian R Walker; Adam S Butterworth; Yali Xue; Richard Durbin; Kerrin S Small; Nicole Soranzo; Nicholas J Timpson; Eleftheria Zeggini
Journal:  Am J Hum Genet       Date:  2017-05-25       Impact factor: 11.025

Review 10.  Assessment of cognitive dysfunction in traumatic brain injury patients: a review.

Authors:  Huiyan Sun; Chengliang Luo; Xiping Chen; Luyang Tao
Journal:  Forensic Sci Res       Date:  2017-11-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.