Literature DB >> 22208904

Candidate gene analysis of spontaneous preterm delivery: new insights from re-analysis of a case-control study using case-parent triads and control-mother dyads.

Solveig Myking1, Ronny Myhre, Håkon K Gjessing, Nils-Halvdan Morken, Verena Sengpiel, Scott M Williams, Kelli K Ryckman, Per Magnus, Bo Jacobsson.   

Abstract

BACKGROUND: Spontaneous preterm delivery (PTD) has a multifactorial etiology with evidence of a genetic contribution to its pathogenesis. A number of candidate gene case-control studies have been performed on spontaneous PTD, but the results have been inconsistent, and do not fully assess the role of how two genotypes can impact outcome. To elucidate this latter point we re-analyzed data from a previously published case-control candidate gene study, using a case-parent triad design and a hybrid design combining case-parent triads and control-mother dyads. These methods offer a robust approach to genetic association studies for PTD compared to traditional case-control designs.
METHODS: The study participants were obtained from the Norwegian Mother and Child Cohort Study (MoBa). A total of 196 case triads and 211 control dyads were selected for the analysis. A case-parent triad design as well as a hybrid design was used to analyze 1,326 SNPs from 159 candidate genes. We compared our results to those from a previous case-control study on the same samples. Haplotypes were analyzed using a sliding window of three SNPs and a pathway analysis was performed to gain biological insight into the pathophysiology of preterm delivery.
RESULTS: The most consistent significant fetal gene across all analyses was COL5A2. The functionally similar COL5A1 was significant when combining fetal and maternal genotypes. PON1 was significant with analytical approaches for single locus association of fetal genes alone, but was possibly confounded by maternal effects. Focal adhesion (hsa04510), Cell Communication (hsa01430) and ECM receptor interaction (hsa04512) were the most constant significant pathways.
CONCLUSION: This study suggests a fetal association of COL5A2 and a combined fetal-maternal association of COL5A1 with spontaneous PTD. In addition, the pathway analysis implied interactions of genes affecting cell communication and extracellular matrix.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22208904      PMCID: PMC3260094          DOI: 10.1186/1471-2350-12-174

Source DB:  PubMed          Journal:  BMC Med Genet        ISSN: 1471-2350            Impact factor:   2.103


Background

Preterm delivery (PTD) is defined as delivery occurring before 37 weeks of gestation [1]. In Scandinavian countries PTD rates vary from 5.8% to 6.4% [2]. Children born preterm are at increased risk of neonatal and infant mortality and morbidity. Globally, 28% of neonatal deaths are estimated to be directly attributable to PTD [3]. PTD can be divided into two main groups according to clinical presentation: those with spontaneous onset with either preterm labor (PTL) or preterm prelabor rupture of membranes (pPROM) and those who are delivered due to maternal or fetal complications (e.g. preeclampsia, small for gestational age) [4]. Spontaneous PTD is a common complex condition with no single environmental or genetic factor being completely responsible for its pathogenesis. Known risk factors include infection, inflammation, previous PTD, cigarette smoking, gestational bleeding and low socioeconomic status [5]. Four different pathophysiological pathways have been proposed leading to spontaneous PTD through a common terminal pathway resulting in release of uterotonins and proteases that causes cervical ripening, uterus contractions and membrane rupture [6]. These four pathways are: 1) activation of maternal or fetal hypothalamic pituitary-adrenal (HPA) axis, 2) local or systemic inflammation and infection, 3) decidual hemorrhage and 4) pathological distention of the uterus [6]. Immunological factors, such as abnormal allograft reaction and allergy, have also been hypothesized as possible mechanisms for spontaneous PTD [7]. How each of these putative causal pathways function has been difficult to elucidate. Epidemiological evidence indicates that genetic factors play a significant role in the etiology of spontaneous PTD [8-11]. A number of candidate gene studies, almost exclusively using case-control design, have identified some genes that associate with PTD [12-17]. However, the results have rarely been replicated. Of importance for this phenotype are the possible effects of two genomes, maternal and fetal, and previous studies have implicated one or the other although epidemiological data supports the predominance of the maternal genome. In addition, interactions between maternal and fetal genomes may affect PTD risk. There has also been uncertainty about the role of the paternal genome [8,11,18-20]. In the present study we re-analyzed data from a candidate gene case-control study for spontaneous PTD [21] using a case-parent triad design, which includes information from the paternal genome, and a hybrid design combining case-parent triads and control-mother dyads. Few studies have used either of these designs for PTD and none have done so in combination. These approaches provide several advantages over case-control designs in terms of minimizing potential population stratification (case-parent triad design) and their ability to increase study power (hybrid design) [22,23]. In our study we included the analysis of haplotypes. Haplotypes are in some cases preferable to SNPs, because haplotypes can sometimes capture un-genotyped functional SNPs better than single SNP analyses [24]. We considered fetal and maternal effects separately and in combination. Finally, we examined the distribution of associating variants based on the KEGG pathways in which they exist, to see if particular pathways are over-represented in our associations, thereby providing more biological insight that would not be possible by focusing solely on single genes or SNPs.

Methods

Participants

In a recent case-control candidate genetic association study, fetal and maternal samples from the Norwegian Mother and Child Cohort Study (MoBa) were genotyped at 1,430 SNPs in 140 genes to association with spontaneous PTD [21]. In the current study the same data was used from case and control mother-infant dyads with the addition of paternal samples from case pregnancies. The Norwegian Mother and Child Cohort Study (MoBa) is a pregnancy cohort consisting of more than 107 000 pregnancies recruited from 1999-2008 [25]. The majority of all pregnant women in Norway were invited to participate through a postal invitation in connection with routine ultrasound examination at 17-18 weeks of gestation http://www.fhi.no/morogbarn. The participation rate was around 44% and a written informed consent was obtained from each participant. The MoBa study collected biological specimens from mother, father and offspring and data from questionnaires given to the mother and father. The study is linked to the Medical Birth Registry of Norway (MBRN). MBRN receives medical records from every birth that takes place in Norway after gestational week 16 (after 2002 data is from week 12) [26], and all records from this registry are included in the MoBa study database. In our analyses we used samples derived from Version 2 of the MoBa cohort that included 53,711 pregnancies. Blood samples were collected from the mother and father at the ultrasound screening appointment at the 17th-18th week of gestation [27]. A new blood sample from the mother and a cord blood sample from the child were drawn at delivery. The majority of samples were received at the MoBa Biobank the day after collection and DNA was extracted on the day of receipt as previously described [27]. Selection of cases and controls has been previously described [21]. Briefly, cases were defined as live, singleton spontaneous PTD between 154 and 258 days of gestation (220/7-366/7 weeks) in women aged 20 to 34 years. No exclusion criteria were made for the fathers. Extracted DNA had to be available from the Biobank for both the mother and child for the family to be included. Extracted DNA also had to be available for the case fathers, but not for the controls. Controls were selected according to the same criteria as cases, except for gestational age that was between 273 and 286 days (390/7and 406/7 weeks). Two hundred fifteen control dyads were randomly selected from the eligible dyads. Cases and controls were not matched on any variables. In Version 2 of the MoBa database we identified 203 case-parent triads eligible for the study. Among the case-parent triads, 9 of the fathers did not have available DNA and only the case-mother dyads were used.

Candidate genes, SNP selection and genotyping

Selection of candidate genes was based on previous associations of maternal and fetal genes with spontaneous PTD and are described elsewhere [21]. A total of 1,536 SNPs were selected from 143 candidate genes, but ambiguous placement using the SNPper database http://snpper.chip.org assigned them to 167 genes; the analyses were done using this annotation. Genotyping was performed on the Illumina GoldenGate Assay system http://www.illumina.com/technology/goldengate_genotyping_assay.ilmn.

Data pre-processing

Call-rate, deviations from Hardy-Weinberg equilibrium (HWE) and Mendelian inconsistencies were determined with PLINK http://pngu.mgh.harvard.edu/purcell/plink/[28]. Minor allele frequency (MAF) calculations and additional analyses were performed using HAPLIN http://www.uib.no/smis/gjessing/genetics/software/haplin/[22,29,30]. Of the selected SNPs, 1443 SNPs were successfully genotyped with call-rates greater than 90%. Of these a total of 31 SNPs on the X-chromosome, 18 SNPs that deviated from HWE (p < 0.01) in controls and 68 SNPs with a minor allele frequency of < 5% were excluded from analyses, leaving 1,326 SNPs within 159 genes (Additional file 1 Table S1). Pedigrees assessed for Mendelian inconsistencies were removed if more than 1% of the SNPs showed evidence of such; two case triads and three control dyads were removed based on this criterion. In addition, families were excluded if the mother or the offspring had low call-rates (< 95%). If the father had low call-rate, data from his DNA was excluded from analysis, but the rest of the family remained in the study. The final sample size consisted of 407 fetal samples (196 cases, 211 controls), 407 maternal samples (196 cases, 211 controls) and 186 paternal samples (cases only).

Data analysis

The single locus associations and the haplotype analyses were performed using HAPLIN software. Haplin can analyze case-parent triad data, case-control data and hybrid designs combining data from both case triads and control triads. It uses a full likelihood model, and estimates both population frequencies and relative risks relating to each haplotype [29]. The case-parent triad design has advantages and disadvantages relative to the case-control design [31]. For example, population-based case-control designs may be affected by population stratification, while family-based designs are robust to this [31]. Case-parent triad analyses and hybrid analyses also make it possible to better evaluate the balance of maternal and fetal effects. This is a substantial advantage for phenotypes that have their origins in fetal life and therefore can be influenced by both maternal genetics and the intra-uterine environment [32]. Simply comparing case mothers with control mothers or case children with control children does not account for differential effects of maternal and fetal genotypes. Triad analysis assumes mating symmetry in the population at large, and estimates the effects of maternal and fetal genes simultaneously. However, case-parent triad analyses have slightly less power than case-control studies and cannot estimate exposure effects [31]. The hybrid design combines case-parent triads and control-mother dyads in a joint likelihood model, and thus has a higher power than the case-parent triad and the case-control designs used separately [23,33]. Hybrid analyses may, however, still be vulnerable to the effects of population stratification, though less so than the case-control design [23]. Therefore, we have re-analyzed data from a previous study to assess if: 1) we find evidence for association in the same genes as previously reports, and 2) if new genes can be detected using this family based analysis plan. SNPs and haplotypes were analyzed using the case-parent triad design and a hybrid design combining case-parent triads and control-mother dyads. The analyses were done both by looking at the effect of the fetal genes alone and by combining the effects of fetal and maternal alleles to avoid confounding by maternal genes [34]. In the combined estimation model, separate relative risks for fetal and maternal effects are estimated simultaneously in a joint model, adjusted for each other. The combined p-value refers to a likelihood ratio test comparing a full model including fetal to maternal effects with a null model with no effects whatsoever. In addition, we performed Wald tests to assess whether a second genome contributed significantly to PTD relative to only a single genetic contribution. In addition to calculating p-values for individual SNPs, haplotypes were analyzed using overlapping sliding-windows of three SNPs. Haplotype significance and effect sizes were calculated relative to the most frequent haplotype. A multiplicative gene-dose model was assumed. To control for multiple testing within a gene, a single overall p-value was computed for each gene, using a score test procedure in Haplin [35]. To assess the effect of multiple testing as a whole, QQ-plots were used to plot the observed p-values against p-values expected purely by chance, i.e., p-values drawn from a uniform distribution. Pathway analyses were performed using R http://www.r-project.org/[36]. The pathway analysis aimed at identifying pathways whose genes taken together are more associated with disease than random candidate genes from our study. That is, the criterion for significance of a pathway is that it has more genes associating with PTD than the "background" effects from our candidate genes as opposed to an a priori statistical distribution. This is a more conservative than the null hypothesis of no effects of any of the included genes. Pathways were analyzed using results of case-parent triads and a hybrid design using both case-parent triads and control-mother dyads. The analyses were done using fetal SNPs alone as well as using a combined estimate of fetal and maternal SNPs. Adjusted p-values for genes were matched to respective KEGG pathways using the KEGG_2_snp_b129 annotation http://www.genome.jp/kegg/[37]. Combined pathway-specific p-values were then obtained using a Fisher combination of p-values. That is, the combined p-value for a pathway is computed from a Chi-squared distribution with 2 k degrees of freedom, using -2(log(p1) +... +log(pk)) as the test statistic, where k is the number of genes in the pathway and pi is the p-value for gene in the pathway. The Fisher combination of p-values assumes independence between genes within the same pathway, which may not strictly be the case. We performed 10,000 simulations where the test statistic for a pathway was compared to the simulated test statistics obtained from drawing genes randomly from our study, each time selecting the same number of genes as found in the specific pathway. The resulting simulated pathway p-values were practically identical to the Fisher chi-squared values. In total 212 pathways were assessed (Additional file 2 Table S2).

Ethics approval

Approval for this study was obtained from the Regional Committee for Medical Research and Ethics (S-06075) and the Norwegian Data Inspectorate (05/016784).

Results

As expected from the case definition, cases and controls differed with respect to gestational age and birth weight (Table 1). In addition, there were significantly more primiparous women and women with a previous PTD in the case group than in the control group (Table 1). No other demographic differences existed between cases and controls.
Table 1

Clinical and demographical characteristics

Cases (n = 196)Controls (n = 211)p
Maternal age (years)29 [20-34]30 [21-34]0.135
Gestational age (days)253 [172-258]280 [273-286]< 0.001
Primiparity128 (65.3%)98 (46.4%)< 0.001
Smoking wk 1-17 of pregnancy (%)64 (32.7%)55 (26.1%)0.124
Pre-pregnancy BMI23.03 [14.84-41.26]23.23 [17.30-38.10]0.561
Birth weight (g)2815 [747-4000]3645 [2610-4970]< 0.001
Gender infant, male51.5%42.7%0.073
Paternal age (years)31 [20-46]31 [23-48]0.561
Previous PTD18 (9.2%)3 (1.4%)< 0.001
Gestational bleeding, 2nd trimester28 (15.8%)21 (10.9%)0.161

1 Medians are reported with the range in brackets.

2 Cases are defined as preterm delivery < 37 weeks of gestation, while controls are defined as term delivery from 39+0 to 40+6 weeks of gestation.

3 P-values are calculated by Mann-Whitney U test for continuous variables and chi-square test for categorical variables.

4 Information about pre-pregnancy BMI and smoking in week 1-17 of pregnancy is self reported and collected from MoBa Questionnaire 1.

Clinical and demographical characteristics 1 Medians are reported with the range in brackets. 2 Cases are defined as preterm delivery < 37 weeks of gestation, while controls are defined as term delivery from 39+0 to 40+6 weeks of gestation. 3 P-values are calculated by Mann-Whitney U test for continuous variables and chi-square test for categorical variables. 4 Information about pre-pregnancy BMI and smoking in week 1-17 of pregnancy is self reported and collected from MoBa Questionnaire 1.

Analysis of fetal genes

Significant associations were observed in the fetal analyses. The most significant gene in the case-parent triad approach was COL5A2 (collagen V alpha-2) with p = 0.006 in the single locus analysis and p = 0.002 in the haplotype analysis (Table 2 Figure 1). This gene was also significant in the hybrid analysis (Table 2 Figure 1). Within this gene several SNPs showed evidence of association (Table 3 and 4), as were several haplotypes (Table 5 and 6).
Table 2

Significant fetal genes

Case-parent triad designHybrid design
Single locusHaplotypeSingle locusHaplotype
COL5A20.0060.0020.0340.013
PLG0.0100.0140.3360.530
IGFBP30.011-0.091-
PON10.0220.0530.0050.021
G0S20.028-0.007-
AKAP50.032-0.036-
PTCRA0.0610.0420.0980.112
IL1A0.0640.0900.0140.018
SMCR80.084-0.040-
SLC23A10.0910.0320.3360.091
TCN20.130-0.036-
TFPI0.1310.3170.0010.005
IL40.1500.1860.0470.045
DEFA30.1520.0470.1460.070
CRH0.1570.2690.0720.031
IL10RB0.2750.1930.1560.006
NAT10.2850.0320.5460.118
MMP80.3190.2780.2160.046
CD140.3280.0490.6160.309
TREM10.3870.2240.1890.039
TNFRSF1B0.7700.0450.4580.161
IL4R0.8140.2770.0790.020

1 Correction for multiple testing has been performed within each gene.

Figure 1

Fetal results. QQ-plots showing the most significant fetal genes for the different designs. A) Single locus, case-parent triad. B) Haplotype, case-parent triad. C) Single locus, hybrid. D) Haplotype, hybrid. The observed p-values for each gene are plotted against the expected p-values. If there is no effect of the genes, the p-values will be positioned on the straight line. The grey area along the straight line shows the point-wise confidence interval for the expected p-values. Correction for multiple testing was performed within each gene. There have been done no adjustments for covariates in the hybrid analysis.

Table 3

Significant SNPs in COL5A2 (collagen V alpha-2) and COL5A1 (collagen V alpha-1)

FetalMaternalCombined
genesnpalleleMAFRR95% CIpRR95% CIpoverall p
COL5A2rs3923384G0.0342.291.12, 4.540.0210.830.43, 1.590.5680.048
rs6434322A0.0342.291.12, 4.540.0210.830.43, 1.590.5680.048
rs10165260G0.0342.291.12, 4.540.0210.830.43, 1.590.5680.048
rs7420331G0.1540.470.30, 0.730.0011.090.70, 1.690.7100.003
COL5A1rs4842161C0.4821.320.99, 1.750.0600.750.57, 1.000.0550.024
rs3124932A0.4331.531.14, 2.030.0050.780.58, 1.040.0840.003
rs12005720G0.1370.830.56, 1.220.3431.611.08, 2.400.0180.034
rs3128621A0.4401.491.12, 1.990.0080.700.53, 0.930.0150.001
rs4842167G0.3881.511.13, 2.030.0070.700.52, 0.930.0160.001
rs3811161G0.5090.700.52, 0.930.0121.320.99, 1.750.0600.007
rs3811152G0.0920.590.36, 0.950.0321.721.05, 2.790.0310.008
rs10745387A0.4051.361.02, 1.820.0370.750.56, 1.000.0550.016

Case-parent triad analysis

1 Results are not corrected for multiple testing.

2 A multiplicative model is assumed.

Table 4

Significant SNPs in COL5A2 (collagen V alpha-2) and COL5A1 (collagen V alpha-1)

FetalMaternalCombined
genesnpalleleMAFRR95% CIpRR95% CIpoverall p
COL5A2rs3923384G0.0421.951.14,3.310.0150.760.42, 1.350.3470.055
rs6434322A0.0421.941.14,3.310.0130.760.42, 1.350.3420.055
rs10165260G0.0421.951.14,3.310.0150.760.42, 1.350.3470.055
rs7420331G0.1290.530.35,0.810.0041.280.87, 1.850.1990.008
COL5A1rs3124311C0.5100.770.60, 0.990.0451.060.83, 1.370.6240.130
rs4842157G0.3640.750.57, 0.990.0391.070.81, 1.390.6310.115
rs4842161C0.4581.401.10, 1.800.0080.810.63, 1.030.0850.017
rs3124932A0.4331.521.19, 1.960.00040.780.61, 1.000.0540.002
rs12005720G0.1610.740.53, 1.040.0901.401.01, 1.940.0430.063
rs3128621A0.4411.481.16, 1.900.0010.700.54, 0.900.0060.001
rs4842167G0.4241.371.07, 1.760.0140.640.49, 0.820.0010.001
rs3811161G0.4650.780.61, 1.000.0551.491.16, 1.900.0030.004
rs3811152G0.1030.550.35, 0.860.0091.541.04, 2.270.0330.009
rs10745387A0.4341.250.98, 1,610.0730.700.54, 0.900.0050.011

Hybrid analysis

1 Results are not corrected for multiple testing.

2 A multiplicative model is assumed.

Table 5

Fetal and maternal haplotypes in COL5A2

Fetal haplotypesMaternal haplotypesCombined
MarkersHaplotypefreqRR95% CIpRR95% CIpOverall p
rs6760780-rs3923384-rs6434317T-A-C0.8101.00--1.00--0.006
T-A-A0.1160.550.33, 0.920.0210.970.59, 1.600.920
A-A-A0.0350.330.13, 0.850.0201.500.64, 3.500.351
A-G-A10.0342.111.05, 4.220.0350.840.44, 1.600.594
rs3923384-rs6434317-rs6434322A-C-G0.8101.00--1.00--0.003
A-A-G10.1540.490.31, 0.770.0021.080.70, 1.670.724
G-A-A0.0342.091.03, 4.190.0420.830.44, 1.600.588
rs6434317-rs6434322-rs10165260C-G-A0.8101.00--1.00--0.003
A-G-A10.1540.490.31, 0.760.0021.070.69, 1.650.747
A-A-G0.0342.091.03, 4.180.0400.830.43, 1.600.584
rs6434322-rs10165260-rs7420331G-A-A0.8101.00--1.00--0.003
G-A-G0.1540.490.31, 0.770.0011.080.70, 1.670.726
A-G-A0.0352.091.03, 4.190.0410.830.44, 1.610.592
rs10165260-rs7420331-rs13024858A-A-A0.7621.00--1.00--0.010
A-G-A0.1490.500.31, 0.790.0031.000.64, 1.580.993
A-A-C0.0471.200.58, 2.440.6310.570.27, 1.200.138
G-A-A0.0281.740.72, 4,240.2110.500.21, 1.190.122
G-A-C0.0082.370.71, 7.540.1511.700.56, 5.110.343
rs7420331-rs13024858-rs6752781A-A-A0.7941.00--1.00--0.016
C-A-A0.1510.480.30, 0.770.0021.030.65, 1.600.913
A-C-G0.0531.430.77, 2.630.2590.810.44, 1.480.492

Case-parent triad analysis

1 Haplotype deviates from Hardy-Weinberg equilibrium.

2 Calculation of haplotypes has been performed in Haplin with a sliding window of three SNPs.

3 Haplotypes are not adjusted for multiple testing.

Table 6

Fetal and maternal haplotypes in COL5A2

Fetal haplotypesMaternal haplotypesCombined
markershaplotypefreqRR95% CIpRR95% CIpOverall p
rs6760780-rs3923384-rs6434317T-A-C0.8301.00--1.00--0.013
T-A-A0.0910.650.41, 1.050.0771.200.77, 1.860.419
A-G-A0.0411.821.06, 3.080.0300.780.44, 1.410.413
A-A-A0.0370.330.14, 0.810.0141.480.76, 2.880.244
rs3923384-rs6434317-rs6434322A-C-G0.8281.00--1.00--0.006
A-A-G0.1290.540.36, 0.820.0041.260.87, 1.840.241
G-A-A0.0421.811.06, 3.100.0290.780.43, 1.390.393
rs6434317-rs6434322-rs10165260C-G-A0.8291.00--1.00--0.006
A-G-A0.1290.550.36, 0.830.0061.270.88, 1.840.210
A-A-G0.0421.811.06, 3.100.0260.770.43, 1.400.385
rs6434322-rs10165260-rs7420331G-A-A0.8291.00--1.00--0.006
G-A-G0.1290.540.36, 0.820.0041.260.87, 1.830.232
A-G-A0.0421.801.06, 3.110.0260.780.43, 1.390.392
rs10165260-rs7420331-rs13024858A-A-A0.7921.00--1.00--0.041
A-G-A0.1090.580.37, 0.910.0181.330.90, 1.970.159
A-A-C0.0351.460.78, 2.760.2420.700.35, 1.440.329
G-A-A0.0222.231.12, 4.570.0230.630.28, 1.400.262
A-G-C0.0200.380.10, 1.550.1750.720.23, 2.290.582
G-A-C0.0191.440.65, 3.200.3620.950.42, 2.230.901
rs7420331-rs13024858-rs6752781A-A-A0.8171.00--1.00--0.043
G-A-A0.1090.550.35, 0.860.0091.370.92, 2.050.116
A-C-G0.0531.410.86, 2.380.1870.830.48, 1.420.516
G-C-G0.0200.470.13, 1.810.2770.700.22, 2.220.542

Hybrid analysis

1 Calculation of haplotypes has been performed in Haplin with a sliding window of three SNPs.

2 Haplotypes are not adjusted for multiple testing.

Significant fetal genes 1 Correction for multiple testing has been performed within each gene. Fetal results. QQ-plots showing the most significant fetal genes for the different designs. A) Single locus, case-parent triad. B) Haplotype, case-parent triad. C) Single locus, hybrid. D) Haplotype, hybrid. The observed p-values for each gene are plotted against the expected p-values. If there is no effect of the genes, the p-values will be positioned on the straight line. The grey area along the straight line shows the point-wise confidence interval for the expected p-values. Correction for multiple testing was performed within each gene. There have been done no adjustments for covariates in the hybrid analysis. Significant SNPs in COL5A2 (collagen V alpha-2) and COL5A1 (collagen V alpha-1) Case-parent triad analysis 1 Results are not corrected for multiple testing. 2 A multiplicative model is assumed. Significant SNPs in COL5A2 (collagen V alpha-2) and COL5A1 (collagen V alpha-1) Hybrid analysis 1 Results are not corrected for multiple testing. 2 A multiplicative model is assumed. Fetal and maternal haplotypes in COL5A2 Case-parent triad analysis 1 Haplotype deviates from Hardy-Weinberg equilibrium. 2 Calculation of haplotypes has been performed in Haplin with a sliding window of three SNPs. 3 Haplotypes are not adjusted for multiple testing. Fetal and maternal haplotypes in COL5A2 Hybrid analysis 1 Calculation of haplotypes has been performed in Haplin with a sliding window of three SNPs. 2 Haplotypes are not adjusted for multiple testing. The most significant single locus association was with the G allele at rs7420331. This SNP had a p-value of 0.001 with a relative risk (RR) of 0.47 (confidence interval, (CI): 0.30, 0.73) in the case-parent triad analysis and a p-value of 0.004 and a RR of 0.53 (CI: 0.35, 0.85) in the hybrid analysis, indicating that the G allele protects against spontaneous PTD. The single locus association, rs7420331, also had a significant uncorrected genotypic result in the previous case-control study (p = 0.01) [21]. The other three SNPs in COL5A2 had a p-value of 0.021 and an RR of 2.29 (CI: 1.12, 4.54) in the case-parent triad analysis. In the hybrid analysis the p-value was 0.015 with an RR of 1.95 (CI: 1.14, 3.31). This indicates that these SNPs associate with increased risk of spontaneous PTD, but since they are in strong linkage disequilibrium with each other they cannot be considered independently and most likely tag a single causal variant. In the hybrid analysis the most significant gene was TFPI (tissue factor pathway inhibitor), which also was the most significant fetal gene in the previous case-control study on the same samples [21]. However, this gene was not significant in the case-parent triad analysis, except for one SNP at rs6434222. PON1 (paraoxonase 1) was significant using all analytical approaches, except for the haplotype analysis in the case-parent triad design where it was borderline significant (p = 0.053) (Table 2). Moreover, this gene was found to be significant in fetal samples in the previous published case-control analysis [21]. The most significant SNP in this gene was rs854552 for all three approaches, with the G allele conferring a protective effect against PTD (p = 0.001 in the case-parent triad analysis and p = 0.0003 in the hybrid analysis). The three haplotypes in COL5A2 that were the most significant were equivalent in their association with PTD: G-A-G at rs6434322-rs10165260-rs7420331 (p = 0.001) with an RR of 0.49 (CI: 0.31, 0.77), A-A-G at rs3923384-rs6434317-rs6434322 (RR = 0.49, CI: 0.31, 0.77) and A-G-A at rs6434317-rs6434322-rs10165260 (RR = 0.49, CI: 0.31, 0.77) (Table 7). This indicates that they tag the same associating variant(s).
Table 7

Significant genes when maternal and fetal alleles are combined

Case-parent triad designHybrid design
Single locusHaplotypeSingle locusHaplotype
IGFBP30.010-0.216-
GSTP10.014-0.016-
COL5A10.0180.1040.0140.077
COL5A20.0220.0160.0590.031
MTHFD10.0230.0510.0480.116
PLG0.0250.0490.1210.144
MMP80.0340.0730.0410.008
IL10RA0.0500.0280.0830.044
SLC23A10.0660.0070.5490.215
PON10.0820.1310.0250.120
G0S20.085-0.022-
IL1A0.1870.2760.0480.085
TFPI0.3040.5990.0040.039
SERPINH10.3540.0880.2160.016
TREM10.4590.2680.1060.041
SLC6A40.5250.3460.0320.045
HSPA60.5830.3100.0500.182
IL10RB0.6180.5840.4060.031

1 Correction for multiple testing has been performed within each gene.

Significant genes when maternal and fetal alleles are combined 1 Correction for multiple testing has been performed within each gene.

Combined analysis of fetal and maternal genes

When including maternal effects COL5A2 remained significant in all analyses except the hybrid single locus approach, which was borderline (p = 0.059) (Table 7 Figure 2). In addition a related gene, COL5A1 (collagen V alpha-1), was significant in the single locus analysis both in the case-parent triad approach and in the hybrid approach. When looking at maternal and fetal SNPs in COL5A2 separately (Table 3 and 4), it is evident that the SNPs were significant only for the fetal genotypes, but not the maternal. The combined effect of the fetal and the maternal genotypes is less significant than the fetal gene, indicating that the association with this gene is driven by the fetal genome. The same was true for the haplotypes (Table 5 and 6). COL5A1 on the other hand, showed significance for both fetal and maternal genotypes, but the overall p-value for the gene did not reach significance when considering fetal SNPs alone. When looking at the significant fetal and maternal SNPs in COL5A1 separately and combined it becomes clear that the combined effect for several of the fetal and maternal SNPs are stronger than when considered separately. This implies a combined effect of fetal and maternal alleles. In the case-control analysis [21] one fetal SNP in COL5A2 (rs7420331) and one in COL5A1 were significant in the uncorrected analysis. In the maternal samples, five SNPs were significant in COL5A1. The most significant gene in the hybrid single locus analysis was TFPI (tissue factor plasminogen inhibitor) (Table 2, 5 and 6). In the haplotype analysis the most significant genes were SLC23A1 (Solute carrier family 23 member 1) for the case-parent triad approach and MMP8 (matrix metalloproteinase 8) for the hybrid approach (Table 2).
Figure 2

Combined results for fetal and maternal genes. QQ-plots showing the most significant results for the different designs when the effects of maternal and fetal genes are combined. A) Single locus, case-parent triad. B) Haplotype, case-parent triad. C) Single locus, hybrid. D) Haplotype, hybrid.

Combined results for fetal and maternal genes. QQ-plots showing the most significant results for the different designs when the effects of maternal and fetal genes are combined. A) Single locus, case-parent triad. B) Haplotype, case-parent triad. C) Single locus, hybrid. D) Haplotype, hybrid.

Pathway analysis

We identified several pathways as significantly associating with spontaneous PTD. The most significant fetal pathways were Focal Adhesion (hsa04510), p53 signaling (hsa04115), Cell Communication (hsa01430), and ECM (extracellular matrix) receptor interaction (hsa04512) (Table 8). Looking at the combined effect of maternal and fetal SNPs the most significant pathways were Glutathione metabolism (hsa00480) and Prostate Cancer (hsa05215) (Table 9). Cell Communication, ECM-receptor interaction and Focal Adhesion remained significant when including maternal effects.
Table 8

Significant fetal pathways in spontaneous preterm delivery

Case-parent triadHybrid
KEGG pathwayPathway AnnotationSingle locusHaplotypeSingle locusHaplotype
hsa04510Focal Adhesion0.0170.0190.1030.016
hsa04115p53 signaling pathway0.0220.0100.0540.087
hsa01430Cell Communication0.0240.0200.1070.012
hsa04512ECM-receptor interaction0.0240.0200.1070.012
hsa00361Gamma-Hexachlorocyclohexane0.0590.1430.0160.030
hsa00363Bisphenol A degradation0.0590.1430.0160.030
hsa04330Notch signaling pathway0.0610.0420.0980.112
hsa04210Apoptosis0.1140.2110.0270.127
hsa00680Methane metabolism0.1940.2610.0220.022
hsa04640Hematopoietic cell lineage0.4380.3170.0830.040
hsa04060Cytokine-cytokine receptor interaction0.5000.2100.0800.033
hsa04630Jak-STAT signaling pathway0.5220.1970.0430.013
Table 9

Significant pathways when the effects of maternal and fetal genes are combined

Case-parent triadHybrid
KEGG pathwayPathway AnnotationSingle locusHaplotypeSingle locusHaplotype
hsa00480Gluthatione metabolism0.0140.0140.0160.017
hsa05215Prostate Cancer0.0170.0230.0130.026
hsa00630Glyoxylate and dicarboxylate metabolism0.0230.0510.0480.116
hsa01430Cell Communication0.0250.0810.0290.010
hsa04512ECM receptor interaction0.0250.0810.0290.010
hsa04510Focal Adhesion0.0300.1160.0610.025
hsa00361Gamma Hexachlorocyclohexane0.0360.0840.0440.145
hsa00363Bisphenol A degradation0.0360.0840.0440.145
Significant fetal pathways in spontaneous preterm delivery Significant pathways when the effects of maternal and fetal genes are combined

Discussion

In the present study we presented a re-analysis of previously published data that further elucidated the relative roles of maternal and fetal genomes on spontaneous PTD. In the previous study that used overlapping data, the most significantly associated genes were COL1A2 and PTGER3 in the maternal and TFPI and PON1 in the fetal analyses. We confirmed in our analyses an association with PON1. However, TFPI, which was found in the previous study, was only significant in our hybrid analysis. It is likely that the original finding was due to population stratification, a factor minimized by the family based analyses we used. Using our approach, the most consistent significant gene across all analyses was COL5A2, which is involved in the production of type V collagen. The previous analysis only provided minimal evidence for the association with this gene [21]. COL5A1, which also contributes to the production of type V collagen, was also found to be significant in the single locus analysis when maternal effects were included, and several SNPs were significant when examining maternal and fetal alleles separately (Table 5 and 6). Type V collagen plays a critical role in early fibril initiation and in the determination of fibril structure and matrix organization [38]. Defects in type V collagen due to mutations in COL5A1 and COL5A2 are the cause of the classical type (types I and II) of the heritable connective tissue disorder Ehler-Danlos syndrome that confers an increased risk for PTD if the fetus is affected, especially from pPROM [39-41]. It is therefore reasonable to hypothesize that variations in these genes might be involved in the pathophysiology leading to PTD. However, the results must be interpreted with care, as the QQ-plots shows that the observed p-values do not deviate from what would be expected by chance. Few other studies have tested the association between COL5A2 and the risk of spontaneous PTD. A recent study by Romero et al found an association between rs189683203 in fetal DNA and the risk of pPROM with an unadjusted p-value of 0.021 (odds ratio, (OR) = 1.42, CI: 1.06, 1.92) [17]. In another study on the same study population the authors found an association between rs6750027 in maternal DNA and the risk of PTL with an unadjusted p-value of 0.043 (OR = 1.32, CI: 1.01, 1.74) [13]. Another study by Velez et al found significant associations in three fetal SNPs and one maternal SNP in COL5A2 [12]. PON1 was significantly associated with PTD in both the triad analysis and the hybrid analysis of the fetal genes alone, but not in the combined analysis of maternal and fetal genes. This is most likely because the combined effects of maternal and fetal genes may not reach significance due to reduced power in this type of analysis. None of the maternal SNPs showed an association with spontaneous PTD, while several of the fetal SNPs did. PON1 was found to be significantly associated with PTD in the previously published case-control analysis on the same fetal samples as well, and possible mechanisms of how this gene might contribute to preterm delivery are discussed there [21]. TFPI which was the strongest fetal association in the case-control study [21], showed at best weak association in the triad-analysis, but was the most significant gene in the hybrid design. The most significant SNP in the case-control study, rs6434222 in TFPI, also had a significant unadjusted p-value in the triad analysis (p = 0.02). Overall, the results from the case-parent-triad analysis and the previously performed case-control analysis only overlapped in a few genes. The results from the hybrid analysis lay somewhere in-between the results from the case-parent triad and the case-control study. These differences may indicate that within our study population there was population stratification that could have led to spurious results in the case-control analysis. This is minimized using triads. Although the hybrid analysis has more power than the triad analysis, it may still be affected by population stratification, but to a lesser degree than a case-control design. The case-parent triad analysis is therefore the most reliable in terms of reducing the problem of stratification, and we present these as our most compelling results. Our study also identified several pathways as associating with PTD. Significant results were found in the Focal Adhesion, Cell Communication and ECM-receptor interaction pathways, all of which include COL5A2 and COL5A1, but none of the other associated genes in our study. The ECM-receptor interaction pathway is involved in tissue and organ morphogenesis associated with the bleeding disorders Bernard-Soulier syndrome and Glanzmann thrombasthenia. The Focal adhesion pathway is involved in cell matrix adhesion and also associated with the bleeding disorder Glanzmann thrombasthenia. For the Notch signaling (hsa04330), Gluthatione Metabolism (hsa00480) and Glyoxylate and dicarboxylate metabolism (hsa00630) pathways only one gene was available for inclusion and the results from these pathways must be interpreted with care. Because this was a candidate gene study, the number of included genes and SNPs in each pathway was limited. Nevertheless, those pathways that provide strong evidence of association can probably be taken as truly being involved in spontaneous PTD. The major strength of this study was that we used the case-parent triad design and the hybrid design and compared these results to those of the traditional case-control design. Few other candidate gene studies on PTD have been performed using the case-triad design, which offers protection against bias due to population stratification. Additionally, we performed a hybrid analysis, which has increased statistical power over both the case-triad and the case-control designs. These designs also provide separate estimates of fetal and maternal alleles, as well as an overall p-value estimating the combined effect of maternal and fetal alleles. In this way, confounding through maternal alleles, which can affect the intrauterine environment and thus the phenotype of the fetus, can be avoided. Our study was limited in that the small sample size was small compared to modern GWAS level analyses and in that it was based on a limited number of candidate genes. Also, no covariates were included in the hybrid analysis and we were not able to separate spontaneous PTD into pPROM and PTL at the time of analysis. Another weakness of this study is that we did not have an external replication sample to corroborate our findings. The findings should thus be regarded as exploratory, although the prior plausibility of the genes provides increased confidence in our results.

Conclusion

The results from this study suggest that fetal SNPs and haplotypes in COL5A2 and a combined effect of fetal and maternal SNPs in COL5A1 are associated with spontaneous PTD. The significant pathways, Focal adhesion, Cell communication and ECM receptor interaction all included COL5A2 and COL5A1.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

All authors planned the study and contributed with interpretation of results and writing of the paper. SM, RM, HKG, SMW, KKR and BJ designed the study and SM, RM, NHM and BJ selected cases and controls. SM, RM and HKG analyzed the data. All authors have read and approved the final manuscript.

Pre-publication history

The pre-publication history for this paper can be accessed here: http://www.biomedcentral.com/1471-2350/12/174/prepub

Additional file 1

Table S1 Single-nucleotide polymorphisms and genes examined. Table showing which single-nucleotide polymorphisms and genes that have been examined in the study. Click here for file

Additional file 2

Table S2 KEGG pathways examined. Table showing which KEGG pathways that have been examined in the study. Click here for file
  38 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  The Medical Birth Registry of Norway. Epidemiological research and surveillance throughout 30 years.

Authors:  L M Irgens
Journal:  Acta Obstet Gynecol Scand       Date:  2000-06       Impact factor: 3.636

3.  Genetic influences on premature parturition in an Australian twin sample.

Authors:  S A Treloar; G A Macones; L E Mitchell; N G Martin
Journal:  Twin Res       Date:  2000-06

4.  Identification of fetal and maternal single nucleotide polymorphisms in candidate genes that predispose to spontaneous preterm labor with intact membranes.

Authors:  Roberto Romero; Digna R Velez Edwards; Juan Pedro Kusanovic; Sonia S Hassan; Shali Mazaki-Tovi; Edi Vaisbuch; Chong Jai Kim; Tinnakorn Chaiworapongsa; Brad D Pearce; Lara A Friel; Jacquelaine Bartlett; Madan Kumar Anant; Benjamin A Salisbury; Gerald F Vovis; Min Seob Lee; Ricardo Gomez; Ernesto Behnke; Enrique Oyarzun; Gerard Tromp; Scott M Williams; Ramkumar Menon
Journal:  Am J Obstet Gynecol       Date:  2010-05       Impact factor: 8.661

5.  Maternal effects for preterm birth: a genetic epidemiologic study of 630,000 families.

Authors:  Anna C Svensson; Sven Sandin; Sven Cnattingius; Marie Reilly; Yudi Pawitan; Christina M Hultman; Paul Lichtenstein
Journal:  Am J Epidemiol       Date:  2009-10-23       Impact factor: 4.897

6.  A genetic association study of maternal and fetal candidate genes that predispose to preterm prelabor rupture of membranes (PROM).

Authors:  Roberto Romero; Lara A Friel; Digna R Velez Edwards; Juan Pedro Kusanovic; Sonia S Hassan; Shali Mazaki-Tovi; Edi Vaisbuch; Chong Jai Kim; Offer Erez; Tinnakorn Chaiworapongsa; Brad D Pearce; Jacquelaine Bartlett; Benjamin A Salisbury; Madan Kumar Anant; Gerald F Vovis; Min Seob Lee; Ricardo Gomez; Ernesto Behnke; Enrique Oyarzun; Gerard Tromp; Scott M Williams; Ramkumar Menon
Journal:  Am J Obstet Gynecol       Date:  2010-07-31       Impact factor: 8.661

7.  Maternal and fetal genetic associations of PTGER3 and PON1 with preterm birth.

Authors:  Kelli K Ryckman; Nils-Halvdan Morken; Marquitta J White; Digna R Velez; Ramkumar Menon; Stephen J Fortunato; Per Magnus; Scott M Williams; Bo Jacobsson
Journal:  PLoS One       Date:  2010-02-03       Impact factor: 3.240

8.  Maternal contributions to preterm delivery.

Authors:  Heather A Boyd; Gry Poulsen; Jan Wohlfahrt; Jeffrey C Murray; Bjarke Feenstra; Mads Melbye
Journal:  Am J Epidemiol       Date:  2009-10-23       Impact factor: 4.897

9.  Spontaneous preterm birth in African Americans is associated with infection and inflammatory response gene variants.

Authors:  Digna R Velez; Stephen Fortunato; Poul Thorsen; Salvatore J Lombardi; Scott M Williams; Ramkumar Menon
Journal:  Am J Obstet Gynecol       Date:  2008-11-18       Impact factor: 8.661

Review 10.  Connective tissue and related disorders and preterm birth: clues to genes contributing to prematurity.

Authors:  E A Anum; L D Hill; A Pandya; J F Strauss
Journal:  Placenta       Date:  2009-01-18       Impact factor: 3.481

View more
  10 in total

Review 1.  Emerging role of toll-like receptors signaling and its regulators in preterm birth: a narrative review.

Authors:  Ge Huang; Dan Yao; Xiaoli Yan; Mingyu Zheng; Ping Yan; Xiaoxia Chen; Dan Wang
Journal:  Arch Gynecol Obstet       Date:  2022-08-02       Impact factor: 2.493

Review 2.  Gene-environment interactions related to maternal exposure to environmental and lifestyle-related chemicals during pregnancy and the resulting adverse fetal growth: a review.

Authors:  Sumitaka Kobayashi; Fumihiro Sata; Reiko Kishi
Journal:  Environ Health Prev Med       Date:  2022       Impact factor: 4.395

3.  Localization of a major susceptibility locus influencing preterm birth.

Authors:  G Chittoor; V S Farook; S Puppala; S P Fowler; J Schneider; T D Dyer; S A Cole; J L Lynch; J E Curran; L Almasy; J W Maccluer; A G Comuzzie; D E Hale; R S Ramamurthy; D J Dudley; E K Moses; R Arya; D M Lehman; C P Jenkinson; B S Bradshaw; R A Defronzo; J Blangero; R Duggirala
Journal:  Mol Hum Reprod       Date:  2013-05-20       Impact factor: 4.025

4.  Association of polymorphisms in natural killer cell-related genes with preterm birth.

Authors:  Quaker E Harmon; Stephanie M Engel; Andrew F Olshan; Thomas Moran; Alison M Stuebe; Jingchun Luo; Michael C Wu; Christy L Avery
Journal:  Am J Epidemiol       Date:  2013-08-27       Impact factor: 4.897

Review 5.  Spontaneous preterm birth: advances toward the discovery of genetic predisposition.

Authors:  Jerome F Strauss; Roberto Romero; Nardhy Gomez-Lopez; Hannah Haymond-Thornburg; Bhavi P Modi; Maria E Teves; Laurel N Pearson; Timothy P York; Harvey A Schenkein
Journal:  Am J Obstet Gynecol       Date:  2017-12-14       Impact factor: 8.661

6.  Polymorphisms in NR5A2, gene encoding liver receptor homolog-1 are associated with preterm birth.

Authors:  Dinushan C Kaluarachchi; Allison M Momany; Tamara D Busch; Lucas G Gimenez; Cesar Saleme; Viviana Cosentino; Kaare Christensen; John M Dagle; Kelli K Ryckman; Jeffrey C Murray
Journal:  Pediatr Res       Date:  2016-01-13       Impact factor: 3.756

Review 7.  Spontaneous preterm birth and single nucleotide gene polymorphisms: a recent update.

Authors:  Ishfaq A Sheikh; Ejaz Ahmad; Mohammad S Jamal; Mohd Rehan; Mourad Assidi; Iftikhar A Tayubi; Samera F AlBasri; Osama S Bajouh; Rola F Turki; Adel M Abuzenadah; Ghazi A Damanhouri; Mohd A Beg; Mohammed Al-Qahtani
Journal:  BMC Genomics       Date:  2016-10-17       Impact factor: 3.969

8.  Rare mutations and potentially damaging missense variants in genes encoding fibrillar collagens and proteins involved in their production are candidates for risk for preterm premature rupture of membranes.

Authors:  Bhavi P Modi; Maria E Teves; Laurel N Pearson; Hardik I Parikh; Piya Chaemsaithong; Nihar U Sheth; Timothy P York; Roberto Romero; Jerome F Strauss
Journal:  PLoS One       Date:  2017-03-27       Impact factor: 3.240

9.  A potential novel spontaneous preterm birth gene, AR, identified by linkage and association analysis of X chromosomal markers.

Authors:  Minna K Karjalainen; Johanna M Huusko; Johanna Ulvila; Jenni Sotkasiira; Aino Luukkonen; Kari Teramo; Jevon Plunkett; Verneri Anttila; Aarno Palotie; Ritva Haataja; Louis J Muglia; Mikko Hallman
Journal:  PLoS One       Date:  2012-12-05       Impact factor: 3.240

10.  Genetic landscape of preterm birth due to cervical insufficiency: Comprehensive gene analysis and patient next-generation sequencing data interpretation.

Authors:  Ludmila Volozonoka; Dmitrijs Rots; Inga Kempa; Anna Kornete; Dace Rezeberga; Linda Gailite; Anna Miskova
Journal:  PLoS One       Date:  2020-03-26       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.