| Literature DB >> 28877031 |
Ge Zhang1, Bjarke Feenstra1, Jonas Bacelis1, Xueping Liu1, Lisa M Muglia1, Julius Juodakis1, Daniel E Miller1, Nadia Litterman1, Pan-Pan Jiang1, Laura Russell1, David A Hinds1, Youna Hu1, Matthew T Weirauch1, Xiaoting Chen1, Arun R Chavan1, Günter P Wagner1, Mihaela Pavličev1, Mauris C Nnamani1, Jamie Maziarz1, Minna K Karjalainen1, Mika Rämet1, Verena Sengpiel1, Frank Geller1, Heather A Boyd1, Aarno Palotie1, Allison Momany1, Bruce Bedell1, Kelli K Ryckman1, Johanna M Huusko1, Carmy R Forney1, Leah C Kottyan1, Mikko Hallman1, Kari Teramo1, Ellen A Nohr1, George Davey Smith1, Mads Melbye1, Bo Jacobsson1, Louis J Muglia1.
Abstract
BACKGROUND: Despite evidence that genetic factors contribute to the duration of gestation and the risk of preterm birth, robust associations with genetic variants have not been identified. We used large data sets that included the gestational duration to determine possible genetic associations.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28877031 PMCID: PMC5561422 DOI: 10.1056/NEJMoa1612665
Source DB: PubMed Journal: N Engl J Med ISSN: 0028-4793 Impact factor: 91.245
Figure 1. Manhattan plots of discovery stage genomewide-associated results.Top: gestational length as quantitative trait; bottom: preterm birth as dichotomous trait. Regions reached genome wide significance (P < 5×10-8) and suggestive significance (P < 1×10-6) were highlighted in red and orange respectively. The six replicated loci were highlighted in bold.
For each locus, the most significant SNP in discovery stage (index SNP) and the most significant SNP in replication stage are shown.
| No | Char | Genes | SNP Information | Discovery Phase | Replication Phase | Joint analysis | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs | pos | A/B | Freq | Eff |
| Rank |
| Freq | Eff |
| Directions |
| |||
|
| |||||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| 6 | 2 |
| rs4853012 | 74361290 | G/A | 0.141 | -0.920 | 1.1E-07 | 1 | 0.145 | -0.355 | 0.42 | -++ | 1.6E-07 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| 7 | 9 |
| rs717267 | 16408826 | G/A | 0.399 | -0.637 | 1.7E-07 | 1 | 0.423 | -0.12 | 0.70 | +-+ | 5.3E-07 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| 8 | 1 |
| rs4658267 | 92240753 | C/A | 0.319 | 0.679 | 1.9E-07 | 1 | 0.319 | 0.0562 | 0.87 | -++ | 8.7E-07 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| 9 | 9 |
| rs182704 |
|
|
|
|
|
|
|
|
|
|
| |
| 10 | 6 |
| rs2532929 | 30897774 | A/G | 0.397 | -0.622 | 4.2E-07 | 1 | 0.402 | -0.0486 | 0.88 | +-+ | 1.8E-06 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| 12 | 10 |
| rs2253165 | 28337017 | A/G | 0.440 | -0.594 | 9.3E-07 | 1 | 0.429 | 0.454 | 0.15 | +-- | 4.5E-05 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
| |||||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| 2 | 3 |
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| 3 | 17 |
| rs7217780 | 15191024 | T/C | 0.336 | 1.15 | 3.5E-07 | 1 | 0.341 | 1.09 | 0.025 | +++ | 4.9E-08 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| 4 | 19 |
| rs11466328 | 41851042 |
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
For each suggestive locus (P < 1×10-6, discovery stage), the SNP showing the strongest association in the replication stage is shown below the index SNP (the most significant SNP in discovery stage). Only SNPs with P < 1×10-6 (discovery stage) and their close proxies (r 2 > 0.8) were tested for replication. Replicated regions are highlighted in bold.
For each region, the gene closest to the index SNP was shown.
SNP positions were based on GRCh37/hg19. Alleles were given based on positive strand of reference genome. Allele B is used as the reference allele for frequency and effect.
For gestational length, effect is unstandardized regression coefficient, which shows the estimated changes in gestational days per allele (B). For preterm birth, effect is the estimated odds ratio of the reference allele (B).
Discovery stage P-values were adjusted by inflation factors. The replication stage P-values were calculated from the inflation adjusted effect sizes and standard error of the three Nordic studies using fixed-effect meta-analysis. Joint-analysis P-values were calculated from 23andMe and combined Nodic studies using the inverse variance method.
Directions represent whether the effects observed in the three Nordic studies (FIN/MoBa/DNBC) are same (+) or different (-) from the effects estimated from the 23andMe discovery cohort.
For each locus, the rank (based on the P-value in discovery stage) of the most significant SNP in replication stage (show in italic) together with the r 2 with the index SNP was provided. The r 2 was estimated from haplotype data of the Phase 1 1000 Genomes EUR samples.
Figure 2. ESR1 binding at the a. The rs3820282 T allele creates a stronger ESR1 binding site. The ESR1 binding motif ‘sequence logo’ (taken from the CisBP web server) illustrates the DNA binding preferences of ESR1. Tall nucleotides above the X-axis indicate DNA bases preferred by ESR1. Bases below the X-axis are disfavored. The sequence located in the WNT4 promoter is shown below, with the T allele for rs3820282 shown at the bottom. Note that the T allele changes the sequence from C (most disfavored) to T (most preferred). b. rs3820282 overlaps ATAC-seq and H3K4me3 signals in decidual stromal cells at the WNT4 locus. The red vertical line indicates the position of rs3820282. The location of the WNT4 gene is depicted at the bottom. Tall blocks indicate exons, medium height blocks indicate UTRs, and thin lines indicate introns. Arrows within introns indicate the direction of transcription. c. Experimental validation of allele-dependent binding of ESR1 to rs3820282 by electrophoretic mobility shift assay (EMSA). Fluorescently-labeled rs3820282 probe with either the C or T allele was incubated with nuclear extracts of decidual stromal endometrial cells in the presence or absence of purified ESR1 and/or antibody against ESR1. Lane pairs indicate C and T alleles. Preferential binding of ESR1 to the T allele is observed through an increased signal intensity of the shifted band. Upper and lower arrows indicate the locations of supershifted and shifted bands, respectively. Left to right - Lanes 1+2: negative control lanes containing only oligos; Lanes 3+4: increased binding of purified ESR1 to the T allele. Lanes 5+6: limited binding in the presence of nuclear extract only, due to low expression of ESR1 in these cells; Lanes 7+8: substantial allelic binding is detected with the addition of purified ESR1 to the nuclear extract; Lanes 9+10: Supershift using an ESR1 antibody