| Literature DB >> 30001343 |
Johanna M Huusko1,2, Minna K Karjalainen1, Britney E Graham3, Ge Zhang2, Emily G Farrow4, Neil A Miller4, Bo Jacobsson5, Haley R Eidem6, Jeffrey C Murray7, Bruce Bedell7, Patrick Breheny8, Noah W Brown9, Frans L Bødker10, Nadia K Litterman11, Pan-Pan Jiang11, Laura Russell11, David A Hinds11, Youna Hu11, Antonis Rokas6, Kari Teramo12, Kaare Christensen10, Scott M Williams3, Mika Rämet1, Stephen F Kingsmore13, Kelli K Ryckman9, Mikko Hallman1, Louis J Muglia2.
Abstract
Preterm birth is a leading cause of morbidity and mortality in infants. Genetic and environmental factors play a role in the susceptibility to preterm birth, but despite many investigations, the genetic basis for preterm birth remain largely unknown. Our objective was to identify rare, possibly damaging, nucleotide variants in mothers from families with recurrent spontaneous preterm births (SPTB). DNA samples from 17 Finnish mothers who delivered at least one infant preterm were subjected to whole exome sequencing. All mothers were of northern Finnish origin and were from seven multiplex families. Additional replication samples of European origin consisted of 93 Danish sister pairs (and two sister triads), all with a history of a preterm delivery. Rare exonic variants (frequency <1%) were analyzed to identify genes and pathways likely to affect SPTB susceptibility. We identified rare, possibly damaging, variants in genes that were common to multiple affected individuals. The glucocorticoid receptor signaling pathway was the most significant (p<1.7e-8) with genes containing these variants in a subgroup of ten Finnish mothers, each having had 2-4 SPTBs. This pathway was replicated among the Danish sister pairs. A gene in this pathway, heat shock protein family A (Hsp70) member 1 like (HSPA1L), contains two likely damaging missense alleles that were found in four different Finnish families. One of the variants (rs34620296) had a higher frequency in cases compared to controls (0.0025 vs. 0.0010, p = 0.002) in a large preterm birth genome-wide association study (GWAS) consisting of mothers of general European ancestry. Sister pairs in replication samples also shared rare, likely damaging HSPA1L variants. Furthermore, in silico analysis predicted an additional phosphorylation site generated by rs34620296 that could potentially affect chaperone activity or HSPA1L protein stability. Finally, in vitro functional experiment showed a link between HSPA1L activity and decidualization. In conclusion, rare, likely damaging, variants in HSPA1L were observed in multiple families with recurrent SPTB.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30001343 PMCID: PMC6042692 DOI: 10.1371/journal.pgen.1007394
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1Overview of the whole exome sequencing study workflow.
Abbreviations used QC; quality control, MAF; minor allele frequency, RD; read depth, GQ; genotype quality.
Most significant pathway results for Finnish mothers with recurrent SPTB (n = 10).
| Pathway name | Pathway p-value | No. of genes | No. of variants | Genes within the pathway identified by WES |
|---|---|---|---|---|
| Glucocorticoid receptor signaling | 1.673E-8 | 13 | 36 | |
| Estrogen receptor signaling | 5.067E-7 | 8 | 27 | |
| AMPK signaling | 9.804E-7 | 10 | 28 |
1P-value is calculated using a Fisher’s Exact Test based on the distinct sets of genes at the currently-selected filter cascade step, and those that are known to be associated with a given pathway. P-values <0.05 indicate that there are significantly more genes within the pathway than expected by chance.
Pathway results for Danish sister pairs (n = 93); showing only genes related to the Discovery findings.
| Rank | Pathway name | No. of Families | Average p-value | Median p-value | Gene (no. of families) |
|---|---|---|---|---|---|
| 7 | Estrogen receptor signaling | 79 | 0.00073 | 0.00004 | |
| 10 | Glucocorticoid receptor signaling | 75 | 0.00124 | 0.00034 |
HSPA1L variants in the large GWAS data and their association to SPTB.
| Rs# | Chr | Position | Alleles | Protein change | MAF in EA | GWAS | GWAS |
|---|---|---|---|---|---|---|---|
| rs34620296 | 6 | 31778948 | C/T | Ala268Thr | 0.0014 | 0.0025/0.0010 | 0.0022 |
| rs150472288 | 6 | 31779089 | C/T | Val221Ile | 0.0001 | Not present | - |
| rs482145 | 6 | 31778314 | A/G | Thr479Met | 0.0000 | 0.0000/0.0002 | NS |
| rs139193421 | 6 | 31779723 | T/C | Ile9Met | 0.0002 | Not present | - |
1position in GRCh37.p13.
2Frequency for European American (EA) population according to ESP6500: NHLBI GO Exome Sequencing Project (ESP), Exome Variant Server (http://evs.gs.washington.edu/EVS/).
3GWAS set including over 40,000 mothers. Association to SPTB; mother investigated as affected i.e. giving birth preterm.
Functional predictions for rare missense HSPA1L variants.
| CADD Score | RegulomeDB | HaploReg v4.1 | ||||
|---|---|---|---|---|---|---|
| SIFT | PolyPhen2 | Chromatin State: (Example tissue/cell line) | Tissue/cell line: (Active chromatin state | DNAse | ||
| rs34620296 | 29.7 | Damaging | Probably damaging | Active TSS | HeLa-S3 (a, d, e, f); Primary T cell subsets | NA |
| rs150472288 | 26.4 | Damaging | Probably Damaging | Strong transcription (e.g. T cell subtypes | Primary T cell subsets | Ovary |
| rs482145 | 28.1 | Damaging | Probably damaging | Active TSS (Foreskin Fibroblast Primary Cells) Strong transcription (e.g. T cell subtypes | Foreskin Fibroblast Primary Cells (a, b, c, d, e); Primary T cell subsets5 from peripheral blood (c, d, e) | NA |
| rs139193421 | NA | Damaging | Possibly damaging | NA | Strong transcription enhancers, Genic enhancers, Enhancers and histone marks (c, d, e) in Primary T cell subsets | Psoas Muscle |
1CADD Score >20 = variant is amongst top 1% of deleterious variants in human genome
2In silico functional effect predictions: SIFT and PolyPhen-2 annotated by Varseq
3a:Active TSS (TssA), active transcription start site, b:Poised Promoter (PromP), c:H3K4me1_Enhancer, d:H3K4me3_Promoter, e:H3K27ac_Enhancer, f:H3K9ac_Promoter
4HeLa-S3 = HeLa-S3 Cervical Carcinoma Cell Line
5Primary T cell subsets included e.g. effector/memory, regulatory, helper (memory and naïve), CD8+ cells.
Fig 2Effects of HSPA1L variants on protein sequence and structure.
(A) HSPA1L is a 641-amino acid protein that consist of two major functional domains; an N-terminal nucleotide-binding domain and a C-terminal substrate-binding domain, that are connected with interdomain linker. Locations of four HSPA1L variants from whole exome sequencing (Discovery and Replication findings) are shown in the protein sequence. Purple diamonds represent ATP nucleotide-binding sites at positions 14−17, 204−206, 270−277 and 341−344. Green circles denote the additional phosphorylation site generated by Ala268Thr and the existing one T267-p. (B) In silico comparison of higher order assembly of reference and modified (Ala268Thr) HSPA1L protein models containing an ADP molecule. Overlayed reference and Ala268Thr molecules are presented as gold and light blue rounded ribbon structures, respectively. The interacting ADP molecule is shown as a stick model. (C) Closeup view of the intermolecular contact interface of HSPA1L bound to the ADP molecule. Key interacting residues (amino acids) are shown and their corresponding side chains are presented as stick molecules; nitrogen and oxygen atoms are indicated in blue and red, respectively. All interacting residues; THR16, TYR17, GLU270, LYS273, ARG274, SER277 (not shown in figure) and ASP368, that bind to the ADP ligand showed small changes in the chemical bond lengths. Only changes of ≥0.002Å are shown in the figure (+/- lenght of Ala268Thr structure in relation to reference structure).
Fig 3HSPA1L and GR protein levels in decidualized human endometrial stromal fibroblasts.
Cultured ESFs were transfected with WT or Ala268Thr HSPA1L-pcDNA3.1 constructs or with empty pcDNA3.1 vector (control). Cells were treated with decidualization media supplemented with 100nM dexamethasone (glucocorticoids) for 72h. Both cytosolic and nuclear protein were extracted, and HSPA1L and GR protein levels were measured by Western blot. Band intensity of HSPA1L or GR was normalized to band intensity of the corresponding β-actin. Cytosolic (A) and nuclear (B) HSPA1L levels as well as cytosolic (C) and nuclear (D) GR levels are shown for control (empty vector), WT and Ala268Thr sample groups. Each experiment was performed as triplicates in three different passages (n = 9 each group, except n = 8 for nuclear control group) and bars represent mean + SEM. Significant p-value <0.05 is presented with an asterisk. Cytosolic GR levels were significantly higher (p = 0.04) in the WT compared to the Ala268Thr group as well as in the WT compared to the control group (p = 0.04).