| Literature DB >> 30807572 |
Jiafen Gong1, Fan Wang1,2, Bowei Xiao1, Naim Panjwani1, Fan Lin1, Katherine Keenan3, Julie Avolio4, Mohsen Esmaeili1, Lin Zhang1,2, Gengming He1,5, David Soave6,7, Scott Mastromatteo1, Zeynep Baskurt1, Sangook Kim1,5, Wanda K O'Neal8, Deepika Polineni8,9, Scott M Blackman10, Harriet Corvol11,12, Garry R Cutting10,13, Mitchell Drumm14,15, Michael R Knowles8, Johanna M Rommens1,16, Lei Sun2,5, Lisa J Strug1,2,5,17.
Abstract
Cystic Fibrosis (CF) exhibits morbidity in several organs, including progressive lung disease in all patients and intestinal obstruction at birth (meconium ileus) in ~15%. Individuals with the same causal CFTR mutations show variable disease presentation which is partly attributed to modifier genes. With >6,500 participants from the International CF Gene Modifier Consortium, genome-wide association investigation identified a new modifier locus for meconium ileus encompassing ATP12A on chromosome 13 (min p = 3.83x10(-10)); replicated loci encompassing SLC6A14 on chromosome X and SLC26A9 on chromosome 1, (min p<2.2x10(-16), 2.81x10(-11), respectively); and replicated a suggestive locus on chromosome 7 near PRSS1 (min p = 2.55x10(-7)). PRSS1 is exclusively expressed in the exocrine pancreas and was previously associated with non-CF pancreatitis with functional characterization demonstrating impact on PRSS1 gene expression. We thus asked whether the other meconium ileus modifier loci impact gene expression and in which organ. We developed and applied a colocalization framework called the Simple Sum (SS) that integrates regulatory and genetic association information, and also contrasts colocalization evidence across tissues or genes. The associated modifier loci colocalized with expression quantitative trait loci (eQTLs) for ATP12A (p = 3.35x10(-8)), SLC6A14 (p = 1.12x10(-10)) and SLC26A9 (p = 4.48x10(-5)) in the pancreas, even though meconium ileus manifests in the intestine. The meconium ileus susceptibility locus on chromosome X appeared shifted in location from a previously identified locus for CF lung disease severity. Using the SS we integrated the lung disease association locus with eQTLs from nasal epithelia of 63 CF participants and demonstrated evidence of colocalization with airway-specific regulation of SLC6A14 (p = 2.3x10(-4)). Cystic Fibrosis is realizing the promise of personalized medicine, and identification of the contributing organ and understanding of tissue specificity for a gene modifier is essential for the next phase of personalizing therapeutic strategies.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30807572 PMCID: PMC6407791 DOI: 10.1371/journal.pgen.1008007
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Characteristics of CF participants included in the genome-wide study of meconium ileus from the International CF Gene Modifier Consortium (GMC).
The four consortium sites are Canadian CF Gene Modifier Study (CGMS), UNC/Case Western Reserve Modifier Study (UNC/Case), Johns Hopkins University CF Twin and Sibling Study (JHU), and French Gene Modifier Study (FGMS). Additional sample details stratified by genotyping platform are provided in S1 Table.
| Consortium Site | Participants | Female % | meconium ileus | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Phe508del/ | Phe508del/ | Yes | No | %F | |||||
| Phe508del | other | ||||||||
| Phase I-NA | Phase II-NA | Mega-all | % | % | |||||
| CGMS | 1519 | 338 | 1857 | 46.3 | 61.1 | 31.3 | 310 | 1547 | 53.2 |
| UNC/Case | 1303 | 602 | 1905 | 45.7 | 78.4 | 19.1 | 326 | 1579 | 46.6 |
| JHU | 1042 | 700 | 1742 | 47.4 | 56.2 | 37.9 | 320 | 1422 | 47.2 |
| FGMS | - | - | 1266 | 48.7 | 58.1 | 35.8 | 159 | 1107 | 54.7 |
| Total | 3864 | 1640 | 6770 | 46.8 | 64.2 | 30.4 | 1115 | 5655 | 49.8 |
aAll patients from the four study sites had two severe CFTR mutations associated with pancreatic insufficiency, and all patients clustered within 6 S.D. of the Hapmap (v3) [29] Utah residents with Northern and Western European ancestry from the CEPH collection (CEU) and Toscani in Italia (TSI) in a principal component analysis (PCA).
b%F indicates percentage of female participants.
cp-value tests whether the proportion of females differ from 0.5
Fig 1Manhattan plot for the genome-wide association study with meconium ileus in the GMC sample of 6,770 individuals with CF.
All variants with MAF>1% are included in the analysis. The solid horizontal line corresponds to the genome-wide significance threshold of p = 5x10-8 [30], and the dotted horizontal line is the false discovery rate control threshold of q = 0.05 (equivalent to p<7.67x10-7) [31]. Accuracy of p-value calculation was up to 2.2x10-16 by the geeglm function [100] in R. Variants rs4077468 and rs7512462 identified in previous CF gene modifier studies also noted.
Fig 2Overlay of the p-values of individual SNPs from the meconium ileus GWAS and GTEx eQTL association and the enhancer and promoter states from the REMC at the three genome-wide significant loci.
All the p-values are on the–log10 scale with the left y-axis representing the GWAS (with the filled diamond for the SNP with the lowest p-value and colored dots indicating the LD with this SNP) and the right y-axis representing the GTEx eQTL results (version 7; with colored lines indicated for different tissues) for SLC6A14 (A), SLC26A9 (B), and ATP12A (C). Note that eQTLs for SLC6A14 in the pancreas and ATP12A in the transverse colon were not mapped by GTEx v7, but were calculated and included here—refer to Material and Methods, GTEx Data section for details. To display the eQTL data, each physical region shown was divided into 25 equal windows and lines were drawn for each tissue by connecting the minimum GTEx eQTL p-value within each window for that gene. The absence of a line indicates insufficient gene expression for that particular tissue (see GTEx Data section for details on expression threshold). In the lower panels, the 5-mark 15-state observed chromHMM enhancer (orange) and promoter (red) states within DNase hypersensitive sites are shown (Material and Methods) from adult pancreas (Panc), fetal small intestine (FIS) and adult small intestine (SI).
Top ranked variants by p-value from each genome-wide significant locus in the GMC sample of 6,770 individuals with CF.
The first three rows correspond to loci with SNPs that reached genome-wide significance and the bottom three rows correspond to loci with SNPs that exceeded false discovery rate (FDR) control threshold of q = 0.05 (equivalent to p<7.67x10-7). The top SNP in the SLC6A14 region remains the same at rs3788766 as in [12], while the top SNP at the SLC26A9 locus is now rs7549173. However all significant loci reported at the SLC26A9 locus in [12] are in linkage disequilibrium with rs7549173 (S3 Table), including rs7512462 which has been used as an instrumental variable in a Mendelian Randomization analysis of CF-related diabetes and shown to associate with exocrine pancreas disease [15] and CFTR-directed therapeutics response [2]. The variants in the ATP12A and PRSS1 regions in the previous study [12] are provided in S3 Table. The chromosome 5 and 20 variants are new findings and were not previously identified in [12].
| Chr | BP | SNPs | Nearest gene | Risk Allele | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OR | P | OR | P | OR | P | OR | P | OR | P | |||||
| X | 115566839 | rs3788766 | A | 1.44 | <2.2x10-16 | 1.40 | 2.21x10−11 | 1.53 | 4.15x10−14 | 1.33 | 2.98x10−4 | 1.42 | 1.50x10−3 | |
| 1 | 205906897 | rs7549173 | C | 1.37 | 2.81x10−11 | 1.41 | 3.37x10−9 | 1.40 | 8.88x10−8 | 1.37 | 5.80x10−4 | 1.24 | 0.076 | |
| 13 | 25282819 | rs61948108 | T | 1.55 | 3.83x10-10 | 1.70 | 5.69x10−10 | 1.54 | 1.42x10−6 | 1.48 | 4.62x10−3 | 1.78 | 2.50x10−3 | |
| 7 | 142455538 | rs3757377 | T | 1.29 | 2.55x10-7 | 1.31 | 5.55x10−6 | 1.34 | 3.88x10−6 | 1.28 | 8.43x10−3 | 1.12 | 0.4 | |
| 5 | 33471002 | rs139816984 | AAAAAAAAT | 1.35 | 3.01x10-7 | 1.30 | 1.93x10−4 | 1.33 | 1.85x10−4 | 1.32 | 0.012 | 1.58 | 6.80x10−3 | |
| 20 | 48835331 | rs2094716 | A | 1.30 | 4.92x10-7 | 1.28 | 8.90x10−5 | 1.26 | 1.10x10−3 | 1.29 | 9.30x10−3 | 1.48 | 3.80x10−3 | |
1Mega-all: all GMC CF participants (n = 6,770)
2Mega-DFDF: individuals homozygous for the Phe508del mutation (n = 4,343)
3Phase I-NA: the sample (n = 3,864) analyzed previously in [12], although the current analysis differs by inclusion of additional variants available through genome-wide imputation with the hybrid reference panel
4Phase II-NA: continued North American recruitment (n = 1,640)
5FGMS: individuals from the French CF Gene Modifier Study (n = 1,266)
Fig 3SLC6A14 meconium ileus and lung association and gene expression profiles point to cis-regulation that is distinct in pancreas and lung.
(A) Overlay of p-values (on the–log10 scale) from the meconium ileus GWAS (red/yellow palette of colored dots; this study), from the previous lung function GWAS (green/blue palette of colored dots; [41]) and from the GTEx eQTL for association between SNPs and SLC6A14 expression (lines, derived as for Fig 2) for tissues of interest. (B) and (C) Slc6a14 gene expression levels measured by RT-PCR of RNA from C57BL6/J mice (n = 4) from pancreas (B) and lung (C); expression is shown relative to Gusb at gestational (E) or post-natal (P) days indicated.
Results of Simple Sum colocalization and contrasting colocalization analyses for the three meconium ileus genome-wide significant loci, and colocalization posterior probabilities from COLOC and eCAVIAR.
The eQTL evidence is the GTEx p-value based on the -log10(eQTL p) scale for a specified gene and tissue. The meconium ileus-SNP association evidence is the -distributed Wald test statistic obtained in this study. Analytical (and permutation-based) Simple Sum (SS) colocalization p-values correspond to each individual tissue and evaluates if the eQTLs and meconium ileus-associated variants colocalize. Simple Sum contrasting (SSC) colocalization p-value evaluates if the eQTLs in the pancreas colocalize with meconium ileus-associated variants more than eQTLs in another tissue; NAs are listed for the pancreas since we do not contrast pancreas with itself. All p-values are one-sided because only positive association implies eQTL-association colocalization (i.e. eQTL peaks coincide with association peaks). COLOC [43] PP represents the posterior probability of eQTL-association colocalization calculated for each individual tissue. eCAVIAR [44] calculates SNP-level colocalization posterior probability (CLPP) for each variant in a locus for a given gene and in a given tissue. The regional colocalization probability (RCP) can be defined by summation of the SNP-level CLPP in a given LD block; RCP is recommended in [49] and the results are provided in the ‘eCAVIAR RCP’ column. eCAVIAR maximum regional CLPP provides similar qualitative results as eCAVIAR RCP thus are not shown here.
| Gene | Tissue | SS Analytical | SS Permutation | SSC | COLOC | eCAVIAR |
|---|---|---|---|---|---|---|
| SLC6A14 | Pancreas | 1.12x10-10 | 0 | NA | 0.9966 | 0.4916 |
| Esophagus | 1 | 1 | 9.97x10-9 | 0.0191 | 0.0035 | |
| Transverse Colon | 9.49x10-9 | 0 | 7.65x10-9 | 0.3755 | 0.0213 | |
| Stomach | 0.983 | 0.983 | 7.64x10-8 | 0.0265 | 0.0035 | |
| Lung | 1 | 1 | 7.68x10-9 | 0.014 | 0.0039 | |
| SLC26A9 | Pancreas | 4.48x10-5 | 2.00x10-5 | NA | 0.6926 | 0.0172 |
| Esophagus | 0.0301 | 0.0303 | 5.47x10-5 | 0.0064 | 0.002 | |
| Stomach | 1 | 1 | 5.73x10-6 | 0.0275 | 0.0046 | |
| Lung | 0.641 | 0.642 | 4.05x10-7 | 0.0241 | 0.006 | |
| ATP12A | Pancreas | 3.35x10-8 | < 10−5 | NA | 0.285 | 0.0188 |
| Transverse Colon | 1 | 1 | 1.67x10-9 | 0.06 | 0.0017 | |
| Esophagus | 1 | 1 | 8.85x10-9 | 0.0474 | 0.0026 | |
| Lung | 1 | 1 | 1.86x10-8 | 0.0372 | 0.0028 |