| Literature DB >> 35396391 |
Jiafen Gong1, Gengming He1,2, Cheng Wang1, Claire Bartlett3, Naim Panjwani1, Scott Mastromatteo1, Fan Lin1, Katherine Keenan1,3, Julie Avolio3, Anat Halevy1, Michelle Shaw3, Mohsen Esmaeili1, Guillaume Côté-Maurais4, Damien Adam4,5, Stéphanie Bégin4, Candice Bjornson6, Mark Chilvers7, Joe Reisman8, April Price9, Michael Parkins10, Richard van Wylick11, Yves Berthiaume5, Lara Bilodeau12, Dimas Mateos-Corral13, Daniel Hughes13, Mary J Smith14, Nancy Morrison15, Janna Brusky16, Elizabeth Tullis17, Anne L Stephenson17, Bradley S Quon18, Pearce Wilcox18, Winnie M Leung19, Melinda Solomon20,21, Lei Sun2,22, Emmanuelle Brochiero4,5, Theo J Moraes3,20, Tanja Gonska3,23, Felix Ratjen3,21, Johanna M Rommens1,24, Lisa J Strug25,26,27,28,29.
Abstract
Over 400 variants in the cystic fibrosis (CF) transmembrane conductance regulator (CFTR) are CF-causing. CFTR modulators target variants to improve lung function, but marked variability in response exists and current therapies do not address all CF-causing variants highlighting unmet needs. Alternative epithelial ion channel/transporters such as SLC26A9 could compensate for CFTR dysfunction, providing therapeutic targets that may benefit all individuals with CF. We investigate the relationship between rs7512462, a marker of SLC26A9 activity, and lung function pre- and post-treatment with CFTR modulators in Canadian and US CF cohorts, in the general population, and in those with chronic obstructive pulmonary disease (COPD). Rs7512462 CC genotype is associated with greater lung function in CF individuals with minimal function variants (for which there are currently no approved therapies; p = 0.008); and for gating (p = 0.033) and p.Phe508del/ p.Phe508del (p = 0.006) genotypes upon treatment with CFTR modulators. In parallel, human nasal epithelia with CC and p.Phe508del/p.Phe508del after Ussing chamber analysis of a combination of approved and experimental modulator treatments show greater CFTR function (p = 0.0022). Beyond CF, rs7512462 is associated with peak expiratory flow in a meta-analysis of the UK Biobank and Spirometa Consortium (p = 2.74 × 10-44) and provides p = 0.0891 in an analysis of COPD case-control status in the UK Biobank defined by spirometry. These findings support SLC26A9 as a therapeutic target to improve lung function for all people with CF and in individuals with other obstructive lung diseases.Entities:
Year: 2022 PMID: 35396391 PMCID: PMC8993824 DOI: 10.1038/s41525-022-00299-9
Source DB: PubMed Journal: NPJ Genom Med ISSN: 2056-7944 Impact factor: 8.617
Fig. 1Assessment of the association of rs7512462 in the Canadian CF Gene Modifier Study (CGMS) population broken down according to particular sub-study groups.
Sample sizes indicated in parentheses represent the overlap with those published in Strug et al.[31]. MF minimal function variants; HNE human nasal epithelial. *This analysis is combined with the US PROSPECT cohort.
Characteristics of participants included for response to CFTR modulators from the CGMS and US PROSPECT studies.
| Participants on CFTR modulator | CGMS participants | US PROSPECT | |||
|---|---|---|---|---|---|
| Participant characteristics | Characteristic values | CGMS IVA (In Strug et al.[ | CGMS IVA (new, | CGMS LUM/IVA ( | LUM/IVA ( |
| rs7512462 | TT/TC/CC | 11/10/1 | 12/9/2 | 41/48/15 | 30/50/11 |
| age | mean (range) | 26.5 (6.1–58.3) | 31.3 (14.0–55.3) | 25.7 (10.5–55) | 20.9 (6–57) |
| age < = 12 | 5 | 0 | 2 | 23 | |
| age (12,20] | 4 | 3 | 38 | 27 | |
| age (20,30] | 3 | 11 | 38 | 24 | |
| age > 30 | 10 | 9 | 26 | 17 | |
| sex | female (%) | 16 (72.7%) | 13 (56.3%) | 56 (53.8%) | 52 (57.1%) |
| FEV1pp baseline | mean (range) | 66.6 (30.7–90.4) | 58.9 (30.9–89.6) | 59.6 (30.8–95.5) | 75.7 (31.7–95.8) |
| Number of post-treatment FEV1pp in 400 days | median (range) | 4 (1–6) | 4 (1–20) | 6 (1–33) | 4 (2–4) |
aone sample with FEV1pp baseline outside the inclusion criteria and another who received a lung transplant were removed.
The lung function measure used for the CFTR modulator study is forced expiratory volume in one second percent predicted (FEV1pp). A subset of n = 22 individuals from CGMS on IVA were included for analysis in Strug et al.[31] while n = 23 are newly-recruited into the CGMS.
Fig. 2Participant lung function or lung treatment response categorized by rs7512462 genotype.
a Boxplot includes an overlay with a stripchart of lung function from CGMS participants prior to modulator treatment, measured as Saknorm across different CFTR mutation groups. Saknorm[5,35] is calculated with FEV1 measurements taken prior to modulator treatment, if applicable. b Boxplot includes an overlay with a stripchart for treatment response in patients on IVA (n = 45), LUM/IVA in PROSPECT cohort (n = 91) or LUM/IVA in combined PROSPECT and CGMS samples (n = 195). Following Strug et al.[31], the treatment response for IVA is defined as the difference between the mean post-treatment FEV1pp within 15 to 400 days and FEV1pp baseline. Response for LUM/IVA is defined as the difference in FEV1pp between the first post treatment within 5 to 7 months to the baseline[65] (Methods). The center line, upper and lower box limits of boxplots correspond to the median, the third quartile (Q3) and the first quartiles (Q1), respectively; whiskers reflect the 1.5x interquartile range (IQR), i.e., bottom whisker is Q1-1.5xIQR and upper whisker is Q3 + 1.5 x IQR. Each dot represents an individual measurement.
Association between Saknorm and rs7512462 using additive and recessive codings in the CGMS prior to treatment.
| Models | Covariate | Effect size | S.E. | |
|---|---|---|---|---|
| Additive | rs7512462_C | 0.072 | 0.030 | 0.017 |
| CFTR-Gating/Other | 0.210 | 0.094 | 0.025 | |
| CFTR-MF/MF | –0.099 | 0.099 | 0.317 | |
| Saknorm new reference equation | –0.244 | 0.050 | 1.41 × 10−6 | |
| Recessive | rs7512462_CC | 0.130 | 0.057 | 0.022 |
| CFTR-Gating/Other | 0.208 | 0.093 | 0.026 | |
| CFTR-MF/MF | –0.101 | 0.099 | 0.307 | |
| Saknorm new reference equation | –0.244 | 0.050 | 1.32 × 10−6 |
Saknorm is a continuous FEV1- based CF-specific percentile that is normalized and accounts for age, sex and height and is survival adjusted[5,35]. Saknorm[5,35] is calculated with FEV1 measurements taken prior to modulator treatment, if applicable. The combined samples include individuals with gating mutations (n = 89), individuals homozygous for Phe508del (n = 1266) or those with two minimal function (MF) mutations (n = 63). All analyses use the robust covariance matrix estimates by the R package ‘rms’[66]. In addition to rs7512462, both models include a CFTR mutation group indicator, with reference group as homozygous Phe508del. Saknorm new reference equation is an indicator for the reference equation used to calculate Saknorm (Methods) depending on the year the lung function measurement was taken. Individuals with a gating variant, individuals who are homozygous Phe508del or those who carry two MF alleles are included.
Association of rs7512462 with CFTR modulator response.
| Studies | Covariates | Effect Size | S.E. | |
|---|---|---|---|---|
| IVA(CGMS) | Age at baseline | –0.077 | 0.095 | 0.419 |
| FEV1pp at baseline | –0.025 | 0.073 | 0.731 | |
| rs7512462_CC | 9.905 | 4.477 | 0.033 | |
| Early CGMS Cohort | 5.336 | 2.606 | 0.047 | |
| LUM/IVA (US PROSPECT) | Age at baseline | –0.188 | 0.074 | 0.014 |
| FEV1pp at baseline | –0.123 | 0.053 | 0.024 | |
| rs7512462_CC | 8.518 | 2.657 | 0.002 | |
| LUM/IVA (US PROSPECT + CGMS) | Age at baseline | –0.119 | 0.064 | 0.066 |
| FEV1pp at baseline | –0.069 | 0.037 | 0.066 | |
| rs7512462_CC | 5.217 | 1.871 | 0.006 | |
| PROSPECT Cohort | 0.865 | 1.401 | 0.540 |
CFTR modulator response is the difference between the FEV1pp after the modulator response and the value at last visit before the treatment initiation within 3 months as defined in Methods. Increased Δ FEV1pp indicates better treatment response. Sample sizes in each CFTR modulator treatment group are n = 45 for IVA (CGMS), n = 91 for LUM/IVA (US PROSPECT) and n = 195 for the combined LUM/IVA analysis of the US PROSPECT and CGMS participants (US PROSPECT + CGMS). Early CGMS Cohort for IVA-CGMS reflects an indicator for those participants that were included in an earlier published study[31]. LUM/IVA studies were also adjusted for population structure by principal components (PCs; n = 7 PCs for US PROSPECT and n = 4 for US PROSPECT + CGMS). PROSPECT Cohort is an indicator of whether the individual was a participant in the US PROSPECT study or the CGMS. All results use robust variance estimates.
Fig. 3Boxplot overlayed with stripchart for CFTR function by rs7512462 genotype in HNE cultured to P2.
The CFTR function is defined as the difference in ΔIeq-forskolin from treatment with VX-770+VX-809 (left; n = 36) or VX-770+VX-809+amplifier (right; n = 30) to DMSO vehicle in CFTR-mediated current in cultured HNE from participants homozygous for Phe508del. More negative measurements reflect greater CFTR function. The centerline, upper and lower box limits of boxplots correspond to the median, the third quartile (Q3) and the first quartiles (Q1), respectively; whiskers reflect the 1.5x interquartile range (IQR), i.e., bottom whisker is Q1-1.5xIQR and upper whisker is Q3 + 1.5 x IQR. Each dot represents an individual measurement.
Association of rs7512462 with change in function to modulator treatment.
| Studies | Covariates | Effect Size | S.E. | |
|---|---|---|---|---|
| CFTR-mediated current (VX-770+VX-809) | rs7512462_CC | −0.406 | 0.417 | 0.3375 |
| Standard media | −1.969 | 0.456 | 0.0001 | |
| CFTR-mediated current (VX-770+VX-809+ amplifier) | rs7512462_CC | −1.934 | 0.572 | 0.0022 |
| Standard media | −2.170 | 0.697 | 0.0044 |
Forskolin-stimulated currents mediated by CFTR (measured as change in current after application of forskolin, ΔIeq-forskolin) is used as the treatment response; more negative values correspond to more CFTR activity. Functional study includes HNEs from individuals homozygous for Phe508del with VX-770+VX-809 (n = 36) and VX-770+VX-809+ amplifier (n = 30; a subset of the 36 samples tested with 770+VX-809) measured as difference in CFTR-mediated current after applying forskolin regressed on rs7512462 with a recessive coding. All results use robust variance estimates.
Fig. 4CFTR and SLC26A9 single-cell expression from the Human Lung Cell Atlas[40].
a Lung cell expression profiles from three individuals were clustered by nearest-neighbor and visualized using uniform manifold approximation and projection (UMAP). Normalized and log-scaled RNA counts for CFTR and SLC26A9 are plotted for each cell. Tissue compartment as defined by the Human Lung Cell Atlas is shown; expression of both CTFR and SLC26A9 is primarily restricted to epithelial cells, b UMAP clustering of epithelial lung cells only; cell types were annotated in the Human Lung Cell Atlas. c Breakdown of CFTR and SLC26A9 expression by cell type. Cells were grouped as having CFTR read count > 1 (red), SLC26A9 > 1 (blue) or co-expressing both (purple). Cell total varies and is annotated for each cell type.
Co-expression between SLC26A9 and CFTR using the Human Lung Cell Atlas data[40].
| Cell Type | Number of cells | Zero-inflated negative binomial model | Spearman’s correlation | |||
|---|---|---|---|---|---|---|
| Effect size | S.E. | Rho | ||||
| Club | 895 | −0.00007 | 0.00125 | 0.9564 | 0.05173 | 0.12196 |
| AT2a | 760 | 0.00035 | 0.00013 | 0.0097 | 0.13215 | 0.00026 |
| Basal | 240 | −11.6529 | 94.6542 | 0.9020 | −0.02365 | 0.71544 |
| AT1a | 421 | −8.41119 | 152.863 | 0.9561 | −0.01178 | 0.80962 |
| AT2-sa | 201 | 0.00143 | 0.00122 | 0.2433 | 0.28686 | 0.00004 |
| Differentiating Basal | 43 | −0.02385 | 0.01931 | 0.2168 | 0.07235 | 0.64478 |
| AT2 and AT2-s | 961 | 0.00041 | 0.00014 | 0.0036 | 0.16567 | <0.0001 |
| AT1, AT2 and AT2-s | 1382 | 0.00048 | 0.00015 | 0.0018 | 0.32261 | <0.0001 |
The effect size estimate from zero-inflated negative binomial model indicates log-ratio relations between CFTR and SLC26A9 expression. Values greater than zero correspond to positive co-expression. Spearman’s correlation estimates (rho) were calculated using log-transformed read counts per million measures for CFTR and SLC26A9. The Spearman’s correlation and the zero-inflated negative binomial model analysis were performed by R function cor.test() and the zeroinfl() function in the package pscl[90], respectively. All p-values indicate two-sided statistical test results.
aAT2 Alveolar Epithelial Type 2; AT1 Alveolar Epithelial Type 1; AT2-s Signaling Alveolar Epithelial Type 2
The 10 phenotypes with smallest rs7512462 association p-value obtained from the GWAS ATLAS[44].
| PMID | Domain | Trait | Effect Size | Colocalization | ||
|---|---|---|---|---|---|---|
| 30804560 | Respiratory | PEF (UKBB | 0.038 | 4.33 × 1046 | 321,047 | 2.24 x 10−9 |
| 30804560 | Respiratory | PEF (Meta of UKBB and Spirometa) | 0.037 | 2.74 × 10−44 | 345,265 | 2.82 × 10−8 |
| 30807572 | Gastrointestinal | Meconium ileus in CF | −0.289 | 1.86 × 10−9 | 6770 | NA |
| 28436984 | Reproduction | Age at menarche | −0.021 | 3.93 × 10−6 | 252,514 | 3.63 × 10−8 |
| 30804560 | Respiratory | FEV1/FVC ratio (Meta of UKBB and Spirometa) | 0.01 | 4.11 × 10−5 | 400,102 | 6.03 × 10−9 |
| 30804560 | Respiratory | FEV1/FVC ratio (UKBB) | 0.01 | 1.44 × 10−4 | 321,047 | 2.24 × 10−8 |
| 23754948 | Metabolic | Waist circumference (male) | 0.03 | 2.2 × 10−4 | 36,231 | 7.08 × 10−9 |
| 25231870 | Reproduction | Age at menarche | −0.023 | 3.2 × 10−4 | 132,989 | 7.41 × 10−9 |
| 19430480 | Endocrine | Type 1 Diabetes | NA | 7.47 × 10−4 | 7982 | NA |
| 23754948 | Metabolic | Waist-hip ratio (male) | 0.028 | 7.7 × 10−4 | 34,629 | 1.78 × 10−8 |
The colocalization p-value assesses evidence for common genetic contributions between the corresponding study phenotype and the meconium ileus CF GWAS[1] at chr1:205,895,000-205,921,000 locus. The effect allele in all studies is C, and the other allele is T, except for type 1 diabetes, where the effect size is not reported for the SNP in the original summary statistics from the listed publication (PMID). PheWAS significance level is p < 1.05 × 10−5 after adjusting for 4756 GWAS in the database (alpha 0.05). UKBB refers to an analysis using the UK Biobank. ‘Colocalization P-value’ represents the p-values from the colocalization analysis calculated here with the CF GWAS summary statistics in Gong et al.[1] for the corresponding phenotype, using LocusFocus[46] calculated on the chr1: 205,895,000-205,925,000 region in hg19. NA in this column reflects a lack of information available to carry out the colocalization analysis.
Fig. 5Lung function measures colocalize with CF meconium ileus GWAS at the SLC26A9 locus.
Visualization of colocalization using LocusFocus[46]. Peak expiratory flow (PEF) GWAS summary statistics (n = 345,265) from Shrine et al.[33] are shown as circles with corresponding left y-axis against lines that trace (1) GTEx v7 adult lung eQTLs for SLC26A9; (2) meconium ileus GWAS[1] summary statistics; (3) FEV1pp GWAS summary statistics from UKBB participants (N = 263,461; 22,071 of which have COPD as per GOLD level 2-4 definitions); (4) FEV1/FVC ratio GWAS summary statistics from Shrine et al.[33] (highest effective sample size is 396,686, a function of sample size (N = 400,102) times imputation quality); and (5) Chronic obstructive pulmonary disease (COPD) case-control association summary statistics (22,071 cases and 241,390 controls). The choice of y-axis scales and the primary dataset is for visualization purposes only and does not influence colocalization test result conclusions. There is a known gap on the human reference GRCh37 at chr1:205,922,708-206,072,707, which corresponds to the white gap in the figure as no variants exist. Lines traverse the lowest p-values per window as a visual guide (window size = 6.67 Kbp) with corresponding right y-axis. A total of 85 SNPs in this region were used to test for colocalization using the Simple Sum method[1,45]. The Simple Sum colocalization test[1,45] is implemented in the grey region, which was selected to match the observed peak at chr1:205,899,000-205,925,000. The color of the circles reflects the pairwise linkage disequilibrium (LD) with the rs7512462 lead SNP (purple) using the European subset of the 1000 Genomes Project, with varying strength of LD depicted with a different colour as denoted in the legend.