| Literature DB >> 31092817 |
John A Lees1,2, Bart Ferwerda3, Philip H C Kremer3, Nicole E Wheeler2,4, Mercedes Valls Serón3, Nicholas J Croucher5, Rebecca A Gladstone2, Hester J Bootsma6, Nynke Y Rots6, Alienke J Wijmega-Monsuur6, Elisabeth A M Sanders6,7, Krzysztof Trzciński7, Anne L Wyllie7,8, Aeilko H Zwinderman9, Leonard H van den Berg10, Wouter van Rheenen10, Jan H Veldink10, Zitta B Harboe11, Lene F Lundbo12, Lisette C P G M de Groot13, Natasja M van Schoor14, Nathalie van der Velde15,16, Lars H Ängquist17, Thorkild I A Sørensen18,19, Ellen A Nohr20, Alexander J Mentzer21, Tara C Mills21, Julian C Knight21, Mignon du Plessis22, Susan Nzenze22, Jeffrey N Weiser1, Julian Parkhill2, Shabir Madhi23, Thomas Benfield12, Anne von Gottberg22,23, Arie van der Ende24,25, Matthijs C Brouwer3, Jeffrey C Barrett2,26, Stephen D Bentley27, Diederik van de Beek28.
Abstract
Streptococcus pneumoniae is a common nasopharyngeal colonizer, but can also cause life-threatening invasive diseases such as empyema, bacteremia and meningitis. Genetic variation of host and pathogen is known to play a role in invasive pneumococcal disease, though to what extent is unknown. In a genome-wide association study of human and pathogen we show that human variation explains almost half of variation in susceptibility to pneumococcal meningitis and one-third of variation in severity, identifying variants in CCDC33 associated with susceptibility. Pneumococcal genetic variation explains a large amount of invasive potential (70%), but has no effect on severity. Serotype alone is insufficient to explain invasiveness, suggesting other pneumococcal factors are involved in progression to invasive disease. We identify pneumococcal genes involved in invasiveness including pspC and zmpD, and perform a human-bacteria interaction analysis. These genes are potential candidates for the development of more broadly-acting pneumococcal vaccines.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31092817 PMCID: PMC6520353 DOI: 10.1038/s41467-019-09976-3
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Glossary of terms
| Term | Meaning |
|---|---|
| CSF | Cerebrospinal fluid |
| IPD | Invasive pneumococcal disease |
| GWAS | Genome-wide association study |
| hGWAS | Host genome-wide association study |
| pGWAS | Pathogen genome-wide association study |
|
| Heritability (variation in phenotype explained by genetic variation) |
| OR | Odds ratio |
| LD | Linkage disequilibrium |
| MAF | Minor allele frequency (of the least common allele observed in the population) |
| AF | Allele frequency (presence of gene or versus reference allele) |
| LoF | Loss of function (frameshift or nonsense mutation in protein) |
| SFS | Site frequency spectrum (histogram of minor allele frequencies) |
| LMM | Linear mixed model (association model used in genome-wide association study) |
| eQTL | Expression quantitative trait loci (genetic association with transcript variation) |
| LD | Linkage disequilibrium |
Fig. 1Overview of cohorts sequenced and associations performed. Left, host data; right, bacterial data; the centre represents samples with both host and pathogen sequence data. Samples in green are those collected from our MeninGene cohort that form the centre of this work. Owing to unbalanced case–control ratios, we show the effective sample size, specific numbers of cases and controls of human genotypes in Supplementary table 2
Fig. 2Burden of rare variation between invasive and carriage isolates, based on mapping and calling short variants against a single reference genome. Loss-of-function (LoF) are frameshift or nonsense mutations. a The site frequency spectrum (SFS) stratified by niche and by predicted consequence. Frequency has been normalized with respect to the number of samples in each population. b Histogram of Tajima’s D for all coding sequences in the genome, stratified by niche. c Boxplot of the number of rare variants per sample, stratified by niche and predicted consequence. Damaging mutations are LoF mutations and missense mutations predicted damaging by PROVEAN. Centre line is the median, box spans lower to upper quartiles. Whiskers show the outlier range, defined as being >1.5× the interquartile range above or below the lower and upper quartiles
Fig. 3Phylogenetic tree of all samples included in the pathogen genome-wide association study. Rings show metadata about samples, from inside to outside: phenotype (carriage or invasive); cohort (Netherlands or South Africa); common serotypes; patient age on a continuous scale from younger (white) to older (blue). Scale bar: 0.01 substitutions per site. An interactive version is available at https://microreact.org/project/Spn_GWAS/9eb0bd5d (project link https://microreact.org/project/Spn_GWAS)
Signals of bacterial association in a pooled analysis
| Gene ID | Gene name | Gene population frequency | Method | OR/effect size | Function | |
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
|
|
|
|
|
|
| |
| SPN23F10590 |
| 0.53 | Burden of rare LoF in invasive | 1.42 | <1 × 10–10 | Unknown; paralogous to IgA1 protease ( |
| SPN23F09820 |
| 0.36 | Missense burden in invasive | 1.30 | 3.4 × 10–49 | Bacteriocin precursor |
| SPN23F05670 |
| 1.00 | Missense burden in invasive | 1.27 | 4.3 × 10–47 | Histidine triad family protein (nucleotide phosphatase) |
| SPN23F04740 |
| 1.00 | Missense burden in invasive | 1.15 | 1.4 × 10–8 | ABC transporter ATPase |
| SPN23F11460 |
| — | Missense burden in invasive | 1.13 | 1.2 × 10–6 | Endonuclease |
Combining The Netherlands (meningitis only) and South African (all IPD cases) cohorts into a single dataset and performing a pGWAS with cohort as a covariate. Table shows genes significant (after applying a Bonferroni correction for the number of tests and association methods p < 0.05) in a pooled analysis of both cohorts with any of the association approaches, ordered by p-value. Odds ratios are with respect to carriage samples. The four genes in bold at top of the table are immunogenic and have previous evidence for association with virulence. For Tajima’s D, the effect size is the difference between D values, and for k-mers and LoF burden tests it is the odds ratio. For some p values, the calculation only allows an upper bound to be produced. The locus tag in the ATCC 700669 reference is listed, along with the common gene name if available
OR odds ratio
Meta-analysis of signals of bacterial association
| Gene ID | Gene name | OR (NL) | OR (SA) | Combined | ||
|---|---|---|---|---|---|---|
| SPN23F05680 |
| 1.39 | 2.8 × 10–19 | 1.13 | 9.6 × 10–8 | 5.4 × 10–21 |
| SPN23F22240 |
| 1.00 | 8.1 × 10–1 | 1.12 | 4.7 × 10–12 | 5.3 × 10–11 |
| SPN23F08080 |
| 0.82 | 1.9 × 10–4 | 1.09 | 4.2 × 10–6 | 7.5 × 10–2 |
| SPN23F17820 |
| −2.27 (invasive) | <1 × 10–6 | −2.61 (invasive) | 4.5 × 10–4 | <1 × 10–6 |
| SPN23F10590 |
| 1.17 | 8.6 × 10–13 | 1.12 | 2.5 × 10–5 | 8.4 × 10–14 |
| SPN23F09820 |
| 1.37 | 5.0 × 10–23 | 1.30 | 1.9 × 10–33 | 3.4 × 10–54 |
| SPN23F05670 |
| 1.47 | 1.8 × 10–30 | 1.22 | 2.9 × 10–25 | 8.3 × 10–51 |
| SPN23F04740 |
| 1.20 | 1.8 × 10–7 | 1.11 | 9.7 × 10–4 | 1.8 × 10–8 |
| SPN23F11460 |
| 1.45 | 3.7 × 10–8 | 1.11 | 9.5 × 10–5 | 1.2 × 10–6 |
As in Table 2, showing behaviour of signals in The Netherlands (NL) and South African (SA) cohorts individually. The p value is from meta-analysis of the two cohorts using METAL to compute a combined z value, so is different from the p value in Table 2, which was computed by applying an LMM to all samples
OR odds ratio
Human SNP heritability (h2SNP) of meningitis in the MeninGene cohort
| Phenotype | Method | Heritability | Error | |
|---|---|---|---|---|
| Susceptibility (pneumococcal) | GCTA | 0.25 | 0.05 | 2.4 × 10–6 |
| LDAK | 0.29 | 0.07 | 3.9 × 10–6 | |
| Severity (any species) | GCTA | 0.29 | 0.11 | 2.8 × 10–5 |
| LDAK | 0.49 | 0.14 | 1.4 × 10–4 |
Heritability for susceptibility and severity of meningitis in Dutch adults. Heritabilities are shown on the liability scale (adjusted for population prevalence and case ascertainment ratio). We used two methods for each phenotype, GCTA and LDAK. The latter corrects for linkage disequilibrium when estimating the kinship between genotypes. All results showed significant (p < 0.05) evidence for a heritability above zero
Signals of human association in the MeninGene cohort
| Phenotype | Position (SNP) | Marker | Effect allele | MAF | OR | Annotation | |
|---|---|---|---|---|---|---|---|
| Susceptibility (pneumococcal meningitis only) | chr6:117624549 | (rs210967) | G | 0.46 | 0.77 | 8.8 × 10–7 | |
| chr18:48403560 | (rs2850542) | T | 0.43 | 0.65 | 7.6 × 10–8 | ||
| chr22:47506160 | (rs13057743) | G | 0.33 | 0.74 | 5.5 × 10–7 | ||
| Severity (any species) | chr1:64680775 | (rs12081070) | A | 0.43 | 1.62 | 2.0 × 10–8 | |
| chr4:182823804 | (rs2309554) | A | 0.33 | 1.58 | 4.1 × 10–7 | ||
| chr9:37382231 | (rs72739603) | A | 0.07 | 2.36 | 6.7 × 10–7 |
We report the lead SNP at each putatively associated locus with MAF > 5% and p < 1 × 10–6, and nearby annotated genes. p < 5 × 10–8 is the genome-wide significance threshold—only rs12081070 exceeds this. The suggestive signal in all meningitis cases at rs3870369 was also present when restricted to pneumococcal cases, albeit with a lower p value of 3.9 × 10–7
MAF minor allele frequency, OR odds ratio, SNP single-nucleotide polymorphism, TSS transcription start site