| Literature DB >> 34832643 |
Michael L McHenry1, Eddie M Wampande2, Moses L Joloba2, LaShaunda L Malone3, Harriet Mayanja-Kizza4, William S Bush1, W Henry Boom3, Scott M Williams1, Catherine M Stein1,2.
Abstract
Tuberculosis (TB) remains a major public health threat globally, especially in sub-Saharan Africa. Both human and Mycobacterium tuberculosis (MTBC) genetic variation affect TB outcomes, but few studies have examined if and how the two genomes interact to affect disease. We hypothesize that long-term coexistence between human genomes and MTBC lineages modulates disease to affect its severity. We examined this hypothesis in our TB household contact study in Kampala, Uganda, in which we identified three MTBC lineages, of which one, L4.6-Uganda, is clearly derived and hence recent. We quantified TB severity using the Bandim TBscore and examined the interaction between MTBC lineage and human single-nucleotide polymorphisms (SNPs) genome-wide, in two independent cohorts of TB cases (n = 149 and n = 127). We found a significant interaction between an SNP in PPIAP2 and the Uganda lineage (combined p = 4 × 10-8). PPIAP2 is a pseudogene that is highly expressed in immune cells. Pathway and eQTL analyses indicated potential roles between coevolving SNPs and cellular replication and metabolism as well as platelet aggregation and coagulation. This finding provides further evidence that host-pathogen interactions affect clinical presentation differently than host and pathogen genetic variation independently, and that human-MTBC coevolution is likely to explain patterns of disease severity.Entities:
Keywords: M. tuberculosis; M. tuberculosis–human coevolution; lineage-based GWAS; population genetics; tuberculosis severity
Year: 2021 PMID: 34832643 PMCID: PMC8617877 DOI: 10.3390/pathogens10111487
Source DB: PubMed Journal: Pathogens ISSN: 2076-0817
Study Population.
| Cohort 1 | Cohort 2 | Total |
| |
|---|---|---|---|---|
| Mean Age (SD) | 28.7 (8.1) | 28.7 (9.8) | 28.7 (9.0) | 0.99 |
| # Male (%) | 81 (54.4) | 73 (57.5) | 154 (55.8) | 0.69 |
| # HIV+ (%) | 31 (24.4) | 15 (10.1) | 46 (16.7) | 0.0025 |
| Mean TBscore (SD) | 6.2 (2.1) | 5.4 (2.2) | 5.8 (2.2) | 0.0032 |
| # L4.6/Ugandan (%) | 93 (62.4) | 75 (59.1) | 168 (60.9) | 0.66 |
| # L4 (%) | 15 (10.1) | 17 (13.4) | 32 (11.6) | |
| # L3 (%) | 39 (26.2) | 31 (24.4) | 70 (25.3) |
Differences in age and TBscore were analyzed using a Student’s t-test and differences in the percentage of males and HIV+ subjects were analyzed using Z-statistics. Differences in the distribution of lineages were analyzed using a Chi-squared test. For all tests, p < 0.05 was considered a significant difference.
Figure 1Boxplot of Severity by Lineage. Box plot shows the TBscore values for subjects who are infected with each of the 3 lineages present in our combined cohort. The midline of the box is the median. The box represents the inter-quartile range (25th to 75th percentiles) of the TBscore and the dots represent outliers.
Figure 2Manhattan Plot for Meta-Analytic p-values of Interaction Between SNP and L4.6/Ugandan Lineage. The Manhattan plot shows the inverse log(10) of the p-values for the association between interaction of each SNP and the L4.6/Ugandan lineage and TBscore on the y-axis, and the x-axis represents the physical location of each SNP on the chromosomes, which are in order from 1–22.
Figure 3Q–Q Plot for Meta-Analytic p-Values for Interaction Between SNP and L4.6 Sub-Lineage. The quantile–quantile (Q–Q) plot shows the inverse log(10) of the observed p-values on the y-axis relative to what is expected if there was no association on the x-axis. Deviations above the line indicate an association with the outcome. If the line deviates at the low quantiles, then this is considered evidence to suggest genome-wide inflation of the test statistics, which typically indicates unmeasured confounding.
GWAS-Significant Loci for Interaction between SNP and L4.6 Sub-Lineage.
| SNP | CHR:BP | Ref/Alt | Gene * | Location | MAF (LWK) | |
|---|---|---|---|---|---|---|
| rs114945555 | 21:20187488 | C/T |
| Intergenic | 7% | 4.0 × 10−8 |
* rs114945555 mapped to PPIAP22 in LocusZoom and FUMA GWAS web applications. Minor allele frequencies were ascertained from the 1000 G project using Ensembl v104.
rs114945555 × Lineage Interaction across Cohorts 1 and 2.
| Cohort 1 | Cohort 2 | Combined | ||||
|---|---|---|---|---|---|---|
| β |
| β |
| β |
| |
| rs11945555 s119 (β3) | 1.98 (0.28,3.68) | 0.024 | 2.94 (1.27,4.61) | 7 × 10−4 | 2.47 | 4.80 × 10−5 |
| L.4.6/Ugandan (β2) | 0.59 (−0.13,1.3) | 0.11 | 0.57 (−0.24,1.38) | 0.17 | 0.58 | 0.03 |
| Sex (β4) | 0.49 (−0.17,1.14) | 0.15 | 1.09 (0.32,1.85) | 0.0061 | 0.75 | 0.012 |
| HIV+ status (β1) | 0.60 (−0.48,1.69) | 0.28 | −0.14 (−1.02,0.74) | 0.75 | 0.16 | 0.66 |
| rs11945555 × L4.6/Ugandan (combination of CT/TT and L4.6 Ugandan) (β5) † | −3.77 (−5.79,−1.75) | 3.62 × 10−4 | −4.53 (−6.68,−2.38) | 6.79 × 10−5 | −4.13 | 4.00 × 10−8 |
Regression Model: Y = β0 + β1X1+ β2X2 + β3X3 + β4X4 + β5 (X2X3) + ε. X1 = HIV+ Status, X2 = L4.6/Ugandan lineage, X3 = rs114945555, X4 = Sex. † Model of inheritance (CT/TT vs. CC as referent).
Figure 4Interaction between rs114945555 and L4.6/Ugandan Lineage. There were also 6 SNP by lineage interactions with p < 1 × 10−6 (Table 4). Though none of these were below the GWAS threshold for significance, all were nominally significant (p < 0.05) and had consistent directions of effect (i.e., the beta value had the same sign) in both cohorts.
Figure 5LocusZoom Plot for Region Surrounding rs114945555. The LocusZoom plot shows the region surrounding rs1848553 on chromosome 5, using an LD panel and reference genome from the AFR super-population in the 1000 G project. Yellow and orange indicate higher levels of LD.
Figure 6PPIAP22 Expression in Immune Cells from DICE Database of Cell-specific Gene Expression.
Figure 7Differential Expression of PPIAP22 in Immune Cells from DICE Database of Cell-specific Gene Expression.
Gene Set Enrichment for SNPs by Lineage Interactions.
| GeneSet |
|
| Adjusted | Genes | Database | |
|---|---|---|---|---|---|---|
| Breast Cancer Amplicon | 329 | 24 | 3.5 × 10−23 | 1.15 × 10−19 |
| Chemical and Genetic Perturbations (MsigDB c2) |
| Neuroblastoma Copy Number Up | 181 | 9 | 8.92 × 10−8 | 0.000147 |
| |
| Nephrolithiasis | 8 | 3 | 4.33 × 10−6 | 7.87 × 10−3 |
| GWAS Catalog |
| Urolithiasis | 17 | 3 | 5.12 × 10−5 | 0.046 |
|
N indicates the total number of genes in the set while n shows the number of genes that SNPs mapped to in our GWAS data. Gene sets were queried using FUMA GWAS. The adjusted p is an FDR-corrected p-value based on the number of gene sets examined.
Figure 8STRING Network for PPIs of eQTL Response Genes. This figure is a STRING diagram showing protein–protein interactions among the genes identified in my analysis. Lines represent an experimentally determined protein–protein interaction and multiple lines between the same two proteins indicate multiple interactions that have been identified, but multiple interactions between the same two proteins are still considered to be one edge.
Significantly Enriched KEGG Pathways for eQTL Response Genes.
| KEGG Pathway | # Genes | # In Set | Strength | FDR | Matching Proteins |
|---|---|---|---|---|---|
| Renin secretion | 3 | 63 | 1.44 | 0.0166 |
|
| Gap junction | 3 | 87 | 1.3 | 0.0207 |
|
| Renin-angiotensin system | 2 | 23 | 1.7 | 0.0234 |
|
| Vascular smooth muscle contraction | 3 | 119 | 1.16 | 0.0251 |
|
| cGMP-PKG signaling pathway | 3 | 160 | 1.03 | 0.0459 |
|
| Purine metabolism | 3 | 173 | 1 | 0.0474 |
|
SNPs Above GWAS Threshold with p < 1 × 10−6 (Meta-analytic p-values and Betas).
| Chr | Bp | SNP |
| β | Gene | Location |
|---|---|---|---|---|---|---|
| 5 | 17775271 | rs369093426 | 9.82 × 10−7 | 3.2095 |
| Intron |
| 5 | 121258204 | rs76190408 | 8.14 × 10−7 | 3.0447 | None | Intergenic |
| 8 | 50958714 | rs203964 | 1.70 × 10−7 | 2.7085 |
| Intron |
| 8 | 141085471 | rs56990580 | 1.77 × 10−7 | −3.3244 |
| Intron |
| 13 | 985899842 | rs8000063 | 9.82 × 10−7 | 2.6203 | None | Intergenic |
| 21 | 20182990 | rs112560854 | 2.58 × 10−7 | −3.7805 | None | Intergenic |
Chr–chromosome number, Bp = basepair location. Gene and location were ascertained using Ensembl v.104.