| Literature DB >> 31311883 |
Aleeza C Gerstein1, Katrina M Jackson1, Tami R McDonald1, Yina Wang2, Benjamin D Lueck1, Sara Bohjanen1, Kyle D Smith1, Andrew Akampurira3, David B Meya3, Chaoyang Xue2, David R Boulware4, Kirsten Nielsen5.
Abstract
Patient outcomes during infection are due to a complex interplay between the quality of medical care, host immunity factors, and the infecting pathogen's characteristics. To probe the influence of pathogen genotype on human survival, immune response, and other parameters of disease, we examined Cryptococcus neoformans isolates collected during the Cryptococcal Optimal Antiretroviral Therapy (ART) Timing (COAT) Trial in Uganda. We measured human participants' survival, meningitis disease parameters, immunologic phenotypes, and pathogen in vitro growth characteristics. We compared those clinical data to whole-genome sequences from 38 C. neoformans isolates of the most frequently observed sequence type (ST), ST93, in our Ugandan participant population and to sequences from an additional 18 strains of 9 other sequence types representing the known genetic diversity within the Ugandan Cryptococcus clinical isolates. We focused our analyses on 652 polymorphisms that were variable among the ST93 genomes, were not in centromeres or extreme telomeres, and were predicted to have a fitness effect. Logistic regression and principal component analysis identified 40 candidate Cryptococcus genes and 3 hypothetical RNAs associated with human survival, immunologic response, or clinical parameters. We infected mice with 17 available KN99α gene deletion strains for these candidate genes and found that 35% (6/17) directly influenced murine survival. Four of the six gene deletions that impacted murine survival were novel. Such bedside-to-bench translational research identifies important candidate genes for future studies on virulence-associated traits in human Cryptococcus infections.IMPORTANCE Even with the best available care, mortality rates in cryptococcal meningitis range from 20% to 60%. Disease is often due to infection by the fungus Cryptococcus neoformans and involves a complex interaction between the human host and the fungal pathogen. Although previous studies have suggested genetic differences in the pathogen impact human disease, it has proven quite difficult to identify the specific C. neoformans genes that impact the outcome of the human infection. Here, we take advantage of a Ugandan patient cohort infected with closely related C. neoformans strains to examine the role of pathogen genetic variants on several human disease characteristics. Using a pathogen whole-genome sequencing approach, we showed that 40 C. neoformans genes are associated with human disease. Surprisingly, many of these genes are specific to Cryptococcus and have unknown functions. We also show deletion of some of these genes alters disease in a mouse model of infection, confirming their role in disease. These findings are particularly important because they are the first to identify C. neoformans genes associated with human cryptococcal meningitis and lay the foundation for future studies that may lead to new treatment strategies aimed at reducing patient mortality.Entities:
Keywords: CNS; GWAS; HIV; SNP; cryptococcosis; fungus; genome analysis; meningitis; pathogenesis; polymorphism; variant; virulence
Mesh:
Substances:
Year: 2019 PMID: 31311883 PMCID: PMC6635531 DOI: 10.1128/mBio.01440-19
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1Summary of variants identified among all strains. (A) The number of variants per gene with a long right tail. The inset panel presents the same data magnified to show genes with at least 50 variants for visualization purposes. (B) There was no correlation between gene length and the number of variants per base pair in each gene (P = 0.20).
FIG 2Phylogenetic analysis of all sequenced strains. (A) The majority of ST93 strains fall into two well-supported clades, magnified in panel B for ease of viewing as follows: ST93A (purple background) and ST93B (yellow background). Bootstrap values of >50 are indicated with the numeric bootstrap value. A red diamond at the end of the terminal branch indicates a strain isolated from a patient who died.
FIG 3Variants that were common to all ST93 genomes are dispersed among 2,715 genes. A small number of clustered genes had a large number of variants. Genes with more than 20 variants and named genes are indicated.
FIG 4Clade-specific variants. (A) Variants that are specific to ST93A (purple) and ST93B (yellow) clades are distributed across the genome. (B) Upstream variants represent the majority class found in all ST93 genomes (“allST93”) and among the variants that are specific to either clade. In contrast, ST93A variants were more likely to be nonsynonymous and less likely to be downstream than the allST93 or ST93B variants. The distributions of the ST93A and ST93B variant classes are significantly different (P = 0.009). (C) IL-2 cytokine levels in the CSF and in vitro phagocyte uptake levels differed between ST93A and ST93B strains (t test results: IL-2, P = 0.022; uptake, P = 0.011).
Survival and quantitative infection phenotypes measured from participants enrolled in the COAT trial
| Class | Phenotype variable | |
|---|---|---|
| Survival | 38 | Patient survival |
| Clinical parameters | 38 | CD4 T cell |
| 35 | CSF white cell | |
| 31 | CSF protein | |
| 35 | HIV load | |
| 37 | CSF clearance rate (EFA) | |
| 30 | CSF CrAg LFA titer | |
| Immune cytokines | 36 | IL-1β |
| 36 | IL-2 | |
| 36 | IL-4 | |
| 36 | IL-5 | |
| 36 | IL-6 | |
| 36 | IL-7 | |
| 36 | IL-8 | |
| 36 | IL-10 | |
| 36 | IL-12 | |
| 36 | IL-13 | |
| 36 | IL-17 | |
| 36 | G-CSF | |
| 36 | GM-CSF | |
| 36 | IFN-γ | |
| 36 | MCP-1 | |
| 36 | TNF-α | |
| 36 | MIP-1β | |
| 37 | Absolute growth at 30°C | |
| 37 | Fluconazole MIC | |
| 37 | Amphotericin B MIC | |
| 37 | Cell wall chitin | |
| 38 | Macrophage adherence | |
| 38 | Macrophage uptake | |
FIG 5Flow chart for the bioinformatic approaches used to identify C. neoformans genes associated with survival and quantitative infection phenotypes. Survival was analyzed with logistic regression. Two complementary approaches were used for quantitative analysis of the infection phenotypes: (i) logistic regression followed by cluster analysis and (ii) principal component analysis (PCA). The clinical parameters, immune cytokines, and in vitro characteristics analyzed are listed in Table 1.
Significant variants from the linear regression analysis
| Gene | Chr | Expression | Variant position(s) | Effect(s) | Phenotypes | |
|---|---|---|---|---|---|---|
| 4 | D | Up | ||||
| 6 | D | 988405; 988733; 988843; 988922; | Down; NS; NS; NS; | Chitin, SERT; IL-1β, IL-13, MCP-1, MIP-1β; MIP-1β; | ||
| 7 | E | 164473; 164887; 164926; 165027; | Up | HIV RNA; IL-2, TNF-α; IL-2, MIP-1β; MIP-1β; | ||
| 9 | E | 705343; | Up | IL-8, EFA; | ||
| 12 | D |
| Up | IL-1β, IL-13, | ||
| 14 | D | 910323; 910328; | Down | TNF-α; IL-1β, IL-13, TNF-α; | ||
| 14 | D | 910742; | Down | AMP; | ||
| 00014 | 1 | E | 47564; 47575; 47671 | NS | G-CSF; G-CSF; GM-CSF | |
| 00363 | 1 | E | 927896; 927901 | NS | IL-2; IL-2 | |
| 07950 | 1 | N | 975152; 975212; 975397 | Up | IL-8, HIV RNA; IL-4, IL-6, IL-8, GM-CSF, IFN-γ, | |
| 06704 | 2 | D | 270700 | Up | IL-2, protein | |
| 02798 | 3 | E | 750294 | Up | CD4, AMP | |
| 06876 | 5 | N | 7093 | Down | IFN-γ, MIP-1β, TNF-α | |
| 01371 | 5 | D | 475470 | Up | MCP-1, HIV RNA | |
| 01241 | 5 | D | 836479; 836697; 836899 | Up | IL-2; IL-4, IL-5, IL-7, IL-17, GM-CSF, TNF-α, | |
| 02475 | 6 | D | 221273; 221275; 221282 | Up | IL-7, growth; growth; growth | |
| 02177 | 6 | E | 990701 | Up | IL-1β, IL-6, IL-10 | |
| 02112 | 6 | E | 1160524; 1160528; 1160532 | Up | AMP; AMP; AMP | |
| 06525 | 7 | D | 11056; 14006 | NS; Up | IL-5, IL-10; IL-6, IL-8 | |
| 12610* | 7 | D | 49744 | Up | MCP-1, uptake | |
| 05746 | 7 | E | 752861 | UTR—3 | IL-17, GM-CSF, MCP-1, TNF-α | |
| 05913 | 7 | E | 1205599; 1205600 | Up | MIP-1β, adherence; IL-13, IL-17, MIP-1β, | |
| 05937 | 7 | D | 1263610 | Up | Uptake, SERT | |
| 07703 | 7 | D | 1341024 | NS | IL-6, IL-8 | |
| 06968 | 8 | E | 1383765 | Indel | IL-12, IL-17 | |
| 04100 | 9 | N | 5213; 7729; 8171 | Up | Adherence, FLC, SERT; growth; EFA, SERT | |
| 04102 | 9 | D | 10033 | Down | GM-CSF, EFA | |
| 04179 | 9 | D | 220963 | Up | EFA, SERT, protein | |
| 04535 | 9 | E | 1115286 | Up | IL-17, G-CSF, LFA | |
| 07837 | 10 | D | 13558; 15288; 15302 | Up; Down; Down | IL-2; WBCc; CD4 | |
| 04922 | 10 | D | 18908; 18915; 18933; 18941; | Up | IL-2; IL-2; IL-2; IL-2; adherence; adherence; | |
| 08006 | 11 | E | 804710; 804742 | Up | IL-4, IL-5, IL-6, MIP-1β, TNF-α, adherence, | |
| 01802 | 11 | D | 966644; 966669; 966700 | Up | WBC; IL-2; IL-7 | |
| 05987 | 12 | D | 14009; 14035; 14125;14197; | NS; NS; Indel; NS; | IL-2; IL-2; chitin; EFA, adherence; EFA, adherence; | |
| 06169 | 12 | E | 502808; 502888; 502890; 503049; | Down | IL-8; GM-CSF, growth; IL-6, IL-8, GM-CSF; GM-CSF, | |
| 06256 | 13 | N | 11118; 11130 | Up | IFN-γ, TNF-α; TNF-α | |
| 13108* | 13 | N | 128625; 128715; 128729 | Up | IL-13, G-CSF; IL-13, G-CSF; IL-13, G-CSF | |
| 06332 | 13 | D | 219021; 219311; 219312 | Up | Adherence; EFA; EFA | |
| 06422 | 13 | E | 436551; 436554 | Up | IL-2; IL-2 | |
| 06490 | 13 | D | 655915 | Indel | Protein, HIV RNA, CD4 | |
| 05450 | 14 | E | 342562 | NS | IL-6, IL-7, IL-12, IL-13, G-CSF, MIP-1β | |
| 05661 | 14 | D | 908850; 908994; 909011; 909638; | Up | IL-8, GM-CSF, IFN-γ, MCP-1, MIP-1β; uptake, | |
| 13204* | 14 | E | 924025; 924047; 924049; 924050 | Up | GM-CSF; IL-13; IL-13; IL-13 |
The gray block denotes genes with variants associated with survival; gene numbers and variant positions that are associated with survival are indicated in bold. Dark gray text indicates genes, variants, and phenotypes that were identified as lower confidence in the post hoc bootstrap analysis. Semicolons are used as separators of different variants. When only one effect is listed, it is common among all variants of the gene. Chr, chromosome.
Gene number corresponds to the CNAG number from the Cryptococcus neoformans H99 reference genome on FungiDB. Hypothetical RNAs are indicated with an asterisk (*).
Effect data designate location or type of variant as follows: Up, upstream of the coding region; Down, downstream of the coding region; NS, nonsynonymous change in the coding region; Indel, small insertion or deletion.
Class type designations are indicated as follows: a, the gene(s) has one variant significant for at least two phenotypes; b, there are multiple variants in the same gene with at least one significant phenotype each; ab, both criteria are fulfilled.
E indicates expression; D indicates differential expression between the VNI and VNII clinical strains in the CSF; N indicates no expression detected. Data are from reference 32 and were analyzed in FungiDB as percentile of expression compared to all other genes in the experiment.
Significant variants from PCA
| Gene | Chr | Position | Effect | PCA1 | PCA2 |
|---|---|---|---|---|---|
| CNAG_07950 | 1 | 975212 | Upstream | 0.047 | 0.141 |
| CNAG_01241 | 5 | 836697 | Upstream | 0.04 | 0.505 |
| 5 | 836899 | Upstream | 0.025 | 0.29 | |
| CNAG_02176 | 6 | 988733 | Stop gained | 0.047 | 0.749 |
| 6 | 989490 | NS | 0.834 | 0.03 | |
| 6 | 989960 | NS | 0.967 | 0.039 | |
| CNAG_07703 | 7 | 1341024 | NS | 0.031 | 0.289 |
| CNAG_07727 | 8 | 818838 | Upstream | 0.036 | 0.726 |
| CNAG_08006 | 11 | 804710 | 5′UTR | 0.048 | 0.312 |
| CNAG_05987 | 12 | 19741 | Upstream | 0.355 | 0.031 |
| CNAG_06169 | 12 | 503321 | 3′UTR | 0.048 | 0.795 |
| CNAG_05450 | 14 | 342562 | NS | 0.024 | 0.142 |
| CNAG_05661 | 14 | 908850 | Upstream | 0.042 | 0.928 |
| CNAG_05663 | 14 | 910328 | Downstream | 0.042 | 0.12 |
| CNAG_05662 | 14 | 911099 | Downstream | 0.045 | 0.143 |
| 14 | 911129 | Downstream | 0.048 | 0.046 |
FIG 6Comparing variant frequencies across the genome. Data represent relative frequencies of variants per gene for significant genes (red dots) compared to all sequenced variants across all genomes (black line) and all variants that were variable within ST93 genomes (gray line). The only genes shown here are those with at least one potential significant variant; hence, the gray and black lines do not reach 0.
Survival curve statistical results
| Gene knockout | χ2 statistic | |
|---|---|---|
| CNAG_00363 ( | 0.05 | 0.82 |
| CNAG_02176 | 9 | 0.0027 |
| CNAG_04373 | 3.07 | 0.08 |
| CNAG_04535 | 2.79 | 0.095 |
| CNAG_04922 | 9.97 | 0.0016 |
| CNAG_05662 ( | 6.22 | 0.013 |
| CNAG_05663 | 0.61 | 0.43 |
| CNAG_05913 | 0.07 | 0.79 |
| CNAG_05937 | 0.09 | 0.77 |
| CNAG_06169 | 0.13 | 0.72 |
| CNAG_06332 | 4.05 | 0.044 |
| CNAG_06490 | 1.02 | 0.31 |
| CNAG_06574 ( | 9 | 0.0027 |
| CNAG_06704 | 5.83 | 0.016 |
| CNAG_06876 | 0.05 | 0.82 |
| CNAG_06986 | 7 | 0.0082 |
| CNAG_07703 | 0.05 | 0.31 |
| CNAG_07837 | 1.8 | 0.18 |
FIG 7Deletion strain virulence in mice. Groups of five 6-to-8-week-old C57BL/6 mice were infected intranasally with 5 × 104 cells. Progression to severe morbidity was monitored for 35 days, and mice were sacrificed when endpoint criteria were reached. Strains were tested in two separate experiments (indicated as experiment 1 [E1] and E2, respectively). Statistical analysis of the survival curves are presented in Table 4.
FIG 8Analysis of ITR4 through in vivo virulence and in vitro growth and inositol uptake. (A) Groups of 10 6-to-8-week-old C57BL/6 mice were infected intranasally with 1 × 103 cells. Progression to severe morbidity was monitored for 44 days, and mice were sacrificed when endpoint criteria were reached. (B) Schematic diagram showing locations of the variants in the UgCl389 and UgCl462 clinical isolates relative to the ITR4 coding region. UgCl443 has the H99 reference allele. (C) Growth assay of C. neoformans wild-type strain KN99α, itr4Δ mutant, and clinical strains on medium with different inositol levels. Yeast cells were cultured in YPD medium. Equal cell concentrations were spotted as 10-fold serial dilutions onto YNB plates made with 1% glucose, 1% inositol, or 1% glucose and 1% inositol. Plates were incubated at 30°C or 37°C, and growth was examined after 4 days. The assay was repeated three times with similar results. (D) Inositol uptake analysis of C. neoformans strains. Yeast cells were mixed with 3H-labeled inositol and incubated at 30°C for 10 min in triplicate (repeated twice with similar patterns). Error bars indicate standard deviations of results from the three replicates. All strains presented were grown on the same plate, but some strains that were present on the plate have been removed for clarity. Each white line indicates a location where a strain was removed.