| Literature DB >> 28679543 |
Johanna Rhodes1, Christopher A Desjardins2, Sean M Sykes2, Mathew A Beale1,3,4, Mathieu Vanhove1, Sharadha Sakthikumar2, Yuan Chen5, Sharvari Gujja2, Sakina Saif2, Anuradha Chowdhary6, Daniel John Lawson7, Vinicius Ponzio8, Arnaldo Lopes Colombo8, Wieland Meyer9,10, David M Engelthaler11, Ferry Hagen12,13, Maria Teresa Illnait-Zaragozi14, Alexandre Alanio15, Jo-Marie Vreulink16, Joseph Heitman17, John R Perfect5, Anastasia P Litvintseva17, Tihana Bicanic3, Thomas S Harrison3, Matthew C Fisher18, Christina A Cuomo19.
Abstract
Cryptococcus neoformans var. grubii is the causative agent of cryptococcal meningitis, a significant source of mortality in immunocompromised individuals, typically human immunodeficiency virus/AIDS patients from developing countries. Despite the worldwide emergence of this ubiquitous infection, little is known about the global molecular epidemiology of this fungal pathogen. Here we sequence the genomes of 188 diverse isolates and characterize the major subdivisions, their relative diversity, and the level of genetic exchange between them. While most isolates of C. neoformans var. grubii belong to one of three major lineages (VNI, VNII, and VNB), some haploid isolates show hybrid ancestry including some that appear to have recently interbred, based on the detection of large blocks of each ancestry across each chromosome. Many isolates display evidence of aneuploidy, which was detected for all chromosomes. In diploid isolates of C. neoformans var. grubii (serotype AA) and of hybrids with C. neoformans var. neoformans (serotype AD) such aneuploidies have resulted in loss of heterozygosity, where a chromosomal region is represented by the genotype of only one parental isolate. Phylogenetic and population genomic analyses of isolates from Brazil reveal that the previously "African" VNB lineage occurs naturally in the South American environment. This suggests migration of the VNB lineage between Africa and South America prior to its diversification, supported by finding ancestral recombination events between isolates from different lineages and regions. The results provide evidence of substantial population structure, with all lineages showing multi-continental distributions; demonstrating the highly dispersive nature of this pathogen.Entities:
Keywords: Cryptococcus; genome sequence; hybridization; phylogeography; recombination; selection
Mesh:
Year: 2017 PMID: 28679543 PMCID: PMC5586382 DOI: 10.1534/genetics.117.203836
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Properties of sequenced isolates
| Population | Isolates (no.) | |||||
|---|---|---|---|---|---|---|
| Haploid isolates | ||||||
| VNI | 111 | 109 | 2 | |||
| VNII | 23 | 23 | 0 | |||
| VNB | 25 | 21 | 4 | |||
| VNI–VNB | 5 | 1 | 4 | |||
| VNII–VNB | 2 | 2 | 0 | |||
| Diploid isolates | ||||||
| VNI–VNB | 1 | 1 | 0 | 0 | ||
| VNII–VNB | 2 | 2 | 0 | 0 | ||
| VNB–Cnn | 8 | 0 | 1 | 7 | ||
| VNB–Cg | 1 | 1 | 0 | 0 | ||
For each population, the total number of isolates analyzed and the mating type(s) of the isolates are given. Cnn, C. neoformans var. neoformans; Cg, C. gattii.
Figure 1Phylogenetic analysis supports three major lineages of C. neoformans var. grubii. Using a set of 876,121 SNPs across the 159 nonhybrid isolates, a phylogenetic tree was inferred using RAxML. The tree was rooted with VNII as the outgroup (Hagen ). The percentage of 1000 bootstrap isolates that support each node is shown for major nodes with at least 90% support. For each isolate, the geographic site of isolation is noted by colored boxes.
Figure 2Ancestry characterization of three major groups highlights hybrid isolates. (A) The fraction of ancestry (k = 3) estimated by Structure is shown within a column for each isolate. (B) PCA separates the three major lineages, with the hybrid isolates showing a mix of VNB ancestry with either VNI or VNII.
Figure 3Large blocks of ancestry suggest recent recombination between lineages. For each of the four isolates depicted (A, Bt125; B, Bt131; C, Ftc260-1, and D, CCTP51), the Structure-assigned ancestry for each site along each chromosome is depicted as a colored bar corresponding to VNI, VNII, and VNB ancestry. Locations of centromeres are marked with black bars.
Figure 4Chromosome ancestry and ploidy variation of AD hybrids. For three AD hybrid isolates (RCT14, IFNR21, and CCTP50), the contribution and copy number of A (green) and D (blue) ancestry chromosomal regions was measured by aligning all sequence reads to a combined AD reference (A: H99, left, and D: JEC21, right). The copy number of each chromosome is depicted, with either full or partial chromosomal regions shown; see Figure S4 for detailed coverage plots for all AD hybrid isolates.
Figure 5Lineage-specific gene clusters. Two large lineage-specific clusters were detected in the VNI genomes or VNII genomes; these are depicted using a representative genome from each lineage. (A) Insertion of CNAG_06649 to CNAG_06653 in H99 (blue, VNI). Syntenic genes in Bt85 (pink, VNB) and MW_RSA852 (green, VNII) are connected with gray bars. (B) Insertion of C358_04097 to C358_04102 in MW_RSA852.
Rapidly evolving genes in the three lineages of C. neoformans var. grubii
| Comparison | Locus | Gene | Annotation | |
|---|---|---|---|---|
| VNI | 0.0181 | CNAG_01841 | Transcription factor, deletion sensitive to organic peroxides ( | |
| 0.0155 | CNAG_03894 | Transcription factor, deletion with reduced virulence ( | ||
| 0.0095 | CNAG_03213 | UV damage endonuclease | ||
| 0.0092 | CNAG_02756 | Geranylgeranyltransferase-I, essential for virulence ( | ||
| 0.0090 | CNAG_06655 | GPI-anchor transamidase | ||
| 0.0089 | CNAG_01908 | Uroporphyrinogen-III synthase | ||
| 0.0085 | CNAG_03133 | UDP-glucose sterol transferase | ||
| 0.0084 | CNAG_03617 | Clampless protein 1 | ||
| 0.0076 | CNAG_05740 | Farnesyltransferase β-subunit, essential for virulence ( | ||
| 0.0068 | CNAG_03637 | Double-strand break repair factor and silencing regulator, deletion has reduced virulence ( | ||
| VNI | 0.0610 | CNAG_05836 | α-1,6-mannosyltransferase ( | |
| 0.0408 | CNAG_05838 | Rho GTPase activating protein, deletion has increased virulence ( | ||
| 0.0214 | CNAG_06031 | β-glucan synthase, involved in capsule and cell wall formation, deletion has decreased virulence ( | ||
| 0.0149 | CNAG_06814 | α cell type transcription factor, required for mating ( | ||
| 0.0142 | CNAG_01841 | See above | ||
| 0.0135 | CNAG_03229 | Transcription factor, deletion sensitive to organic peroxides ( | ||
| 0.0127 | CNAG_03398 | Zinc ion transporter | ||
| 0.0113 | CNAG_03133 | See above | ||
| 0.0110 | CNAG_03366 | Transcription factor, overexpression results in reduced virulence ( | ||
| 0.0104 | CNAG_01019 | Superoxide dismutase | ||
| VNB | 0.0617 | CNAG_05836 | See above | |
| 0.0402 | CNAG_05838 | See above | ||
| 0.0171 | CNAG_06031 | See above | ||
| 0.0128 | CNAG_03366 | See above | ||
| 0.0122 | CNAG_06814 | See above | ||
| 0.0114 | CNAG_03213 | See above | ||
| 0.0104 | CNAG_01019 | See above | ||
| 0.0104 | CNAG_03398 | See above | ||
| 0.0102 | CNAG_01841 | See above | ||
| 0.0102 | CNAG_02756 | See above |
Consensus sequences were built for each lineage, and dN and dS were calculated for each lineage pair. As dS was uniformly low throughout the data set due to limited genetic diversity, for each pair of lineages we identified the 10 genes with assigned names (Inglis ) with the highest dN, which measures both the mutation rate and selection.
Population genetic features of the lineages of C. neoformans var. grubii
| Populations | Isolates (no.) | Segregating sites | π | Tajima’s D |
|---|---|---|---|---|
| VNI | 111 | 190,716 | 0.00200 | −0.107179 |
| VNII | 23 | 337,990 | 0.00105 | −1.005950 |
| VNB | 25 | 613,991 | 0.00736 | −0.232596 |
The total number of isolates, number of segregating sites, π, and Tajima’s D are given for each population.
Pairwise population genetic statistics between the lineages of C. neoformans var. grubii
| Comparisons | Fixed | Shared | Private_A | Private_B | dXY | |
|---|---|---|---|---|---|---|
| VNB | 54,719 | 52,536 | 446,566 | 102,817 | IvB: 0.0119 | IvB: 0.595 |
| VNB | 118,329 | 68,211 | 405,406 | 78,444 | BvII: 0.0154 | BvII: 0.707 |
| VNI | 188,590 | 38,501 | 116,845 | 83,802 | IvII: 0.0152 | IvII: 0.874 |
The number of alleles fixed and shared between the populations, and alleles private to each population are given, along with divergence metrics dXY and FST.
Figure 6VNB alleles in population subdivisions and across geography. (A) Phylogeny of VNB lineage showing major subdivisions (VNBI and VNBII) and inferred ancestral geography (South America or Africa, depicted as continent shapes). (B) Classification of all 445,193 private VNB alleles (present in at least one VNB isolate and no VNI or VNII isolates) by subdivisions and geography. Most VNB alleles are specific for each VNB subdivision and for the geographic subdivisions within each group. More alleles are shared between geographic locations in the same subdivision (VNBI or VNBII) than are shared within geographic locations across subdivisions.
Figure 7Genome-sharing analysis of C. neoformans var. grubii using fineSTRUCTURE was performed on a SNP matrix using a representative of each clonal population within the VNI lineage. These genomes were reduced to a pairwise similarity matrix, which facilitates the identification of population structure based on haplotype sharing within regions of the genome. The x-axis represents the “donor” of genomic regions, while the y-axis represents the recipient of shared genomic regions. The scale bar represents the amount of genomic sharing, with black representing the largest amount of sharing of genetic material, and white representing the least amount of shared genetic material (no sharing). The geographic site of isolation is illustrated with colored boxes as in Figure 1, and lineages are also shown.
Figure 8Genome-sharing analysis of the VNI lineage using fineSTRUCTURE on a SNP matrix of 111 genomes. The x-axis represents the donor of genomic regions, while the y-axis represents the recipient of shared genomic regions. The scale bar represents the amount of genomic sharing, with blue representing the largest amount of sharing of genetic material, and yellow representing the least amount of shared genetic material (no sharing).