| Literature DB >> 27704050 |
José F Muñoz1, Rhys A Farrer2, Christopher A Desjardins2, Juan E Gallo3, Sean Sykes2, Sharadha Sakthikumar2, Elizabeth Misas4, Emily A Whiston5, Eduardo Bagagli6, Celia M A Soares7, Marcus de M Teixeira8, John W Taylor5, Oliver K Clay9, Juan G McEwen10, Christina A Cuomo2.
Abstract
The Paracoccidioides genus includes two species of thermally dimorphic fungi that cause paracoccidioidomycosis, a neglected health-threatening human systemic mycosis endemic to Latin America. To examine the genome evolution and the diversity of Paracoccidioides spp., we conducted whole-genome sequencing of 31 isolates representing the phylogenetic, geographic, and ecological breadth of the genus. These samples included clinical, environmental and laboratory reference strains of the S1, PS2, PS3, and PS4 lineages of P. brasiliensis and also isolates of Paracoccidioides lutzii species. We completed the first annotated genome assemblies for the PS3 and PS4 lineages and found that gene order was highly conserved across the major lineages, with only a few chromosomal rearrangements. Comparing whole-genome assemblies of the major lineages with single-nucleotide polymorphisms (SNPs) predicted from the remaining 26 isolates, we identified a deep split of the S1 lineage into two clades we named S1a and S1b. We found evidence for greater genetic exchange between the S1b lineage and all other lineages; this may reflect the broad geographic range of S1b, which is often sympatric with the remaining, largely geographically isolated lineages. In addition, we found evidence of positive selection for the GP43 and PGA1 antigen genes and genes coding for other secreted proteins and proteases and lineage-specific loss-of-function mutations in cell wall and protease genes; these together may contribute to virulence and host immune response variation among natural isolates of Paracoccidioides spp. These insights into the recent evolutionary events highlight important differences between the lineages that could impact the distribution, pathogenicity, and ecology of Paracoccidioides. IMPORTANCE Characterization of genetic differences between lineages of the dimorphic human-pathogenic fungus Paracoccidioides can identify changes linked to important phenotypes and guide the development of new diagnostics and treatments. In this article, we compared genomes of 31 diverse isolates representing the major lineages of Paracoccidioides spp. and completed the first annotated genome sequences for the PS3 and PS4 lineages. We analyzed the population structure and characterized the genetic diversity among the lineages of Paracoccidioides, including a deep split of S1 into two lineages (S1a and S1b), and differentiated S1b, associated with most clinical cases, as the more highly recombining and diverse lineage. In addition, we found patterns of positive selection in surface proteins and secreted enzymes among the lineages, suggesting diversifying mechanisms of pathogenicity and adaptation across this species complex. These genetic differences suggest associations with the geographic range, pathogenicity, and ecological niches of Paracoccidioides lineages.Entities:
Keywords: Paracoccidioides; evolution; genetic recombination; genome analysis; mycology; population genetics
Year: 2016 PMID: 27704050 PMCID: PMC5040785 DOI: 10.1128/mSphere.00213-16
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
Paracoccidioides species isolates selected for this study
| Isolate ID | Other name(s) | Origin | Source | Lineage | Provider | Reference |
|---|---|---|---|---|---|---|
| Pb18 | B17 | Sao Paulo, Brazil | Chronic PCM | S1b | R. Puccia | |
| PbCaz | Cazon; A1 | Chaco, Argentina | Acute PCM | S1b | R. Negroni | |
| Pb113 | Manaus-AM, Brazil | PCM | S1b | C. de Almeida Soares | ||
| PbBlo | Brazil | PCM | S1b | C. de Almeida Soares | This study | |
| MS1 | Mato Grosso do Sul, Brazil | PCM | S1a | M. Sueli Felipe | ||
| D03 | Piracicaba, SP, Brazil | PCM | S1a | E. Bagagli | This study | |
| MS2 | Mato Grosso do Sul, Brazil | PCM | S1a | M. Sueli Felipe | ||
| Pb1445 | A5 | Argentina | Chronic PCM | S1a | R. Negroni | |
| Pb337 | T15LN1; B10 | Brazil | S1a | E. Bagagli | ||
| Pb66 | Brazil | PCM | S1a | C. de Almeida Soares | ||
| PbBer | Bercelli; A3 | Argentina | PCM | S1a | R. Negroni | |
| D02 | Laranjal Paulista, SP, Brazil | PCM | S1a | E. Bagagli | This study | |
| T1F1 | B1 | Pratanea, SP, Brazil | S1a | E. Bagagli | ||
| T15N1 | Botucatu, Brazil | S1a | E. Bagagli | This study | ||
| T16B1 | Brazil | S1a | E. Bagagli | This study | ||
| Pb300 | V1 | Miranda, Venezuela | Soil | PS4 | M. B. Albornoz | |
| EPM83 | Bogotá, Colombia | Chronic PCM | PS3 | A. Restrepo | ||
| Pb339 | B18 | Sao Paulo, Brazil | PCM | PS3 | A. Restrepo | |
| Pb60855 | C4 | Antioquia, Colombia | Chronic PCM | PS3 | A. Restrepo | |
| PbBac | Colombia | PCM | PS3 | A. Restrepo | This study | |
| PbCab | P196; C6 | Caldas, Colombia | PS3 | A. Restrepo | ||
| PbCnh | Colombia | Chronic PCM | PS3 | A. Restrepo | This study | |
| PbJam | Colombia | Chronic PCM | PS3 | A. Restrepo | This study | |
| Pb02 | V2 | Caracas, Venezuela | Chronic PCM | PS2 | R. Puccia | |
| Pb03 | B26 | Sao Paulo, Brazil | Chronic PCM | PS2 | R. Puccia | |
| Pb262 | Uberlândia, MG, Brazil | Dog food | PS2 | Z. Pires de Camargo | ||
| T10B1 | B7 | Botucatu, Brazil | PS2 | E. Bagagli | ||
| Pb01 | Goiás, Brazil | PCM | R. Puccia | |||
| Pl1578 | Goiás, Brazil | PCM | C. de Almeida Soares | |||
| ED01 | Goiás, Brazil | PCM | C. de Almeida Soares | |||
| PlEE | EE | Mato Grosso, Brazil | PCM | M. Sueli Felipe |
Reference strain assembled and annotated genome.
PCM, paracoccidioidomycosis.
FIG 1 Phylogeny and recombination in Paracoccidioides. Two methods were used to examine strain relationships originating from across South America (A): using 614,570 SNPs, including a phylogenetic network constructed with SplitsTree4 (B), and a Bayesian calibrated phylogeny constructed with BEAST (C); bootstrap values from maximum likelihood phylogeny constructed with RAxML were included for major subdivisions. Both methods show evidence of five distinct lineages in P. brasiliensis: S1 (blue), which is divided into two groups S1a (dark blue) and S1b (light blue), PS2 (green), PS3 (red), and the recently described PS4 (purple). Also, this phylogeny supports the divergence between P. brasiliensis and P. lutzii (Pl [orange]) as a different species. In addition, the phylogenetic network of P. brasiliensis suggests patterns of recombination (red branches).
FIG 2 Classification of population-specific SNP alleles. The percentage of the distribution of SNP alleles is shown for each pairwise comparison among the six lineages of Paracoccidioides spp. SNP alleles were classified as fixed (blue), shared (red), private A (light gray [for each pairwise comparison specific to the first lineage in the x axis]), and private B (dark gray [specific to the second lineage]).
FIG 3 Genetic population structure of Paracoccidioides. (A) PCA of genetic variants in Paracoccidioides is shown as a two-dimensional plot for all isolates (n = 31) (left) or for all P. brasiliensis isolates (n = 26) (right). In each plot, circles indicate isolates and colors indicate the lineage. (B) Population structure of Paracoccidioides spp. (left) and P. brasiliensis (right) inferred from 476,589 and 339,966 SNPs, respectively, using the Structure software program with different k values of 2 and 4, respectively. An admixture model with correlated allele frequencies and site-by-site analysis was used. Each isolate is represented by a single vertical line broken into k-colored segments, with lengths proportional to each of the k inferred clusters. (C) Whole-genome plot for Pb300 (PS4) deducted from Structure site-by-site analysis showing the chromosomal ancestry.
FIG 4 Genome-wide nucleotide diversity (π [Pi]), Tajima’s D (TD), and population divergence analysis (F) in Paracoccidioides. (A) Genome-wide average of nucleotide diversity (π) and TD for sliding windows of 10-kb regions within the main Paracoccidioides lineages. (B) Average of genome-wide (10-kb windows) variation in (θ), Weir’s formulation of Wright’s fixation index (F), for pairwise comparisons in each lineage. (C) Distribution of the average nucleotide diversity (π), TD, and F for sliding windows of 10-kb regions for the S1a, S1b, PS3, and PS3 Paracoccidioides lineages.
FIG 5 Genes under positive selection in Paracoccidioides. Shown is a histogram of the dN/dS values comparing P. brasiliensis (Pb18) and P. lutzii (Pb01). Genes undergoing positive selection (dN/dS of >1) are in blue.
Selection of candidate virulence factors in Paracoccidioides isolates found in highly diverse regions and/or under selection
| Locus ID | Description of protein | π | TD | SNPs | ||
|---|---|---|---|---|---|---|
| PADG_02498 | 3-Hydroxyanthranilate 3,4-dioxygenase | + | − | − | − | − |
| PADG_00940 | Acetate kinase; | − | + | − | + | − |
| PADG_01835 | Aldehyde reductase; | − | − | − | + | + |
| PADG_07461 | α-1,3-Glucanase | + | + | + | + | + |
| PADG_03747 | Alternative oxidase; | − | − | − | + | − |
| PADG_02460 | Antigenic GPI-protein; secreted; antigen; | + | − | + | + | + |
| PADG_04167 | Aspartyl aminopeptidase; peptidase family M18 | + | − | − | − | − |
| PADG_06131 | BUD32 protein kinase; vesicles | − | − | − | + | − |
| PADG_02399 | Calcium binding protein; secreted; | − | − | − | + | + |
| PADG_00743 | Class II aldolase; | − | − | − | + | − |
| PADG_12370 | Endo-1,3(4)-β-glucanase; secreted; | − | + | − | + | − |
| PADG_05497 | GATA-binding protein; | − | − | − | + | − |
| PADG_07615 | Glucan 1,3-β-glucosidase; secreted; antigen; | − | + | + | + | − |
| PADG_05345 | High-affinity nickel transporter; | − | − | − | + | − |
| PADG_07274 | Hypothetical protein | − | − | − | + | − |
| PADG_06699 | Hypothetical protein; | − | + | − | + | − |
| PADG_01238 | Hypothetical protein; | − | − | − | + | − |
| PADG_01283 | Hypothetical protein; | − | − | − | + | − |
| PADG_03908 | Hypothetical protein; | − | − | − | + | − |
| PADG_07534 | Hypothetical protein; | − | − | − | + | − |
| PADG_11963 | Hypothetical protein; | − | − | − | + | + |
| PADG_02535 | Hypothetical protein; secreted; vesicles; | + | + | + | + | + |
| PADG_02542 | Hypothetical protein; vesicles | − | − | − | + | + |
| PADG_12450 | Hypothetical protein; vesicles | + | − | + | + | − |
| PADG_04741 | Hypothetical protein; vesicles | + | + | − | + | − |
| PADG_02521 | Hypothetical protein; vesicles | + | − | − | + | − |
| PADG_12101 | Hypothetical protein; vesicles | − | − | − | + | − |
| PADG_01494 | Mannan endo-1,6-α-mannosidase; secreted; | − | + | − | + | − |
| PADG_00948 | Oxidoreductase; | − | + | − | + | − |
| PADG_07460 | Predicted aminopeptidase; peptidase family M18; induced during macrophage interaction | + | + | + | − | + |
| PADG_05820 | Predicted aminopeptidase; peptidase family M24; induced during macrophage interaction | + | − | + | − | − |
| PADG_07369 | Predicted dehydrogenase | + | + | + | − | + |
| PADG_02527 | Predicted dehydrogenase | + | − | − | − | − |
| PADG_02562 | Predicted dehydrogenase | + | + | − | − | − |
| PADG_02492 | Predicted dehydrogenase | + | − | − | − | − |
| PADG_07365 | Predicted dehydrogenase | + | − | − | − | + |
| PADG_07411 | Predicted dehydrogenase | + | − | − | − | − |
| PADG_02575 | Predicted nonribosomal peptide synthetase | + | + | − | − | − |
| PADG_06309 | Predicted oxidoreductase | + | + | + | − | + |
| PADG_02592 | Predicted oxidoreductase | + | + | + | − | − |
| PADG_06322 | Predicted peroxidase | + | + | − | − | + |
| PADG_02507 | Predicted peroxidase; secreted | + | − | − | + | − |
| PADG_07053 | Predicted protease; peptidase family 16 | − | − | − | + | − |
| PADG_00686 | Predicted protease; peptidase family M28 | − | − | − | + | + |
| PADG_06314 | Predicted protease; peptidase family S10; secreted; induced during macrophage interaction | + | + | − | − | − |
| PADG_06167 | Predicted protease; peptidase family S24 | + | − | − | + | − |
| PADG_07422 | Predicted protease; peptidase family S8; secreted | − | + | − | + | + |
| PADG_07454 | Predicted scramblase; secreted | + | − | + | + | + |
| PADG_06308 | Predicted transporter | + | + | + | + | + |
| PADG_02081 | RING finger domain-containing protein; vesicles | − | + | − | + | − |
| PADG_08583 | Secreted protein | − | − | − | + | − |
| PADG_02569 | Secreted protein | + | + | − | − | − |
| PADG_00954 | Secreted protein immunoreactive protein; secreted | + | + | + | + | − |
| PADG_07830 | Secreted protein; | − | − | − | + | − |
| PADG_05055 | Secreted protein; | + | + | + | + | − |
| PADG_03277 | Secreted protein; vesicles; | − | − | − | + | − |
| PADG_02842 | Superoxide dismutase; secreted; | − | − | − | + | − |
| PADG_05504 | Thioredoxin | − | − | − | + | − |
| PADG_02515 | Transporter | + | − | − | + | − |
| PADG_05881 | Vacuolar amino acid transporter; vesicles | − | − | − | + | − |
| PADG_00941 | Xylulose-5-phosphate phosphoketolase; | − | + | − | + | − |