| Literature DB >> 30283677 |
Colin S Maxwell1, Victoria E Sepulveda2, David A Turissini1, William E Goldman2, Daniel R Matute1.
Abstract
Hybridization between species of pathogens has the potential to speed evolution of virulence by providing the raw material for adaptation through introgression or by assembling new combinations of virulence traits. Fungal diseases are a source high morbidity, and remain difficult to treat. Yet the frequency of hybridization between fungal species has rarely been explored, and the functional role of introgressed alleles remains largely unknown. Histoplasma mississippiense and H. ohiense are sympatric throughout their range in North America and have distinct virulence strategies, making them an ideal system to examine the role introgression may play in fungal pathogens. We identified introgressed tracts in the genomes of a sample of H. mississippiense and H. ohiense isolates. We found strong evidence in each species for recent admixture, but introgressed alleles were present at low frequencies, suggesting that they were deleterious. Consistent with this, coding and regulatory sequences were strongly depleted within introgressed regions, whereas intergenic regions were enriched, indicating that functional introgressed alleles were frequently deleterious in their new genomic context. Surprisingly, we found only two isolates with substantial admixture: the H. mississippiense and H. ohiense genomic reference strains, WU24 and G217B, respectively. Our results show that recent admixture has occurred, that it is frequently deleterious and that conclusions based on studies of the H. mississippiense and H. ohiense type strains should be revisited with more representative samples from the genus.Entities:
Keywords: Gene exchange; Hidden Markov Model (HMM); introgression; speciation
Year: 2018 PMID: 30283677 PMCID: PMC6121842 DOI: 10.1002/evl3.59
Source DB: PubMed Journal: Evol Lett ISSN: 2056-3744
Characteristics of the introgressed genetic material from H. ohiense into H. mississippiense described for each isolate
| Individual | Number of tracts | Cumulative length (bp) | Percentage of introgressed genome | Maximum haplotype length (bp) |
|---|---|---|---|---|
| 505 | 1 | 842 | 0.0022 | 842 |
| CI_19 | 0 | 0 | 0 | 0 |
| CI_22 | 0 | 0 | 0 | 0 |
| CI_24 | 0 | 0 | 0 | 0 |
| CI_42 | 1 | 842 | 0.0022 | 842 |
| CI_43 | 0 | 0 | 0 | 0 |
| CI_7 | 1 | 803 | 0.0021 | 803 |
| DOWNS | 1 | 803 | 0.0021 | 803 |
| UCLA_531 | 0 | 0 | 0 | 0 |
| WU24 | 151 | 6,325,384 | 16.688 | 187,896 |
Each row represents a single H. mississippiense isolate. Introgression is rare in most isolates with the clear exception of WU24, an outlier isolate with more than 150 putative introgressions. N = 10 individuals were analyzed.
Characteristics of the introgressed genetic material from H. mississippiense into H. ohiense described for each isolate
| Individual | Number of tracts | Cumulative length | Percentage of introgressed genome | Maximum haplotype length |
|---|---|---|---|---|
| 1986 | 3 | 33,891 | 0.0894 | 28,087 |
| CI_10 | 2 | 1125 | 0.003 | 582 |
| CI_17 | 3 | 13,138 | 0.0347 | 7625 |
| CI_18 | 1 | 543 | 0.0014 | 543 |
| CI_30 | 3 | 3380 | 0.0089 | 1762 |
| CI_35 | 1 | 995 | 0.0026 | 995 |
| CI_4 | 2 | 1125 | 0.003 | 582 |
| CI_6 | 1 | 2178 | 0.0057 | 2178 |
| CI_9 | 2 | 1542 | 0.0041 | 870 |
| G217B | 209 | 2,222,320 | 5.863 | 99,973 |
| G222B | 1 | 2615 | 0.0069 | 2615 |
Introgression is rare in most isolates with the clear exception of G217B, an outlier isolate with more than 200 putative introgressions. Each row represents a single H. ohiense isolate. N = 11 individuals were analyzed.
Figure 1Int‐HMM uses a Hidden Markov model to detect introgressed haplotypes. (A) The algorithm uses ancestry information and coverage on each site to infer tracts of introgression. “Probabilities” are the output probabilities returned by Int‐HMM for the two states at each site (light blue: recipient, red: donor). Int‐HMM has two more possible states, donor error state, and recipient error state, which we did not observed (See also Files S1 and S2). “Unfiltered” represents the raw tracks obtained by grouping contiguous blocks of SNPs with the same most probable state. White space in this panel indicates no information for ancestry. “Tracts” are the inferred tracts after applying the filters listed in “Identifying introgression tracts” (i.e., connecting small haplotypes). (B) An example of an introgression. The inferred introgressed haplotype from H. mississippiense into H. ohiense contains three genes (HCEG_06471, HCEG_06472, and HCEG_06473) present only in the isolate G217B (H. ohiense).
Figure 2Distributions of the size of introgressed haplotypes in two species of Histoplasma identified using Int‐HMM. The Int‐HMM algorithm was run on each individual separately and the distribution of haplotype sizes was computed for each of the two directions of introgression. (A) Size distribution of introgressed haplotypes from H. mississippiense into H. ohiense. (B) Size distribution of introgressed haplotypes from H. ohiense into H. mississippiense. In the two reciprocal directions, there are large introgressions (over 50 kb) but the majority of the introgressions are small.
Figure 3Genomic distributions of introgression tracts in two species of Histoplasma. We plotted the number of introgressions on each supercontig by counting the number of introgressed tracts in 500 kb windows. Each supercontig is represented by either a gray or white rectangle with the exception of supercontigs 2.11–2.16 that are small (collectively represent 1.4% of the genome) and were pooled together. Each panel shows a direction of introgression. Note the y‐axis differ. (A) Genome wide frequency of introgressions from H. mississippiense into H. ohiense. (B) Genome wide frequency of introgressions from H. ohiense into H. mississippiense.
Introgressions are more likely to occur in noncoding regions of the genome
| Direction | Sequence type | Length (kb) | Introgressed percentage | Genomic percentage | Enrichment |
|
|---|---|---|---|---|---|---|
|
| 10kb inter | 269.2 | 6.4 | 15.0 | 0.42 | 0.52 |
|
| 2kb upstream inter | 89.0 | 2.1 | 19.0 | 0.11 | <0.0001 |
|
| 3’ ‐UTR | 1.4 | 0.0 | 3.0 | 0.01 | <0.0001 |
|
| 5’ ‐UTR | 0.7 | 0.0 | 1.1 | 0.01 | <0.0001 |
|
| CDS | 26.1 | 0.6 | 33.2 | 0.02 | <0.0001 |
|
|
|
|
|
|
|
|
|
| intron | 3.5 | 0.1 | 7.2 | 0.01 | <0.0001 |
|
| 10kb inter | 232.7 | 16.4 | 15.0 | 1.09 | 0.45 |
|
| 2kb upstream inter | 112.0 | 7.9 | 19.0 | 0.42 | 0.24 |
|
| 3’ ‐UTR | 1.0 | 0.1 | 3.0 | 0.02 | <0.0001 |
|
| 5’ ‐UTR | 0.3 | 0.0 | 1.1 | 0.02 | <0.0001 |
|
| CDS | 25.6 | 1.8 | 33.2 | 0.05 | <0.0001 |
|
|
|
|
|
|
|
|
|
| intron | 4.9 | 0.3 | 7.2 | 0.05 | <0.0001 |
We studied what functional categories were more likely to be found in introgressions. We portioned the Histoplasma genome by sequence type in one of seven categories: 10 kb inter (intergenic sequence within 10 kb of a gene), 2 kb upstream inter (intergenic sequence 2 kb upstream of a gene), 3’‐UTR, 5’‐UTR, CDS (coding sequence), intergenic (intergenic sequence more than 10 kb from a gene), and intron. “Length (kb)” is the cumulative amount of introgressed DNA in each of the seven functional categories. “Introgressed percentage” is defined as the percentage of introgressions that contain each sequence type, “Genomic percentage” is the percentage of the genome in each of the seven categories, and “Enrichment” is defined as the ratio of introgressed percentage to genomic percentage. P‐values were calculated with permutation tests (see Methods for a full description). Overrepresented categories are bolded.
| State ( | |||
| d (Donor) | r (Recipient) | ||
| State (i‐1) | d (Donor) |
| |
| r (Recipient) | |||