| Literature DB >> 25287146 |
Mark Sistrom1, Benjamin Evans2, Robert Bjornson3, Wendy Gibson4, Oliver Balmer5, Pascal Mäser6, Serap Aksoy7, Adalgisa Caccone2.
Abstract
The Trypanosoma brucei complex contains a number of subspecies with exceptionally variable life histories, including zoonotic subspecies, which are causative agents of human African trypanosomiasis (HAT) in sub-Saharan Africa. Paradoxically, genomic variation between taxa is extremely low. We analyzed the whole-genome sequences of 39 isolates across the T. brucei complex from diverse hosts and regions, identifying 608,501 single nucleotide polymorphisms that represent 2.33% of the nuclear genome. We show that human pathogenicity occurs across a wide range of parasite genotypes, and taxonomic designation does not reflect genetic variation across the group, as previous studies have suggested based on a small number of genes. This genome-wide study allowed the identification of significant host and geographic location associations. Strong purifying selection was detected in genomic regions associated with cytoskeleton structure, and regulatory genes associated with antigenic variation, suggesting conservation of these regions in African trypanosomes. In agreement with expectations drawn from meiotic reciprocal recombination, differences in average linkage disequilibrium between chromosomes in T. brucei correlate positively with chromosome size. In addition to insights into the life history of a diverse group of eukaryotic parasites, the documentation of genomic variation across the T. brucei complex and its association with specific hosts and geographic localities will aid in the development of comprehensive monitoring tools crucial to the proposed elimination of HAT by 2020, and on a shorter term, for monitoring the feared merger between the two human infective parasites, T. brucei rhodesiense and T. b. gambiense, in northern Uganda.Entities:
Keywords: comparative genomics; genomics; next gen sequencing; population genomics; trypanosomatids
Mesh:
Year: 2014 PMID: 25287146 PMCID: PMC4224348 DOI: 10.1093/gbe/evu222
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FResults from comparative genomic analyses. Chromosome 1 is shown in the upper right inset to highlight details on the overall figure. Chromosomes are numbered (large numbers, 1–10, at the outer edge of the circle) corresponding to the TREU927/4 genome annotation. Data tracks are listed from outermost to innermost in the inset, small black and gray numbers indicate positions in megabases. Track A: A summary of coding (black), noncoding (gray) regions, annotated repeat regions (yellow), and annotated VSG (red) genes. Track B: Coverage (orange) overlaid with repetition (purple) averaged in 10-kb windows. Scale lines represent 100× average coverage/repetition per window. Track C: SNP density summarized as average number of SNPs per kilobase in 10-kb windows. Scale lines represent ten SNPs per kilobase. Track D: Windows under high levels of selection as determined by calculations of Tajima’s D. Green bars indicate windows in which the Tajima’s D score was in the highest and lowest 5% of values above and below zero.
FCluster diagram of individual assignment of each isolate to the nine clusters inferred using Admixture (Alexander et al. 2009). Bars represent individuals and colors indicate each genetic cluster inferred with greater than 0.95 support. The height of each bar on the left indicates the probability with which each isolate was assigned to each cluster. Isolates are arranged by (A) subspecies, (B) date of isolation, (C) host species, and (D) country of isolation. Details for each strain are listed in supplementary appendix S1, Supplementary Material online.
Results of χ2 Tests to Determine Significant Correlations between Genomic Clustering and Subspecies Designation, Host Species, Date of Collection and Country of Collection
| Test Parameter | χ2 | df | |
|---|---|---|---|
| Subspecies | 9.38 | 10 | 0.5 |
| Country | 129.17 | 40 | <0.01 |
| Host | 124.87 | 40 | <0.01 |
| Date collected | 101.61 | 95 | 0.3 |
Note.—df, degrees of freedom,
*Significant values.
GO Terms Found to be Significantly Enriched in Genomic Windows under High Selection
| GO Term | Aspect | Genes | |
|---|---|---|---|
| Purifying selection | |||
| GO:0009975 cyclase activity | F | >0.01 | Tb927.5.320 Tb927.5.330 Tb927.7.6060 Tb927.10.2430 |
| Tb927.6.280 Tb927.7.6050 Tb927.4.3880 Tb927.6.300 | |||
| Tb927.9.14410 | |||
| GO:0004016 adenylate cyclase activity | F | >0.01 | Tb927.5.320 Tb927.5.330 Tb927.7.6060 Tb927.10.2430 |
| Tb927.6.280 Tb927.7.6050 Tb927.4.3880 Tb927.6.300 | |||
| GO:0016849 phosphorus-oxygen lyase activity | F | >0.01 | Tb927.5.320 Tb927.5.330 Tb927.7.6060 Tb927.10.2430 |
| Tb927.6.280 Tb927.7.6050 Tb927.4.3880 Tb927.6.300 | |||
| Diversifying selection | |||
| GO:0000226 microtubule cytoskeleton organization | P | >0.01 | Tb927.1.2380 Tb927.1.2360 Tb927.1.2350 Tb927.1.2370 |
| Tb927.1.2330 Tb927.1.2340 | |||
| GO:0005874 microtubule | C | >0.01 | Tb927.1.2380 Tb927.1.2360 Tb927.1.2350 Tb927.1.2370 |
| Tb927.1.2330 Tb927.1.2340 | |||
| GO:0005200 structural constituent of cytoskeleton | F | >0.01 | Tb927.1.2380 Tb927.1.2360 Tb927.1.2350 Tb927.1.2370 |
| Tb927.1.2330 Tb927.1.2340 | |||
Note.—Aspect refers to the ontology aspect of the enriched term; C, cellular component; F, molecular function; P, biological process.
FPlots of average linkage coefficient r2 scores against the distance between SNPs for each chromosome for distances between 1 and 1,000 bp (A) and between 1 and 100,000 bp (B). The average for value of r2 for each chromosome is plotted as a point, the fitted line represents a Loess nonparametric regression line of best fit to the average r2 for each value.
Results of Multiple Regression Analyses to Determine Significant Correlations between the Average r2 of Each Chromosome and Chromosome Length (Length), Number of Annotated Repeat Regions (Repeats), and Number of Annotated VSG Gene Regions (VSG)
| Parameter | Standard Error | ||
|---|---|---|---|
| Overall | 7.10E-02 | 2.50E-02 | 2.84 |
| Length | 2.20E-08 | 6.79E-09 | 3.25 |
| Repeats | 1.25E-05 | 1.70E-05 | 0.74 |
| VSG | −1.11E-04 | 6.04E-05 | −1.84 |
Note.—Scores are shown for all parameters combined (overall), in addition to the contribution of each.