| Literature DB >> 31708883 |
Javier Del Campo1,2, Thierry J Heger1,3, Raquel Rodríguez-Martínez4, Alexandra Z Worden5, Thomas A Richards4, Ramon Massana6, Patrick J Keeling1.
Abstract
Apicomplexans are a group of microbial eukaryotes that contain some of the most well-studied parasites, including the causing agents of toxoplasmosis and malaria, and emergent diseases like cryptosporidiosis or babesiosis. Decades of research have illuminated the pathogenic mechanisms, molecular biology, and genomics of model apicomplexans, but we know little about their diversity and distribution in natural environments. In this study we analyze the distribution of apicomplexans across a range of both host-associated and free-living environments. Using publicly available small subunit (SSU) rRNA gene databases, high-throughput environmental sequencing (HTES) surveys, and our own generated HTES data, we developed an apicomplexan reference database, which includes the largest apicomplexan SSU rRNA tree available to date and encompasses comprehensive sampling of this group and their closest relatives. This tree allowed us to identify and correct incongruences in the molecular identification of apicomplexan sequences. Analyzing the diversity and distribution of apicomplexans in HTES studies with this curated reference database also showed a widespread, and quantitatively important, presence of apicomplexans across a variety of free-living environments. These data allow us to describe a remarkable molecular diversity of this group compared with our current knowledge, especially when compared with that identified from described apicomplexan species. This is most striking in marine environments, where potentially the most diverse apicomplexans apparently exist, but have not yet been formally recognized. The new database will be useful for microbial ecology and epidemiological studies, and provide valuable reference for medical and veterinary diagnosis especially in cases of emerging, zoonotic, and cryptic infections.Entities:
Keywords: apicomplexans; classification; distribution; diversity; environmental sequencing; metabarcoding; phylogeny; reference database
Year: 2019 PMID: 31708883 PMCID: PMC6819320 DOI: 10.3389/fmicb.2019.02373
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1(A) Apicomplexan reference tree inferred from a maximum-likelihood (RAxML) phylogenetic tree constructed (best tree of 1000) using 18S rRNA sequences. Black dots represent bootstrap values >50% (1000 bootstrap replicates). The tree has been collapsed in the main apicomplexan groups based on our taxonomic annotation (Supplementary Table S1). The tree showing all the OTUs (97% similarity clustering) is available as Supplementary Information S1. The first two columns next to the Apicomplexan groups’ names inform about the origin (colored circle) and the number of OTUs (number in the middle of the circle) retrieved from (B) GenBank and from (C) high-throughput environmental sequencing (HTES) studies. The HTES OTUs have been added to the reference tree using the Evolutionary Placement Algorithm in RAxML and the correspondent tree is available as Supplementary Information S2. The last column indicates the fold increase in numbers of OTUs after adding the HTES OTUs to the reference tree. In the figure “Isolates” stands for sequences retrieved from cultured or isolated organisms while “Environmental” stands for OTUs retrieved from culture and isolation independent molecular surveys. CHR1 corresponds to a group of Colpodellids constituted of exclusively environmental sequences.
FIGURE 2Apicomplexan host distribution heatmap based on the information associated to sequences available in GenBank. Detailed information on the taxonomic annotation of the apicomplexan sequences used in this figure is available in Supplementary Table S1.
FIGURE 3(A) World map showing all the HTES analyzed sites, in blue the sites where apicomplexan reads have been reported in red those in which no apicomplexan reads have been found. (B) Distribution (presence) of apicomplexans in the analyzed HTES datasets as a whole and clustered by different features. Size fraction (pico/nano: 0.8–20 μm, meso: 20–180 μm, micro: 180–2000 μm, total: no size fractioning, dissolved: dissolved DNA), temperature (tropical: >18°C, temperate: 10–18° C, polar: <10°C), salinity (euhaline: 30–40 PSU, polyhaline: 18–30 PSU, mesohaline: 5–18 PSU, limnetic: <0.5 PSU). (C) Blast similarity against GenBank of the HTES apicomplexan reads. In brown the results when comparing against sequences annotated with taxonomic affiliation and in orange the results when comparing against all the apicomplexan sequences including those environmental sequences that do not correspond to any identified apicomplexan species.
FIGURE 4Round tree showing the placement of the short reads using the Evolutionary Placement Algorithm in RAxML (EPA-RAxML) on the reference tree. The colors of the leaves and the inner crown surrounding the tree indicate the 18S rRNA region of the corresponding OTUs (V4 or V9). The outer crown indicates the number of reads per OTU.
FIGURE 5Apicomplexan environment distribution heatmap based on the information associated to the sequences retrieved from (A) GenBank and (B) relative amplicon abundances in HTES studies. The novel lineages obtained only by HTES studies are inserted on the tree shown in Figure 2, respecting the original topology. Detailed information on the taxonomic annotation of the apicomplexan sequences used in this figure is available in Supplementary Table S1.