| Literature DB >> 31236592 |
Nathan J Kenny1, Bruna Plese1,2, Ana Riesgo1, Valeria B Itskovich3.
Abstract
Freshwater sponges (Spongillida) are a unique lineage of demosponges that secondarily colonized lakes and rivers and are now found ubiquitously in these ecosystems. They developed specific adaptations to freshwater systems, including the ability to survive extreme thermal ranges, long-lasting dessication, anoxia, and resistance to a variety of pollutants. While spongillids have colonized all freshwater systems, the family Lubomirskiidae is endemic to Lake Baikal, and plays a range of key roles in this ecosystem. Our work compares the genomic content and microbiome of individuals of three species of the Lubomirskiidae, providing hypotheses for how molecular evolution has allowed them to adapt to their unique environments. We have sequenced deep (>92% of the metazoan 'Benchmarking Universal Single-Copy Orthologs' (BUSCO) set) transcriptomes from three species of Lubomirskiidae and a draft genome resource for Lubomirskia baikalensis. We note Baikal sponges contain unicellular algal and bacterial symbionts, as well as the dinoflagellate Gyrodinium. We investigated molecular evolution, gene duplication and novelty in freshwater sponges compared to marine lineages. Sixty one orthogroups have consilient evidence of positive selection. Transporters (e.g. zinc transporter-2), transcription factors (aristaless-related homeobox) and structural proteins (for example actin-3), alongside other genes, are under strong evolutionary pressure in freshwater, with duplication driving novelty across the Spongillida, but especially in the Lubomirskiidae. This addition to knowledge of freshwater sponge genetics provides a range of tools for understanding the molecular biology and, in the future, the ecology (for example, colonization and migration patterns) of these key species.Entities:
Keywords: Freshwater; Genome; Lake Baikal; Porifera; Sponges; Symbiosis; Transcriptome
Year: 2019 PMID: 31236592 PMCID: PMC6805232 DOI: 10.1093/molbev/msz151
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.Specimens used for RNA and gDNA extraction. (A) Lubomirskia baikalensis. (B) Lubomirskia abietina. (C) Baikalospongia bacillifera. Scale bar lengths 1 cm.
. 2.(A) Blobplot results, showing distribution of annotated contigs according to GC content (x axis) and coverage (y axis) for the assemblies presented here. Summary statistics are also provided at the top of each panel. Note the bimodal distribution of GC content in the genome, which may represent differences between the coding and noncoding elements of the genome as observed in other species. (B) Maximum likelihood tree showing relationship between Gyrodinium sp. 18S rRNA sequence found in our Lubomirskia baikalensis sample and those identified previously. Note monophyletic group of freshwater samples, and internally to that clade, a monophyletic group of freshwater sponge symbiote sequences, as marked with symbols/colors noted in legend. (C) Maximum likelihood phylogeny showing relationship of Choricystis sp. 28S-ITS1 sequence found in our Lubomirskia baikalensis sample compared with previously sequenced samples from GenBank. Choricystis samples shown at left (light green branches) with our sample shown in bold.
. 3.(A) Phylogenetic relationships of a variety of freshwater sponge species, inferred using Bayesian methods, based on alignment of nucleotide sequence from mitochondrial protein coding and rRNA genes. Numbers at bases of nodes indicate posterior probability, number at base of tree (under scale bar) indicates number of changes per site at unit length. (B) example of indels not used in tree, but nonetheless present in alignment. These indels (example from Nad1 provided) provide further data that can be used to support clades, but are not represented in the phylogeny shown (as any site without sequence data is excluded from that analysis). Sponges endemic to Lake Baikal are, on the basis of this evidence, firmly supported as a monophyletic clade, to the exclusion of other freshwater sponges.
. 6.Analysis of novelties present in freshwater sponge transcriptomes when compared with a range of other genomes and transcriptomes. (A) Phylogeny of freshwater sponges together with marine outgroups used in this analysis. Tree is rooted with Ircinia fasciculata. Mapped onto the phylogeny at bases of nodes are the number of duplications that were inferred to map to each node. (B) Contigs unassigned to orthogroups (and thus unique, and single copy, within individual species). (C) orthogroups present only in a single species (and thus unique to the species, but with duplication/alternative isoforms present). (D) Matrix of numbers of orthogroups (top) and orthologs (bottom) shared between species, colored as heat map, with green showing the highest number and red the lowest numbers for each comparison. These numbers are given for every species pair and can be read by finding the names of the species for comparison on the x and y labels, and moving to the site of overlap.
. 4.(A) Phylogenetic relationship of freshwater sponge species and outgroup taxa, inferred from a concatenated multiple gene alignment using partitioned GTR analysis in MrBayes (3,222 genes, 3,242,264 site alignment, before trimming). This tree was used as the basis for tests of selection on the freshwater lineage, the results of which are summarized in (B) orthogroups displaying significant results for tests of selection under several tests, shown in a Venn diagram indicating consilience of results. (C) Examples of genes and sites under selection, with sites marked according to key at bottom of figure. Domain locations are indicated below each gene as appropriate. Note that further site-level (MEME and BEB) results are available in supplementary file 5, Supplementary Material online.
Orthogroups (OG) with Consilient Signals of Molecular Evolution.
| Alignment Number | GenBank Hit | Mapped to GO Term(s) | Alignment Number | GenBank Hit | Mapped to GO Term(s) |
|---|---|---|---|---|---|
| OG0000044 | Fibrinogen C domain-containing protein 1-A-like | No | OG0001408 | Uncharacterized protein LOC109593314 | Yes |
| OG0000120 | Zinc transporter 2-like | Yes | OG0001488 | UPF0600 protein C5orf51 homolog | No |
| OG0000141 | Neurobeachin-like | No | OG0001657 | Protein ADP-ribosylarginine hydrolase-like | No |
| OG0000142 | Actin-3-like | No | OG0001662 | Elongation factor 1-alpha | Yes |
| OG0000150 | Proteinase T-like | Yes | OG0001704 | Apoptosis-inducing factor 3-like | No |
| OG0000161 | Autophagy-related protein 16-1-like | No | OG0001826 | WD and tetratricopeptide repeats protein 1-like isoform X1 | No |
| OG0000227 | Uncharacterized protein LOC109592610 isoform X2 | No | OG0002175 | Tyrosine-protein phosphatase nonreceptor type 11 isoform X2 | No |
| OG0000236 | Protein F37C4.5-like | No | OG0002186 | U3 small nucleolar ribonucleoprotein protein MPP10-like | Yes |
| OG0000300 | Dual oxidase 1-like | Yes | OG0002195 | Rho GTPase-activating protein 26-like isoform X2 | Yes |
| OG0000375 | SEC14-like protein 2 | Yes | OG0002335 | SH3 domain-containing kinase-binding protein 1-like isoform X2 | Yes |
| OG0000378 | Glutamyl aminopeptidase-like isoform X2 | Yes | OG0002484 | Radixin-like | Yes |
| OG0000422 | Annexin-B12-like, partial | No | OG0002622 | Tetratricopeptide repeat protein 14 | No |
| OG0000434 | long-chain-fatty-acid--CoA ligase ACSBG2-like | No | OG0002646 | Alpha-xylosidase-like | Yes |
| OG0000519 | Gamma-glutamyltranspeptidase 1-like | Yes | OG0002712 | Uncharacterized protein LOC109586818 | No |
| OG0000528 | DnaJ homolog subfamily A member 3, mitochondrial-like isoform X1 | Yes | OG0002718 | Pyruvate kinase PKM-like | Yes |
| OG0000631 | Cytochrome b-245 heavy chain-like | Yes | OG0002786 | Heparan-alpha-glucosaminide | No |
| OG0000658 | Disintegrin and metalloproteinase domain-containing protein 17 | Yes | OG0003043 | Uncharacterized protein LOC100635535 | No |
| OG0000696 | 2′-5′-Oligoadenylate synthase-like protein 2 | Yes | OG0003137 | Uncharacterized protein LOC105316736 isoform X2 | Yes |
| OG0000757 | Protein vav-1-like | Yes | OG0003269 | Acyl Co-a binding | No |
| OG0000774 | TGF-beta-activated kinase 1 and MAP3K7-binding protein 1-like | Yes | OG0003737 | Tetratricopeptide repeat protein 1-like isoform X1 | No |
| OG0000885 | Tyrosine-protein kinase Tec-like | Yes | OG0003883 | Flowering time control protein FY-like | No |
| OG0000894 | Poly(rC)-binding protein 2-like | No | OG0004236 | Dual oxidase maturation factor 1-like | Yes |
| OG0000984 | Ataxin-7-like protein 3 | No | OG0004507 | Stromal membrane-associated protein 1-like | Yes |
| OG0001031 | Integrin alpha-9-like | Yes | OG0004509 | Brefeldin A-inhibited guanine nucleotide-exchange protein 1 | Yes |
| OG0001052 | Glucosidase 2 subunit beta-like, partial | Yes | OG0005056 | Homocysteine-responsive endoplasmic reticulum-resident ubiquitin-like | Yes |
| OG0001078 | Protein DD3-3-like | No | OG0005907 | Protein tweety homolog 2-like | Yes |
| OG0001082 | Guanine nucleotide-binding protein G(q) subunit alpha-like | Yes | OG0005914 | UNC93-like protein MFSD11 | Yes |
| OG0001093 | Probable ATP-dependent RNA helicase DDX49, partial | Yes | OG0006570 | Aristaless-related homeobox protein-like | Yes |
| OG0001117 | Rho GTPase-activating protein 39-like isoform X1 | Yes | OG0007030 | 28-kDa heat- and acid-stable phosphoprotein-like | No |
| OG0001194 | Importin subunit alpha-6-like | Yes | OG0007208 | Pre-mRNA-processing factor 40 homolog B-like | No |
| OG0001217 | No BLAST/PFAM Annotation | No |
Note.—Thirty-five of these could be mapped to Gene Ontology (GO) terms, as described in text. Full details of sites and significance, along with mapped GO terms, can be found in supplementary file 5, Supplementary Material online.
. 5.(A) GO categories enriched (P value cutoff [FDR] 0.05) in the 61 genes identified as under positive selection by all tests performed, as displayed by REVIGO. These are displayed according to GO subcategory (MF [19 annotated genes], CC [20 genes], and BP [23 genes]), with area of colored squares corresponding to the proportion of GO terms represented. (B) ShinyGO network analysis of enriched GO terms for CC and BP subcategories (P value cutoff false discovery rate (FDR) 0.05) within the 61 genes with consilient evidence for positive selection listed in table 1. Nodes each represent an enriched GO term, with size corresponding to number of genes with this term. Lines connect related GO terms, and line thickness reflects percentage of genes that overlap for these categories. Please note in particular degree of overlap and connectedness of these categories.