| Literature DB >> 35557718 |
Rosalyn Lo1, Katherine E Dougan1, Yibi Chen1, Sarah Shah1, Debashish Bhattacharya2, Cheong Xin Chan1.
Abstract
Dinoflagellates of the family Symbiodiniaceae are predominantly essential symbionts of corals and other marine organisms. Recent research reveals extensive genome sequence divergence among Symbiodiniaceae taxa and high phylogenetic diversity hidden behind subtly different cell morphologies. Using an alignment-free phylogenetic approach based on sub-sequences of fixed length k (i.e. k-mers), we assessed the phylogenetic signal among whole-genome sequences from 16 Symbiodiniaceae taxa (including the genera of Symbiodinium, Breviolum, Cladocopium, Durusdinium and Fugacium) and two strains of Polarella glacialis as outgroup. Based on phylogenetic trees inferred from k-mers in distinct genomic regions (i.e. repeat-masked genome sequences, protein-coding sequences, introns and repeats) and in protein sequences, the phylogenetic signal associated with protein-coding DNA and the encoded amino acids is largely consistent with the Symbiodiniaceae phylogeny based on established markers, such as large subunit rRNA. The other genome sequences (introns and repeats) exhibit distinct phylogenetic signals, supporting the expected differential evolutionary pressure acting on these regions. Our analysis of conserved core k-mers revealed the prevalence of conserved k-mers (>95% core 23-mers among all 18 genomes) in annotated repeats and non-genic regions of the genomes. We observed 180 distinct repeat types that are significantly enriched in genomes of the symbiotic versus free-living Symbiodinium taxa, suggesting an enhanced activity of transposable elements linked to the symbiotic lifestyle. We provide evidence that representation of alignment-free phylogenies as dynamic networks enhances the ability to generate new hypotheses about genome evolution in Symbiodiniaceae. These results demonstrate the potential of alignment-free phylogenetic methods as a scalable approach for inferring comprehensive, unbiased whole-genome phylogenies of dinoflagellates and more broadly of microbial eukaryotes.Entities:
Keywords: Symbiodiniaceae; alignment-free phylogenetics; coral symbionts; genome evolution; k-mers; phylogeny; symbiosis
Year: 2022 PMID: 35557718 PMCID: PMC9087856 DOI: 10.3389/fpls.2022.815714
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Genome data of the 18 dinoflagellate taxa (Order Suessiales) used in this study.
| Species | Isolate | Lifestyle | Scaffolds | Scaffold N50 length (Kb) | References |
|---|---|---|---|---|---|
|
| Mf1.05b.01 | Symbiotic | 21,898 | 125.23 |
|
|
| SCF055 | Symbiotic | 6,843 | 353.90 | Revised from |
| C15 | Symbiotic | 34,589 | 50.69 |
| |
| C92 | Symbiotic | 6,685 | 247.56 |
| |
|
| CCMP2556 | Symbiotic | 29,137 | 774.26 |
|
|
| SCF082 | Symbiotic | 44,682 | 398.48 |
|
|
| CCMP2456 | Symbiotic | 37,772 | 58.08 |
|
|
| 04-503SCI.03 | Symbiotic | 57,558 | 49.98 |
|
|
| CassKB8 | Symbiotic | 67,937 | 42.99 |
|
|
| CCMP2467 | Symbiotic | 94 | 9,963 |
|
|
| CCMP2592 | Symbiotic | 6,245 | 651.26 |
|
|
| Sh18 | Symbiotic | 16,175 | 132.49 |
|
|
| CCMP2469 | Opportunistic | 104,583 | 14.53 |
|
|
| CCMP2548 | Free-living | 2,855 | 610.50 |
|
|
| CCMP2461 | Free-living | 48,302 | 62.44 |
|
| CCMP2468 | Free-living? | 29,213 | 13,533 |
| |
|
| CCMP1383 | Free-living, psychrophilic | 33,494 | 170.30 |
|
|
| CCMP2088 | Free-living, psychrophilic | 37,768 | 129.21 |
|
All taxa are classified in family Symbiodiniaceae except the outgroup Polarella glacialis.
Figure 1Alignment-free phylogenetic trees of 18 dinoflagellate taxa inferred from: (A) whole-genome sequences (WGS), (B) repeat-masked whole-genome sequences (rmWGS), (C) protein-coding sequences (CDS), (D) the encoded protein sequences, (E) the annotated repeats and (F) the intronic regions. The corresponding k used for each inference is shown at the bottom of each tree. (G) The topology of reference tree inferred from multiple sequence alignment of LSU rRNA, showing ultrafast bootstrap support (from 1,000 sample replicates). All tree topologies are rooted with Polarella glacialis as the outgroup. Differing branching order between each alignment-free tree and the reference tree is indicated with the symbols α, β, γ, δ and ε. Phylogenetic networks of these alignment-free trees are available as Supplementary Data 2.
Figure 2Annotated genome features associated with core 23-mers identified for: (A) all 18 genomes of Suessiales, and (B) the seven genomes of symbiotic Symbiodinium. The Venn diagrams shown in (A) and (B) are not to scale. (C) Repeat classes identified as significantly more abundant and/or significantly more conserved in symbiotic Symbiodinium than in free-living members of this genus.
Figure 3Network of genome relatedness of 18 Suessiales inferred using the rmWGS data at different t-values: (A) t = 0.00, (B) t = 2.85, (C) t = 4.02, and (D) t = 10.0. The colour used to represent each genome is shown in the legend.