| Literature DB >> 34903227 |
Wenbing Chen1,2, Changling Zuo1, Chundi Wang3,4, Tengteng Zhang3,4, Liping Lyu3,4, Yu Qiao3,4, Fangqing Zhao5,6, Miao Miao7.
Abstract
BACKGROUND: Ciliated protists are a widely distributed, morphologically diverse, and genetically heterogeneous group of unicellular organisms, usually known for containing two types of nuclei: a transcribed polyploid macronucleus involved in gene expression and a silent diploid micronucleus responsible for transmission of genetic material during sexual reproduction and generation of the macronucleus. Although studies in a few species of culturable ciliated protists have revealed the highly dynamic nature of replicative and recombination events relating the micronucleus to the macronucleus, the broader understanding of the genomic diversity of ciliated protists, as well as their phylogenetic relationships and metabolic potential, has been hampered by the inability to culture numerous other species under laboratory conditions, as well as the presence of symbiotic bacteria and microalgae which provide a challenge for current sequencing technologies. Here, we optimized single-cell sequencing methods and associated data analyses, to effectively remove contamination by commensal bacteria, and generated high-quality genomes for a number of Euplotia species.Entities:
Keywords: Ciliate; Euplotia; Genetic diversity; Single-cell genomics
Mesh:
Year: 2021 PMID: 34903227 PMCID: PMC8670190 DOI: 10.1186/s12915-021-01202-1
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Sample collection and single-cell genome sequencing of eight species in Euplotia. A The taxonomy and the number of sequenced genomes or transcriptomes in Spirotricha. The size of nodes is proportional to the counts of recorded species shown in black. The number of sequenced species is shown in red. B The morphology and sampling sites of the eight species in Euplotia. C Comparison of read mapping rate and sequencing coverage between the two whole-genome amplification methods. D Percentage of bacteria-derived reads in each cell. E Read mapping depth on the assembled genome using MALBAC and MDA. F The correlation between genome completeness and the cell number used in single-cell genome sequencing
Summary of Euplotia genome assembly and annotation
| Species | Filtered reads (Gb) | Genome size (bp) | N50 (bp) | GC content (%) | Genome completeness (%) | Gene number |
|---|---|---|---|---|---|---|
| 59.6 | 74,018,178 | 1270 | 42.31 | 71.58 | 15,982 | |
| 55.1 | 125,778,342 | 1838 | 36.04 | 90.91 | 25,327 | |
| 51.4 | 83,224,465 | 2696 | 36.76 | 96.97 | 14,298 | |
| / | 42,117,764 | 1929 | 32.02 | 95.79 | 9382 | |
| 39.6 | 83,441,685 | 1610 | 43.02 | 89.47 | 19,635 | |
| / | 84,760,489 | 2692 | 36.89 | 84.85 | 15,078 | |
| 66.7 | 84,107,277 | 934 | 43.86 | 77.89 | 21,295 | |
| 53.0 | 87,592,728 | 1887 | 37.72 | 91.58 | 15,830 | |
| / | 51,565,225 | 1761 | 36.91 | 95.79 | 11,481 | |
| 15.6 | 68,224,519 | 2316 | 45.86 | 78.79 | 19,733 | |
| 70.9 | 111,284,557 | 2543 | 36.43 | 87.88 | 20,152 |
Fig. 2.Comparative analysis of 24 ciliate genomes in Spirotricha. A The number of annotated genes matched to the protein nonredundant database. B Phylogenetic tree and heatmap of shared genes. The time tree on the left was constructed by BEAST, and the numbers on the nodes represent the bootstrap values estimated by the ML method. The heatmap on the right represents the count of shared genes among these species. C The k-means clustering of the homologous groups based on gene counts. Each column represents a homologous group, and each row represents a species. D Venn diagrams of shared homologous groups among representative species of Euplotia and Hypotrichia. E The distribution of protein identities among representative species of Euplotia and Hypotrichia
Fig. 3Extensive gene amplification events in Euplotia. A The counts of genes related to cilium kinetics in each species of Euplotia and Hypotrichia. B Significantly expanded/contracted gene families among all species of Euplotia and Hypotrichia. Each cell in the heatmap represents the normalized gene count of a Pfam family. C Gene duplication before and after speciation. The phylogenetic trees on the right side were constructed based on ABC transporters and Cytochrome P450 gene families using the ML method, respectively
Fig. 4Genomic diversity of genus Euplotes. A Synteny analysis of the seven genomes in Euplotes. Only the contigs containing syntenic regions are showed in the figure. B Syntenic regions of cyclic nucleotide phosphodiesterase, ion transport protein, and kinesin. C Phylogenetic tree of the 157 orthologs found in Euplotes using the ML method. D The distribution of protein identities among the four Euplotes groups. E Significantly expanded/contracted gene families among all species of Euplotia and Hypotrichia
Fig. 5Genomic characteristics of genus Uronychia. A Synteny analysis between Uronychia spp. and their relatives, E. parawoodruffi and O. trifallax. Only the contigs containing syntenic regions are showed in the figure. B The distribution of protein identities between Uronychia spp. and their relatives. C Gene Ontology enrichment analysis in Uronychia. BP, CC, and MF represent the biological process, cellular component, and molecular function categories, respectively. The size of the circles represents the count of enriched genes, and the color of the circles represents the enrichment or depletion of functional terms in Uronychia compared to other species in Euplotia. D A schematic summary of ciliary movement machinery in Uronychia. The color of each component represents the enrichment or depletion of functional terms of Uronychia compared to other species in Euplotia