| Literature DB >> 35562825 |
Jianbo Yuan1,2, Xiaojun Zhang1,2, Qi Kou1,2, Yamin Sun3, Chengzhang Liu1,2, Shihao Li1,2, Yang Yu1,2, Chengsong Zhang1,2, Songjun Jin1,2, Jianhai Xiang4,5, Xinzheng Li6,7, Fuhua Li8,9.
Abstract
BACKGROUND: The deep-sea may be regarded as a hostile living environment, due to low temperature, high hydrostatic pressure, and limited food and light. Isopods, a species-rich group of crustaceans, are widely distributed across different environments including the deep sea and as such are a useful model for studying adaptation, migration, and speciation. Similar to other deep-sea organisms, giant isopods have larger body size than their shallow water relatives and have large stomachs and fat bodies presumably to store organic reserves. In order to shed light on the genetic basis of these large crustaceans adapting to the oligotrophic environment of deep-sea, the high-quality genome of a deep-sea giant isopod Bathynomus jamesi was sequenced and assembled.Entities:
Keywords: Body size evolution; Deep-sea; Genome assembly; Giant isopod; Oligotrophic adaptation
Mesh:
Year: 2022 PMID: 35562825 PMCID: PMC9107163 DOI: 10.1186/s12915-022-01302-6
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.364
Fig. 1The distributions and phenotypes of isopods. A The distributions of various isopods from the land to deep-sea environments. B The morphology of the giant isopod, B. jamesi
Summary of genome assembly and characteristics of B. jamesi and other three crustaceans
| Species | ||||
|---|---|---|---|---|
| Genome size (bp) | 5,892,409,081 | 1,725,108,002 | 1,618,026,442 | 1,562,256,418 |
| Number of Contigs | 22,827 | 52,740 | 50,304 | 12,722 |
| Contig N50 (bp) | 587,279 | 38,359 | 57,650 | 26,045 |
| Contig N90 (bp) | 108,712 | 18,318 | 14,641 | 2,670 |
| Genome GC percent% | 37.28% | 29.15% | 35.68% | 46.39% |
| BUSCOs coverage (%) | 94.80% | 91.38% | 94.00% | 91.20% |
| Repeat percentage (%) | 85.32% | 69.54% | 49.39% | 45.30% |
| Gene number | 23,221 | 19,051 | 25,572 | 28,033 |
| Gene average length (bp) | 936 | 1259 | 1546 | 1078 |
| Exon number per gene | 4.18 | 4.93 | 5.94 | 3.26 |
| Exon average length (bp) | 223 | 181 | 260 | 330 |
| Intron average length (bp) | 3010 | 1872 | 1484 | 1602 |
Comparison of the repeats among four crustaceans
| Repeats | ||||
|---|---|---|---|---|
| Total length | 5.90 Gb | 1.73 Gb | 1.66 Gb | 1.56 Gb |
| Repeats | 85.32% | 69.54% | 49.39% | 35.57% |
| DNA | 35.99% | 7.08% | 9.33% | 2.30% |
| DNA/En-Spm | 3.28% | 0.00% | 6.39% | 0.82% |
| DNA/Maverick | 5.08% | 0.63% | 0.80% | 0.10% |
| DNA/Merlin | 0.37% | 0.28% | 0.00% | 0.01% |
| DNA/TcMar-Mariner | 0.87% | 0.21% | 0.06% | 0.00% |
| DNA/TcMar-Tc1 | 6.05% | 1.23% | 0.03% | 0.02% |
| DNA/hAT-Ac | 1.41% | 2.18% | 0.00% | 0.11% |
| DNA/hAT-Charlie | 1.04% | 0.11% | 1.00% | 0.09% |
| DNA/hAT-hATm | 5.77% | 0.81% | 0.00% | 0.00% |
| DNA/hAT-Tip100 | 2.67% | 0.36% | 0.00% | 0.00% |
| LINE | 19.36% | 20.24% | 2.82% | 9.72% |
| LINE/CR1 | 9.13% | 14.46% | 0.25% | 4.06% |
| LINE/Jockey | 1.06% | 0.63% | 0.06% | 0.05% |
| LINE/L2 | 1.80% | 0.62% | 0.35% | 0.36% |
| LINE/Penelope | 3.61% | 1.26% | 0.45% | 0.04% |
| LINE/RTE-BovB | 0.62% | 3.00% | 0.77% | 0.91% |
| SINE | 1.00% | 0.00% | 0.06% | 0.29% |
| LTR | 5.95% | 5.89% | 0.62% | 1.79% |
| LTR/ERV1 | 0.24% | 0.00% | 0.02% | 0.01% |
| LTR/Pao | 2.48% | 2.32% | 0.00% | 0.19% |
| LTR/Gypsy | 2.76% | 3.22% | 0.22% | 1.28% |
| Unknown | 21.97% | 14.87% | 3.42% | 10.39% |
| Satellite | 0.31% | 0.00% | 0.10% | 0.00% |
| Simple repeat | 0.65% | 18.08% | 23.93% | 6.90% |
| Low complexity | 0.01% | 3.57% | 9.49% | 2.04% |
Fig. 2The evolution of transposable elements (TEs) and genome size. A The relationship between the genome size and repeat content. The repeat contents and genome sizes of the sequenced crustacean genomes were summarized in the Additional file 1: Table S2. The TE content and the genome size was positively correlated with the Pearson correlation r = 0.68 and p-value = 0.00275. B Kimura distance-based copy divergence analyses of TEs in the two isopod genomes, B. jamesi and A. vulgare. The graphs represent genome coverage for each TE superfamily in the different genomes analyzed. Clustering was performed according to their Kimura distances (K-value from 0 to 50). C Phylogenetic tree of the CR1 LINEs from B. jamesi (yellow) and A. vulgare (dark gray). D Enrichment analyses of TE families within gene promoters. The closest TE was calculated for each gene, and the content of the closest TEs were calculated and compared with that of the whole genome
Fig. 3Comparative genomes analyses of B. jamesi and its relatives. A Phylogenetic tree and divergence times of B. jamesi and other arthropods. The number of significantly expanded (+, green) and contracted (−, red) gene families is designated on each branch. B Number of gene families shared among four Malacostraca species shown as a Venn diagram. C KEGG enrichment analysis of the expanded gene families of B. jamesi. The enrichment analysis was performed by using the toolkit from Omicshare (https://www.omicshare.com/). The enriched KEGG terms was referred to the Additional file 1: Fig. S7
Fig. 4Gene expansion in growth-related hormone signaling pathways. A Heatmap of the number of genes involved in thyroid and insulin hormone signaling pathways among six crustaceans. The included species are B. jamesi (Bjam), A. vulgare (Avul), A. amphitrite (Aamp), E. affinis (Eaff), L. vannamei (Lvan), and E. sinensis (Esin). The red star indicates the correspondent gene of B. jamesi is significantly more than that of the other five crustaceans, and the black star indicates correspondent gene of B. jamesi is significantly more than that of A. vulgare but similar to other crustaceans (p < 0.05). B KEGG enrichment of the positively selected genes in B. jamesi. C The strengthened thyroid and insulin hormone signaling pathways. Genes with red background indicates significantly expanded genes of B. jamesi in comparison to A. vulgare; Genes with orange background indicates these genes are present in B. jamesi but absent in A. vulgare
Fig. 5The differential gene expressions in six tissues of B. jamesi. A KEGG enrichment analysis of the highly expressed genes in stomach and intestine. The top 20 significantly enriched KEGG terms were displayed in the plot. B Heatmap of the number of genes involved in glycolysis, lipid metabolism and endocytosis among six crustaceans. The included species are B. jamesi (Bjam), A. vulgare (Avul), A. amphitrite (Aamp), E. affinis (Eaff), L. vannamei (Lvan), and E. sinensis (Esin). The red star indicates the correspondent gene of B. jamesi is significantly less than that of the other five crustaceans, and the black star indicates correspondent gene of B. jamesi is significantly more than that of the other five crustaceans (p < 0.05). C Expression level of the genes involved in the endocytosis of B. jamesi
Fig. 6Phylogenetic tree of the genes encoding AnxB9. The AnxB9 genes from various crustaceans were used for the tree construction, which labeled in various colors. A cluster of AnxB9 genes was specific expanded in the B. jamesi genome (gray background), and these genes were tandem duplicated in the genome. The circles with different colors indicate the genes located on different scaffolds