| Literature DB >> 33931033 |
Werner Pieter Veldsman1, Ka Yan Ma2, Jerome Ho Lam Hui2, Ting Fung Chan2, J Antonio Baeza3,4,5, Jing Qin6, Ka Hou Chu7.
Abstract
BACKGROUND: The complex life cycle of the coconut crab, Birgus latro, begins when an obligate terrestrial adult female visits the intertidal to hatch zoea larvae into the surf. After drifting for several weeks in the ocean, the post-larval glaucothoes settle in the shallow subtidal zone, undergo metamorphosis, and the early juveniles then subsequently make their way to land where they undergo further physiological changes that prevent them from ever entering the sea again. Here, we sequenced, assembled and analyzed the coconut crab genome to shed light on its adaptation to terrestrial life. For comparison, we also assembled the genomes of the long-tailed marine-living ornate spiny lobster, Panulirus ornatus, and the short-tailed marine-living red king crab, Paralithodes camtschaticus. Our selection of the latter two organisms furthermore allowed us to explore parallel evolution of the crab-like form in anomurans.Entities:
Keywords: Birgus latro; Nuclear genome; Panulirus ornatus; Paralithodes camtschaticus
Mesh:
Year: 2021 PMID: 33931033 PMCID: PMC8086120 DOI: 10.1186/s12864-021-07636-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary statistics on genome assembly, genome completeness and AT-content
| Organism | Estimated genome size (Gbp) | Assembly size (Gbp) | Contig N50 (bp) | Scaffold N50 (bp) | Scaffolds larger than 100 Kbp | Fragmented signature homologs (%) | Complete signature homologs (%) | AT-content of called bases (%) |
|---|---|---|---|---|---|---|---|---|
| 6.22 | 2.96 | 5342 | 6350 | 1054 | 23.2 | 63.5 | 57.56 | |
| 3.23 | 1.93 | 5451 | 8144 | 1787 | 15.5 | 77.6 | 57.36 | |
| 7.29 | 3.81 | 5815 | 7037 | 637 | 29.5 | 57.6 | 58.77 |
Percentage repetitive elements in the assembled genomes
| Repeat type | |||
|---|---|---|---|
| DNA elements | 4.36 | 3.70 | 1.04 |
| LINEs | 15.27 | 15.34 | 12.12 |
| Low complexity | 0.70 | 0.29 | 0.19 |
| LTR elements | 3.97 | 6.81 | 0.84 |
| Satellites | 0.06 | 0.03 | 0.03 |
| Simple repeats | 4.96 | 2.68 | 2.73 |
| SINEs | 0.21 | 0.80 | 0.13 |
| Small RNAs | 0.02 | 0.09 | 0.02 |
| Unclassified | 29.80 | 38.56 | 24.42 |
| 59.35 | 68.30 | 41.52 |
Fig. 1Phylogeny of the compared species. Interleave nodes on the tree are color coded with observed duplication events. All branches have 100% bootstrap support (separately determined with a maximum likelihood approach using 40 single copy orthologs) unless otherwise indicated with a star. This figure was drawn with ggtree version 2.2.3 [9] and Microsoft PowerPoint
Fig. 2Orthological relationships between the compared genomes. Shared orthologs are placed into four cardinal groups. a one-to-one orthology b one-to-many orthology c many-to-one orthology, and d many-to-many orthology. This figure was drawn with Circlize version 0.4.10 [10] and Microsoft PowerPoint
Fig. 3Nuclear expressed mitochondrial-targeting protein (mTP) interaction. Genes that contain mTP signals are shown in this interaction plot, which is similar in concept to a Venn-diagram. Degrees refer to the number of sets that have a given number of features in common. It is worth noting that the mTP-signal containing gene with the highest number of copies, kif22, is most proliferated in B. latro. This figure was drawn with UpSetR version 1.4.0 [11] and Microsoft PowerPoint
Fig. 4Proportional representation of alternative splicing profiles. The ratio of skipped exons to mutually exclusive exons are represented as percentage contribution with respect to their combined occurrence. Values within the bars indicate the number of alternatively spliced constructs. Each assayed tissue type is represented by an individual plot: a eyestalk, b gill, c hepatopancreas and d muscle. Identifiers starting with SRR are Sequence Read Archive (SRA) identifiers. This graph was drawn with Microsoft Excel
Fig. 5Comparison of coding sequences containing more than 25% SR-content. The genomes of the three species assembled in this study have the highest overall proportion of coding sequences with more than 25% serine/arginine (SR) content, while the two anomurans have a disproportionate number of high SR-content coding sequences without annotated gene symbols. This graph was drawn with Microsoft Excel
Fig. 6A gene structure ideogram of the high SR-content encoding b52 gene. The b52 gene contains three exons and an exceptionally high percentage of serine/arginine (SR) amino acids. This ideogram depicts the position of the gene in the Paralithodes camtschaticus genome. Besides from this gene’s product being exceptional high in SR-content, it is also of interest since it is present in all assembled genomes. This ideogram was drawn with Google Drawings