| Literature DB >> 35277961 |
James P Bernot1,2, Pavel Avdeyev1, Anton Zamyatin3, Niklas Dreyer4,5,6,7, Nikita Alexeev3, Marcos Pérez-Losada1,8,9, Keith A Crandall1,2,8.
Abstract
BACKGROUND: The barnacles are a group of >2,000 species that have fascinated biologists, including Darwin, for centuries. Their lifestyles are extremely diverse, from free-swimming larvae to sessile adults, and even root-like endoparasites. Barnacles also cause hundreds of millions of dollars of losses annually due to biofouling. However, genomic resources for crustaceans, and barnacles in particular, are lacking.Entities:
Keywords: Pollicipes; annotation; assembly; barnacle; crustacea; genome; larval evolution; phylogeny
Mesh:
Year: 2022 PMID: 35277961 PMCID: PMC8917513 DOI: 10.1093/gigascience/giac021
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Pollicipes pollicipes life cycle. Note the fundamental structural differences among the life history stages. (A) Nauplius stage 1. (B) Nauplius stage 6. (C) Cyprid, insert showing magnified view of the third antennal segment used for permanent attachment to the substratum surface. (D) Juvenile adult. (E) Mature adult, insert showing a dissected adult specimen with 6 cirri or “feeding legs,” the penis, and the gut. (F) P. pollicipes voucher and genome sequencing specimens. Asterisk indicates genome hologenophore specimen (USNM 1622609).
Figure 2:(A) Previous phylogenetic hypotheses on the position of Pollicipedomorpha taxa. (B) Phylogeny of the Cirripedia based on 5,734 protein-coding orthologs comprising 1,999,119 AA positions. The topology was identical across all analyses and all nodes received maximum support in all analyses (PP = 1, BS = 100%). Branch lengths for the partitioned ML analysis are shown. Illustrations from Darwin [8,18] except for Rhizocephala, which is from Haeckel [19].
Pollicipes pollicipes genome assembly statistics
| Statistic | Value |
|---|---|
| Total assembly length (Mb) | 770 |
| GC (%) | 52.3 |
| Largest scaffold (bp) | 64,043,775 |
| Scaffold N50 (bp) | 47,009,503 |
| Scaffold N75 (bp) | 37,696,644 |
| Scaffold L50 | 8 |
| Scaffold L75 | 12 |
| No. contigs | 1,254 |
| Contig N50 (bp) | 95,549 |
| Contig N75 (bp) | 22,233 |
| Contig N90 (bp) | 16,125 |
| BUSCO Arthropoda (%) | |
| Complete | 90.5 |
| Single | 69.4 |
| Duplicated | 21.1 |
| Fragmented | 3.3 |
| Missing | 6.2 |
| BUSCO Metazoa (%) | |
| Complete | 91.2 |
| Single | 67.8 |
| Duplicate | 23.4 |
| Fragmented | 4.4 |
| Missing | 4.4 |
Genome sequencing mean coverage based on raw data (prior to QA/QC) and genome size of 770 Mb
| Data type | Raw data (bp) | Coverage (×) |
| PacBio | 47,984,705,480 | 62 |
| Illumina WGS | 150,600,000,000 | 196 |
| HiC | 156,600,000,000 | 203 |
| CHi-C | 53,100,000,000 | 69 |
Comparison of barnacle genomes and chromosome-level crustacean genome assemblies
| Contiguity | Taxon | Species | Assembly |
| Scaffold N50 | Scaffold L50 | Arthropod BUSCO (%) | Reference |
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| Scaffold | Thecostraca |
| GCA_009805615.1 | 613 | 458,238 | 415 | 92.4 | [ |
| Scaffold | Thecostraca |
| GCA_014673585.1 | 482 | 56,726 | 1,896 | 56.4 | NCBI |
| Contig | Thecostraca |
| GCA_003709985.1 | 101 | 1,475 | 24,797 | 14.5 | NCBI |
| Chromosome | Branchiopoda |
| GCA_013167095.1 | 132 | 8,418,570 | 7 | 98.6 | NCBI |
| Chromosome | Branchiopoda |
| GCA_003990815.1 | 123 | 10,124,675 | 6 | 98.0 | [ |
| Chromosome | Copepoda |
| GCA_013387185.1 | 478 | 27,802,916 | 8 | 61.5 | NCBI |
| Chromosome | Copepoda |
| GCA_007210705.1 | 191 | 15,806,032 | 6 | 93.5 | [ |
| Chromosome | Copepoda |
| GCA_010645155.1 | 197 | 10,654,335 | 8 | 94.1 | [ |
| Chromosome | Decapoda |
| GCA_013436485.1 | 1,272 | 17,608,299 | 30 | 92.6 | [ |
| Chromosome | Decapoda |
| GCA_013283005.1 | 4,805 | 51,153,954 | 39 | 81.4 | [ |
The size of this assembly is much shorter than the estimated size of the haploid genome (1,300–1,600 Mb).
Taxa and orthologs used in phylogenetic analyses
| Taxon | No. orthologs (%) | AA positions | Accession No. |
|---|---|---|---|
|
| 2,137 (37) | 314,239 | SRR10034703 |
|
| 2,220 (39) | 475,439 | SRR10012027 |
|
| 2,216 (39) | 534,367 | SRR5140130 |
|
| 3,685 (64) | 678,927 | SRR5140144 |
|
| 2,238 (39) | 783,829 | DRR169034, DRR169035, DRR169036 |
|
| 4,232 (74) | 931,663 | SRR8775110 |
|
| 4,571 (80) | 1,137,491 | SRR426837 |
|
| 4,833 (84) | 1,188,730 | SRR4113502 |
|
| 4,919 (86) | 1,653,588 | SRR6818896 |
|
| 5,092 (89) | 1,742,039 | GCA_011947565.2 |
|
| 5,363 (94) | 1,777,080 | SRR5091879, SRR5091880 |
|
| 5,161 (90) | 1,790,600 | SRR10527303 |
|
| 5,221 (91) | 1,790,661 | SRR10523768 |
|
| 5,385 (94) | 1,807,445 | SRR10034703 |
Figure 3: (A) Venn diagram of genes expressed in each life stage. (B) Summary of most significant, enriched GO terms in nauplius DEGs accounting for nested GO terms. (C) Summary of most significant, enriched GO terms in adult DEGs accounting for nested GO terms. (D) Heat map of top 100 DEGs including gene IDs and annotations, clustered according to expression on the y-axis. N 1 = nauplius 1, N 2 = nauplius 2.