| Literature DB >> 33591306 |
Patrick Tran Van1,2, Yoann Anselmetti3, Jens Bast1, Zoé Dumas1, Nicolas Galtier3, Kamil S Jaron1, Koen Martens4,5, Darren J Parker1,2, Marc Robinson-Rechavi1,2, Tanja Schwander1, Paul Simion3,6, Isa Schön4,7.
Abstract
Ostracods are one of the oldest crustacean groups with an excellent fossil record and high importance for phylogenetic analyses but genome resources for this class are still lacking. We have successfully assembled and annotated the first reference genomes for three species of nonmarine ostracods; two with obligate sexual reproduction (Cyprideis torosa and Notodromas monacha) and the putative ancient asexual Darwinula stevensoni. This kind of genomic research has so far been impeded by the small size of most ostracods and the absence of genetic resources such as linkage maps or BAC libraries that were available for other crustaceans. For genome assembly, we used an Illumina-based sequencing technology, resulting in assemblies of similar sizes for the three species (335-382 Mb) and with scaffold numbers and their N50 (19-56 kb) in the same orders of magnitude. Gene annotations were guided by transcriptome data from each species. The three assemblies are relatively complete with BUSCO scores of 92-96. The number of predicted genes (13,771-17,776) is in the same range as Branchiopoda genomes but lower than in most malacostracan genomes. These three reference genomes from nonmarine ostracods provide the urgently needed basis to further develop ostracods as models for evolutionary and ecological research.Entities:
Keywords: zzm321990 Cyprideis torosazzm321990 ; zzm321990 Darwinula stevensonizzm321990 ; ancient asexual; sexual
Year: 2021 PMID: 33591306 PMCID: PMC8049415 DOI: 10.1093/g3journal/jkab043
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1The phylogenetic position of the Ostracoda among the pancrustaceans and their age estimated from fossil and molecular data. Modified from Oakley . Different pancrustaceans are indicated by branches in different colors. The Ostracoda include the Podocopida, Platycopida, and Myodocopida. Here, three representatives of the Platycopida (indicated in purple) have been sequenced. The phylogenetic clade to which D. stevensoni belongs, is indicated by D, the clade to which C. torosa and N. monacha belong, is indicated by *. Black horizontal bars represent the range of age estimates in myr from Bayesian analyses by Oakley . The letters A–C in the black boxes indicated fossils that were used for calibrations of age estimates.
Quality features of published crustacean genomic assemblies of the last 4 years and of the current study
| Class | Order | Species | Size | No. of scaffolds | N50 | BUSCO | Reference |
|---|---|---|---|---|---|---|---|
| Branchiopoda | Diplostraca |
| 156 | 1,822 | 1,661 | 96 |
|
| Branchiopoda | Diplostraca |
| 130 | 4,193 | 10,124 | 96.7 (C) |
|
| Branchiopoda | Notostraca |
| 73 | 7,167 | 116 | 98.4 (C) |
|
| Branchiopoda | Notostraca |
| 90 | 20,738 | 402 | 97.8 (C) |
|
| Branchiopoda | Spinicaudata |
| 120 | 112 | 18,000 | n.i. |
|
| Copepoda | Cyclopoida |
| 258 | 97,072 | n.i. | 50 (C) |
|
| Copepoda | Cyclopoida |
| 85 | 4,626 | 401 | n.i. |
|
| Copepoda | Harpaticoida |
| 190 | 459 | 298 | 94.5 (C) |
|
| Copepoda | Harpaticoida |
| 197 | 339 | 10,650 | 96 (C) |
|
| Copepoda | Harpaticoida |
| 295 | 270,823 | 159 | 61.1 (C) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Malacostraca | Amphipoda |
| 4,024 | 100,000 | 69 | n.i. |
|
| Malacostraca | Isopoda |
| 1,725 | 43,451 | 51 | 87.9 (C) |
|
| Malacostraca | Decapoda |
| 3,237 | 508,682 | 33 | 81.3 (C) |
|
| Malacostraca | Decapoda |
| 1,270 | 1,368 | 3,185 | 92.7 (C) |
|
| Malacostraca | Decapoda |
| 9,185 | 28,089,718 | 586 | n.i. |
|
| Malacostraca | Decapoda |
| 1,600 | 1,211,364 | 2 | 96.8 (C + F) |
|
| Malacostraca | Decapoda |
| 1,664 | 4,682 | 606 | 95 |
|
| Malacostraca | Decapoda |
| 924 | 37,192,281 | 1 | 97 |
|
| Malacostraca | Decapoda |
| 3,300 | 3,752,011 | 39 | n.i. |
|
Assembly size is provided in million base pairs, scaffold N50 in kilo base pairs, and BUSCO scores in %. Letters behind BUSCO scores indicate the % of complete single copy genes (C) or % of single and fragmented single copy genes (C + F), respectively. Where BUSCO scores lack brackets, no further information on completeness of single copy genes was provided. n.i. = no information available.
Anchoring of scaffolds in existing genome assembly.
Linkage map available.
Long-read technology.
BAC library available.