Literature DB >> 33591306

First annotated draft genomes of nonmarine ostracods (Ostracoda, Crustacea) with different reproductive modes.

Patrick Tran Van1,2, Yoann Anselmetti3, Jens Bast1, Zoé Dumas1, Nicolas Galtier3, Kamil S Jaron1, Koen Martens4,5, Darren J Parker1,2, Marc Robinson-Rechavi1,2, Tanja Schwander1, Paul Simion3,6, Isa Schön4,7.   

Abstract

Ostracods are one of the oldest crustacean groups with an excellent fossil record and high importance for phylogenetic analyses but genome resources for this class are still lacking. We have successfully assembled and annotated the first reference genomes for three species of nonmarine ostracods; two with obligate sexual reproduction (Cyprideis torosa and Notodromas monacha) and the putative ancient asexual Darwinula stevensoni. This kind of genomic research has so far been impeded by the small size of most ostracods and the absence of genetic resources such as linkage maps or BAC libraries that were available for other crustaceans. For genome assembly, we used an Illumina-based sequencing technology, resulting in assemblies of similar sizes for the three species (335-382 Mb) and with scaffold numbers and their N50 (19-56 kb) in the same orders of magnitude. Gene annotations were guided by transcriptome data from each species. The three assemblies are relatively complete with BUSCO scores of 92-96. The number of predicted genes (13,771-17,776) is in the same range as Branchiopoda genomes but lower than in most malacostracan genomes. These three reference genomes from nonmarine ostracods provide the urgently needed basis to further develop ostracods as models for evolutionary and ecological research.
© The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America.

Entities:  

Keywords:  zzm321990 Cyprideis torosazzm321990 ; zzm321990 Darwinula stevensonizzm321990 ; ancient asexual; sexual

Year:  2021        PMID: 33591306      PMCID: PMC8049415          DOI: 10.1093/g3journal/jkab043

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Introduction

Relevance of ostracods

Ostracoda are small, bivalved crustaceans, widely occurring in almost all aquatic habitats as part of the meiobenthos and periphyton. There are 2330 formally described species of extant nonmarine ostracods (Meisch ) and at least another 7000 described species of extant marine ostracod species (see Schön and Martens 2016 for an estimate by S. Brandao). Their calcified valves are preserved as microfossils, making them the extant arthropod group with the most extensive fossil record. The group has an estimated (Cambrian) age of c. 500 myr (millions of years) according to a molecular clock (Oakley ), and c. 450 myr (Ordovician; Maddocks 1982) to 509 myr (Wolfe ) according to the fossil record. This makes them one of the oldest extant pancrustacean groups (Figure 1). Because of their excellent fossil data, evolutionary events can be dated with real-time estimates making ostracods ideal models for evolutionary research (Butlin and Menozzi 2000; Oakley and Cunningham 2002; Oakley ; Schön and Martens 2016).
Figure 1

The phylogenetic position of the Ostracoda among the pancrustaceans and their age estimated from fossil and molecular data. Modified from Oakley . Different pancrustaceans are indicated by branches in different colors. The Ostracoda include the Podocopida, Platycopida, and Myodocopida. Here, three representatives of the Platycopida (indicated in purple) have been sequenced. The phylogenetic clade to which D. stevensoni belongs, is indicated by D, the clade to which C. torosa and N. monacha belong, is indicated by *. Black horizontal bars represent the range of age estimates in myr from Bayesian analyses by Oakley . The letters A–C in the black boxes indicated fossils that were used for calibrations of age estimates.

The phylogenetic position of the Ostracoda among the pancrustaceans and their age estimated from fossil and molecular data. Modified from Oakley . Different pancrustaceans are indicated by branches in different colors. The Ostracoda include the Podocopida, Platycopida, and Myodocopida. Here, three representatives of the Platycopida (indicated in purple) have been sequenced. The phylogenetic clade to which D. stevensoni belongs, is indicated by D, the clade to which C. torosa and N. monacha belong, is indicated by *. Black horizontal bars represent the range of age estimates in myr from Bayesian analyses by Oakley . The letters A–C in the black boxes indicated fossils that were used for calibrations of age estimates. Contrary to the extensive focus on this group for palaeontological research, there is a total lack of published ostracod genomes, and even isolated genomic data from ostracods in open access databases are still rare. Thus, the only resources available beyond individual gene sequences are four mitogenomes [the marine ostracods Vargula hilgendorfii (Ogoh and Ohmiya 2004; GenBank accession number NC_005306) and Cypridina dentata (Wang ; NC_042792); and two unpublished mitogenomes from V. tsujii (NC_039175) and Cyprideis torosa (PRJNA302529)]. Also, raw Illumina DNA sequencing reads of the podocopid ostracod Eucypris virens have been generated as part of a study testing DNA extraction methods for high-throughput sequencing in zooplankton (SRX8021019; Beninde ) but these have neither been assembled nor annotated. In studies on crustacean phylogenies and gene expression (see Supplementary Table S1 for details), raw RNA-sequencing reads have been generated for a total of 12 species coming from the three major ostracod lineages (Mydocopida, Halocyprida, and Podocopida), but the number of assembled and annotated ostracod genes in these studies remains very limited, ranging between 4 and 822 genes.

Choice of model species

Extant nonmarine ostracods show a high prevalence of asexual reproduction (Chaplin ; Butlin ; Martens ), which has evolved several times independently in different ostracod lineages and is most frequent in the Cyprididae and the Darwinulidae. Ostracods are thus an ideal group to further study the paradox of sex, which remains one of the most puzzling questions in evolutionary biology (Bell 1982; Otto and Lenormand 2002; Schön ; Neiman ). The most important sets of hypotheses explaining why sex is advantageous despite its direct costs are based on the fact that physical linkage among loci generates different forms of selective interference (recently reviewed in Otto 2020). Genome-wide data are very valuable to test if asexuals indeed are affected by these predictions (e.g., Glémin ; Jaron ) and to develop insights into mechanisms such as gene conversion (Omilian ), DNA repair (Schön and Martens, 1998; Hecox-Lea and Mark Welch 2018), or horizontal gene transfer (Gladyshev ; Danchin ; Boschetti ; Paganini ; Flot ). Such data are also needed to further test for general consequences of asexuality beyond lineage-specific effects (Jaron ). For many animal groups in which asexuality is frequent, genomic data are limited to a few representatives only (Tvedte ) or are totally absent like in the Ostracoda. Of all extant nonmarine ostracods, the Cyprididae (cyprids) are most speciose, comprising 42% of all known species (Meisch ). They would thus be an obvious choice for genomic studies, also because in this ostracod family, mixed reproduction with sexual and asexual females and geographic parthenogenesis is very common (Horne ). Asexual cyprids, however, are often polyploid (Adolfsson ; Symonová ), probably because of hybridization between males and asexual females through accidental mating (Schmit ). Consequently, genome sizes are relatively large (Jeffery ; Gregory 2020) up to 3.13 pg which equals more than 3 Gb. These features are likely to seriously complicate genomic assemblies and annotations in the absence of any genomic resources for ostracods, which is why we did not choose any asexual cypridid ostracods for this genome project. Instead, we have selected three other species of nonmarine ostracods, one putative ancient asexual darwinulid ostracod and two species with obligate sexual reproduction. The ostracod family Darwinulidae is one of the two last remaining animal groups which are still supported as being genuine ancient asexuals (Heethoff ; Schön ; Schwander 2016) and comprises about 35 morphospecies (Meisch ). All darwinulids are brooders with valve dimorphisms between males and females that are detectable in the fossil record. Martens showed that males have been absent in this family for at least 200 myr. One study reported a few males in a single darwinulid species (Smith ) but proof of the functionality of these males for successful mating and meaningful genetic exchange could not been provided. Such (potential) atavistic males have also been reported in other putative ancient asexuals (Heethoff ). The type species of the Darwinulidae, Darwinula stevensoni, has been asexual since c. 20 myr (Straub 1952), occurs on all continents except Antarctica (Schön ) and in a wide range of habitats (Schön ). Darwinula stevensoni is the best investigated darwinulid ostracod so far and has been the subject of ecological (Van Doninck , 2003a, 2003b; Van den Broecke ) and molecular research using DNA sequence data from single genes (Schön ; 2003; Martens ; Schön ). These studies revealed that D. stevensoni is most likely apomictic or functionally mitotic (following the definition of apomixis in animals as in Schön ). The species also has low mutation rates as there appears to be no (Schön ) or low (Schön and Martens 2003; Schön ) allelic divergence within individuals, and genetic differences between populations from different continents can be attributed to ancient vicariant processes (Schön ). It has also been suggested that gene conversion is common in this species (Schön and Martens 1998; 2003), which could be an explanation for the low observed mutation rates. These results, however, were based on a limited number of genes and require further confirmation with genome-wide data. Darwinula stevensoni has a life cycle of 1 year in Belgium (Van Doninck ) and up to 4 years in more northern regions (McGregor 1969 in Northern America; Ranta 1979 in Finland), which is exceptionally long for a nonmarine ostracod. It can survive a wide range of temperatures, salinities (Van Doninck ), and oxygen concentrations (Rossi ). The total genome size of D. stevensoni has been estimated as 0.86–0.93 pg with flow cytometry (Paczesniak, unpublished), approximating 900 Mb. There is no information on the ploidy level of D. stevensoni, except for the study by Tétart (1979) showing 22 dot-like chromosomes. Because of its putative ancient asexuality, no close sexual relatives of D. stevensoni are available for comparative, genomic analyses. We have chosen two fully sexual nonmarine ostracod species from the Cytherideidae and the Notodromadidae with high population densities in Belgium as comparisons to the putative ancient asexual: C. torosa and Notodromas monacha, respectively. Cyprideis torosa inhabits brackish waters and is the only extant species of this genus in Europe (Meisch 2000). It has been the subject of various biological and especially palaeontological and geochemical studies (see for example, Heip, 1976a, 1976b; De Deckker ; Keyser 2005). Frogley and Whittaker (2017) suggested that C. torosa is at least of Pleistocene origin (c. 2.5 myr) but might be older. There are only two molecular studies of this species based on single genes (Schön and Martens 2003; Schön ). No information on the genome size or the karyotype of C. torosa is currently available. The second sexual ostracod species analyzed here, N. monacha, occurs throughout the Northern hemisphere and is a nonmarine ostracod with a most peculiar behavior: it is partially hyponeustonic, hanging upside down attached to the water surface (Meisch 2000). The fossil record of N. monacha goes back to the Miocene (max 23 myr—Janz 1997), and its genome size is at 0.87 pg (Jeffery ; Gregory 2020) very similar to that of D. stevensoni. This species has not yet been the subject of any molecular studies. Our aim here is to provide the first reference genome data of nonmarine ostracods from three different species with varying reproductive modes: the putative ancient asexual D. stevensoni and the two obligate sexuals, C. torosa and N. monacha. We also generate transcriptomes of these species to facilitate genome annotations.

Materials and methods

Sample collection for genome and transcriptome sequencing

All three nonmarine ostracod species were sampled in Belgian lakes where previous research had shown that these species occurred (Schön and Martens 2003; Merckx ). Living ostracods were sampled using a hand net with a mesh size of 150 µm. The hand net was swept in between the vegetation and forcefully right above the surface of the sediment for collecting D. stevensoni and C. torosa. N. monacha was sampled by moving the net on the water surface. Nonmarine ostracods were kept in habitat water. Their taxonomic identity was confirmed, and they were sorted alive under a binocular microscope as described by Martens and Horne (2016). Individual ostracods were picked with a pipette and transferred into sterilized EPA water in which they were maintained until DNA and RNA were extracted. More details on the origin of biological samples are provided in Supplementary Table S2. For generating reference genomes, DNA was extracted from a single female of each species using the QIAamp DNA Micro kit according to the manufacturer’s instructions. The extracted DNA from single females was amplified in two independent reactions using the SYNGIS TruePrime WGA kit and then pooled, to generate sufficient DNA for preparing different libraries. To generate transcriptomes for annotation of reference genomes, RNA was extracted from 40 pooled individuals per species from the same collection batch. For this, individuals were frozen in liquid nitrogen and, after addition of Trizol (Life Technologies), mechanically crushed with beads (Sigmund Lindner). Next, chloroform and ethanol-extraction methods were applied to the homogenized tissue and the aqueous layer transferred to RNeasy MinElute Columns (Qiagen). Subsequent steps of RNA extraction were done following the RNeasy Mini Kit protocol, including DNase digestion. Finally, RNA was eluted into water and stored at −80°C. RNA quantity and quality were estimated with the NanoDrop (Thermo Scientific) and Bioanalyzer (Agilent).

Genome assembly

We prepared five genomic DNA libraries for each reference genome (three 2 × 125 bp paired-end libraries with average insert sizes of 250–300, 550 and 700 bp, and two mate-pair libraries with average insert sizes of 3000 and 5000 bp; see Supplementary Table S3 for more details) with the Illumina TruSeq DNA Library Prep Kit. Reads were generated with the Illumina HiSeq 3000 system for a total coverage between 351× and 386× (Supplementary Table S3). Reads were filtered with Trimmomatic v0.36 (Bolger ) and NxTrim v0.4.1 (O’Connell ). Because of uneven coverage produced by PCR-based whole-genome amplification (Chen ; Oyola ), we first normalized reads using BBMap v36.59 (Bushnell 2014) and then assembled into contigs with SPAdes v3.10.1 (Bankevich ). Scaffolding was performed using SSPACE v3.0 (Boetzer ). Scaffolds identified as contaminants were filtered out using Blobtools v1.0 (Laetsch and Blaxter 2017). The completeness of genomes assemblies was assessed with BUSCO v3.0.2 (Seppey ) against the arthropoda_odb9 dataset. More details of the assembly pipelines and the applied parameters can be found in Supplementary Material SM1.

Protein coding gene annotation

Libraries were prepared using the Illumina TruSeq Stranded RNA kit, following the manufacturer’s instructions. RNA reads were generated with the Illumina HiSeq 2500 system (Supplementary Table S4). Reads were filtered with Trimmomatic v0.36. All trimmed reads were mapped against the genomes with STAR v2.5.3a (Dobin ) and further assembled with Trinity v2.5.1 (Haas ) under the “genome guided” mode to produce transcriptome assemblies. The obtained transcriptomes and protein evidence were used to train and predict protein coding genes using MAKER v2.31.8 (Holt and Yandell 2011). Predicted protein coding genes were functionally annotated with Blast2GO v5.5.1 (Conesa ; Götz ) against the NCBI non-redundant arthropods protein database (v 2018-10). More details of the annotation pipelines and the applied parameters can be found in Supplementary Material SM2.

GenomeScope analyses

The whole genome amplification approach, which we used in the present study because of the small body size of individual ostracods, generated unequal read coverage of ostracod genomes and prevented us from directly estimating genome sizes and levels of heterozygosity from the assemblies. To overcome this problem, we re-sequenced two individual ostracods each of D. stevensoni and N. monacha without whole genome amplification, preparing libraries with the NEBNext® Ultra™ II DNA Library Prep Kit for Illumina. Reads were filtered with Trimmomatic v0.36 and analyzed using GenomeScope v2.0 (Ranallo-Benavidez ) to correctly estimate genome size and heterozygosity. More details on the analyses are provided in the Supplementary Material SM3.

Data availability

Raw sequence reads have been deposited in NCBI’s sequence read archive under the following bioprojects: PRJNA515625 (reference genomes, Supplementary Table S3) and PRJNA631617 (RNA-seq for annotations and resequenced individuals, Supplementary Tables S4 and S5). Genome assemblies and annotations have been deposited in the European Nucleotide Archive (ENA) under the accession number PRJEB38362 (Supplementary Tables S6 and S7). Codes for the analyses are available at: https://github.com/AsexGenomeEvol/Ostracoda_genomes. Additional supplementary material is available at the figshare collection page, including detailed descriptions of the methods (Supplementary Methods SM1–SM3), Tables S1 and S2, and S6 and S8, and the results of the GenomeScope analyses of D. stevensoni and N. monacha (Supplementary Figures S1, A–D). Supplemental Material available at figshare: https://doi.org/10.25387/g3.13858817.

Results and discussion

First ostracod reference genomes and their attributes

We produced the first de novo reference genomes of nonmarine ostracods, namely of the three species D. stevensoni, C. torosa, and N. monacha with different reproductive modes (see Supplementary Material SM1 and Tables S3 and S4 for more details on the assemblies). We used a whole genome amplification approach (WGA), because the TruSeq DNA Nano library prep kit for Illumina sequencing or low input protocols for PacBio (Duncan ) were not available when these assemblies were generated. We would not recommend WGA for future studies because this PCR-based method generated uneven coverage, and consequently, problems for applying routine genome assembly methods and estimates of genome size and heterozygosity. Despite these limitations, our approach produced genome assemblies that are useful for future research as will be outlined below. When assessing the quality of the obtained ostracod de novo genome assemblies, the assembly of the putative ancient asexual, D. stevensoni, had the best contiguity, with the largest N50 although the total number of scaffolds was similar to N. monacha (Table 1, Supplementary Table S6). The genome of the putative ancient asexual is furthermore the most complete as shown by its total BUSCO score of 96% and of 94% for complete single copy genes (Table 1). The quality of the genome from the obligate sexual ostracod C. torosa is the lowest of the three ostracod species as it has the highest number of scaffolds, and the lowest N50; it is also less complete with a total BUSCO score of 92% (Supplementary Table S7) and of 87% for complete single copy genes (Table 1). All three species have similar numbers of predicted genes and transcripts (Supplementary Table S7).
Table 1

Quality features of published crustacean genomic assemblies of the last 4 years and of the current study

ClassOrderSpeciesSizeNo. of scaffoldsN50BUSCOReference
BranchiopodaDiplostraca Daphnia pulexa,b 1561,8221,66196 Ye et al. (2017)
BranchiopodaDiplostraca D. magnaa,b 1304,19310,12496.7 (C) Lee et al. (2019)
BranchiopodaNotostraca Lepidurus arcticus 737,16711698.4 (C) Savojardo et al. (2019)
BranchiopodaNotostraca L. apus lubbocki 9020,73840297.8 (C) Savojardo et al. (2019)
BranchiopodaSpinicaudata Eulimnadia texanac 12011218,000n.i. Baldwin-Brown et al. (2018)
CopepodaCyclopoida Apocyclops royi 25897,072n.i.50 (C) Jørgensen et al. (2019)
CopepodaCyclopoida Oithona nana 854,626401n.i. Madoui et al. 2017
CopepodaHarpaticoida Tigriopus californicusc 19045929894.5 (C) Barreto et al. (2018)
CopepodaHarpaticoida T.japonicusa 19733910,65096 (C) Jeong et al. (2020)
CopepodaHarpaticoida T. kingsejongensis 295270,82315961.1 (C) Kang et al. 2017
Ostracoda Podocopida Cyprideis torosa 335 132,611 19

86.6 (C)

91.9 (C + F)

Current study
Ostracoda Podocopida Darwinula stevensoni 382 62,118 56

93.7 (C)

95.8 (C + F)

Current study
Ostracoda Podocopida Notodromas monacha 377 62,251 42

92.7 (C)

94.4 (C + F)

Current study
MalacostracaAmphipoda Parhyale hawaiensisb,d 4,024100,00069n.i. Kao et al. (2016)
MalacostracaIsopoda Armadillidium vulgarec 1,72543,4515187.9 (C) Chebbi et al. (2019)
MalacostracaDecapoda Cherax quadricarinatus c 3,237508,6823381.3 (C) Tan et al. (2020)
MalacostracaDecapoda Eriocheir japonica sinensis c 1,2701,3683,18592.7 (C) Tang et al. (2020)
MalacostracaDecapoda Palaeomon carinicaudab 9,18528,089,718586n.i. Li et al. (2019)
MalacostracaDecapoda Penaeus monodonc 1,6001,211,364296.8 (C + F) Van Quyen et al. (2020)
MalacostracaDecapoda Litopenaeus vannameib,c 1,6644,68260695 Zhang et al. (2019)
MalacostracaDecapoda Marsupenaeus japonicus 92437,192,281197 Yuan et al. (2018)
MalacostracaDecapoda Procambarus virginalis 3,3003,752,01139n.i. Gutekunst et al. (2018)

Assembly size is provided in million base pairs, scaffold N50 in kilo base pairs, and BUSCO scores in %. Letters behind BUSCO scores indicate the % of complete single copy genes (C) or % of single and fragmented single copy genes (C + F), respectively. Where BUSCO scores lack brackets, no further information on completeness of single copy genes was provided. n.i. = no information available.

Anchoring of scaffolds in existing genome assembly.

Linkage map available.

Long-read technology.

BAC library available.

Quality features of published crustacean genomic assemblies of the last 4 years and of the current study 86.6 (C) 91.9 (C + F) 93.7 (C) 95.8 (C + F) 92.7 (C) 94.4 (C + F) Assembly size is provided in million base pairs, scaffold N50 in kilo base pairs, and BUSCO scores in %. Letters behind BUSCO scores indicate the % of complete single copy genes (C) or % of single and fragmented single copy genes (C + F), respectively. Where BUSCO scores lack brackets, no further information on completeness of single copy genes was provided. n.i. = no information available. Anchoring of scaffolds in existing genome assembly. Linkage map available. Long-read technology. BAC library available. Ostracod genome sizes estimated with flow cytometry are somewhat larger than the estimates that we obtained here from GenomeScope analyses of re-sequenced individual ostracods. The haploid genome size of D. stevensoni was estimated at 420–455 Mb with flow cytometry (Paczesniak, unpublished) while we estimated 362 Mb from sequence reads (Supplementary Figure S1, A and B). Similarly, the size of the haploid genome of N. monacha is estimated at 425 Mb with flow cytometry (Jeffery ; Gregory 2020), which is larger than the 385 Mb (Supplementary Figure S1, C and D) that we obtained from sequence reads. It thus seems that either the genome size estimates by flow cytometry are incorrect or that some parts of each genome are missing from our sequencing reads. Transposons and repeat-rich genomic regions can contribute to gaps in genomic assemblies (Peona ). Some of these missing regions could also be GC rich, a feature which is known to cause a sequencing bias with Illumina technology (see for example, Chen , Botero-Castro ). Acquiring more complete genome assemblies will require the additional application of long-read technologies to ostracods. Genome-wide estimates of heterozygosity are especially interesting for asexual taxa because the absence of recombination is expected to cause accumulation of mutations, resulting in increasing allelic divergences within individuals (Birky 1996). Jaron identified three factors driving intragenomic heterozygosity in asexuals: how the transition to parthenogenesis occurred, which cytological mechanism underlies parthenogenesis and how long asexual reproduction has been ongoing. Based on sequencing reads from individual ostracods, we estimate heterozygosity of the putative ancient asexual ostracod D. stevensoni to be 0.92%–0.99% (Supplementary Figure S1, A and B) and 1.32%–1.43% for the sexual N. monacha (Supplementary Figure S1, C and D). The genome-wide heterozygosity of D. stevensoni matches to some extent an earlier study on intra-individual divergence in three nuclear genes of D. stevensoni (Schön and Martens 2003). The finding of almost 1% heterozygosity in D. stevensoni is remarkable, given that all previous genome-wide estimates for asexual arthropods that did not evolve via hybridization revealed extremely low levels of heterozygosity (Jaron ). Yet heterozygosity is clearly less than the estimates for parthenogenetic species with known hybrid origin (1.73%–8.5%) or polyploidy (1.84%–33.21%) (Jaron ), supporting the view that D. stevensoni is neither a hybrid nor a polyploid. Asexual reproduction in ostracods is thought to be apomictic (Chaplin ), implying that observed heterozygosity levels are largely dependent on the relative impact of heterozygosity losses from gene conversion and heterozygosity gains from new mutations. Given the apparent absence of sex and recombination for millions of years (Straub 1952), it is perhaps surprising that heterozygosity in this putative ancient asexual ostracod is not larger. This may suggest that genome-wide rates of gene conversion and mutation are comparable in this species.

Genome contiguity of ostracod assemblies as compared to other crustaceans

We here compare the qualities of our ostracod genome assemblies to those of 19 other crustacean species (Table 1) published in the last 4 years. We only include studies with complete assemblies and sufficient information to assess assembly qualities. We assessed the contiguity of the three de novo ostracod genome assemblies by the number of scaffolds and their N50. Both features are comparable to those of the copepod Apocylops royi (Jørgensen ) and the amphipod Parhyale hawaiensis (Kao ) (Table 1) and better than for crustaceans with larger genomes such as the decapods Cherax quadricarinatus (Tan ), Palaeomon carinicauda (Li ), Penaeus mondon (Van Quyen ), Marsupenaeus japonicus (Yuan ), and Procamburus virginalis (Gutekunst ; Table 1). Genome assemblies of several other crustaceans, however, have smaller scaffold numbers and higher N50 and thus better contiguities than the assemblies obtained here for nonmarine ostracods. For the two notostracan Lepidurus species (Savojardo ), this can probably be explained by their smaller genome sizes. For other crustaceans, genome assemblies or linkage maps have been available beforehand which have considerably improved assembly qualities (Table 1) as in the examples of the cladocerans Daphnia pulex (Ye ), D. magna (Lee ), and the copepod Tigriopus japonicus (Jeong ). No such genomic resources are currently available for ostracods. Finally, other studies of crustacean genomes with better assembly contiguities (the branchiopod Eulimnadia texana—Baldwin-Brown , and the decapod Erichoir japonica sinensis—Tang , the copepod Tigriopus californicus—Jeong , and the isopod Armadillium vulgare—Chebbi ) have used a combination of Illumina and long-read technologies (Table 1). Long-read technologies such as PacBio used to require a relatively large amount of high-molecular weight DNA (Solares ), which could not be obtained for ostracods with their very low yields of high-molecular weight DNA from individual specimens and their small body sizes as compared to many other crustaceans (Schön and Martens 2016). We hope that low input protocols for PacBio (Duncan ) and other long-read technologies can be successfully applied to ostracods in the future, in which case the genome assemblies obtained here could form the basis for subsequent hybrid assemblies. Optimizing Oxford Nanopore Technology for nonmarine ostracods has already commenced (Schön et al. in prep.).

Genome annotations of ostracods and other crustaceans

Because our de novo ostracod genome assemblies are relatively complete (see BUSCO scores in Table 1), we will here also briefly compare some features of predicted protein coding genes with those of other crustaceans (Supplementary Table S8). We have predicted 13,771–17,776 protein coding genes in the three nonmarine ostracod genomes (Supplementary Tables S7 and S8), with the highest number for the sexual C. torosa and an intermediate estimate for the putative ancient asexual D. stevensoni. The number of annotated protein coding genes in nonmarine ostracods is similar to estimates for various branchiopods and the copepods Oithona nana, Tigriopus californicus, and T. kingsejongensis but lower than in most malacostracans (Supplementary Table S8). Not all genome studies of crustaceans cited here contain information on other features of coding genes, such as the average size of genes, introns, and exons (Supplementary Table S7). Comparisons of these features are therefore limited and will not be further discussed here but we provide available data of these features for ostracods and other crustacean genomes for reference. Gene annotation in general but especially in the crustaceans is challenging; this is for example illustrated by the much lower numbers of protein coding genes (18,440) which are predicted in the novel reference genome of the cladoceran Daphnia pulex by Ye as compared to the first assembly of D. pulex with more than 30,000 predicted genes (Colbourne ). Even more difficult is assigning gene functions to annotated crustacean genomes (Rotllant ). The novel data on predicted genes and transcripts from nonmarine ostracods in the current study will significantly contribute to future genome annotations in crustaceans and other arthropods. The genes and transcripts predicted here can also provide the baseline for future gene expression studies of nonmarine and marine ostracods.

Conclusions

We have successfully obtained de novo genome assemblies for three species of nonmarine ostracods with different reproductive modes. These represent the first quality reference genomes for ostracods. Given the paucity of genome assemblies from crustaceans as compared to insects or other arthropods, these assemblies are important tools to further develop ostracods as models for evolutionary and ecological research, also including marine species. Even if the de novo genome assemblies are somewhat fragmented and not yet at the chromosome level, they have a high level of completeness and will thus facilitate future studies of ostracods. The genomes presented here can also provide the first step toward a genomic assessment of the putative ancient asexual status of nonmarine darwinulid ostracod species.
  64 in total

1.  Multiple lateral gene transfers and duplications have promoted plant parasitism ability in nematodes.

Authors:  Etienne G J Danchin; Marie-Noëlle Rosso; Paulo Vieira; Janice de Almeida-Engler; Pedro M Coutinho; Bernard Henrissat; Pierre Abad
Journal:  Proc Natl Acad Sci U S A       Date:  2010-09-27       Impact factor: 11.205

2.  Ameiotic recombination in asexual lineages of Daphnia.

Authors:  Angela R Omilian; Melania E A Cristescu; Jeffry L Dudycha; Michael Lynch
Journal:  Proc Natl Acad Sci U S A       Date:  2006-11-22       Impact factor: 11.205

3.  Clonal genome evolution and rapid invasive spread of the marbled crayfish.

Authors:  Julian Gutekunst; Ranja Andriantsoa; Cassandra Falckenhayn; Katharina Hanna; Wolfgang Stein; Jeanne Rasamy; Frank Lyko
Journal:  Nat Ecol Evol       Date:  2018-02-05       Impact factor: 15.460

4.  Selective Interference and the Evolution of Sex.

Authors:  Sarah P Otto
Journal:  J Hered       Date:  2021-03-12       Impact factor: 2.645

5.  The genome of the freshwater water flea Daphnia magna: A potential use for freshwater molecular ecotoxicology.

Authors:  Bo-Young Lee; Beom-Soon Choi; Min-Sub Kim; Jun Chul Park; Chang-Bum Jeong; Jeonghoon Han; Jae-Seong Lee
Journal:  Aquat Toxicol       Date:  2019-02-15       Impact factor: 4.964

6.  A general purpose genotype in an ancient asexual.

Authors:  Karine Van Doninck; Isa Schön; Luc De Bruyn; Koen Martens
Journal:  Oecologia       Date:  2002-07-01       Impact factor: 3.225

7.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

8.  The Genome of Armadillidium vulgare (Crustacea, Isopoda) Provides Insights into Sex Chromosome Evolution in the Context of Cytoplasmic Sex Determination.

Authors:  Mohamed Amine Chebbi; Thomas Becking; Bouziane Moumen; Isabelle Giraud; Clément Gilbert; Jean Peccoud; Richard Cordaux
Journal:  Mol Biol Evol       Date:  2019-04-01       Impact factor: 16.240

9.  Evolutionary diversity and novelty of DNA repair genes in asexual Bdelloid rotifers.

Authors:  Bette J Hecox-Lea; David B Mark Welch
Journal:  BMC Evol Biol       Date:  2018-11-28       Impact factor: 3.260

10.  Contribution of lateral gene transfers to the genome composition and parasitic ability of root-knot nematodes.

Authors:  Julien Paganini; Amandine Campan-Fournier; Martine Da Rocha; Philippe Gouret; Pierre Pontarotti; Eric Wajnberg; Pierre Abad; Etienne G J Danchin
Journal:  PLoS One       Date:  2012-11-30       Impact factor: 3.240

View more
  1 in total

1.  A Survey of Transposon Landscapes in the Putative Ancient Asexual Ostracod Darwinula stevensoni.

Authors:  Isa Schön; Fernando Rodriguez; Matthew Dunn; Koen Martens; Michael Shribak; Irina R Arkhipova
Journal:  Genes (Basel)       Date:  2021-03-11       Impact factor: 4.096

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.