| Literature DB >> 35126445 |
Christopher M Austin1,2, Laurence J Croft1,2, Frederic Grandjean3, Han Ming Gan4.
Abstract
Cherax destructor, the yabby, is an iconic Australian freshwater crayfish species, which, similar to other major invertebrate groups, is grossly under-represented in genomic databases. The yabby is also the principal commercial freshwater crustacean species in Australia subject to explotation via inland fisheries and aquaculture. To address the genomics knowledge gap for this species and explore cost effective and efficient methods for genome assembly, we generated 106.8 gb of Nanopore reads and performed a long-read only assembly of the Cherax destructor genome. On a mini-server configured with an ultra-fast swap space, the de novo assembly took 131 h (∼5.5 days). Genome polishing with 126.3 gb of PCR-Free Illumina reads generated an assembled genome size of 3.3 gb (74.6% BUSCO completeness) with a contig N50 of 80,900 bp, making it the most contiguous for freshwater crayfish genome assemblies. We found an unusually large number of cellulase genes within the yabby genome which is relevant to understanding the nutritional biology, commercial feed development, and ecological role of this species and crayfish more generally. These resources will be useful for genomic research on freshwater crayfish and our methods for rapid and super-efficient genome assembly will have wide application.Entities:
Keywords: Parastacidae; annotation; aquacultrure; cellulase; decapoda; genome; nanopore
Year: 2022 PMID: 35126445 PMCID: PMC8807398 DOI: 10.3389/fgene.2021.695763
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Adult Cherax destructor. Photo provided by Christopher Austin.
Genome assembly and annotation statistics.
| Parameter | Details |
|---|---|
| Organism | Cherax destructor (Australian yabby) |
| Isolate | CDF2 (female, wild population) |
| Bioproject | PRJNA588861 |
| Biosample | SAMN13258587 |
| Whole-genome GenBank accession | WNWK00000000 |
| Assembled scaffold/contig length | 3,336,744,225 bp/3,336,542,896 bp |
| Scaffold N50 (number of sequences) | 87,184 bp (98,662) |
| Contig N50 (number of sequences) | 80,900 bp (100,635) |
| GC content | 41.43% |
| BUSCO completeness | 74.6% Single-copy, 1.1% Duplicated |
| Arthropoda odb9 ( | 15.1% Fragmented, 9.1% Missing |
| Number of predicted protein-coding genes | 45,673 |
| Number of predicted proteins | 47,377 |
| With InterPro signature | 21,102 (44.5%) |
| With gene ontology (GO) term | 14,068 (29.7%) |
FIGURE 2Statistics of publicly available decapod crustacean genome assemblies. (A) Number of sequences, Genome N50, and total assembled length (B) BUSCO completeness based on the Arthopoda ortholog dataset.
FIGURE 3Identification and phylogenetic analysis of cellulases. (A) Number of identified cellulases in five decapod crustacean and an amphipod proteomes (B) IQTree maximum likelihood tree showing the evolutionary relationships of GH9 cellulases identified from the selected proteomes. The nodes were colored based on ultrafast bootstrap values and the first three letters in each tip label correspond to the species name. Branch lengths indicate number of substitutions per site. Cdes, Cherax destructor; Cqua, Cherax quadricarinatus; Lvan, Litopenaeus vannamei; Phaw, Parhyale hawaiensis; Pmod, Penaeus monodon; Pvir, Procambarus virginalis.