| Literature DB >> 35028429 |
Alex Hayward1, Charlotte Wright2.
Abstract
We present a genome assembly from an individual male Celastrina argiolus) (the holly blue; Arthropoda; Insecta; Lepidoptera; Lycaenidae). The genome sequence is 499 megabases in span. The majority (99.99%) of the assembly is scaffolded into 26 chromosomal pseudomolecules, with the Z sex chromosome assembled. Gene annotation of this assembly on Ensembl has identified 12,199 protein coding genes. Copyright:Entities:
Keywords: Celastrina argiolus; Lepidoptera; chromosomal; genome sequence; holly blue
Year: 2021 PMID: 35028429 PMCID: PMC8729184 DOI: 10.12688/wellcomeopenres.17478.1
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
Figure 1. Fore and hind wings of the Celastrina argiolus specimen from which the genome was sequenced.
Dorsal (left) and ventral (right) surface view of wings from specimen EN_OX_1170 (ilCelArgi3) from Oxford, UK, used to generate Pacific Biosciences and 10X genomics data.
Genome data for Celastrina argiolus, ilCelArgi3.1.
|
| |
|---|---|
| Assembly identifier | ilCelArgi3.1 |
| Species |
|
| Specimen | ilCelArgi3 (genome assembly); ilCelArgi1, ilCelArgi4 (RNA-Seq) |
| NCBI taxonomy ID | NCBI:txid203782 |
| BioProject | PRJEB41907 |
| BioSample ID | SAMEA7523268 |
| Isolate information | Male, whole organisms |
|
| |
| PacificBiosciences SEQUEL II | ERR6558180 |
| 10X Genomics Illumina | ERR6002602-ERR6002605 |
| Hi-C Illumina | ERR6002606 |
| Illumina polyA RNA-Seq | ERR6002607, ERR6787413 |
|
| |
| Assembly accession | GCA_905187575.1 |
|
| GCA_905147145.1 |
| Span (Mb) | 499 |
| Number of contigs | 137 |
| Contig N50 length (Mb) | 8 |
| Number of scaffolds | 28 |
| Scaffold N50 length (Mb) | 20 |
| Longest scaffold (Mb) | 29 |
| BUSCO
| C:97.1%[S:96.7%,D:0.5%],F:0.6%,M:2.3%,n:5286 |
*BUSCO scores based on the lepidoptera_odb10 BUSCO set using v5.1.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/ilCelArgi3.1/dataset/CAJJIP01/busco.
Figure 2. Genome assembly of Celastrina argiolus, ilCelArgi3.1: metrics.
The BlobToolKit Snailplot shows N50 metrics and BUSCO gene completeness. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1% of the 499,114,119 bp assembly. The distribution of scaffold lengths is shown in dark grey with the plot radius scaled to the longest chromosome present in the assembly (29,052,767 bp, shown in red). Orange and pale-orange arcs show the N50 and N90 chromosome lengths (20,425,925 and 16,318,055 bp), respectively. The pale grey spiral shows the cumulative scaffold count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot. A summary of complete, fragmented, duplicated and missing BUSCO genes in the lepidoptera_odb10 set is shown in the top right. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/ilCelArgi3.1/dataset/CAJJIP01/snail.
Figure 5. Genome assembly of Celastrina argiolus, ilCelArgi3.1: Hi-C contact map.
Hi-C contact map of the ilCelArgi3.1 assembly, visualised in HiGlass. Chromosomes are shown in size order from left to right and top to bottom.
Chromosomal pseudomolecules in the genome assembly of Celastrina argiolus, ilCelArgi3.1.
| INSDC accession | Chromosome | Size (Mb) | GC% |
|---|---|---|---|
| LR994577.1 | 1 | 29.05 | 36.2 |
| LR994578.1 | 2 | 24.85 | 36 |
| LR994579.1 | 3 | 24.55 | 35.8 |
| LR994580.1 | 4 | 24.51 | 36.2 |
| LR994581.1 | 5 | 24.42 | 35.8 |
| LR994582.1 | 6 | 24.04 | 36.1 |
| LR994584.1 | 7 | 21.46 | 36.1 |
| LR994585.1 | 8 | 21.39 | 35.9 |
| LR994586.1 | 9 | 20.68 | 35.7 |
| LR994587.1 | 10 | 20.43 | 36.3 |
| LR994588.1 | 11 | 19.02 | 36.1 |
| LR994589.1 | 12 | 18.92 | 35.9 |
| LR994590.1 | 13 | 18.52 | 36.4 |
| LR994591.1 | 14 | 18.52 | 36.2 |
| LR994592.1 | 15 | 18.42 | 35.9 |
| LR994593.1 | 16 | 18.32 | 36.1 |
| LR994594.1 | 17 | 17.10 | 36.4 |
| LR994595.1 | 18 | 16.97 | 36 |
| LR994596.1 | 19 | 16.88 | 36.2 |
| LR994597.1 | 20 | 16.52 | 36.5 |
| LR994598.1 | 21 | 16.32 | 36.3 |
| LR994599.1 | 22 | 15.74 | 36.2 |
| LR994600.1 | 23 | 11.45 | 36.9 |
| LR994601.1 | 24 | 10.30 | 37 |
| LR994602.1 | 25 | 6.95 | 36.4 |
| LR994583.1 | Z | 23.77 | 35 |
| LR994603.1 | MT | 18.00 | 18 |
| - | Unplaced | 0.02 | 49 |
Software tools used.
| Software tool | Version | Source |
|---|---|---|
| Hifiasm | 0.7 |
|
| purge_dups | 1.2.3 |
|
| SALSA2 | 2.2 |
|
| longranger align | 2.2.2 |
|
| freebayes | 1.3.1-17-gaa2ace8 |
|
| gEVAL | 2016 |
|
| HiGlass | 1.11.6 |
|
| PretextView | 0.1.x |
|
| BlobToolKit | 2.6.2 |
|