| Literature DB >> 36105557 |
Oskar Lohse1, Konrad Lohse2, Hannah Augustijnen3, Kay Lucek3.
Abstract
We present a genome assembly from an individual female Erebia aethiops (the scotch argus; Arthropoda; Insecta; Lepidoptera; Nymphalidae). The genome sequence is 473 megabases in span. The complete assembly is scaffolded into 20 chromosomal pseudomolecules, with the W and Z sex chromosomes assembled. The complete mitochondrial genome was also assembled and is 15.2 kilobases in length. Copyright:Entities:
Keywords: Erebia aethiops; Lepidoptera; chromosomal; genome sequence; scotch argus
Year: 2022 PMID: 36105557 PMCID: PMC9445563 DOI: 10.12688/wellcomeopenres.17927.1
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
Chromosomal pseudomolecules in the genome assembly of Erebia aethiops, ilEreAeth2.1.
|
|
|
|
|
|---|---|---|---|
| OV281080.1 | 1 | 33.25 | 37.1 |
| OV281081.1 | 2 | 32.76 | 37.1 |
| OV281082.1 | 3 | 32.72 | 37.2 |
| OV281083.1 | 4 | 30.44 | 37.1 |
| OV281084.1 | 5 | 30.01 | 37.5 |
| OV281085.1 | 6 | 26.26 | 37.4 |
| OV281086.1 | 7 | 25.86 | 37.4 |
| OV281087.1 | 8 | 23.96 | 37.2 |
| OV281088.1 | 9 | 20.72 | 37.3 |
| OV281089.1 | 10 | 20.45 | 37.1 |
| OV281090.1 | 11 | 20.15 | 37.3 |
| OV281091.1 | 12 | 19.45 | 37.3 |
| OV281092.1 | 13 | 19.3 | 37.3 |
| OV281093.1 | 14 | 18.42 | 37.2 |
| OV281094.1 | 15 | 17.95 | 37.3 |
| OV281095.1 | 16 | 17.05 | 37.3 |
| OV281096.1 | 17 | 15.92 | 37.4 |
| OV281097.1 | 18 | 15.76 | 37.7 |
| OV281098.1 | W | 3.11 | 37.7 |
| OV281079.1 | Z | 37.95 | 36.8 |
| OV281099.1 | MT | 0.02 | 19.6 |
| - | Unplaced | 11.97 | 37.5 |
Figure 1. Fore and hind wings of the Erebia aethiops specimen from which the genome was sequenced.
Dorsal (left) and ventral (right) surface views of wings from the specimen SC_EA_1391 (ilEreAeth2) from Carrifran Wildwood, Scotland, used to generate Pacific BioSciences and 10X genomics data.
Genome data for Erebia aethiops, ilEreAeth2.1.
|
| |
|---|---|
| Assembly identifier | ilEreAeth2.1 |
| Species |
|
| Specimen | ilEreAeth2 (genome assembly); ilEreAeth1 (additional HiFi,10X reads); ilEreAeth3 (Hi-C) |
| NCBI taxonomy ID | 447833 |
| BioProject | PRJEB47324 |
| BioSample ID | SAMEA7523289 |
| Isolate information | Female, whole organisms (ilEreAeth2, ilEreAeth1); male, whole organism (ilEreAeth3) |
|
| |
| PacificBiosciences SEQUEL II | ERR6808048 (ilEreAeth2); ERR6636094-ERR6636096, ERR6808047 (ilEreAeth1) |
| 10X Genomics Illumina | ERR6688769-ERR6688772 (ilEreAeth2); ERR6688764-ERR6688767 (ilEreAeth1) |
| Hi-C Illumina | ERR6688768 (ilEreAeth3) |
|
| |
| Assembly accession | GCA_923060345.1 |
|
| GCA_923062935.1 |
| Span (Mb) | 473 |
| Number of contigs | 80 |
| Contig N50 length (Mb) | 21.4 |
| Number of scaffolds | 54 |
| Scaffold N50 length (Mb) | 25.9 |
| Longest scaffold (Mb) | 33.25 |
| BUSCO
| C:98.5%[S:97.8%,D:0.7%],F:0.4%,M:1.1%,n:5286 |
*BUSCO scores based on the lepidoptera_odb10 BUSCO set using v5.1.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/ilEreAeth2.1/dataset/CAKLPR01/busco.
Figure 2. Genome assembly of Erebia aethiops, ilEreAeth2.1: metrics.
The BlobToolKit Snailplot shows N50 metrics and BUSCO gene completeness. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1% of the 473,469,105 bp assembly. The distribution of chromosome lengths is shown in dark grey with the plot radius scaled to the longest chromosome present in the assembly (37,954,409 bp, shown in red). Orange and pale-orange arcs show the N50 and N90 chromosome lengths (25,856,419 and 17,052,335 bp), respectively. The pale grey spiral shows the cumulative chromosome count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot. A summary of complete, fragmented, duplicated and missing BUSCO genes in the lepidoptera_odb10 set is shown in the top right. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/ilEreAeth2.1/dataset/CAKLPR01/snail.
Figure 5. Genome assembly of Erebia aethiops, ilEreAeth2.1: Hi-C contact map.
Hi-C contact map of the ilEreAeth2.1 assembly, visualised in HiGlass. Chromosomes are arranged in size order from left to right and top to bottom. The interactive Hi-C map can be viewed at https://genome-note-higlass.tol.sanger.ac.uk/l/?d=Es29fT2jTLK_QFHleOj4jQ.
Software tools used.
|
|
|
|
|---|---|---|
| Hifiasm | 0.15.3-r339 |
|
| purge_dups | 1.2.3 |
|
| SALSA2 | 2.2 |
|
| longranger align | 2.2.2 |
|
| freebayes | 1.3.1-17-
|
|
| MitoHiFi | 2.0 |
|
| HiGlass | 1.11.6 |
|
| PretextView | 0.2.x |
|
| BlobToolKit | 3.0.5 |
|