| Literature DB >> 35673393 |
Alex Hayward1, Roger Vila2, Konrad Lohse3, Dominik Laetsch3.
Abstract
We present a genome assembly from an individual male Pyrgus malvae (the grizzled skipper; Arthropoda; Insecta; Lepidoptera; Hesperiidae). The genome sequence is 725 megabases in span. The majority (99.97%) of the assembly is scaffolded into 31 chromosomal pseudomolecules, with the Z sex chromosome assembled. Copyright:Entities:
Keywords: Lepidoptera; Pyrgus malvae; chromosomal; genome sequence; grizzled skipper
Year: 2022 PMID: 35673393 PMCID: PMC9156896 DOI: 10.12688/wellcomeopenres.17806.1
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
Figure 1. Fore and hind wings of the Pyrgus malvae specimen from which the genome was sequenced.
Top: Dorsal (left) and ventral (right) surface view of wings from specimen RO_PM_973 (ilPyrMalv3) from Suatu, Romania, used to generate Pacific Biosciences and 10X genomics data. Bottom: Dorsal (left) and ventral (right) surface view of wings from specimen RO_PM_838 (ilPyrMalv2) from Cluj-Napoca, Romania, used to generate Hi-C data.
Genome data for Pyrgus malvae, ilPyrMalv3.1.
|
| |
|---|---|
| Assembly identifier | ilPyrMalv3.1 |
| Species |
|
| Specimen | ilPyrMalv3 (genome assembly); ilPyrMalv2 (Hi-C);
|
| NCBI taxonomy ID | NCBI:txid111923 |
| BioProject | PRJEB46857 |
| BioSample ID | SAMEA7523296 |
| Isolate information | Male, whole organism (ilPyrMalv3); unknown sex,
|
|
| |
| PacificBiosciences SEQUEL II | ERR6606794-ERR6606796 |
| 10X Genomics Illumina | ERR6363273-ERR6363276 |
| Hi-C Illumina | ERR6363278 |
| Illumina polyA RNA-Seq | ERR6363277 |
|
| |
| Assembly accession | GCA_911387765.1 |
|
| GCA_911387725.2 |
| Span (Mb) | 725 |
| Number of contigs | 41 |
| Contig N50 length (Mb) | 26.0 |
| Number of scaffolds | 36 |
| Scaffold N50 length (Mb) | 27.0 |
| Longest scaffold (Mb) | 33.2 |
| BUSCO
| C:98.8%[S:98.3%,D:0.4%],F:0.2%,M:1.0%,n:5286 |
|
| |
| Number of protein-coding genes | 12,096 |
| Average length of coding sequence (bp) | 1,534.63 |
| Average number of exons per transcript | 7.83 |
| Average exon size (bp) | 207.85 |
| Average intron size (bp) | 2,914.74 |
*BUSCO scores based on the lepidoptera_odb10 BUSCO set using v5.1.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/ilPyrMalv3.1/dataset/CAJVQT01/busco.
Figure 2. Genome assembly of Pyrgus malvae, ilPyrMalv3.1: metrics.
The BlobToolKit Snailplot shows N50 metrics and BUSCO gene completeness. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1% of the 724,649,524 bp assembly. The distribution of chromosome lengths is shown in dark grey with the plot radius scaled to the longest chromosome present in the assembly (33,217,309 bp, shown in red). Orange and pale-orange arcs show the N50 and N90 chromosome lengths (26,976,370 and 16,663,010 bp), respectively. The pale grey spiral shows the cumulative chromosome count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot. A summary of complete, fragmented, duplicated and missing BUSCO genes in the lepidoptera_odb10 set is shown in the top right. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/ilPyrMalv3.1/dataset/CAJVQT01/snail.
Figure 5. Genome assembly of Pyrgus malvae, ilPyrMalv3.1: Hi-C contact map.
Hi-C contact map of the ilPyrMalv3.1 assembly, visualised in HiGlass. Chromosomes are shown in size order from left to right and top to bottom. The interactive Hi-C map can be viewed here.
Chromosomal pseudomolecules in the genome assembly of Pyrgus malvae, ilPyrMalv3.1.
| INSDC accession | Chromosome | Size (Mb) | GC% |
|---|---|---|---|
| OU426946.1 | 1 | 33.22 | 36.5 |
| OU426948.1 | 2 | 30.61 | 36.9 |
| OU426949.1 | 3 | 30.48 | 36.8 |
| OU426950.1 | 4 | 30.18 | 36.6 |
| OU426951.1 | 5 | 30.06 | 36.5 |
| OU426952.1 | 6 | 29.97 | 36.8 |
| OU426953.1 | 7 | 29.61 | 36.7 |
| OU426954.1 | 8 | 29.23 | 36.7 |
| OU426955.1 | 9 | 28.53 | 36.8 |
| OU426956.1 | 10 | 27.64 | 36.6 |
| OU426957.1 | 11 | 27.02 | 36.7 |
| OU426958.1 | 12 | 26.98 | 36.9 |
| OU426959.1 | 13 | 25.95 | 36.9 |
| OU426960.1 | 14 | 25.59 | 36.9 |
| OU426961.1 | 15 | 25.29 | 36.9 |
| OU426962.1 | 16 | 24.25 | 37.1 |
| OU426963.1 | 17 | 24.17 | 37.5 |
| OU426964.1 | 18 | 23.61 | 37.2 |
| OU426965.1 | 19 | 22.98 | 37.1 |
| OU426966.1 | 20 | 21.93 | 37.2 |
| OU426967.1 | 21 | 19.36 | 38.0 |
| OU426968.1 | 22 | 19.17 | 37.5 |
| OU426969.1 | 23 | 17.50 | 37.3 |
| OU426970.1 | 24 | 16.66 | 37.5 |
| OU426971.1 | 25 | 15.31 | 37.5 |
| OU426972.1 | 26 | 14.10 | 38.0 |
| OU426974.1 | 27 | 10.67 | 39.6 |
| OU426973.1 | 28 | 10.82 | 38.3 |
| OU426975.1 | 29 | 10.54 | 38.4 |
| OU426976.1 | 30 | 10.42 | 38.4 |
| OU426947.1 | Z | 32.47 | 36.4 |
| OU426977.1 | MT | 0.02 | 18.4 |
| - | Unplaced | 0.34 | 41.9 |
Software tools used.
| Software tool | Version | Source |
|---|---|---|
| Hifiasm | 0.15.1 |
|
| purge_dups | 1.2.3 |
|
| SALSA2 | 2.2 |
|
| longranger align | 2.2.2 |
|
| freebayes | 1.3.1-17-gaa2ace8 |
|
| MitoHiFi | 2 |
|
| gEVAL | N/A |
|
| HiGlass | 1.11.6 |
|
| PretextView | 0.2.x |
|
| BlobToolKit | 2.6.4 |
|