| Literature DB >> 35846179 |
Konrad Lohse1, Alex Hayward2, Sam Ebdon1.
Abstract
We present genome assemblies from a male and female Pieris napi (the green-veined white; Arthropoda; Insecta; Lepidoptera; Pieridae). The genome sequences of the male and female are 320 and 319 megabases in span, respectively. The majority of the assembly (99.79% of the male assembly, 99.88% of the female) is scaffolded into 24 autosomal pseudomolecules, with the Z sex chromosome assembled for the male and Z and W chromosomes assembled for the female. Gene annotation of the male assembly on Ensembl has identified 13,221 protein coding genes. Copyright:Entities:
Keywords: Pieris napi; chromosomal; genome sequence; green-veined white
Year: 2021 PMID: 35846179 PMCID: PMC9257262 DOI: 10.12688/wellcomeopenres.17277.1
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
Figure 1. Fore and hind wings of Pieris napi specimens from which the genome was sequenced.
( A) Dorsal surface view of wings from specimen SC_PN_1158 (ilPieNapi4, male) used to generate Pacific Biosciences and 10X genomics data. ( B) Ventral surface view of wings from specimen SC_PN_1158 (ilPieNapi4) used to generate Pacific Biosciences and 10X genomics data. ( C) Dorsal surface view of wings from specimen SC_PN_1159 (ilPieNapi1) used to generate Hi-C data. ( D) Ventral surface view of wings from specimen SC_PN_1159 (ilPieNapi5) used to generate HiC data. ( E) Dorsal surface view of wings from specimen SC_PN_1388 (ilPieNapi1) used to generate Pacific Biosciences and 10X genomics data. ( F) Ventral surface view of wings from specimen SC_PN_1388 (ilPieNapi1, female) used to generate Pacific Biosciences and 10X genomics data. All three samples were collected from Carrifran Wildwood, Scotland, UK.
Genome data for Pieris nap i, ilPieNapi4.1 (male) and ilPieNapi1.1 (female).
| Male assembly | Female assembly | |
|---|---|---|
|
| ||
| Assembly identifier | ilPieNapi4.1 | ilPieNapi1.1 |
| Species |
| |
| Specimen | ilPieNapi4 (genome assembly); ilPieNapi5 (Hi-C);
| ilPieNapi1 (genome assembly); ilPieNapi5 (Hi-C);
|
| NCBI taxonomy ID | NCBI:txid78633 | |
| BioProject | PRJEB43012 | |
| BioSample ID | SAMEA7523140 | SAMEA7523287 |
| Isolate information | Male, whole organisms | Female, whole organism (genome assembly);
|
|
| ||
| PacificBiosciences SEQUEL II | ERR6594499 | ERR6594498 |
| 10X Genomics Illumina | ERR6054471-ERR6054474 | ERR6054475-ERR6054478 |
| Hi-C Illumina | ERR6054479 | |
| Illumina PolyA RNAseq | ERR6363261 | ERR6787421 |
|
| ||
| Assembly accession | GCA_905231885.1 | GCA_905475465.1 |
|
| GCA_905231895.1 | GCA_905475415.1 |
| Span (Mb) | 320 | 319 |
| Number of contigs | 76 | 84 |
| Contig N50 length (Mb) | 11 | 8 |
| Number of scaffolds | 49 | 43 |
| Scaffold N50 length (Mb) | 13 | 13 |
| Longest scaffold (Mb) | 15 | 15 |
| BUSCO
| C:99.1%[S:98.5%,D:0.5%],F:0.2%,M:0.7%,n:5286 | C:99.0%[S:98.4%,D:0.6%],F:0.2%,M:0.8%,n:5286 |
|
| ||
| Number of protein coding
| 13,221 | - |
| Average coding sequence
| 1,720 | - |
| Average number of exons per
| 10 | - |
| Average exon size (bp) | 360 | - |
| Average intron size (bp) | 1,929 | - |
*BUSCO scores based on the lepidoptera_odb10 BUSCO set using v5.1.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. Full sets of BUSCO scores are available at https://blobtoolkit.genomehubs.org/view/ilPieNapi4.1/dataset/CAJNIX01/busco (male) and https://blobtoolkit.genomehubs.org/view/ilPieNapi1.1/dataset/CAJQFT01/busco (female).
Figure 2. Genome assembly of Pieris napi, ilPieNapi4.1 (male, A) and ilPieNapi1.1 (female, B): metrics.
The BlobToolKit Snailplot shows N50 metrics and BUSCO gene completeness. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1% of the 320,004,350 bp (male, A) and 319,202,574 bp (female, B) assemblies. The distribution of chromosome lengths is shown in dark grey with the plot radius scaled to the longest chromosome present in the assembly (15,058,180 bp (male) and 14,821,532 bp (female), shown in red). Orange and pale-orange arcs show the N50 and N90 chromosome lengths (12,982,002 and 11,574,744 bp, male; 13,068,865 and 10,703,985 bp, female), respectively. The pale grey spiral shows the cumulative chromosome count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot. A summary of complete, fragmented, duplicated and missing BUSCO genes in the lepidoptera_odb10 set is shown in the top right. Interactive versions of this figure are available at https://blobtoolkit.genomehubs.org/view/ilPieNapi4.1/dataset/CAJNIX01/snail (male) and https://blobtoolkit.genomehubs.org/view/ilPieNapi1.1/dataset/CAJQFT01/snail (female).
Figure 5. Genome assembly of Pieris napi, ilPieNapi4.1 (male, A) and ilPieNapi1.1 (female, B): Hi-C contact map.
Hi-C contact map of the ilPieNapi4.1 and ilPieNapi1.1 assemblies, visualised in HiGlass. Chromosomes are given in order of size from left to right and top to bottom.
Chromosomal pseudomolecules in the genome assembly of Pieris napi, ilPieNapi4.1 (male) and ilPieNapi1.1 (female).
| Chromosome | Male | Female | ||||
|---|---|---|---|---|---|---|
| Size (Mb) | GC% | INSDC accession | Size (Mb) | GC% | INSDC accession | |
| 1 | 15.06 | 33.7 | HG993162.1 | 14.82 | 33.7 | FR997694.1 |
| 2 | 14.34 | 33.7 | HG993163.1 | 14.22 | 33.7 | FR997695.1 |
| 3 | 13.84 | 33.7 | HG993166.1 | 13.69 | 33.8 | FR997697.1 |
| 4 | 13.51 | 33.9 | HG993169.1 | 13.66 | 33.9 | FR997698.1 |
| 5 | 13.50 | 33.8 | HG993170.1 | 13.54 | 33.8 | FR997699.1 |
| 6 | 13.82 | 33.8 | HG993167.1 | 13.46 | 33.7 | FR997700.1 |
| 7 | 14.17 | 34.2 | HG993164.1 | 13.4 | 33.7 | FR997701.1 |
| 8 | 13.20 | 33.6 | HG993172.1 | 13.34 | 33.6 | FR997702.1 |
| 9 | 13.39 | 33.5 | HG993171.1 | 13.29 | 33.5 | FR997703.1 |
| 10 | 13.69 | 33.8 | HG993168.1 | 13.26 | 33.7 | FR997704.1 |
| 11 | 12.95 | 33.9 | HG993174.1 | 13.07 | 33.9 | FR997705.1 |
| 12 | 12.84 | 33.8 | HG993176.1 | 12.99 | 33.9 | FR997706.1 |
| 13 | 12.73 | 34 | HG993177.1 | 12.73 | 33.9 | FR997707.1 |
| 14 | 12.98 | 34 | HG993173.1 | 12.71 | 33.8 | FR997708.1 |
| 15 | 12.51 | 34.2 | HG993178.1 | 12.55 | 34.2 | FR997709.1 |
| 16 | 12.49 | 33.6 | HG993179.1 | 12.52 | 33.7 | FR997710.1 |
| 17 | 12.22 | 33.9 | HG993181.1 | 12.2 | 33.7 | FR997711.1 |
| 18 | 12.87 | 35.8 | HG993175.1 | 12.12 | 36 | FR997712.1 |
| 19 | 12.39 | 34.3 | HG993180.1 | 11.73 | 33.9 | FR997713.1 |
| 20 | 11.57 | 34.2 | HG993183.1 | 11.52 | 34.1 | FR997714.1 |
| 21 | 11.75 | 33.8 | HG993182.1 | 11.48 | 33.7 | FR997715.1 |
| 22 | 10.92 | 34.2 | HG993184.1 | 10.7 | 34 | FR997716.1 |
| 23 | 10.42 | 34.6 | HG993185.1 | 10.19 | 34.4 | FR997717.1 |
| 24 | 7.95 | 35.4 | HG993186.1 | 7.39 | 34.9 | FR997718.1 |
| W | - | - | - | 2.68 | 35.2 | FR997719.1 |
| Z | 14.07 | 34 | HG993165.1 | 13.95 | 33.9 | FR997696.1 |
| MT | 0.02 | 19.9 | HG993187.1 | 0.02 | 19.6 | FR997720.1 |
| Unplaced | 0.80 | 41 | - | 1.99 | 35.4 | - |
Software tools used.
| Software tool | Version | Source |
|---|---|---|
| HiCanu | 2.1 |
|
| purge_dups | 1.2.3 |
|
| SALSA2 | 2.2 |
|
| longranger align | 2.2.2 |
|
| freebayes | 1.3.1-17-gaa2ace8 |
|
| MitoHiFi | 1.0 |
|
| gEVAL | N/A |
|
| HiGlass | 1.11.6 |
|
| PretextView | 0.1.x |
|
| BlobToolKit | 2.6.2 |
|