| Literature DB >> 36157970 |
Sam Ebdon1, Alex Mackintosh1, Alex Hayward2, Karl Wotton2.
Abstract
We present a genome assembly from an individual female Colias crocea (also known as Colias croceus; the clouded yellow; Arthropoda; Insecta; Lepidoptera; Pieridae). The genome sequence is 325 megabases in span. The complete assembly is scaffolded into 32 chromosomal pseudomolecules, with the W and Z sex chromosome assembled. Gene annotation of this assembly on Ensembl has identified 13,803 protein coding genes. Copyright:Entities:
Keywords: Colias crocea; Colias croceus; chromosomal; clouded yellow; genome sequence
Year: 2021 PMID: 36157970 PMCID: PMC9490288 DOI: 10.12688/wellcomeopenres.17292.1
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
Figure 1. Fore and hind wings of Colias crocea specimen from which the genome was sequenced.
( A) Dorsal surface view of wings from specimen BU_CC_715 (ilColCroc2) from Bujaruelo, Spain, used to generate Pacific Biosciences and 10X genomics data. ( B) Ventral surface view of wings from specimen BU_CC_715 (ilColCroc2) from Bujaruelo, Spain, used to generate Pacific Biosciences and 10X genomics data.
Genome data for Colias crocea, ilColCroc2.1.
|
| |
|---|---|
| Assembly identifier | ilColCroc2.1 |
| Species |
|
| Specimen | ilColCroc2 |
| NCBI taxonomy ID | NCBI:txid72248 |
| BioProject | PRJEB42878 |
| BioSample ID | SAMEA7523360 |
| Isolate information | Female, abdomen/thorax |
|
| |
| PacificBiosciences SEQUEL II | ERR6558184 |
| 10X Genomics Illumina | ERR6054394-ERR6054397 |
| Hi-C Illumina | ERR6054398 |
| Illumina PolyA RNAseq | ERR6054399 |
|
| |
| Assembly accession | GCA_905220415.1 |
|
| GCA_905220445.1 |
| Span (Mb) | 325 |
| Number of contigs | 42 |
| Contig N50 length (Mb) | 11 |
| Number of scaffolds | 33 |
| Scaffold N50 length (Mb) | 11 |
| Longest scaffold (Mb) | 15 |
| BUSCO* genome score | C:99.0%[S:98.6%,D:0.4%],F:0.2%,M:0.8%,n:1658 |
|
| |
| Number of protein-coding genes | 13,830 |
| Average length of protein coding
| 1.631 |
| Average number of exons per
| 8 |
| Average exon size (bp) | 359 |
| Average intron size (bp) | 2,027 |
Figure 2. Genome assembly of Colias crocea, ilColCroc2.1: metrics.
The BlobToolKit Snailplot shows N50 metrics and BUSCO gene completeness. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1% of the 324,912,214 bp assembly. The distribution of chromosome lengths is shown in dark grey with the plot radius scaled to the longest chromosome present in the assembly (17,237,107 bp, shown in red). Orange and pale-orange arcs show the N50 and N90 chromosome lengths (11,204,669 and 7,474,634 bp), respectively. The pale grey spiral shows the cumulative chromosome count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot. A summary of complete, fragmented, duplicated and missing BUSCO genes in the lepidoptera_odb10 set is shown in the top right. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/ilColCroc2.1/dataset/ilColCroc2_1/snail.
Figure 5. Genome assembly of Colias crocea, ilColCroc2.1: Hi-C contact map.
Hi-C contact map of the ilColCroc2.1 assembly, visualised in HiGlass.
Chromosomal pseudomolecules in the genome assembly of Colias crocea, ilColCroc2.1.
| INSDC accession | Chromosome | Size (Mb) | GC% |
|---|---|---|---|
| HG991959.1 | 1 | 15.09 | 34.1 |
| HG991960.1 | 2 | 13.25 | 33.8 |
| HG991961.1 | 3 | 13.17 | 34 |
| HG991962.1 | 4 | 12.85 | 33.9 |
| HG991963.1 | 5 | 12.69 | 33.4 |
| HG991964.1 | 6 | 12.66 | 33.1 |
| HG991965.1 | 7 | 12.04 | 33.3 |
| HG991966.1 | 8 | 11.65 | 33.2 |
| HG991967.1 | 9 | 11.34 | 33.4 |
| HG991968.1 | 10 | 11.32 | 33.3 |
| HG991969.1 | 11 | 11.31 | 33.4 |
| HG991970.1 | 12 | 11.20 | 33.4 |
| HG991971.1 | 13 | 11.13 | 33.1 |
| HG991972.1 | 14 | 10.75 | 33.3 |
| HG991973.1 | 15 | 10.73 | 33.5 |
| HG991974.1 | 16 | 10.69 | 33.4 |
| HG991975.1 | 17 | 10.66 | 33.6 |
| HG991976.1 | 18 | 10.16 | 33.7 |
| HG991977.1 | 19 | 9.83 | 33.4 |
| HG991978.1 | 20 | 9.70 | 33.3 |
| HG991979.1 | 21 | 9.58 | 33.7 |
| HG991980.1 | 22 | 8.79 | 33.7 |
| HG991981.1 | 23 | 8.06 | 36 |
| HG991982.1 | 24 | 7.97 | 32.9 |
| HG991983.1 | 25 | 7.88 | 32.9 |
| HG991984.1 | 26 | 7.47 | 33.5 |
| HG991985.1 | 27 | 7.27 | 33.3 |
| HG991986.1 | 28 | 5.88 | 33.7 |
| HG991987.1 | 29 | 5.28 | 33.6 |
| HG991988.1 | 30 | 5.08 | 34.5 |
| HG991989.1 | W | 2.16 | 36 |
| HG991958.1 | Z | 17.24 | 33.9 |
| HG991990.1 | MT | 0.02 | 18.7 |
Software tools used.
| Software tool | Version | Source |
|---|---|---|
| Hifiasm | 0.12 |
|
| purge_dups | 1.2.3 |
|
| SALSA2 | 2.2 |
|
| longranger align | 2.2.2 |
|
| freebayes | 1.3.1-17-gaa2ace8 |
|
| MitoHiFi | 1.0 |
|
| gEVAL | N/A |
|
| HiGlass | 1.11.6 |
|
| PretextView | 0.1.x |
|
| BlobToolKit | 2.6.2 |
|