| Literature DB >> 33734373 |
Tomas N Generalovic1, Shane A McCarthy2,3, Ian A Warren1, Jonathan M D Wood2, James Torrance2, Ying Sims2, Michael Quail2, Kerstin Howe2, Miha Pipan4, Richard Durbin2,3, Chris D Jiggins1.
Abstract
Hermetia illucens L. (Diptera: Stratiomyidae), the Black Soldier Fly (BSF) is an increasingly important species for bioconversion of organic material into animal feed. We generated a high-quality chromosome-scale genome assembly of the BSF using Pacific Bioscience, 10X Genomics linked read and high-throughput chromosome conformation capture sequencing technology. Scaffolding the final assembly with Hi-C data produced a highly contiguous 1.01 Gb genome with 99.75% of scaffolds assembled into pseudochromosomes representing seven chromosomes with 16.01 Mb contig and 180.46 Mb scaffold N50 values. The highly complete genome obtained a Benchmarking Universal Single-Copy Orthologs (BUSCO) completeness of 98.6%. We masked 67.32% of the genome as repetitive sequences and annotated a total of 16,478 protein-coding genes using the BRAKER2 pipeline. We analyzed an established lab population to investigate the genomic variation and architecture of the BSF revealing six autosomes and an X chromosome. Additionally, we estimated the inbreeding coefficient (1.9%) of the lab population by assessing runs of homozygosity. This provided evidence for inbreeding events including long runs of homozygosity on chromosome 5. The release of this novel chromosome-scale BSF genome assembly will provide an improved resource for further genomic studies, functional characterization of genes of interest and genetic modification of this economically important species.Entities:
Keywords: zzm321990 Hermetia illucenszzm321990 ; BRAKER2; Black Soldier Fly; Hi-C assembly; PacBio; domestication; genome annotation; genome assembly; inbreeding
Mesh:
Year: 2021 PMID: 33734373 PMCID: PMC8104945 DOI: 10.1093/g3journal/jkab085
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Hermetia illucens key life stages. Dorsal view of Hermetia illucens adult male (upper left), adult female (upper right), larvae (5th instar; lower left), and pupa (lower right). Adult sex identified by genital shape and posterior color. Scale bar = 5 mm. Photos: T.N. Generalovic.
Figure 2Curated Hi-C contact map of chromosomal interactions. Super-scaffold chromosomes (n = 7) are highlighted within black frames and annotated with assembled length at the beginning of each chromosome (interactive map available on https://higlass-grit.sanger.ac.uk/l/?d=OjQNRJcmTgKyKfvkk3yODg).
Summary statistics of Hermetia illucens and selection of Diptera genomes
| Species name | Genome size | Scaffold number | Contig N50 | Scaffold N50 | Gaps | N count | GC (%) | BUSCO (%) | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C | S | D | F | M | ||||||||
|
| 1.01 Gb | 20 | 16.01 Mb | 180.36 Mb | 112 | 26,439 | 42.47 | 98.60 | 97.80 | 0.80 | 0.50 | 0.90 |
|
| 1.10 Gb | 2,806 | 231.07 kb | 1.70 Mb | 24,796 | 14,305,322 | 42.46 | 98.90 | 91.10 | 7.80 | 0.60 | 0.50 |
|
| 143.73 Mb | 1,870 | 21.49 Mb | 25.29 Mb | 572 | 1,152,978 | 41.67 | 99.70 | 99.00 | 0.70 | 0.20 | 0.10 |
|
| 169.7 Mb | 13,530 | 5.10 Mb | 31.08 Mb | 166 | 16,600 | 40.43 | 99.10 | 98.10 | 1.00 | 0.40 | 0.50 |
|
| 750.40 Mb | 20,487 | 11.81 kb | 226.57 kb | 102,610 | 58,683,792 | 32.37 | 98.60 | 96.90 | 1.70 | 0.40 | 1.00 |
|
| 971.19 Mb | 12,042 | 11.31 kb | 504.65 kb | 129,113 | 150,529,258 | 40.30 | 98.40 | 97.70 | 0.70 | 1.00 | 0.60 |
|
| 363.11 Mb | 24,071 | 49.77 kb | NA | 0 | 0 | 34.12 | 98.90 | 96.60 | 2.30 | 0.60 | 0.50 |
|
| 1.28 Gb | 2,310 | 11.76 Mb | 409.78 kb | 229 | 22,935 | 38.18 | 98.90 | 94.50 | 4.40 | 0.40 | 0.70 |
|
| 579.04 Mb | 3,171 | 28.55 kb | 486.76 kb | 45,500 | 39,082,744 | 34.89 | 96.70 | 91.80 | 4.90 | 0.80 | 2.50 |
Assembly statistics for Diptera genomes generated using assembly stats script on the associated reference genome. BUSCO score generated from the “insecta_odb9” database (n = 1,658). BUSCO statistics C, complete; S, single-copy; D, duplicated; F, fragmented; M, missing.
Denotes contig number where the assembly contains no scaffolds.
Figure 3BlobToolKit snail plot of the Hermetia illucens assembly. Genome assembly statistics of iHerIll visualized as a snail plot containing BUSCO “insecta_odb9” completeness scores, scaffold assembly statistics, and sequence composition proportions.
Genome annotation and repeat statistics of the Hermetia illucens assembly
| Genome annotation statistics |
| # | |
|---|---|---|---|
| Protein-coding genes | 16,478 | ||
| Transcripts | 17,664 | ||
Annotation statistics for H. illucens assembly. BUSCO scores generated from the “insecta_odb9” and “diptera_odb9” databases .
Figure 4Genome-wide diversity, Tajima D and ROH analysis on chromosome five of the Hermetia illucens EVE population. Tajima D (A), nucleotide diversity (B), and Runs of Homozygosity (ROH) (B) values across genomic positions of chromosome five. Mean 20 kb windows smoothed with local regression and regions of interest highlighted. Both short (light blue) and long (dark blue) ROH is shown across genome position.