| Literature DB >> 35643968 |
Zhipeng Jia1, Surong Hasi2, Claus Vogl3, Pamela A Burger1.
Abstract
Wohlfahrtia magnifica is a pest fly species, invading livestock in many European, African and Asian countries, and causing heavy agroeconomic losses. In the life cycle of this obligatory parasite, adult flies infect the host by depositing the first-stage larvae into body cavities or open wounds. The feeding larvae cause severe (skin) tissue damage and potentially fatal infections if untreated. Despite serious health detriments and agroeconomic concerns, genomic resources for understanding the biology of W. magnifica have so far been lacking. Here, we present a complete genome assembly from a single adult female W. magnifica using a Low-DNA Input workflow for PacBio HiFi library preparation. The de novo assembled genome is 753.99 Mb in length, with a scaffold N50 of 5.00 Mb, consisting of 16,718 predicted protein-encoding genes. Comparative genomic analysis revealed that W. magnifica has the closest phylogenetic relationship to Sarcophaga bullata followed by Lucilia cuprina. Evolutionary analysis of gene families showed expansions of 173 gene families in W. magnifica that were enriched for gene ontology (GO) categories related to immunity, insecticide-resistance mechanisms, heat stress response and cuticle development. In addition, 45 positively selected genes displaying various functions were identified. This new genomic resource contributes to the evolutionary and comparative analysis of dipterous flies and an in-depth understanding of many aspects of W. magnifica biology. Furthermore, it will facilitate the development of novel tools for controlling W. magnifica infection in livestock.Entities:
Keywords: PacBio HiFi sequencing; comparative genomics; de novo assembly; fly species; gene family; low DNA input; positively selected genes
Mesh:
Year: 2022 PMID: 35643968 PMCID: PMC9545800 DOI: 10.1111/1755-0998.13654
Source DB: PubMed Journal: Mol Ecol Resour ISSN: 1755-098X Impact factor: 8.678
Sequencing data statistics of Wohlfahrtia magnifica
| Parameter | Raw data | Converted HiFi data |
|---|---|---|
| Total sequence length (bp) | 408,396,865,291 | 23,467,199,316 |
| Total sequence number | 43,228,999 | 2,197,069 |
| Average sequence length (bp) | 9447 | 10,681 |
| GC content (%) | 33.79857849 | 32.7324404 |
| Max sequence length (bp) | 468,940 | 38,618 |
| N20 (bp) | 14,919 | 15,097 |
| N50 (bp) | 10,949 | 11,580 |
| N90 (bp) | 6089 | 7338 |
Protein‐coding gene annotation statistics of Wohlfahrtia magnifica assembled genome
| Parameter | |
|---|---|
| Genome size | 753.99 Mb |
| Number of contigs | 543 |
| Contig N50 | 5.00 Mb |
| GC content | 32.82% |
| Max scaffold length | 14.66 Mb |
| Number of scaffolds >50 KB | 541 |
| Number of genes | 16,718 |
| Number of mRNAs | 20,017 |
| Number of exons | 87,424 |
| Number of introns | 67,407 |
| Number of CDS | 20,017 |
| Shortest gene | 147 bp |
| Longest gene | 394,287 bp |
| Mean gene length | 9789 bp |
| Mean mRNA length | 11,538 bp |
| Mean exon length | 347 bp |
| Mean intron length | 2978 bp |
| Mean CDS length | 1515 bp |
| Mean exons per mRNA | 4 |
| Mean introns per mRNA | 3 |
| % of genome covered by genes | 21.7% |
| % of genome covered by CDS | 4.0% |
Comparison of summary statistics of genome assembly between Wohlfahrtia magnifica and other dipterous flies
| Species |
|
|
|
|
|
|---|---|---|---|---|---|
| Genome size | 753.99 Mb | 750.40 Mb | 143.73 Mb | 476.29 Mb | 534.08 Mb |
| Number of scaffolds | 543 | 20,487 | 1870 | 42,093 | 3663 |
| Scaffold N50 | 5.00 Mb | 226.57 kb | 25.29 Mb | 55.53 kb | 616.42 kb |
| Reference | In the study | GCA_000371365.1 | GCA_000001215.4 | GCA_005959815.1 | Scott et al. ( |
FIGURE 1BUSCO analysis between Wohlfahrtia magnifica and other nine dipterous flies based on diptera_odb10 gene set. (a) BUSCO assessment results of genomes. (b) BUSCO assessment results of gene sets. C, complete BUSCO genes; S, single‐copy BUSCO genes; D, duplicated BUSCO genes; F, fragmented BUSCO genes; M, missing BUSCO genes; n, total number of BUSCO genes
Repeat element statistics of Wohlfahrtia magnifica assembled genome
| Repeat element | Numbers | Bases | % of genome |
|---|---|---|---|
| SINEs | 0 | 0 | 0.00 |
| LINEs | 313,495 | 97,444,031 | 12.92 |
| LTR | 98,752 | 29,004,031 | 3.85 |
| DNA | 473,914 | 125,334,368 | 16.62 |
| Unclassified | 976,699 | 176,211,233 | 23.37 |
| Simple repeats | 279,951 | 18,690,856 | 2.48 |
| Low complexity | 61,088 | 2,981,425 | 0.40 |
| Total | / | 450,243,625 | 59.71 |
FIGURE 2Gene family analysis of Wohlfahrtia magnifica and other dipterous flies. (a) Distribution of genes in different species. 1:1 orthologues: Single‐copy orthologues; N:N orthologues: Multiple‐copy orthologues; species‐specific orthologues: Present in specific species; other orthologues: The remaining orthologues. (b) Numbers of orthogroups shared or unique in W. magnifica, S. bullata, L. cuprina and D. melanogaster
FIGURE 3Comparative analysis of the genome of Wohlfahrtia magnifica. (a) Phylogenetic relationship, the estimated divergence time (Ma) and gene family of expansion and contraction (the red numbers reflect contracted gene families, whereas the green numbers show expanded gene families.) of W. magnifica and other nine dipterous flies. (b) Top 20 of GO enrichment results of 173 expanded gene families of W. magnifica