| Literature DB >> 30926723 |
Ellen O Martinson1, Justin Peyton2, Yogeshwar D Kelkar3, Emily C Jennings4, Joshua B Benoit4, John H Werren3, David L Denlinger2.
Abstract
The flesh fly, Sarcophaga bullata, is a widely-used model for examining the physiology of insect diapause, development, stress tolerance, neurobiology, and host-parasitoid interactions. Flies in this taxon are implicated in myiasis (larval infection of vertebrates) and feed on carrion, aspects that are important in forensic studies. Here we present the genome of S. bullata, along with developmental- and reproduction-based RNA-Seq analyses. We predict 15,768 protein coding genes, identify orthology in relation to closely related flies, and establish sex and developmental-specific gene sets based on our RNA-Seq analyses. Genomic sequences, predicted genes, and sequencing data sets have been deposited at the National Center for Biotechnology Information. Our results provide groundwork for genomic studies that will expand the flesh fly's utility as a model system.Entities:
Keywords: Diptera; Sarcophaga bullata; diapause; forensics; host-parasitoid interactions; ontogenesis; stress tolerance
Mesh:
Year: 2019 PMID: 30926723 PMCID: PMC6505164 DOI: 10.1534/g3.119.400148
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Summary of Illumina read filtering
| Reads (#) | Bases (#) | Reads (%) | Bases (%) | |
|---|---|---|---|---|
| Raw | 428,355,968 | 40,615,616,800 | 100 | 100 |
| Quality Filtered | 414,083,892 | 37,890,395,975 | 96.67 | 93.29 |
| k-mer Filtered | 376,681,172 | 33,610,109,059 | 87.94 | 82.75 |
Figure 1Phylogenetic placement and genomic comparisons for Sarcophaga bullata and other fly species. Left, The phylogenetic analysis places S. bullata as a sister species to the bottle fly, Lucilia cuprina. The phylogeny is built using RAxML and it is based on amino acid sequences from 343 single-copy genes that are present in all eight species. Bootstrap values are shown for every node. Middle, Benchmarking Universal Single-Copy Orthologs (BUSCO, Simão ) analyses based on the dipteran dataset (odb8). The dataset was searched against the entire genome sequence (blue) and the annotated gene set (red). Right, Orthology-based analyses of protein coding genes between eight fly species determined the number of genes in single-copy core (i.e., found in all species evaluated) clusters, variable-copy number core clusters, paraphyletic clusters (non-core, non-species specific), singleton, and species-specific clusters, based on OrthoFinder (Emms and Kelly, 2015).
Summary of different types of repeat elements
| Type | Number | Bases | % of Genome |
|---|---|---|---|
| SINEs | 33,478 | 7,221,604 | 1.52 |
| LINEs | 179,696 | 32,921,812 | 6.91 |
| LTR | 17,235 | 6,433,260 | 1.35 |
| DNA | 127,299 | 19,969,616 | 4.19 |
| Unclassified | 397,153 | 59,304,762 | 12.45 |
| Total interspersed | 125,851,054 | 26.42 | |
| Small RNA | 323 | 44,904 | 0.01 |
| Simple repeats | 416,660 | 18,229,578 | 3.83 |
| Low complexity | 89,350 | 4,553,468 | 0.96 |
| Total | 148,389,050 | 31.15 |
Comparison of exon and intron content of Sarcophaga bullata to other flies. Source: Dm: dos Santos ; Gm: International Glossina Genome Initiative 2014; Md: Scott
| Species | Genome size (Mb) | Exon Number | Exon Length (bp) | Intron Number | Intron Length (bp) |
|---|---|---|---|---|---|
| 200 | 77,682 | 539 | 58,537 | 1,700 | |
| 590 | 63,000 | 475 | 52,000 | 1,600 | |
| 1021 | 67,886 | 431 | 52,875 | 3,889 | |
| 593 | 66,485 | 422 | 52,110 | 1,989 |
Figure 2Gene expression analysis to determine sex- and developmental-specific gene sets in S. bullata. A) Multidimensional scaling analysis (MDS) showing clustering of sex and developmental stages. The MDS was performed on count data in EdgeR (Robinson ). B-E) Venn diagrams categorizing differentially expressed genes in sex-specific gene sets displaying B) male-specific gene expression, C) female-specific gene expression, D) male carcass-specific gene expression, and E) female carcass-specific gene expression.