| Literature DB >> 27793085 |
Anne A Andere1, Roy N Platt2, David A Ray2, Christine J Picard3.
Abstract
BACKGROUND: Blow flies (Diptera: Calliphoridae) are important medical, veterinary and forensic insects encompassing 8 % of the species diversity observed in the calyptrate insects. Few genomic resources exist to understand the diversity and evolution of this group.Entities:
Keywords: Blow fly; Calliphoridae; Genome; Phormia regina; Sex determination; X chromosome; Y chromosome
Mesh:
Year: 2016 PMID: 27793085 PMCID: PMC5084420 DOI: 10.1186/s12864-016-3187-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Comparative assembly statistics of preliminary assemblies generated using CLC-GWB, Velvet and SOAPdenovo
| Assembler | Sex | Optimal kmer (bp) | # contigs | N50 (bp) | % readsa |
|---|---|---|---|---|---|
| CLC-GWB | Female | 60 | 251,115 | 3684 | 96.0 |
| Male | 60 | 306,273 | 2487 | 93.5 | |
| Velvet | Female | 75 | 317,916 | 1095 | 47.0 |
| Male | 75 | 310,045 | 956 | 37.9 | |
| SOAPdenovo | Female | 75 | 1,953,762 | 451 | 68.8 |
| Male | 75 | 2,010,365 | 351 | 58.6 |
For each assembly pipeline, different kmers were used to generate the optimal assembly, and the main metric for determining quality of assembly was the combination of the number of contigs and the N50
a% reads = the number of reads that mapped to each assembled genome
Final draft genome assembly statistics of the male and female genomes following the addition of 454 and PacBio reads, including a measure of the robustness of the assembly in the number of core eukaryotic genes assembled (CEGMA)
| Parameter | Female | Male |
|---|---|---|
| Optimal kmer (bp) | 60 | 50 |
| No. of contigs | 192,662 | 187,700 |
| N50 (bp) | 7918 | 7177 |
| Avg. contig (bp) | 2859 | 2846 |
| Max. contig (bp) | 116,828 | 691,679 |
| % reads used | 98.03 | 97.38 |
| Avg. coverage | 44X | 44X |
| % AT | 72 % | 72 % |
| Genome Size (Mb) | 550 | 534 |
| No. of contigs (> = 1000 bp) | 96,027 | 98,029 |
| Total length Mb (> = 1000 bp) | 485 | 473 |
| No. of N’s per 100 kb | 1565 | 1800 |
| CEGMA % (complete) | 93.95 | 96.77 |
| CEGMA % (partial) | 99.19 | 99.60 |
Male and female P. regina gene predictions including the total number of complete genes, the genic structure characteristics (number and length distribution of the intronic and exonic regions), the proportion genes that produced an NCBI result, and the proportion with characterized identifiable protein domains (InterProScan)
| Female | Male | |
|---|---|---|
| No. of predicted protein-encoding genes | 8312 | 9490 |
| Min length (aa) | 66 | 66 |
| Max length (aa) | 14,023 | 12,632 |
| No. of exons | 31,254 | 33,298 |
| Min length (bases) | 3 | 3 |
| Max length (bases) | 14,580 | 14,589 |
| No. of introns | 22,942 | 23,808 |
| Min length (bases) | 42 | 42 |
| Max length (bases) | 55,816 | 47,389 |
| No. of predicted genes with blast results (NCBI) | 7792 (94 %) | 8789 (93 %) |
| InterProScan Statistics | ||
| With InterPro ID’s | 6909 (83 %) | 7964 (84 %) |
An overview comparing the genic structure and statistics in P. regina (P.reg), L. cuprina (L.cup) and M. domestica (M.dom) genome assemblies
| P.reg (Male) | P.reg (Female) | L.cupa | M.doma | |
|---|---|---|---|---|
| Genes Count | 9490 | 8312 | 14,554 | 15,345 |
| Mean length | 6042 bp | 6869 bp | 12,197 bp | 13,553 bp |
| Exons Count | 33,298 | 31,254 | 65,493 | 67,886 |
| Mean length | 403 bp | 393 bp | 432 bp | 431 bp |
| Introns Count | 23,808 | 22,942 | – | 52,875 |
| Mean length | 871 bp | 936 bp | 2560 bp | 3889 bp |
aStatistics of the genic structure for L. cuprina were obtained from Anstead et al in [38], and for M. domestica from Scott et al. in
Fig. 1GO term classification of the 3 functional categories (biological processes, molecular function and cellular component) of the predicted genes in the male and female genome assemblies
Fig. 2The top 35 KEGG biological pathways of the male and female gene sets extracted from the Blast2GO analysis
Fig. 3Predicted gene structure of the sex determining gene daughterless for the female (F) and male (M) P. regina. The red boxes represent the exon, the grey boxes inclusive of the red represent the mRNA, and the black line represent the intron. Image is not drawn to scale
Fig. 4Transposable element accumulation in the female (a) and male (b) Phormia regina genome assemblies. Kimura 2-parameter distances were calculated between transposable element insertions in the genome and the homologous element in the dipteran Repbase library. Larger divergences indicate elements with larger mutation loads, and by extension, were deposited in the genome in the more distant past. Less than 40 SINE elements are present in either the female or male assemblies and are not shown here. Transposable elements are slightly more abundant in the female genome assembly. The accumulation of female specific repeats (c) follows that of the whole genome in general