| Literature DB >> 32191812 |
Fanming Meng1, Zhuoying Liu1, Han Han1, Dmitrijs Finkelbergs1, Yangshuai Jiang1, Mingfei Zhu2, Yang Wang2, Zongyi Sun2, Chao Chen3, Yadong Guo1, Jifeng Cai1.
Abstract
BACKGROUND: Blowflies (Diptera: Calliphoridae) are the most commonly found entomological evidence in forensic investigations. Distinguished from other blowflies, Aldrichina grahami has some unique biological characteristics and is a species of forensic importance. Its development rate, pattern, and life cycle can provide valuable information for the estimation of the minimum postmortem interval.Entities:
Keywords: zzm321990 Aldrichina grahamizzm321990 ; blowfly; forensic entomology; genome assembly; minimum postmortem interval; necrophagous
Year: 2020 PMID: 32191812 PMCID: PMC7081965 DOI: 10.1093/gigascience/giaa020
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Female adult of Aldrichina grahami on a corpse.
An overview comparison of genome assembly and structure features in 5 calyptratae flies of Diptera
| Parameter |
|
|
|
|
|
|---|---|---|---|---|---|
| Sequencing platform | PacBio | Illumina | 454/Illumina | Illumina | 454/PacBio |
| Genome size (Mb) | 600 | 458 | 366 | 692 | 550 |
| No. of contigs/Scaffolds | 1,604/7 | 74,043/4,436 | 24,071/13,807 | -/20,487 | 192,662/- |
| Contig N50 (kb) | 1,930 | 744.4 | 50 | 12 | 7.9 |
| GC level (%) | 31 | 29.3 | 34.1 | 35.1 | 26.2 |
| Repetitive regions (%) | 48.02 | 57.8 | - | 55 | 8.11 |
| Function annotation (gene No.; %) | 12,791; 99.8 | 12,160; 83.6 | 12,308; 99.5 | 14,180; 92.3 | 7,792; 94 |
| Sequencing depth | 86× | 100× | 160× | 90× | 44× |
| Completeness (BUSCO/CEGMA; %) | 99.2 | 96 | 99 | 98 | 93.6 |
Genome completeness was assessed by BUSCO or CEGMA. Four genomes of calyptratae fly species were selected: L. cuprina [57], G. morsitans [58], M. domestica [59], and P. regina [45]. The genome version of P. regina female adult was chosen. NC: not reported.
Statistics of repeat sequence analysis
| Type | RepeatMasker | LTR finder | RepeatProteinMask | RepeatModeler | Combined TEs | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Length (Mb) | % in genome | Length (Mb) | % in genome | Length (Mb) | % in genome | Length (Mb) | % in genome | Length (Mb) | % in genome | |
| DNA | 42,174,497 | 7.03 | 0 | 0 | 41,341,346 | 6.89 | 50,464,704 | 8.41 | 69,933,653 | 11.65 |
| LINE | 10,505,716 | 1.75 | 0 | 0 | 19,838,372 | 3.31 | 26,169,690 | 4.36 | 34,333,817 | 5.72 |
| LTR | 4,789,075 | 0.8 | 15,966,332 | 2.66 | 5,778,730 | 0.96 | 1,229,900 | 0.2 | 21,249,831 | 3.54 |
| SINE | 51,914 | 0.01 | 0 | 0 | 0 | 0 | 453,547 | 0.08 | 446,000 | 0.07 |
| Other | 12,169,475 | 2.02 | 0 | 0 | 7,698,698 | 1.28 | 50,424,873 | 8.4 | 78,096,062 | 13.02 |
| Unknown | 161,876 | 0.03 | 0 | 0 | 0 | 0 | 50,424,873 | 16 | 84,103,655 | 14.02 |
| Total | 69,852,553 | 11.64 | 15,966,332 | 2.66 | 74,657,146 | 12.44 | 224,757,686 | 37.45 | 288,163,018 | 48.02 |
Other represents sequences with annotation but not belonging to the above types of repetitive genes, such as satellites, simple repeats, retroposon, artifact, helitron, and low-complexity repeats; unknown represents sequences that cannot be classified. LINE: long interspersed nuclear element; LTR: long terminal repeat; SINE: short interspersed nuclear element.
Function annotation of protein-coding genes of A. grahami
| Type | No. (%) | |
|---|---|---|
|
| Swiss-Prot | 9,648 (75.2) |
| TrEMBL | 12,721 (99.2) | |
| KEGG | 5,247 (40.9) | |
| KOG | 8,252 (64.4) | |
| GO | 7,518 (58.6) | |
| InterProScan | 10,488 (81.8) | |
| Nr | 12,780 (99.7) | |
|
| Annotated | 12,791 (99.8) |
| Gene | 12,823 |
Nr: Non-Redundant Protein Sequence Database.
Figure 2:Gene family comparison between A. grahami and other insect species.
Genome families of A. grahami and other insect species
| Species | Genes No. | Genes No. in families | Unclustered genes No. | Family No. | Unique families No. | Mean genes per family |
|---|---|---|---|---|---|---|
|
| 14,539 | 12,810 | 1,729 | 8,701 | 485 | 1.47 |
|
| 12,823 | 12,033 | 790 | 10,424 | 53 | 1.15 |
|
| 28,670 | 19,323 | 9,347 | 9,449 | 1,286 | 2.04 |
|
| 11,890 | 9,743 | 2,147 | 8,104 | 250 | 1.2 |
|
| 13 | 11 | 2,403 | 9,694 | 235 | 1.18 |
|
| 19 | 12 | 7,517 | 9,867 | 350 | 1.24 |
|
| 15 | 13 | 1,317 | 11,364 | 560 | 1.22 |
|
| 14 | 12 | 1,268 | 10,713 | 133 | 1.21 |
|
| 12 | 10 | 1,437 | 8,961 | 164 | 1.22 |
|
| 14 | 12 | 1,700 | 9,222 | 372 | 1.37 |
|
| 8,312 | 7,536 | 776 | 6,670 | 18 | 1.13 |
|
| 9,490 | 7,781 | 1,709 | 6,838 | 34 | 1.14 |
|
| 13 | 12 | 1,058 | 10 | 115 | 1.19 |
Unclustered genes and unique families represent the specific genes and families corresponding to each species.
Figure 3:Venn diagram of orthologous gene families. (A) The intersection between A. grahami and other Diptera species. (B) The intersection between A. grahami and other non-Diptera species with different dietary habits.
Figure 4:The estimation of divergence times. The numbers beside the dots of topological branches are the divergent time to the present day (million years ago). Numbers between branches represent the calibration time from fossil evidence. The right lists each family name. Numbers in parentheses are 95% confidence interval.
Figure 5:Hi-C interaction matrix maps within and among 6 chromosomes. The contact density is illustrated by the color bar from red (high density) to white (low density).
Figure 6:Collinearity and gene clustering of the A. grahami genome. (A) Collinear relationship between the A. grahami and D. melanogaster genomes. The blue bar represents the A. grahami genome and the grey one represents the fruit fly genome. (B) Gene density distribution on chromosomes of A. grahami. The outer blue circle indicates the chromosomes. The inner yellow, light blue, green, and orange circles represent the LTR, expanded gene families, contracted gene families, and positively selected genes, respectively. Window size = 1 Mb.