| Literature DB >> 27019632 |
Dong Zhang, Liping Yan, Ming Zhang, Hongjun Chu, Jie Cao, Kai Li, Defu Hu, Thomas Pape.
Abstract
The complete mitogenome of the horse stomach bot fly Gasterophilus pecorum (Fabricius) and a near-complete mitogenome of Wohlfahrt's wound myiasis fly Wohlfahrtia magnifica (Schiner) were sequenced. The mitogenomes contain the typical 37 mitogenes found in metazoans, organized in the same order and orientation as in other cyclorrhaphan Diptera. Phylogenetic analyses of mitogenomes from 38 calyptrate taxa with and without two non-calyptrate outgroups were performed using Bayesian Inference and Maximum Likelihood. Three sub-analyses were performed on the concatenated data: (1) not partitioned; (2) partitioned by gene; (3) 3rd codon positions of protein-coding genes omitted. We estimated the contribution of each of the mitochondrial genes for phylogenetic analysis, as well as the effect of some popular methodologies on calyptrate phylogeny reconstruction. In the favoured trees, the Oestroidea are nested within the muscoid grade. Relationships at the family level within Oestroidea are (remaining Calliphoridae (Sarcophagidae (Oestridae, Pollenia + Tachinidae))). Our mito-phylogenetic reconstruction of the Calyptratae presents the most extensive taxon coverage so far, and the risk of long-branch attraction is reduced by an appropriate selection of outgroups. We find that in the Calyptratae the ND2, ND5, ND1, COIII, and COI genes are more phylogenetically informative compared with other mitochondrial protein-coding genes. Our study provides evidence that data partitioning and the inclusion of conserved tRNA genes have little influence on calyptrate phylogeny reconstruction, and that the 3rd codon positions of protein-coding genes are not saturated and therefore should be included.Entities:
Keywords: Calyptratae.; Oestroidea; gene contribution; long-branch attraction; mitogenome; phylogeny; taxon sampling
Mesh:
Year: 2016 PMID: 27019632 PMCID: PMC4807417 DOI: 10.7150/ijbs.12148
Source DB: PubMed Journal: Int J Biol Sci ISSN: 1449-2288 Impact factor: 6.580
Summary of mitogenomes from Calyptratae, and two outgroup species.
| Superfamily | Family | Subfamily | Species | Locus | Reference |
|---|---|---|---|---|---|
| Tabanoidea | Tabanidae | Tabanidae | NC_008756 | [14] | |
| Ephydroidea | Drosophilidae | NC_024511 | [42] | ||
| Oestroidea | Calliphoridae | Calliphorinae | KP872701 | [43] | |
| Calliphorinae | NC_019639 | [44] | |||
| Chrysomyinae | NC_002660 | [45] | |||
| Chrysomyinae | NC_019631 | [44] | |||
| Chrysomyinae | NC_019632 | [44] | |||
| Chrysomyinae | NC_019633 | [44] | |||
| Chrysomyinae | NC_025338 | [46] | |||
| Chrysomyinae | NC_002697 | [47] | |||
| Chrysomyinae | NC_019634 | [44] | |||
| Chrysomyinae | NC_019635 | [44] | |||
| Chrysomyinae | NC_026668 | [48] | |||
| Chrysomyinae | NC_019636 | [44] | |||
| Luciliinae | NC_019638 | [44] | |||
| Luciliinae | NC_019573 | [44] | |||
| Luciliinae | NC_019637 | [44] | |||
| Luciliinae | NC_009733 | [49] | |||
| Polleniinae | JX913761 | [44] | |||
| Sarcophagidae | Paramacronychiinae | — | Present study | ||
| Sarcophaginae | NC_026196 | [50] | |||
| Sarcophaginae | NC_025944 | [51] | |||
| Sarcophaginae | NC_026667 | [52] | |||
| Sarcophaginae | NC_017605 | [53] | |||
| Sarcophaginae | NC_026112 | [54] | |||
| Sarcophaginae | NC_023532 | [55] | |||
| Sarcophaginae | NC_025574 | [56] | |||
| Sarcophaginae | NC_025573 | [57] | |||
| Tachinidae | Exoristinae | NC_018118 | [58] | ||
| Dexiinae | NC_019640 | [44] | |||
| Oestridae | Cuterebrinae | NC_006378 | [59] | ||
| Gasterophilinae | — | Present study | |||
| Hypodermatinae | NC_013932 | [60] | |||
| Muscoid grade | Anthomyiidae | KP901268 | [61] | ||
| Fanniidae | KP901269 | [61] | |||
| Muscidae | Muscinae | NC_007102 | [62] | ||
| Muscinae | NC_024855 | [63] | |||
| Muscinae | DQ533708 | [62] | |||
| Reinwardtiinae | NC_026292 | [64] | |||
| Scathophagidae | Scathophaginae | NC_024856 | [65] |
* Species used as outgroups in subgroup_1.
Summary of datasets used to perform phylogenetic analyses.
| Taxa | Dataset | Subanalyses | Models |
|---|---|---|---|
| subgroup_1 (40 species, 2 outgroups) | ALL: PCGs, rRNA genes, and a concatenation of tRNA genes. | Not partitioned | GTR + I + G |
| Partitioned by genes | P1 (ND2, COI, COII, ATP8, ATP6, COIII, ND3, ND6, CYTB) GTR + I + G | ||
| Without 3rd codon positions of PCGs | GTR + I + G | ||
| PCG&rRNA: PCGs, rRNA genes. | Not partitioned | GTR + I + G | |
| Partitioned by genes | P1 (ND2, COI, COII, ATP8, ATP6, COIII, ND3, ND6, CYTB) GTR + I + G | ||
| Without 3rd codon positions of PCGs | GTR + I + G | ||
| subgroup_2 (38 species, rooted using | ALL: PCGs, rRNA genes, and a concatenation of tRNA genes. | Not partitioned | GTR + I + G |
| Partitioned by genes | P1 (ND2, COI, COII, ATP8, ATP6, COIII, ND3, ND6, CYTB) GTR + I + G | ||
| Without 3rd codon positions of PCGs | GTR + I + G | ||
| PCG&rRNA: PCGs, rRNA genes. | Not partitioned | GTR + I + G | |
| Partitioned by genes | P1 (ND2, COI, COII, ATP8, ATP6, COIII, ND3, ND6, CYTB) GTR + I + G | ||
| Without 3rd codon positions of PCGs | GTR + I + G |
Figure 1Mitochondrial genome maps of Gasterophilus pecorum (Fabricius) and Wohlfahrtia magnifica (Schiner). Gene names without underline indicate that these genes are coded on (+) strand, while those with underline are on the (-) strand. Transfer RNA (tRNA) genes are designated by single-letter amino acid codes. White colored regions indicate those failed to sequence.
Nucleotide composition of Gasterophilus pecorum (Fabricius) / Wohlfahrtia magnifica (Schiner).
| Region | Nucleotide composition (%) | AT-skew | GC-skew | ||||
|---|---|---|---|---|---|---|---|
| T(U) | C | A | G | A+T | |||
| Whole genome | 32.4 / N * | 18.9 / N | 38.4 / N | 10.3 / N | 70.7 / N | 0.08 / N | -0.29 / N |
| Protein-coding genes | 39.5 / 43.3 | 16.2 / 12.8 | 29.0 / 31.3 | 15.3 / 12.6 | 68.5 / 74.6 | -0.15 / -0.16 | -0.03 / -0.01 |
| 1st codon position | 33.9 / 36.6 | 14.6 / 12.3 | 30.1 / 31.6 | 21.3 / 19.5 | 64.0 / 68.2 | -0.06 / -0.07 | 0.19 / 0.23 |
| 2nd codon position | 45.9 / 46.3 | 20.2 / 19.5 | 19.4 / 20.1 | 14.5 / 14.0 | 65.3 / 66.4 | -0.41 / -0.39 | -0.16 / -0.16 |
| 3rd codon position | 38.7 / 46.9 | 13.7 / 6.5 | 37.6 / 42.3 | 10.0 / 4.2 | 76.3 / 89.2 | -0.01 / -0.05 | -0.16 / -0.21 |
| tRNA | 38.1 / N | 10.8 / N | 37.4 / N | 13.6 / N | 75.5 / N | -0.01 / N | 0.11 / N |
| lrRNA | 42.1 / 41.5 | 6.9 / 6.4 | 33.3 / 38.8 | 17.6 / 13.3 | 75.4 / 80.3 | -0.12 / -0.03 | 0.43 / 0.35 |
| srRNA | 38.2 / 37.5 | 9.3 / 8.5 | 34.1 / 37.7 | 18.4 / 16.3 | 72.3 / 75.2 | -0.06 / 0.00 | 0.33 / 0.31 |
| Control Region | 45.1 / N | 4.9 / N | 35.8 / N | 14.3 / N | 80.8 / N | -0.12 / N | 0.49 / N |
* N = Not available.
Figure 2Comparisons of average mitochondrial gene p-distances at different taxonomic levels in 38 species of Calyptratae. Black bars indicate the standard deviation.
Figure 3Percentage of conserved sites (%cons) and percentage of positions experiencing gaps (%gaps) in the alignments of the 13 protein-coding genes.
Figure 4Nucleotide substitution saturation of each 13 PCGs, 1st & 2nd codon positions of PCGs, and 3rd codon positions of PCGs.
Figure 5Phylogeny of subgroup_1, inferred from mitochondrial datasets comprising 13 protein-coding genes, 2 rRNA genes, and concatenation of tRNA genes. A1, Bayesian tree inferred from not partitioned data. A2, Bayesian tree inferred from partitioned data. A3, Bayesian tree inferred from data excluding 3rd codon positions of PCGs. B1, ML tree inferred from not partitioned data. B2, ML tree inferred from partitioned data. B3, ML tree inferred from data excluding 3rd codon positions of PCGs. Numbers at nodes are posterior probabilities (Bayesian trees), and Bootstrap values (ML trees).
Figure 6Phylogeny of subgroup_1, inferred from mitochondrial datasets comprising 13 protein-coding genes and 2 rRNA genes. A1, Bayesian tree inferred from not partitioned data. A2, Bayesian tree inferred from partitioned data. A3, Bayesian tree inferred from data excluding 3rd codon positions of PCGs. B1, ML tree inferred from not partitioned data. B2, ML tree inferred from partitioned data. B3, ML tree inferred from data excluding 3rd codon positions of PCGs. Numbers at nodes are posterior probabilities (Bayesian trees), and Bootstrap values (ML trees).
Figure 7Phylogeny of subgroup_2, inferred from mitochondrial datasets comprising 13 protein-coding genes, 2 rRNA genes, and concatenation of tRNA genes. A1, Bayesian tree inferred from non-partitioned data. A2, Bayesian tree inferred from partitioned data. A3, Bayesian tree inferred from data excluding 3rd codon position of PCGs. B1, ML tree inferred from not partitioned data. B2, ML tree inferred from partitioned data. B3, ML tree inferred from data excluding 3rd codon positions of PCGs. Numbers at nodes are posterior probabilities (Bayesian trees), and Bootstrap values (ML trees).
Figure 8Phylogeny of subgroup_2, inferred from mitochondrial datasets comprising 13 protein-coding genes and 2 rRNA genes. A1, Bayesian tree inferred from not partitioned data. A2, Bayesian tree inferred from partitioned data. A3, Bayesian tree inferred from data excluding 3rd codon positions of PCGs. B1, ML tree inferred from not partitioned data. B2, ML tree inferred from partitioned data. B3, ML tree inferred from data excluding 3rd codon positions of PCGs. Numbers at nodes are posterior probabilities (Bayesian trees), and Bootstrap values (ML trees).
Figure 9Calyptrate mitochondrial gene k-scores calculated by Ktreedist, measuring overall differences in the relative branch length and topology of the phylogenetic trees generated by single protein-coding genes compared to the combined dataset.