| Literature DB >> 23175716 |
Sabrina Simon1, Apurva Narechania, Rob Desalle, Heike Hadrys.
Abstract
The evolution of the diverse insect lineages is one of the most fascinating issues in evolutionary biology. Despite extensive research in this area, the resolution of insect phylogeny especially of interordinal relationships has turned out to be still a great challenge. One of the challenges for insect systematics is the radiation of the polyneopteran lineages with several contradictory and/or unresolved relationships. Here, we provide the first transcriptomic data for three enigmatic polyneopteran orders (Dermaptera, Plecoptera, and Zoraptera) to clarify one of the most debated issues among higher insect systematics. We applied different approaches to generate 3 data sets comprising 78 species and 1,579 clusters of orthologous genes. Using these three matrices, we explored several key mechanistic problems of phylogenetic reconstruction including missing data, matrix selection, gene and taxa number/choice, and the biological function of the genes. Based on the first phylogenomic approach including these three ambiguous polyneopteran orders, we provide here conclusive support for monophyletic Polyneoptera, contesting the hypothesis of Zoraptera + Paraneoptera and Plecoptera + remaining Neoptera. In addition, we employ various approaches to evaluate data quality and highlight problematic nodes within the Insect Tree that still exist despite our phylogenomic approach. We further show how the support for these nodes or alternative hypotheses might depend on the taxon- and/or gene-sampling.Entities:
Mesh:
Year: 2012 PMID: 23175716 PMCID: PMC3542558 DOI: 10.1093/gbe/evs104
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Existing Phylogenetic Hypotheses within the Polyneopteran Lineages
| Relationship | Clade | Criterion | References |
|---|---|---|---|
| Blattodea + Isoptera + Mantodea | Dictyoptera | Morphology | |
| Molecular | |||
| Grylloblattodae + Mantophasmatodea | Xenonomia | Morphology | |
| Molecular | |||
| Orthoptera + Phasmatodea | Orthopterida | Morphology | |
| Molecular | |||
| Phasmatodea + Embioptera | Eukinolabia | Morphology | |
| Molecular | |||
| Zoraptera + Paraneoptera | Parametabola | Morphology | |
| Plecoptera + remaining Neoptera | (Paraphyly Polyneoptera) | Morphology | |
| Dermaptera + Zoraptera | Haplocerata | Morphology | |
| Molecular | |||
| Dermaptera + Plecoptera | — | Molecular | |
| Zoraptera + Dictyoptera | — | Morphology | |
| Molecular | |||
| Zoraptera + Embioptera | Mystroptera | Morphology |
FRAxML topology derived from data matrix M_matrix (53 species, 335 genes, 71369 amino acid positions), PROTCATWAGF model. Support values are derived from 1,000 bootstrap replicates. Bootstrap values are only given for nodes that lack maximum support. Stars indicate nodes for which at least 90% of data (=300 genes) have to be concatenated to recover this specific node based on the RADICAL analyses. Also shown is a barplot indicating the number of putative orthologous genes for each of the taxa in this data set. Color code: primarily wingless hexapods, gray; “palaeopteran” insects, red; polyneopteran insects, green, paraneopteran insects, purple; and holometabolan insects, pink.
Phylogenetic Hypotheses within the Insect Tree Addressed in This Study
| Data set | ML Analyses | RADICAL Analyses | KOGs (RADICAL Analyses Based on P_matrix_g) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| M_matrix | P_matrix_g | P_matrix_s | M_matrix | P_matrix_g | P_matrix_s | Cell | Info | Meta | Poorly | |
| No. of species | 53 | 62 | 73 | 53 | 62 | 73 | 62 | 62 | 62 | 62 |
| No. of genes | 335 | 285 | 102 | 335 | 285 | 102 | 85 | 80 | 78 | 42 |
| Positions (aa) | 71369 | 79509 | 24507 | 71369 | 79509 | 24507 | 24133 | 19709 | 23964 | 11700 |
| Density (%) | 70 | 75 | 85 | 70 | 75 | 85 | ||||
| Clade support | ||||||||||
| Hexapoda | 100 | 100 | 100 | 15 | 10 | 1 | 20 | 10 | 10 | 10 |
| Ectognatha | n.a. | 100 | 100 | n.a. | 50 | 20 | 85 | 30 | 75 | 40 |
| Pterygota | 100 | 100 | 99 | 15 | 55 | 45 | 45 | 55 | 40 | 42 |
| Metapterygota (=Odonata + Neoptera) | 100 | 99 | 61 | 130 | 110 | 95 | 85 | 35 | 35 | 42 |
| Neoptera | 77 | 99 | 99 | 310 | 150 | 55 | 50 | 55 | 75 | 1 |
| Polyneoptera | 100 | 100 | 97 | 160 | 115 | 40 | 85 | 30 | 75 | 42 |
| Orthoptera, (Blattodea, Isoptera) | 100 | n.a. | 95 | 240 | n.a. | 70 | n.a. | n.a. | n.a. | n.a. |
| (Dermaptera, Plecoptera) | 48 | 43 | 53 | 325 | 275 | 95 | 75 | 65 | 75 | 42 |
| Zoraptera, (remaining Polyneoptera) | 100 | 100 | 97 | 160 | 115 | 40 | 85 | 30 | 75 | 42 |
| (Dermaptera, Plecoptera), (Orthoptera, (Blattodea, Isoptera)) | 67 | 45 | 55 | 300 | 275 | 95 | 40 | 65 | 78 | 42 |
| Eumetabola (=Paraneoptera + Holometabola) | 39 | 54 | 43 | 330 | 275 | 100 | 20 | 30 | 55 | 1 |
| Paraneoptera | 100 | 100 | 33 | 115 | 50 | 100 | 80 | 55 | 20 | 35 |
| Holometabola | 100 | 100 | 100 | 50 | 35 | 30 | 40 | 25 | 55 | 40 |
| Hymenoptera, remaining Holometabla | 100 | 100 | 100 | 50 | 35 | 30 | 40 | 25 | 55 | 40 |
| Coleoptera, (Lepidoptera, Diptera) | 100 | 100 | 100 | 20 | 35 | 30 | 35 | 25 | 35 | 20 |
| (Lepidoptera, Diptera) | 100 | 100 | 100 | 35 | 35 | n.a. | 35 | 20 | 20 | 15 |
Note.—For selected clades (1st column), the bootstrap support values based on the RAxML analyses (2nd column) and the number of genes required for fixation based on the RADICAL analyses (3rd column) for all three matrices are given. In the 4th column, the number of the major KOG genes required for fixation based on the P_matrix_g data set and the RADICAL analyses are given.
* indicates the number of genes for which a node no longer occurs in any randomized concatenation set of that size or larger. For two nodes (Metapterygota and Eumetabola), alternative hypotheses were tested using the P_matrix_g data set and RADICAL based on the major KOG categories (4th column). A: concatenation of 75 meta genes support the Palaeoptera hypothesis; B: concatenation of 15 poorly genes support the Palaeoptera hypothesis. C: concatenation of 75 info genes support a sistergroup relationship of Polyneoptera and Holometabola.
FRAxML topology derived from data matrix P_matrix_g (62 species, 285 genes, 79506 amino acid positions), PROTCATWAGF. Support values are derived from 1,000 bootstrap replicates. Bootstrap values are only given for nodes that lack maximum support. Stars indicate nodes for which at least 90% of data (=256 genes) have to be concatenated to recover this specific node based on the RADICAL analyses. Also shown is a barplot indicating the number of putative orthologous genes for each of the taxa in this data set. Color code: primarily wingless hexapods, gray; “palaeopteran” insects, red; polyneopteran insects, green, paraneopteran insects, purple; and holometabolan insects, pink.
FRADICAL analysis of functional subgroups. AUC values (left column) and fixation points are provided across all total evidence nodes for the functional groups 1) cellular processes and signaling (cell = 85 genes); 2) information storage and processing (info = 80 genes); 3) metabolism (meta = 78 genes); and 4) poorly recognized (poorly = 42 genes). AUC values indicate the proportion of total concatenation space occupied by that node and the fixation point indicates the number of genes required before the node appears in all concatenation sets of that size. The star indicates the number of genes for which a node no longer occurs in any randomized concatenation set of that size or larger.
FRADICAL analysis of nodal support. Shown are the RADICAL curves for each of the five phylogenetic hypotheses (at the top) for the complete P_matrix_g data set and the four functional subgroups based on this data set. The y axis shows the CFI and the x axis the concatenation size (number of genes).