| Literature DB >> 29020740 |
T Fatima Mitterboeck1,2, Shanlin Liu3,4, Sarah J Adamowicz1,2, Jinzhong Fu1, Rui Zhang3, Wenhui Song3, Karen Meusemann5,6,7, Xin Zhou8,9.
Abstract
The evolution of powered flight is a major innovation that has facilitated the success of insects. Previously, studies of birds, bats, and insects have detected molecular signatures of differing selection regimes in energy-related genes associated with flight evolution and/or loss. Here, using DNA sequences from more than 1000 nuclear and mitochondrial protein-coding genes obtained from insect transcriptomes, we conduct a broader exploration of which gene categories display positive and relaxed selection at the origin of flight as well as with multiple independent losses of flight. We detected a number of categories of nuclear genes more often under positive selection in the lineage leading to the winged insects (Pterygota), related to catabolic processes such as proteases, as well as splicing-related genes. Flight loss was associated with relaxed selection signatures in splicing genes, mirroring the results for flight evolution. Similar to previous studies of flight loss in various animal taxa, we observed consistently higher nonsynonymous-to-synonymous substitution ratios in mitochondrial genes of flightless lineages, indicative of relaxed selection in energy-related genes. While oxidative phosphorylation genes were not detected as being under selection with the origin of flight specifically, they were most often detected as being under positive selection in holometabolous (complete metamorphosis) insects as compared with other insect lineages. This study supports some convergence in gene-specific selection pressures associated with flight ability, and the exploratory analysis provided some new insights into gene categories potentially associated with the gain and loss of flight in insects.Entities:
Keywords: 1KITE project; Insect transcriptomes; flight; flight loss; molecular evolution; positive selection
Mesh:
Year: 2017 PMID: 29020740 PMCID: PMC5632299 DOI: 10.1093/gigascience/gix073
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Tree topology and species used in analyses of nuclear genes. Species names followed by a star indicate those species used in positive selection analysis associated with the origin of Pterygota (branch “P”) and other lineages for comparison (branches “U” and “D”). Circles or squares on the branches indicate each of the 11 lineages that were used in positive selection analysis of flight loss, with circles indicating full flight loss and squares indicating female-only flight loss. Triangles indicate related flight-capable branches used for comparison with the lineages representing a loss of flight in positive selection analysis. Note that sub-trees were used for the positive selection tests, and so not all species shown here were included. The colour of the circles or squares indicates the estimated degree of accuracy in the phylogenetic mapping of the flight loss, given the available taxonomic sampling (green = good, orange = fair, grey = approximate). Red lineages (fully flightless) were compared with blue lineages (related flying) in the nuclear gene analyses of relaxed selection (dN/dS ratios) associated with flightlessness, with all other lineages used for a background rate. A similar (smaller) tree was used for mitochondrial gene analyses of relaxed selection where both red (fully flightless) and purple (female-only flightless) lineages were compared with blue (related flying) lineages, with other lineages representing the background rate.
Positively selected genes in the lineage (“P”) leading to Pterygota as over-represented in (A) GO categories from DAVID analysis and (B) Biological Process categories from PANTHER analysis
| (A) DAVID GO results | |||||
|---|---|---|---|---|---|
| 914 total genes | 38 positively selected genes | ||||
| GO term | # in category | Expected # | Observed # |
| Fold enrichment |
| Precatalytic spliceosome | 34 | 1.4 | 7 | 0.00084 | 5.0 |
| mRNA splicing, via spliceosome | 44 | 1.8 | 8 | 0.00087 | 4.4 |
| Catalytic step 2 spliceosome | 30 | 1.2 | 6 | 0.0032 | 4.8 |
| Protein binding | 81 | 3.4 | 8 | 0.035 | 2.4 |
| Protease | 19 | 0.8 | 4 | 0.038 | 5.1 |
| mRNA processing | 9 | 0.4 | 3 | 0.044 | 8.0 |
| (B) PANTHER Biological Process results | |||||
| 894 total genes | 35 positively selected genes | ||||
| PANTHER GO-Slim Biological Process term | # in category | Expected # | Observed # |
| Fold enrichment |
| RNA catabolic process (GO:0 006401) | 9 | 0.4 | 2 | 0.048 | 5.7 |
Terms are for positively selected genes uniquely detected in the “P” lineage and not in 2 control lineages tested (“U” and “D”). Categories with P < 0.05 are shown; full results are given in Table S11 [32]; 954 background genes were mapped to (A) 914 IDs and (B) 894 IDs; 39 unique candidate genes were mapped to (A) 38 IDs and (B) 35 IDs. Statistical over-representation is tested by modified Fisher's exact tests in DAVID and binomial statistics in PANTHER, with raw P values provided here.
Genes detected to be under positive selection in 3 or more lineages with flight loss as over-represented in (A) GO categories from DAVID analysis and (B) Biological Process categories from PANTHER analysis
| (A) DAVID GO results | |||||
|---|---|---|---|---|---|
| 21 candidate positively | |||||
| 1229 total genes | selected genes | ||||
| GO term | # in category | Expected # | Observed # |
| Fold enrichment |
| Coiled coil | 223 | 3.8 | 9 | 0.018 | 2.4 |
| Nucleus | 269 | 4.6 | 9 | 0.048 | 2.0 |
| Dendrite morphogenesis | 21 | 0.4 | 3 | 0.050 | 8.4 |
| (B) PANTHER Biological Process results | |||||
| 21 positively | |||||
| 1207 total genes | selected genes | ||||
| PANTHER GO-Slim Biological Process term | # in category | Expected # | Observed # |
| Fold enrichment |
| Cellular component organization | 113 | 2.0 | 5 | 0.041 | 2.5 |
| Organelle organization | 64 | 1.1 | 4 | 0.0229 | 3.6 |
| Chromatin organization | 18 | 0.3 | 2 | 0.0387 | 6.4 |
Counts of positively selected genes in related flying lineages were removed from counts in flightless lineages to determine candidate genes before functional analysis. In (B), child (sub-categorical) processes are indented below parent processes. Categories with P < 0.05 are shown; full results are given in Table S11 [32]; 1284 total background genes were mapped to (A) 1229 IDs and (B) 1207 IDs; 21 candidate genes were mapped to 21 IDs (A and B).
Genes detected to be under relaxed selection (higher dN/dS ratios) in flightless pterygote lineages as compared to related flying lineages as over-represented in (A) GO categories from DAVID analysis and (B) Biological Process categories from PANTHER analysis
| (A) DAVID GO results | |||||
|---|---|---|---|---|---|
| 1231 total genes | 54 higher dN/dS genes | ||||
| GO term | # in category | Expected # | Observed # |
| Fold enrichment |
| mRNA splicing, via spliceosome | 50 | 2.2 | 8 | 0.0069 | 3.6 |
| Catalytic step 2 spliceosome | 35 | 1.5 | 6 | 0.021 | 3.9 |
| Precatalytic spliceosome | 39 | 1.7 | 6 | 0.033 | 3.5 |
| (B) PANTHER Biological Process results | |||||
| 1209 total genes | 53 higher dN/dS genes | ||||
| PANTHER GO-Slim Biological Process term | # in category | Expected # | Observed # |
| Fold enrichment |
| RNA localization | 11 | 0.5 | 3 | 0.013 | 6.2 |
| Death | 13 | 0.6 | 3 | 0.020 | 5.3 |
| Cell death | 13 | 0.6 | 3 | 0.020 | 5.3 |
| Apoptotic process | 13 | 0.6 | 3 | 0.020 | 5.3 |
| Negative regulation of apoptotic process | 1 | 0.04 | 1 | 0.043 | 22.8 |
| Localization | 144 | 6.3 | 12 | 0.020 | 1.9 |
| Extracellular transport | 1 | 0.04 | 1 | 0.043 | 22.8 |
In (B), child (sub-categorical) processes are indented below parent processes. Categories with P < 0.05 are shown; full results are given in Table S11 [32]; 1285 total background genes were mapped to (A) 1231 IDs and (B) 1209 IDs; 56 candidate genes were mapped to (A) 54 IDs and (B) 53 IDs.
Figure 2:dN/dS ratios in flightless vs related flying lineages for 13 mitochondrial protein-coding genes. In 11 of 13 genes, the dN/dS ratio in the flightless pterygote lineages is higher than the dN/dS ratio of flying lineages. Genes with a significant difference in rates (after Benjamini-Hochberg correction) are marked with an asterisk; in all 5 cases, the dN/dS ratio is higher in the flightless lineages than in their flight-capable counterparts. Dashed lines signify the mean dN/dS values; flightless: 0.031 and flying: 0.021. The tree with lineages tested is provided in the S10 tree file [32].
Figure 3:Positive selection in hexapod lineages in nuclear and mitochondrial genes of interest. The tree is adapted from Misof et al. [2], showing orders and involving 1 species representative per insect order for each gene tested. Orders/lineages that are shaded grey are apterygote (i.e., hexapods that never evolved the ability to fly), and those shaded orange consist entirely of species that are flightless due to a secondary loss of flight after its origin in Pterygota; note Embioptera and Strepsiptera are female flightless only. The lineage marked with “P” represents the lineage leading to the clade Pterygota; “L” = polyneoptera and “H” = holometabola (i.e., complete metamorphosis) insects.