Literature DB >> 35002370

The role of plants in the formation of species-specific features in grass flies (Diptera, Chloropidae, Meromyza).

Tatiana A Triseleva1, Varos G Petrosyan1, Aleksandra A Yatsuk1, Andrey F Safonkin1.   

Abstract

In the current manuscript, we present the results of comparative analysis of seven species of Meromyza flies in the "variegata" cluster and of the evolutionary close species M.inornata, based the following criteria: 1) 14 external key features; 2) shape and area of the anterior processes of postgonites; 3) mtDNA CO1 region and 4) host plant diversity data. We could demonstrate the primary role of host plants in species formation inside genus Meromyza and calculated the timing of the divergence of M.inornata and the species of "variegata" cluster. Based on our estimates of evolution rate for mtDNA CO1 gene, we could conclude that that divergence of herbs happened before the speciation of grass flies Meromyza. Meromyza species, close to the ancestral species of the cluster, are adapted to the wide range of host plants. We revealed the most informative variables h1, S and Plant analysing data with the following statistical methods: linear discriminant analysis - LDA, regularised discriminant analysis - RDA, flexible discriminant analysis - FDA and probabilistic neural network - PNN. The highest classification accuracy was achieved using PNN (99%) and the lowest when using LDA (95.8%). When the Plant trait was excluded, the classification accuracy decreased by 14%. We revealed the significant trends in size change of the anterior process of the postgonite amongst studies species. This morphological structure is an element of male reproductive apparatus critical for the restriction of interspecies mating. We determined three branches of speciation in the "variegata" cluster and five trends in the evolution of this cluster, based on the external morphological features. We showed that M.variegata and especially M.mosquensis, the species closest to the ancestral haplotype, have the largest number of features typical of those of M.inornata. Based on the external features and the area of the anterior process of the postgonite, we reconstructed the phylogenetic position of M.elbergi in the cluster. In accordance with the obtained outcomes, we could conclude that the distribution, species diversity and the adaptation of the grass flies to narrow oligophagy were directly connected to host plant diversity. The adaptation to different host plants could be the main factor in divergence of grass flies and their evolution started later than the diversification in the Pooideae subfamily of grasses. Tatiana A. Triseleva, Varos G. Petrosyan, Aleksandra A. Yatsuk, Andrey F. Safonkin.

Entities:  

Keywords:  Poaceae; co-evolution insect-plant; morphometric; mtDNA CO1; postgonites

Year:  2021        PMID: 35002370      PMCID: PMC8732877          DOI: 10.3897/BDJ.9.e78017

Source DB:  PubMed          Journal:  Biodivers Data J        ISSN: 1314-2828


Introduction

Studies of the mechanisms and factors contributing to species biodiversity are an important part of modern research in zoology (Chesson 2000, Vorobjeva and Striganova 2005, Parmesan 2006, Gaujour et al. 2012, Jeltsch et al. 2013). Modern approaches to phylogenetic reconstructions are based on interaction of cladistics, numerical and genetic phyletics (Scotland et al. 2003, Pavlinov 2003, Pavlinov 2005, Patwardhan et al. 2014). In classical phylogenetics, construction of a cladogram represents the initial stage of phylogenetic studies where reconstruction of an evolutionary scenario requires a set of additional data (Pavlinov 2005). In contrast, the creation of a cladogram in modern phylogenetics is the final stage of phylogenesis reconstruction. Grass flies of the genus (, , Meigen, 1830) represent a perfect model for employment of both modern and classical approaches to phylogenetic reconstructions. To date, includes more than 90 species, distributed throughout the Northern Hemisphere. The identification key of this genus was developed, based on a set of external morphological features and specific features of male genital apparatus (Narchuk and Fedoseeva 2010). The results of genetic analysis made it possible to divide the genus into eight clusters (Safonkin et al. 2016) which triggered the question about the factors contributing to diversification of this genus. The change of host plant is important in speciation of phytophages (Forbes et al. 2017). Assuming that evolution of host plants can be an important factor, we looked closely into a list of host plants for one third of the species of flies (Panteleeva 1989, Nartshuk and Fedoseeva 2011). Additionally, the genetic analysis has allowed us to trace the formation of species groups (clusters) within the genus, as well as the relative time of speciation within these groups and the degree of species nutritional adaptation (Safonkin et al. 2020a). However, no data are available about the development of specific features, in particular, for species during the evolution of genus or the role of host plant. In flies of genus , all host plants, the area, the relative time of speciation, the degree of relationship between species, based on the postgonite structure and the mtDNA COI locus are currently known only for a representative sample of species from the “” cluster. Species of the “” cluster are widespread throughout Europe (Safonkin et al. 2020b). These species show striking similarities despite wide variability of external features, but for species-specific male genital apparatus (Safonkin et al. 2020b). Some species within the cluster have a broad range of host plants, but other species are specialised in limited host plant groups (Panteleeva 1989, Nartshuk and Fedoseeva 2011). In the current manuscript, we show the results from our study of the distribution of various external key features and structural features of male genital apparatus amongst the species of “” cluster. We also present here the results of comparative analysis of seven species of flies in the “” cluster and evolutionary close species Becker, 1910, based on external key features, shape and area of the anterior processes of postgonites, mtDNA CO1 region and host plant diversity data. We demonstrate the important role of host plants in species formation inside genus .

Material and methods

The comparative analysis was based on the original descriptions of Fedoseeva, 1962, Fedoseeva, 1979, Macquart, 1835, Meigeni, 1838, Fedoseeva 1960, Fedoseeva. 1962, Meigeni, 1830 and (Becker 1910, Fedoseeva 1960, Fedoseeva 1962, Hubicka 1970, Fedoseeva 1979) and results of our morphometric study of specimens from the collection of the Severtsov Institute of Ecology and Evolution (Moscow, Russia), partially presented earlier (Safonkin et al. 2020b). Genetic analysis. We analysed the relationship and time of species divergence, based on the nucleotide sequences of mtDNA CO1 locus previously deposited by us in GenBank. A phylogenetic tree was constructed via the Bayesian approach using the BEAST v.1.10.4 software package with default parameters, except for the Tree Prior parameter, for which the G. Yule speciation model was chosen. The numbers in the nodes indicate the replacement for the site for 1 million years (Triseleva et al. 2014, Safonkin et al. 2016, Safonkin et al. 2020b). Methods for species differentiation, based on comparative analysis of external features and male genitals. For comparative analysis in 66 males and 58 females of eight species of grass flies, we have selected the most distinct features (out of 35 analysed): eight quantitative features: h1 (ratio of the gena height to the height of the 3rd antennal segment), h2 (ratio of the length of triangle to the length of head), h3 (ratio of the height of ocellar triangle to the base of triangle), h4 (ratio of the width of hind femur to the width of hind tibia), L (length of the body without abdomen), L1 (ratio of the length of head to the length of mesonotum), L2 (ratio of mesonotum to scutellum), S (area of anterior process of the postgonite) and six qualitative traits (Table 1). Additionally, we analysed three ecological characters: habitat, biotope and species of host plants.
Table 1.

The characteristics of and the species of the " " cluster.

Features M.inornata M.variegata M.laeta M.mosquensis M.femorata M.rufa M.bohemica M.elbergi
N = 20N = 20N = 20N = 20N = 20N = 3N =20N =1
Colour of palpisometimes brown in distal partsometimes brown in distal partblacksometimes brown in distal parthalf blacklightsometimes brown in distal parthalf black
Setae on the lower surface of genalightsometimes blacklightlightsometimes blacklightoften blacksometimes black
Ratio of the gena height to the height of the 3rd antennal segment1.04 ± 0.060.82 ± 0.030.84 ± 0.030.85 ± 0.020.97 ± 0.030.64 ± 0.030.77±0.031.0
Ratio of the height of ocellar triangle to the base of triangle1.01 ± 0.020.94 ± 0.021.05 ± 0.030.92 ± 0.030.96 ± 0.031.05 ± 0.031.17±0.041.35
Ratio of the length of triangle to the length of head0.77 ± 0.010.59 ± 0.010.66 ± 0.010.56 ± 0.010.66 ± 0.010.61 ± 0.020.66±0.010.68
Occipital spotyesnonoyesnononono
Occipital stripsnononoyesnoyesyesyes
Colour of mesonotum stripsblackbrownbrown-blackblackreddishyellow-brownbrownbrown-black
Ratio of the length of head to the length of mesonotum0.68 ± 0.020.62 ± 0.020.63 ± 0.010.61 ± 0.010.62 ± 0.010.71 ± 0.030.64 ± 0.010.73
Mid-strip of mesonotumstrip reaches the scutellumstrip does not reach the scutellumsometimes passes through the scutellumpasses through the scutellumstrip does not reach the scutellumstrip does not reach the scutellumstrip does not reach the scutellumstrip reaches the scutellum
Ratio of mesonotum to scutellum3.00 ± 0.123.14 ± 0.053.27 ± 0.073.23 ± 0.073.17 ± 0.063.37 ± 0.323.19 ± 0.062.92
Ratio of the width of hind femurs to the width of hind tibia3.52 ± 0.103.64 ± 0.093.31 ± 0.093.28 ± 0.094.32 ± 0.153.83 ± 0.203.37 ± 0.064.0
Length of the body without abdomen1.76 ± 0.061.80 ± 0.041.43 ± 0.021.44 ± 0.031.87 ± 0.041.49 ± 0.021.76 ± 0.031.51
Area of anterior process of the postgonite, µm²8440.9 ± 49.59010.2 ± 134.34512.6 ± 91.65507.9 ± 87.07228.7 ± 93.13053.8 ± 296.34365.0 ± 139.44625.08
N = 21N = 20N = 4N = 22N = 23N = 4N = 9N =1
Area of speciesEast AsiaPolyzonal (Europe)Polyzonal (Europe)Boreal (Euro-Siberian)Polyzonal (Europe)Polyzonal (Europe)Polyzonal (Europe)Polyzonal (Europe)
Biotoperiverine meadowgroves, banks, forest edgeriverine meadowmeadowsgroves, banks, swamp meadow side, dry meadowsflood meadowgroves, lowland meadow, near the roadsswamp and forest meadows, gardens, urban habitat

N - number of specimens measured, Х ± SE.

For comparative analysis of the shape of anterior process of the postgonites, we used our previous data (Yatsuk and Safonkin 2018). The analysis of host plants. The species of host plants are taken from the work of Safonkin et al. (2020b). To time the origin of grasses, we used data from the work of Pimentel et al. (2017). Statistical methods for species differentiation, based on quantitative and qualitative traits. Evaluation of the separating ability of the selected quantitative (8) and qualitative (1) traits for the differentiation of 96 individuals, belonging to seven species of grass flies, was carried out using two types of methods. The first type includes various forms of discriminant analysis (DA) (Friedman 1989, Petrosyan 2014, Shitikov and Mastitsky 2017) (linear discriminant analysis - LDA and regularised discriminant analysis - RDA, flexible discriminant analysis - FDA). The second type includes methods based on artificial neural networks, in particular, probabilistic neural network (PNN) (Specht 1990, Petrosyan 2014, Mohebali et al. 2020). The use of different methods of DA is due to the fact that these methods have different requirements for the dataset, in particular, the LDA method assumes that the data distribution in each class is normal and that the intragroup variance and correlation matrices are equal. Another DA method, RDA, builds a species differentiation rule, based on information features by regularising group covariance matrices, allowing a more reliable model to be created taking into account the multicollinearity of the data. This technique is usually useful for large multidimensional data containing highly correlated predictors. The third modification method of DA is FDA, which is a flexible extension to LDA that uses non-linear combinations of predictors, such as splines. The FDA is known to be useful for modelling multidimensional abnormal and non-linear relationships between variables in each group, resulting in a more accurate classification. It is important to note that DA methods work only with quantitative traits. In our case, there is one quality feature (the number of host plant species). For this reason, PNN is used to assess the informative nature of this trait for the separation of species. To determine the set of linear discriminant functions (LDF) separating species, we used a stepwise discriminant analysis procedure with the threshold value of the inclusion of variables F = 4 (Petrosyan 2014). A quantitative assessment of the distinguished discriminant functions was carried out using a set of parameters, including eigenvectors, coefficients of canonical correlations for discriminant functions, Wilkes statistics, С hi-square statistics, as well as P-values of testing hypotheses for the significance of the separation of species (Petrosyan 2014). After the selection of informative features, the quality of the LDA model was assessed using a leave-one-out CV-procedures (Shitikov and Mastitsky 2017). For other forms of discriminant analysis in FDA and RDA, we also used a cross-check procedure for the final assessment of the quality of the models (Shitikov and Mastitsky 2017). In the present study we use PNN, which has four layers: input, pattern, summation and output (Specht 1990, Petrosyan 2014, Mohebali et al. 2020). The task of the input layer is to distribute the input layer data for the pattern layer. The number of input signals is equal to the number of variables that define the samples (individuals). The pattern layer has one element for each sample (individual) from the training dataset. The input layer and the sample layer form a fully-connected structure. The output layer consists of seven neurons that determine whether an individual belongs to one of the seven species based on voting, taking into account the signals received from the summation layer. Effective methods for constructing PNNs are presented in the literature (Specht 1990, Petrosyan 2014, Mohebali et al. 2020). In this PNN, information moves in only one direction - forward from the input nodes (neurons) through the pattern layer to the output layer (neurons). As input variables of the network, we used all the quantitative variables that were used in the LDA, FDA and RDA methods and, additionally, we added the variable plant. The assessment of the classification accuracy was determined by averaging the accuracy of a 24-fold repetition of the network models. At each run, 92 individuals were randomly used as training and four individuals for testing (Petrosyan 2014). The small amount of selection of individuals to test the accuracy of the PNN-models is due to the fact that two species and are represented by a limited number of some features, three and four, respectively. All assessments with the method of discriminant analysis were carried out in the RStudio v. 1.4.1106 using basic, special R-packages (MASS, klaR, mda, tidyverse, caret, dplyr, FactoMineR) and additional programmes in the R language. Probabilistic neural networks were created using the Biosystem office (Petrosyan 2014).

Results

Assessment of species diversity, based on genetic analysis Сlustering of species, based on the mtDNA CO1 gene, revealed two close clusters. The “” cluster included six species and the “” cluster included only the single species because, currently, we have no other allied species. There are three branches in the “” cluster: 1). and , which are closer to the hypothetical ancestor of the cluster; 2). ; and 3). the more recent species and (Fig. 1). The timing of the divergence of and the species of the “” cluster is shown in Table 2. The time was calculated according to the estimated time of the origin of the genus from 15.8 to 4.02 million years (Safonkin et al. 2020a).
Figure 1.

A phylogenetic tree of species of the “” and “” clusters and postgonites shape: a phylogenetic tree, based on the mtDNA CO1, constructed in the programme BEAST v.1.10.4. (partially from fig. 1 by Safonkin et al. 2020a). The numbers in the nodes indicate the replacement for the site for 1 million years. Underlined are the species that gave the name to the clusters. Vertical lines – evolutionary lineages of the “” cluster. * – possible position of in the “” cluster. b shape of anterior process of the postgonites of (1), (2), (3), (4), (5), (6) and (7).

Table 2.

The divergence of and the species of the "" cluster.

SpeciesMa
MaxMin
M.inornata − the "variegata" cluster6.581.67
M.mosquensis 2.270.58
M.femorata 1.570.40
M.bohemica+M.rufa0.580.15
M.bohemica 0.250.06
M.laeta 0.250.06

Based on the BEAST programme, from Safonkin et al. (2020a).

Assessment of species diversity, based on external features and male genitals. Based on the combination of morphological features, we include the seventh species into the “” cluster. We can reveal five trends in the evolution of the “” cluster, based on the external key features (Table 1): Pale colour of the mid-stripe and occipital spot, but bright occipital strips (except for and ); Black setae on the lower surface of gena in , sometimes, in , , ; and are close by the height of gena. The remaining species of the cluster are characterised by a decrease in the height of the gena. Closely related species are strikingly different; Based on the ratio of the height and width of the ocellar triangle, closely related species are separated by head capsule stretch ( – , – , – ); The considerable thickening of the hind femur in , and The other key features are almost non-distinguishable. The area of the anterior process of postgonite significantly differs in most species of the “” cluster. The area of the anterior process of the postgonite is maximal in species close to the common hypothetical ancestor (, ) and decreases in younger species ( - , - , - , ) (Table 1). The shape of the anterior process of the postgonite is similar in and , but different considering the degree of curvature of the upper and lower contours. In , the shape of the anterior process of the postgonite is acuminate. In other species, the anterior process of the postgonite is curved forward (Fig. 1b). The number of grasses suitable for development of flies is maximum in (six plants) and (six or seven plants), fewer in and (three plants for each fly species) and one host plant in (Table 3).
Table 3.

The speciation of host plants of species of the «» cluster.

TribeMaSubtribeMaSpecies of grassesMaSpecies of grass flies
Poeae 33.5† Agrostidinae 22†Agrostiscapillaris L.8.8M.femorata, M.laeta, M.mosquensis
Aveninae Koeleriacristata (L.) Pers. 7.2 M.rufa
Avenasativa L.9.1 M.variegata
Poinae 27†Phleumpratense L.,Ph.phleoides (L.) H. Karst.M.rufa, M.variegata
Alopecuruspratensis L.9.2M.mosquensis, M.variegata
Poa sp., Poapratensis L.9.4 M.mosquensis
Loliinae Festucaovina L.3.6 M.mosquensis
Festucarubra L.1.8-3.3M.femorata, M.laeta, M.mosquensis
Festucapratensis Huds. M.variegata
Loliumperenne L.2.9 M.bohemica
Dactylidinae 8Dactylisglomerata L.4.3M.femorata, M.variegata
Triticeae Hordeinae 16Elymusrepens (L.) Gould8M.mosquensis?, M.variegata
Elymushispidus (Opiz) Melderis8 M.mosquensis

The time of speciation of the grasses and the time of divergence of lineages (†) is given according to Pimentel et al. (2017), − - no data, ? - supposedly.

Discriminant analysis results Using a stepwise selection algorithm, it was determined that six variables (S, h1, h2, L1, L and h4) were significant predictors of species (Tables 4, 5). The order of these variables is given by the importance of their inclusion in the LDA model. The stepwise selection algorithm in LDA showed that four discriminating functions have P-values less than 0.05, i.e. are statistically significant at the 95.0% confidence level (Table 5). In our case, although four functions are statistically significant, nevertheless, the first two functions account for the overwhelming majority of the separation of species, i.e. Wilkes' statistics, the relative percentage of discrimination and the canonical correlation coefficient for the first two functions are 0.003, 85.79%, 0.98 and 0.82, 9.26%, 0.86, respectively (Table 5). For the other two functions, the relative percentages of discrimination are 3.4% and 1.23%, respectively. Canonical correlation coefficients indicate that each subsequent discriminant function contributes less to discrimination than the previous one (Table 4). Unlike the first discriminant function, which has a canonical correlation coefficient of 0.98, for the fifth, this coefficient is 0.28. The classification accuracy using the four LDA functions is presented in Table 6.
Table 4.

Results of the stepwise selection algorithm in the LDA model, including eigenvectors, the relative contribution of each function to species differentiation and the coefficients of canonical correlations.

Discriminant functionEigenvalueRelative percentageCanonical correlation
126.885.790.98
22.899.260.86
31.063.400.72
40.3851.230.53
50.0860.280.28
60.01240.040.11
Table 5.

Characteristics of the statistical significance of the separation of seven species of flies within the selected LDA model.

Discriminant functionWilks Lambda Chi-Squared DFP-Value
10.003515.836<< 0.01
20.082221.625<< 0.01
30.318101.316<< 0.01
40.65737.29<< 0.01
50.918.340.08
60.9881.110.3
Table 6.

Classification table of species based on LDA method (percentage of cases correctly classified - 95.83%).

SpeciesActual number of individualsResults of classification
M.bohemica M.femorata M.inornata M.laeta M.mosquensis M.rufa M.variegata
M.bohemica 9 9 0 0 0 0 0 0
M.femorata 20 0 20 0 0 0 0 0
M.inornata 20 0 0 20 0 0 0 0
M.laeta 4 1 0 0 2 1 0 0
M.mosquensis 20 1 0 0 0 19 0 0
M.rufa 3 0 0 0 0 0 3 0
M.variegata 20 0 1 0 0 0 0 19
Correctly classified9292020219319
Incorrectly classified4 2 1 0 0 1 0 0
The application of the leave-one-out CV procedure showed that the selected LDA discriminant functions allow species classification with an accuracy of 95.83% (Table 6). Species projection onto the first linear discriminants LD1 and LD2 is shown in Fig. 2.
Figure 2.

Species projection on the first (LD1) and second (LD2) discriminant functions.

The use of other methods of DA in the form of RDA and FDA using six traits (S, h1, h2, L1, L and h4) showed that the classification accuracy of these methods does not improve significantly. In the RDA methods of classification, the numbers of correct and incorrect classification of individuals are 94 and 2, respectively (Table 7), i.e. the classification accuracy is 97.92%. Regularised discriminant analysis projection on the first (RDA1) and second (RDA2) canonical axes is shown in Fig. 3.
Table 7.

Classification table of species, based on RDA method (percentage of cases correctly classified - 97.92%).

SpeciesActual number of individualsResults of classification
M.bohemica M.femora M.inorna M.laeta M.mosquensis M.rufa M.variegata
M.bohemica 99000000
M.femorata 2002000000
M.inornata 2000200000
M.laeta 40003100
M.mosquensis 2000002000
M.rufa 30000030
M.variegata 2001000019
Correctly classified 94 92020320319
Incorrectly classified 2 0 1 0 0 1 00
Figure 3.

Regularised discriminant analysis projection on the first (RDA1) and second (RDA2) canonical axes.

The highest classification accuracy is achieved using the FDA method. In the FDA method, the numbers of correct and incorrect classification of individuals are 95 and 1, respectively (Table 8), i.e. the classification accuracy is 98.96%. Flexible discriminant analysis projection of seven species on the first (FDA1) and second (FDA2) canonical axes is shown in Fig. 4.
Table 8.

Classification table of species based on FDA method (percent of cases correctly classified - 98.96%

SpeciesActual number of individualsResults of classification
M.bohemica M.femorata M.inornata M.laeta M.mosquensis M.rufa M.variegata
M.bohemica 9 9 0 0 0 0 0 0
M.femorata 20 0 20 0 0 0 0 0
M.inornata 20 0 0 20 0 0 0 0
M.laeta 4 0 0 0 4 0 0 0
M.mosquensis 20 0 0 0 0 20 0 0
M.rufa 3 0 0 0 0 0 3 0
M.variegata 20 0 1 0 0 0 0 19
Correctly classified 95 92020320319
Incorrectly classified 1 0 1 0 0 0 00
Figure 4.

Flexible discriminant analysis projection of seven species on the first (FDA1) and second (FDA2) canonical axes.

The results of using a PNN. The general architecture of a PNN, which was used to differentiate seven species, based on quantitative (6) and qualitative (1) features, is shown in Fig. 5. Application of PNN to the seven species of flies showed that the use of a qualitative variable plant further improves the classification results.
Figure 5.

General architecture of a PNN.

The conducted 24 realisations of PNN models showed that the classification accuracy on the training samples is 98.5% (± 0.3) and on the verification (testing) samples is 99% (± 1.1). To visualise the species differentiation, as an example, Fig. 6 shows the areas of change in the values of traits of individuals of seven species, which are used by PNN to differentiate individuals. These diagrams represent the areas of change in the values of features that characterise the most informative variables h1, S and Plant. Fig. 6A shows that each species in the plane S and h1 has its own characteristic area, determined by the ranges of variation of the variables S and h1. It is important to note that the diagram in Figure 6A is divided into seven parts, i.e. each species is characterised by a certain area of definition of the variables h1 and S. However, this statement is not fulfilled in terms of informative features of Plant and S (Fig. 6B). In the plane of these variables, the areas for six species, with the exception of , are clearly distinguished. This diagram shows that the area of variation for Plant and S traits for overlaps with that for (see Fig. 7).
Figure 6.

Areas of change in the values of the most important traits h1, S and Plant for individuals of seven species, determined using a PNN with fixed (average) values of the remaining variables.

Figure 7.

Three-dimensional scatter plot of seven species individuals in the space of three informative features (S, h1 and Plant).

In the plane of informative features for Plant and h1, the areas of variation of these variables are presented for five species (Fig. 6C), with the exception of and . This diagram shows that, with the average values of the remaining variables, the area of variation for Plant and h1 traits for overlaps with that for and (Figs 6, 7), which indicates close values of Plant and h1. To check the differentiating importance of the Plant trait in PNN, we built another network with six traits without Plant trait. Assessments, based on 24 realisations of PNN models using six features, showed that the classification accuracy on training samples is 84.8% (± 0.4) and, on testing samples, is 90% (± 2.7). When this trait is excluded, the classification accuracy decreased by 14%. Typical errors in the classification of individuals of the species when using different methods of DA are presented in Tables 6-8. Typical errors in PNN are associated with the assignment of one to the species and vice versa, as well as the individual to another species.

Discussion

Based on the molecular clock of insect mtDNA CO1, the divergence rate is about 1.5 - 4% per one million years (from 0.0075 to 0.012 substitutions per site (Rowan and Hunt 1991, Brower 1994, Jamnongluk et al. 2003) with an average value of 2.75% (Papadopoulou et al. 2010). Based on the known mutation rate of the CO1 gene of mtDNA in Fallén, 1823 we could extrapolate the time of origin of the common ancestor of at 15.8 - 4.02 million years (Safonkin et al. 2020a). We estimate that the rate of gene evolution of flies is more likely closer to the lower value. Thus, the maximum time of divergence of the ancestors of and the “” cluster and the speciation of can be estimated at 6.58 million years and the period of divergence of “lineage” - 2.27, “lineage” - 1.57 million years, respectively. Taxonomic divergence in (the main host plants of flies) began in the middle of the Eocene - the beginning of the Oligocene and resulted in ecological dominance in the Northern Hemisphere at present (Pimentel et al. 2017). Grass diversity increased from the Middle to Late Miocene, during which the formation of open grass biomes of cold and warm climates also took place everywhere on earth (Strömberg 2005, Spriggs et al. 2014). Grass-dominated biomes of Western Eurasia formed 21–20 million years ago (Strömberg 2011). Based on our estimates of evolution rate for the above gene, we can conclude that divergence of herbs was before the speciation of grass flies . For example, the speciation time of host grass L. (Table 3) exceeds the speciation time of the youngest species of the cluster, by ten times. The adaptation to phytophagy in some dipterans was probably linked to climatic deterioration in the Neogene and the formation of new trophic connections. Tamura et al. demonstrated the correlation between the species evolution, the lowering temperature of the paleoclimate and the fragmentation of habitat in the Cenozoic using the group as a model for the analysis (Tamura et al. 2004). Russo et al. assumed utilisation of the variety of new fruits of flowering plants as one of the possible factors in the speciation in the group, resulting in specialisation and their ecological diversity (Russo et al. 2013). We concluded that similar patterns of speciation observed in grass flies due to the drying of the climate at the end of the Miocene and were related less to formation of grass host plants than to the increase in plant abundance. The adaptation of fly larvae to other species of host grasses, especially to the species of another Subtribe, could be a factor triggering a new speciation in the “” cluster. The change of host plant was important in speciation of some dipteran from : Fitch, 1855 (Craig et al. 1993, Stireman et al. 2005) and Walsh,1867 (Forbes et al. 2005). Based on our data and calculations, diverged from earlier than 0.25 MA. feeds on grasses from subfamily while feeds on . One can assume the earlier origin of in evolutionary lineage - - due to the earlier origin of genus Persoon, 1805 (about 7 MA) compared to (2.5 MA). and are adapted to different grass species from genus L. 1753 and L. 1753. partly feeds on the same grasses as and , except for plants from the supertribe . The earliest by origin, Supertribe Triticodae includes sp. grasses. According to Tsvelev (Tsvelev 1987), an unusually wide variability of hybrid grass species, including (L.) Gould, contributed to rapid evolution of the species. Development on evolutionarily more ancient and variable grass species suggests that grass flies should have morphological, physiological and behavioral features close to those of the ancestral species of the cluster, which were wide oligophages. Two oligophage species meet this criteria: is close to ancestral haplotype and represents a separate branch from the ancestral haplotype (Tables 1, 2, 3, Fig. 1). From the results indicated above, a connection can be assumed between the early origin of plants and their species diversity and the distribution and formation of initial groups of species of grass flies. Climate change and variability of ecological conditions has resulted in the divergence of grass flies, as they adapt to evolutionarily younger, more abundant grasses with expanded distribution ranges. Herbert et al. demonstrated that the reproductive isolating barriers in Kulp, 1968 (: ) were associated with different host plants (Herbert et al. 2013). In general, preferences of grass fly feeding are independent from the ecological characteristics of grasses, except for which feeds on firm bunchgrass, - on short grasses and - on tall and semi-tall grasses. Tall grasses (), unlike short grasses, are characterised by high shoots, large and raw stems and leaves, as well as low stooling. and are more xerophilic, which may be more consistent with an early origin. Various methods of statistical analysis demonstrated the set of the most informative differentiating traits of studied grass flies which allowed to differentiate the species with an accuracy of 95 - 99%. The highest classification accuracy is achieved when using PNN (99%) and the lowest when using LDA (95.8%). The accuracy of classification using RDA and FDA is 97.9% and 98.96%, respectively. The outcomes from different methods of analysis lead to the conclusion that the most important differentiating features of species in the “" cluster are the traits S, h1 and Plant. Input data in PNN analysis without Plant trait decreases the classification accuracy indicating the importance of the differentiating role of the trait Plant in this group of flies. Obviously, the contribution of the Plant trait is not limited only to the number of plant species because each species of grass flies in each evolutionary lineage of “” cluster is associated to a specific host plant for its development (Table 3). It is an indication of importance of original plant species in the evolution of grass flies, besides the number of species of host plants. We think that the area of the anterior process of the postgonite is the most significant criterion of species division. The trends of shape change of the anterior process are less visible than the change of its size (Table 1, Fig. 1B). The Mahalanobis distance calculation confirmed the uniformity of the shape of the anterior process of postgonite both between the “” and “” clusters and within the “” cluster () (Yatsuk and Safonkin 2018). However, the non-overlapping dimensions of the anterior process indicate its important role in reproductive isolation. Possibly, geographic isolation between the East Asian and the species of the “” cluster resulted in similarity in shape and size of the anterior process of the postgonite in and (Table 1). The revealed trends in the evolution of the “” cluster, based on the external key features, may be correlated with different adaptations of grass flies to the environment. Pale colour of mid-stripes and occipital spot, but bright occiput stripes (except for and ), may be due to adaptation of flies to the colour range of the host plants. The considerable thickening of hind femurs may be an adaptation of flies mainly to the jumping motion along the stems as a characteristic of the adult behaviour of the genus . Black setae on the lower surface of gena are the characteristic of the West European origin of some species (Safonkin et al. 2018). Many features, similar to those of , have been revealed for and, to a lesser degree for (Table 1), suggesting that these features have been rooted in a common hypothetical ancestor of both clusters. Lacking genetic data for , we can use only external features and the structure of the postgonite for reconstruction of this species phylogenetic position in the cluster (Fig. 1A). The first hypothesis is that could originate from the “lineage” since it is close to by postgonite structure. The second is that could originate from the lineage, since it has a combination of external key features inferred for the ancestral haplotypes of the clusters ( – ).

Conclusions

This study highlights that the adaptation to different host plants could be the main factor in divergence of grass flies of the “” cluster. We think that formation of the specific set of external features, which are rather uniform within a cluster, was associated with the developing of grass flies in similar conditions of the grass biome. Stabilising selection for a set of species external feature resulted in the formation of differences in structure and size, with insignificant change in shape and specific features of the male genital apparatus. The increased availability of host plants could be directly connected to the distribution, diversification and the adaptation of these grass flies to narrow oligophagy, but the development of these changes started later than the diversification in the subfamily of grasses and the distribution of these grasses in biomes in the Middle and Late Miocene.

Acknowledgements

Our thanks to Dr Michael Blackburn, Invasive insect biocontrol and behaviour laboratory, ARS, USDA, Beltsville, MD and Dr Yelena Golubeva, Cancer Genomics Laboratory, DCEG, LBR, NCI for reading and editing the manuscript. The statistical analysis and its interpretation was performed with support from the Russian Science Foundation (Grant No. 21-14–00123).
  17 in total

1.  Host-associated genetic differentiation in phytophagous insects: general phenomenon or isolated exceptions? Evidence from a goldenrod-insect community.

Authors:  John O Stireman; John D Nason; Stephen B Heard
Journal:  Evolution       Date:  2005-12       Impact factor: 3.694

Review 2.  Revisiting the particular role of host shifts in initiating insect speciation.

Authors:  Andrew A Forbes; Sara N Devine; Alaine C Hippee; Eric S Tvedte; Anna K G Ward; Heather A Widmayer; Caleb J Wilson
Journal:  Evolution       Date:  2017-01-16       Impact factor: 3.694

3.  Decoupled taxonomic radiation and ecological expansion of open-habitat grasses in the Cenozoic of North America.

Authors:  Caroline A E Strömberg
Journal:  Proc Natl Acad Sci U S A       Date:  2005-08-11       Impact factor: 11.205

4.  Rates of DNA change and phylogeny from the DNA sequences of the alcohol dehydrogenase gene for five closely related species of Hawaiian Drosophila.

Authors:  R G Rowan; J A Hunt
Journal:  Mol Biol Evol       Date:  1991-01       Impact factor: 16.240

5.  BEHAVIORAL EVIDENCE FOR HOST-RACE FORMATION IN EUROSTA SOLIDAGINIS.

Authors:  Timothy P Craig; Joanne K Itami; Warren G Abrahamson; John D Horner
Journal:  Evolution       Date:  1993-12       Impact factor: 3.694

6.  Temporal patterns of fruit fly (Drosophila) evolution revealed by mutation clocks.

Authors:  Koichiro Tamura; Sankar Subramanian; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2003-08-29       Impact factor: 16.240

7.  Rapid morphological radiation and convergence among races of the butterfly Heliconius erato inferred from patterns of mitochondrial DNA evolution.

Authors:  A V Brower
Journal:  Proc Natl Acad Sci U S A       Date:  1994-07-05       Impact factor: 11.205

Review 8.  Integrating movement ecology with biodiversity research - exploring new avenues to address spatiotemporal biodiversity dynamics.

Authors:  Florian Jeltsch; Dries Bonte; Guy Pe'er; Björn Reineking; Peter Leimgruber; Niko Balkenhol; Boris Schröder; Carsten M Buchmann; Thomas Mueller; Niels Blaum; Damaris Zurell; Katrin Böhning-Gaese; Thorsten Wiegand; Jana A Eccard; Heribert Hofer; Jette Reeg; Ute Eggers; Silke Bauer
Journal:  Mov Ecol       Date:  2013-08-05       Impact factor: 3.600

9.  Are diversification rates and chromosome evolution in the temperate grasses (Pooideae) associated with major environmental changes in the Oligocene-Miocene?

Authors:  Manuel Pimentel; Marcial Escudero; Elvira Sahuquillo; Miguel Ángel Minaya; Pilar Catalán
Journal:  PeerJ       Date:  2017-09-22       Impact factor: 2.984

10.  C4 photosynthesis promoted species diversification during the Miocene grassland expansion.

Authors:  Elizabeth L Spriggs; Pascal-Antoine Christin; Erika J Edwards
Journal:  PLoS One       Date:  2014-05-16       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.