Literature DB >> 35447769

Population Genetic Structure of Chlorops oryzae (Diptera, Chloropidae) in China.

Xianya Li1, Shunjiao Wu1, Yonghong Xu1, Yinghong Liu1, Jia Wang1.   

Abstract

Frequent outbreaks have made Chlorops oryzae one of the major pests of rice in some regions. In order to understand the ecological adaptation of C. oryzae at the molecular level, and provide a scientific basis for formulating management strategies, we used two molecular markers, COI and ITS1 sequences, to systematically analyze the genetic structure of 31 populations. The higher haplotype diversity and lower nucleotide diversity indicated that the C. oryzae populations experienced rapid expansion after a "Bottleneck effect". The results of the mismatch distribution, neutrality test (Fu's Fs < 0, p < 0.001), and haplotype network analysis suggested that the population has recently undergone an expansion. Although genetic differentiation among C. oryzae populations was found to have existed at low/medium levels (Fst: 0.183 for COI, 0.065 for ITS1), the frequent gene flow presented as well (Nm: 2.23 for COI, 3.60 for ITS1) was supposed to be responsible for frequent local outbreaks.

Entities:  

Keywords:  C. oryzae; COI; ITS1; genetic differentiation; genetic diversity

Year:  2022        PMID: 35447769      PMCID: PMC9032139          DOI: 10.3390/insects13040327

Source DB:  PubMed          Journal:  Insects        ISSN: 2075-4450            Impact factor:   3.139


1. Introduction

Population genetic structure, the most basic genetic information of a species, represents the amount and distribution of genetic variation within and among populations. Moreover, it is the accumulation of the evolutionary history and the basis for the development of future evolutionary adaptations of a species [1,2]. Population genetic structure is usually indicated with genetic diversity, genetic differentiation, and genetic distance [3]. Analysis of population genetic structure is conducive to revealing the extent and pattern of gene flow and establishing phylogenetic relationships among populations, thereby contributing to understanding population dynamics, occurrence trends, and genetic relationships among populations [4,5]. Molecular markers are a direct reflection of genetic diversity, which is a very effective tool to study the genetic structure of species [6]. Presently, molecular markers, such as mitochondrial DNA (mtDNA) [7], ribosomal DNA (rDNA) [8], and microsatellites [9], have been widely used in the analysis of the genetic structure of insect populations. As one of the classical molecular markers, mtDNA supports a better understanding of the process of population dispersal and evolution because of its small size, high level of mutations, strict adherence to matrilineal inheritance, lack of introns, and recombination characteristics [10]. Particularly, the mtDNA cytochrome oxidase I (COI) gene has been widely used owing to its moderate evolutionary rate and readily available advantages [11,12,13,14]. However, the genetic diversity and evolutionary history of mtDNA are not necessarily identical to, or representative of, the organism due to potential introgression, lower effective population size, and potential selection. In light of this, combining mtDNA and other markers can greatly improve the reliability of results [15]. For example, the population dispersal pathways and timing of Meromyza saltatrix were studied in connection with the COI gene and morphological features [16,17]. Moreover, nuclear genes (nDNA) which have genetic information that mtDNA lacks are able to play a complementary role to mtDNA. The rDNA internal transcribed spacer 1 (ITS1), a non-coding sequence located between 18S and 5.8S rDNA with high nucleotide polymorphism in most eukaryotes, has also been widely used as markers to study population genetic structure [18]. For example, the genetic structure of 13 geographic populations of Melipona subnitida in northeastern Brazil was analyzed using ITS1 sequences [19,20]. The analysis of genetic diversity of invasive species Halyomorpha halys in North America and Europe using ITS1 and COI markers suggested that the joint use of ITS1 and COI could improve the accuracy of detection of the source areas of an invasion [21]. Chlorops oryzae (Diptera, Chloropidae), an important rice pest, is widely distributed in Asia, such as China, Japan, and Korea [22]. The larvae of C. oryzae bore into the stem and move to the growing point, where they feed on the developing leaves and young panicles, inhibiting the effectiveness of chemical insecticide [23,24]. In recent years, with changes in agroecology, cultivation and farming system, climate, and control agents, the C. oryzae occurrence areas have been expanding, and the damage levels are geographically different [25]. The frequent outbreaks have made C. oryzae one of the major pests of rice in some areas [26], causing yield losses of 20–50% [27]. In China, C. oryzae occurs 2–5 generations per year in different regions, mainly dependent on diapause induction and duration [23]. Presently, studies on C. oryzae are still limited and have mainly focused on physiology and ecology [28,29]. Till recently, Zhou et al. [30] used COI and ISSR markers to analyze its genetic structure and speculated that frequent gene flow in C. oryzae populations was responsible for outbreaks. However, the genetic mechanisms underlying different occurrence levels and gradual geographic expansions are still unknown. In this study, we used two molecular markers, COI and ITS1 sequences, to systematically analyze the genetic structure of 31 geographic populations collected from the main distribution areas of C. oryzae in China. The results of this study will contribute to understanding the ecological adaptation of C. oryzae at the molecular level, thereby providing a scientific basis for formulating management strategies.

2. Methods

2.1. Sample Collection and DNA Extraction

Samples of 26 C. oryzae populations were collected from Guizhou, Chongqing, and Sichuan Provinces, China, from May 2020–August 2021 (Figure 1 and Table S1). All samples were soaked in 75% ethanol and stored at −20 °C until total DNA extraction. Genomic DNA was extracted according to the method of DeBarro and Driver with minor modification [31]. Briefly, samples were soaked in deionized water to remove alcohol prior to extraction. Individual samples were grounded thoroughly in centrifuge tubes with 30 μL of the lysis buffer (50 mM KCl, 10 mM Tris pH 8.4, 0.45% Tween 20, 0.2% gelatin, 0.45% NP40, 60 μg/mL proteinase K) to form a homogenate, which was incubated at 65 °C for 30 min and then boiled for 10 min to inactivate the proteinase K. Total DNA extraction were stored at −20 °C for subsequent analysis.
Figure 1

Collection sites of populations. Populations data for TY, ZZ, LH, XT, HX, and ZJ were obtained from Zhou et al. [30] and not marked in the figure.

2.2. PCR Amplification and Sequencing

The genomic DNA extracted from C. oryzae was used as a template for COI and ITS1 amplification. The COI fragments were amplified with specific primers COI F (5′-CTA GGT GCT CCA GAT ATA GCA TTT C-3′) and COI R (5′-GGC TAA AAC AAC TCC TGT TAA TCC-3′) [30]. The ITS1 fragments were amplified with primers ITS1 F (5′-CGC ATT ATG TGT TAC GGA TGT T-3′) and ITS1 R (5′-GGT TGC GAA TGT CTC TAA TTC-3′). PCR was performed in 30 μL volumes comprised of 15 μL 2 × Taq PCR MasterMix (Biomed, Beijing, China), 1 μL of each primer (10 mmol/L), 1 μL of template DNA solution, and 12 μL double distilled water. Amplifications were conducted as follows: 34 cycles of denaturation at 94 °C for 30 s, annealing at 55 °C for 30 s, and extension at 72 °C for 1 min. All PCR products were checked by electrophoresis on a 1% agarose gel and bi-directionally sequenced by Sangon Biotech (Shanghai, China). Sequences were deposited in the GenBank under accession numbers OM490688-OM491162 for COI and OM540945-OM541304 for ITS1. COI sequence data of six populations (Hunan: TY, ZZ, LH, XT; Guizhou: HX; Zhejiang: ZJ) were kindly provided by Zhou et al. [30].

2.3. Data Analysis

The sequencing data were edited using SnapGene v.4.2 [32]. All data processing, including basic statistics and calculation of inter- and intraspecific genetic distances (Kimura 2-parameter model for COI and Tamura 3-parameter model for ITS1) and transition/transversion (ts/tv) ratio, were performed using MEGA v.7.0 [33]. The number of haplotypes (h), haplotype diversity (Hd), average number of nucleotide differences (k), nucleotide diversity (Pi), and haplotype analysis were calculated by DnaSP v.5.10 [34]. Neutrality tests (Fu’s Fs [35] and Tajima’s D [36]), F-Statistics (Fst) (Bonferroni correction for significance), gene flow (Nm) (COI: Nm = (1 - Fst)/2Fst; ITS1: Nm = (1 - Fst)/4Fst), and analysis of molecular variance (AMOVA) were performed using Arlequin v.3.15 [37]. The distribution of pairwise differences between individual sequences was analyzed by mismatch distribution analysis using DnaSP. In addition, the statistics of the raggedness (rg) index of the observed distribution and the sum of square deviations (SSD) between the observed and the expected mismatch were also calculated using Arlequin based on the spatial expansion model. The statistical significance of variance components in Arlequin was tested with 1000 permutations. Geographical distances among populations were calculated using MapInfo Professional v.8.5 (Table S2) [38]. Mantel test was conducted by NTsyspc v.2.1 for the natural logarithm of interspecific genetic distance (Fst/(1 - Fst)) and geographic distance [39]. The haplotype networks were constructed using the median-joining method in software Network v.10.2 [40]. Compared with traditional phylogenetic trees, haplotype network diagrams can better reveal the genealogical relationships between conspecifics.

3. Result

3.1. Base Composition and Gene Mutation

A total of 598 COI sequences (475 obtained in this study and 123 from Zhou et al. [30]) representing 31 populations and 360 ITS1 sequences representing 26 populations were used for subsequent analysis (Table S1). The final aligned COI sequence fragments were 720 bp and all alignments were unambiguous, with no insertions or deletions. The average nucleotide composition was T = 36.4%, C = 17.4%, A = 29.6%, and G = 16.6%, showing an obvious AT bias (66.0%). In total, 57 polymorphic sites (7.92%) were detected in all COI sequences, including 34 singleton variable sites and 23 parsimony informative sites. There were 47 transitions and 10 transversions, and the overall ts/tv bias was 6.934. The ts/tv rate ratio was observed to be higher with purines (20.78) than pyrimidines (11.08). The final aligned ITS1 sequence fragments were 607 bp. The average nucleotide composition was T = 31.2%, C = 14.8%, A = 36.5%, G = 17.5% and A + T = 67.7%. All sequences had a total of 20 polymorphic sites (3.29%), including 10 single variable sites and 10 parsimony informative sites. There were 8 transitions and 12 transversions, and the overall ts/tv bias was 0.639. The ts/tv rate ratio was observed to be higher with purines (2.402) than pyrimidines (0.116).

3.2. Genetic Diversity

For COI analysis, the 31 C. oryzae populations had a total of 55 haplotypes, with an overall Hd of 0.346, k of 0.854, and Pi of 0.0012. The Hd, k, and Pi of each population ranged from 0.000–0.638, 0.000–3.985, and 0.0000–0.0052, respectively. LZ and NC populations had higher levels of genetic diversity (Hd > 0.5, Pi > 0.005) (Table 1).
Table 1

Genetic diversity of 31 C. oryzae populations based on COI sequences.

nhkHdPiTajima’s DFu’ Fs
TY 2440.2500.239 ± 0.1130.0004 ± 0.0002−1.7325 **−3.0208 ***
ZZ 2440.2500.239 ± 0.1130.0004 ± 0.0002−1.7325−3.0208 ***
XT 2430.2430.236 ± 0.1090.0003 ± 0.0002−1.2023 *−1.4074 **
LH 2360.4350.395 ± 0.1280.0006 ± 0.0002−1.9921 ***−4.8874 ***
BG 2030.6370.353 ± 0.1230.0009 ± 0.0003−0.65940.2535
DT 2030.2000.195 ± 0.1150.0003 ± 0.0002−1.5128 **−1.8631 **
HX 820.2500.250 ± 0.1800.0004 ± 0.0003−1.0548−0.1820
JC 2050.4000.368 ± 0.1350.0006 ± 0.0002−1.8679 **−3.6541 ***
LQ 2041.0580.432 ± 0.1260.0015 ± 0.0008−1.7892 **0.1219
MG 2040.5790.363 ± 0.1310.0008 ± 0.0003−1.4084 *−1.2369 *
PJ 2040.3000.284 ± 0.1280.0004 ± 0.0002−1.7233 **−2.7493 ***
TJ 2020.1000.100 ± 0.0880.0001 ± 0.0001−1.1644−0.8793 *
TL 2020.3000.100 ± 0.0880.0004 ± 0.0004−1.7233 **0.5439
WQ 2040.3890.363 ± 0.1310.0005 ± 0.0002−1.4407 *−2.1353 ***
YW 2030.2000.195 ± 0.1150.0003 ± 0.0002−1.5128 **−1.8631 **
BZ 2020.1000.100 ± 0.0880.0001 ± 0.0001−1.1644−0.8793 *
DJY 2060.5890.447 ± 0.1370.0008 ± 0.0003−1.7800 **−4.0149 ***
DZ 2020.1890.189 ± 0.1080.0003 ± 0.0002−0.5916−0.0966
GY 2030.4000.195 ± 0.1150.0006 ± 0.0004−1.8679 ***−0.6256
LS 1741.1180.596 ± 0.0990.0016 ± 0.0003−0.16950.0627
LZ 2133.7240.638 ± 0.0580.0052 ± 0.00062.24316.1273
QL 2050.7790.368 ± 0.1350.0011 ± 0.0005−1.7190 **−1.7642 **
YA 2050.8370.368 ± 0.1350.0012 ± 0.0005−1.2429−1.5755 *
ZJ 2040.3890.363 ± 0.1310.0005 ± 0.0002−1.4407 *−2.1353 ***
FL 2042.9630.489 ± 0.1170.0041 ± 0.00100.57793.1379
NC 1233.9850.621 ± 0.0870.0055 ± 0.00062.01354.6229
PS 2140.4670.348 ± 0.1280.0007 ± 0.0003−1.6536 **−1.6755 **
QJ 1740.4710.331 ± 0.1430.0007 ± 0.0003−1.8431 **−1.8636 **
WZ 710.0000.000 ± 0.0000.0000 ± 0.0000NN
XS 2040.3890.363 ± 0.1310.0005 ± 0.0002−1.4407 *−2.1353 **
YY 2070.6000.521 ± 0.1350.0008 ± 0.0003−2.0562 ***−5.6554 ***
Total 598550.8540.346 ± 0.0260.0012 ± 0.0001−2.4784 ***−3.40 × 1038 ***

(n) Number of individuals; (h) number of haplotypes; (Hd) haplotype diversity; (k) average number of nucleotide differences; (Pi) nucleotide diversity; * p < 0.10; ** p < 0.05; *** p < 0.01.

For ITS1 analysis, the 26 C. oryzae populations had a total of 26 haplotypes, with an overall Hd of 0.750, k of 1.551, and Pi of 0.0026. The Hd, k, and Pi of each population ranged from 0.400–0.956, 0.400–2.911, and 0.0007–0.0048, respectively. Except for the YW and SZ populations, all populations had higher levels of Hd (Hd < 0.5) and lower levels of Pi (Pi < 0.005) (Table 2).
Table 2

Genetic diversity of 26 C. oryzae populations based on ITS1 sequences.

nhkHdPiTajima’s DFu’ Fs
BG 2391.3750.838 ± 0.0560.0023 ± 0.0004−0.8686−4.5472 ***
DT 2051.0680.558 ± 0.1140.0018 ± 0.0006−1.1736−0.9454
JC 2461.1450.500 ± 0.1210.0019 ± 0.0006−0.8693−1.5615
LQ 1981.2980.649 ± 0.1080.0021 ± 0.0005−0.2877−0.5202
MG 2281.5760.818 ± 0.0590.0026 ± 0.0004−0.1321−2.8304 **
PJ 1561.5620.848 ± 0.0540.0026 ± 0.00030.0531−1.4952
TJ 2151.2670.595 ± 0.1080.0021 ± 0.0007−0.7553−0.4465
TL 1731.2500.588 ± 0.0930.0021 ± 0.00031.15731.5858
WQ 1052.3560.800 ± 0.1000.0039 ± 0.0009−0.2034−0.2303
YW 1441.0770.495 ± 0.1510.0018 ± 0.0009−1.5407 **−0.2586
BZ 941.9440.806 ± 0.0890.0032 ± 0.00041.30550.3315
DJY 1371.3460.795 ± 0.1090.0022 ± 0.0005−0.5866−3.6278 ***
DZ 1261.6360.848 ± 0.0740.0027 ± 0.0004−0.0430−1.8724 *
GY 1072.9110.933 ± 0.0620.0048 ± 0.00060.7476−2.1342 *
LS 1051.0220.756 ± 0.1300.0017 ± 0.0004−0.1297−2.2036 **
LZ 1131.0550.618 ± 0.1040.0017 ± 0.00031.66480.6938
QL 1082.7330.956 ± 0.0590.0045 ± 0.0006−0.6167−3.8821 ***
YA 1062.2440.889 ± 0.0750.0037 ± 0.00060.2410−1.5332
FL 1771.7790.868 ± 0.0450.0029 ± 0.0005−0.4858−1.9303 *
NC 1341.3850.603 ± 0.1310.0023 ± 0.0010−1.4646 *0.1989
PS 1051.0220.667 ± 0.1630.0017 ± 0.0005−0.1297−2.2036 **
QJ 751.6190.905 ± 0.1030.0027 ± 0.0005−0.0398−2.0192 **
SZ 520.4000.400 ± 0.2370.0007 ± 0.0004−0.81650.0902
WZ 1040.9330.533 ± 0.1800.0015 ± 0.0006−0.4313−1.0204
XS 1771.3600.809 ± 0.0790.0022 ± 0.00040.4581−2.8195 **
YY 1151.3820.618 ± 0.1640.0023 ± 0.0008−0.7301−1.2656 *
Total 360261.5510.750 ± 0.0210.0026 ± 0.0001−1.2621 *−16.0072 ***

(n) Number of individuals; (h) number of haplotypes; (Hd) haplotype diversity; (k) average number of nucleotide differences; (Pi) nucleotide diversity; * p < 0.10; ** p < 0.05; *** p < 0.01.

3.3. Population Demographic History

When all samples were taken as one population, the neutrality test and mismatch analysis of the C. oryzae population were performed based on COI and ITS1 sequences. The Tajima’s D and Fu’s Fs values of the total population were all negative, and all Fu’s Fs values were significant (p < 0.01) (Table 1 and Table 2). Besides, the mismatch distribution of both markers in the total populations showed a single-peaked form, indicating that the population experienced expansion events. In addition, both the statistical reference SSD and rg values did not reach significant level, supporting the spatial expansion model (COI: SSD = 0.0020, p = 0.7000, rg = 0.2316, p = 0.8000; ITS1: SSD = 0.0006, p = 0.8730, rg = 0.0248, p = 0.9050) (Figure 2).
Figure 2

Mismatch distribution of pairwise nucleotide differences for all populations of C. oryzae based on COI sequence (A) and ITS1 sequence (B). (SSD) Sum of squared deviation; (rg) raggedness index.

3.4. Genetic Differentiation

Genetic distances within and between populations were estimated based on COI and ITS1 sequences. For COI analysis, the inter- and intra-population genetic distance ranged from 0.0001–0.0060 and 0.0000–0.0056, respectively (Figure 3A), indicating that genetic distances between populations were higher than those within populations. For ITS1 analysis, the two values ranged from 0.0010–0.0047 and 0.0007–0.0048, respectively (Figure 3B).
Figure 3

Genetic distances between and within C. oryzae populations based on COI sequences using the Kimura’s 2-parameter model (A), and ITS1 sequences using the Tamura 3-parameter model (B).

The results of AMOVA suggested that the genetic variation in C. oryzae populations was mainly from within populations, while less from among populations (Va < Vb, p < 0.001). Moreover, there was some degree of genetic variation within the overall populations (Fst > 0.05, p < 0.001) (Table 3).
Table 3

Analysis of molecular variance (AMOVA) of COI and ITS1 sequences from the C. oryzae populations.

Source of Variation d.f. SSVariance Components%F-Statistic
COI Among populations3056.3360.079 Va **18.32
Within populations567200.0700.353 Vb **81.68
Total597256.4060.432100.00Fst = 0.183 **
ITS1 Among populations2535.5960.050 Va **6.47
Within populations334243.6450.727 Vb **93.53
Total359278.3830.777100.00Fst = 0.065 **

(SS) Sum of squares; (%) percentage of variation; (Va) variance components among populations; (Vb) variance components within populations; ** p < 0.001.

The Fst and Nm values between pairwise populations were calculated based on COI and ITS1 sequences (Table 4 and Table 5). The Fst values ranged from 0.000–0.536 and 0.000–0.307 for COI and ITS1 sequences, respectively. For COI analysis, the genetic differentiation was mainly attributed to individuals in the NC population (Fst > 0.25, Nm < 1). A three-level AMOVA analysis of the NC population with the rest of the populations based on COI markers showed the presence of significant genetic differentiation (Fst = 0.69, p < 0.001).
Table 4

Fst (below diagonal) and Nm (above diagonal) between the 31 C. oryzae populations based on COI sequences.

TYZZXTLHBGDTHXJCLQMGPJTJTLWQYWBZDJYDZGYLSLZQLYAZJFLNCPSQJWZXSYY
TY hun33.84hun84.9755.43infinfinf91.75inf10.23huninfinf7.5319.071.011.96hun22.24inf30.0426.642.050.4532.27inf26.63huninf
ZZ 0.00 infhun15.2055.43huninfinfinfinf12.80huninfhun7.5319.071.011.96hun22.24inf30.0426.642.050.4532.27inf26.63huninf
XT 0.010.00 44.1410.8812.6434.5634.4429.4916.2132.317.3530.8132.3136.436.7514.720.971.9036.2412.6229.4919.2616.211.920.4319.09inf16.2139.3429.77
LH 0.000.000.01 15.6022.66infinfinfhuninf8.16infinfinf9.3825.231.052.37inf36.24inf43.5140.252.120.4946.32inf40.25hunhun
BG 0.010.030.040.03 hun56.0657.7813.8522.3414.9957.04inf14.9959.388.9414.431.843.2217.2611.9213.8515.5713.735.290.8614.20inf13.7419.57hun
DT 0.010.010.040.020.00 20.81inf16.16inf18.48infinf18.4843.798.3515.991.372.5723.0712.1816.1618.3815.273.170.6316.83inf15.2827.80inf
HX 0.000.000.010.000.010.02 hunhun32.59hun7.40infhuninf8.96hun1.092.23inf23.21hun42.20hun2.210.5042.09inf32.59infhun
JC 0.000.000.010.000.010.000.00 huninfhunhuninfhunhun8.9625.491.192.23hun23.17hun42.0932.572.560.5341.98inf32.53huninf
LQ 0.000.000.020.000.030.030.000.00 23.25hun5.8311.98huninf7.4120.921.041.86hun13.73hun33.0123.292.080.4633.47inf23.25infinf
MG 0.010.000.030.000.020.000.020.000.02 hun37.44inf27.9337.328.2818.401.202.3237.1813.7523.2523.1718.442.590.5521.53inf18.4346.85inf
PJ 0.000.000.020.000.030.030.000.000.000.00 6.6154.09hunhun8.1823.201.082.05hun18.48hun37.5527.942.140.4837.87inf27.93infhun
TJ 0.050.040.060.060.010.000.060.000.080.010.07 16.066.618.185.808.361.492.368.185.615.838.286.943.600.677.49hun6.949.74inf
TL 0.000.000.020.000.000.000.000.000.040.000.010.03 54.09inf38.44inf1.753.64inf13.4711.98infinf4.240.84infinfinfinfinf
WQ 0.000.000.020.000.030.030.000.000.000.020.000.070.01 hun9.7323.201.062.05hun18.48hun37.5527.942.140.4837.87inf27.93infinf
YW 0.000.000.010.000.010.010.000.000.000.010.000.060.000.00 9.7327.751.112.42hun28.10infhun37.292.270.5246.14inf37.32hunhun
BZ 0.060.060.070.050.050.060.050.050.060.060.060.080.010.050.05 inf1.18hunhun6.968.80inf8.282.360.598.75inf8.2813.3913.83
DJY 0.030.030.030.020.030.030.000.020.020.030.020.060.000.020.020.00 1.206.58inf15.3520.92inf49.902.460.5919.63inf18.4032.3534.49
DZ 0.33 *0.33 *0.34 *0.32 *0.210.270.310.300.320.290.320.250.220.320.310.300.29 1.181.111.061.041.161.11infinf1.101.881.111.161.36
GY 0.200.200.210.170.130.160.180.180.210.180.200.170.120.200.170.000.070.30 * 3.991.961.866.242.322.080.642.434.162.322.792.70
LS 0.000.000.010.000.030.020.000.000.000.010.000.060.000.000.000.000.000.310.11 28.02huninfhun2.280.5245.97inf37.18hunhun
LZ 0.020.020.040.010.040.040.020.020.040.040.030.080.040.030.020.070.030.320.200.02 13.7318.5813.772.110.48infinf13.7537.8424.40
QL 0.000.000.020.000.030.030.000.000.000.020.000.080.040.000.000.050.020.320.210.000.04 33.0123.292.080.4633.47inf23.25inf80.93
YA 0.020.020.030.010.030.030.010.010.010.020.010.060.000.010.000.000.000.300.070.000.030.01 23.152.370.5525.77inf23.1755.9355.93
ZJ 0.020.020.030.010.040.030.000.020.020.030.020.070.000.020.010.060.010.310.180.000.040.020.02 2.240.5121.54inf18.4446.8037.58
FL 0.200.200.210.190.090.140.180.160.190.160.190.120.110.190.180.170.170.000.190.180.190.190.170.18 7.372.235.002.242.413.05
NC 0.530.53 *0.54 *0.51 *0.370.44 *0.50 *0.480.520.480.510.43 *0.370.51 *0.490.46 *0.46 *0.000.44 *0.49 *0.51 *0.520.470.49 *0.06 0.520.880.510.560.63
PS 0.020.020.030.010.030.030.010.010.010.020.010.060.000.010.010.050.020.310.170.010.000.010.020.020.180.49 inf21.5353.7348.57
QJ 0.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.000.210.110.000.000.000.000.000.090.360.00 infinfinf
WZ 0.020.020.030.010.040.030.020.020.020.030.020.070.000.020.010.060.030.31 *0.180.010.040.020.020.030.180.490.020.00 46.8537.64
XS 0.000.000.010.000.020.020.000.000.000.010.000.050.000.000.000.040.020.30 *0.150.000.010.000.010.010.170.470.010.000.01 inf
YY 0.000.000.020.000.000.000.000.000.000.000.000.000.000.000.000.030.010.270.160.000.020.010.010.010.140.440.010.000.010.00

(inf) Nm is infinite; (hun) Nm > 100; (0.00) Fst ≤ 0.00; * p < 0.02; Nm = (1 - Fst)/2Fst.

Table 5

Fst (below diagonal) and Nm (above diagonal) between the 26 C. oryzae populations based on ITS1 sequences.

BGDTJCLQMGPJTJTLWQYWBZDJYDZGYLSLZQLYAFLPSNCQJSZWZXSYY
BG 11.41inf6.743.291.83inf7.893.58inf1.370.871.800.9612.380.752.231.451.72inf9.085.23infinf6.63inf
DT 0.02 huninf7.092.6949.454.643.49inf1.281.253.521.01inf0.992.211.652.28inf3.3476.44infinfinfinf
JC 0.000.00 7.134.941.6623.4721.045.36inf1.290.841.661.015.090.582.281.482.1261.783.584.37infinf4.90inf
LQ 0.040.000.03 15.249.228.346.287.2317.703.352.17hun1.23inf2.255.773.673.57inf2.36infinfinfinfinf
MG 0.070.030.050.02 inf2.49infinf2.71inf26.09inf5.002.681.11infinfinf5.221.32inf4.273.04infinf
PJ 0.120.090.130.030.00 1.5111.42inf1.41infinfinf5.202.432.52infinfinf3.960.90inf1.601.81infinf
TJ 0.000.010.010.030.090.14 2.572.56inf1.030.811.760.90inf0.901.521.251.29infinf3.69infinf8.81inf
TL 0.030.050.010.040.000.020.09 inf2.6110.312.355.702.591.900.66inf10.47inf4.111.22inf3.052.9610.68inf
WQ 0.070.070.040.030.000.000.090.00 2.30inf9.79infinf2.201.06infinfinf3.791.70inf3.762.8215.20inf
YW 0.000.000.000.010.080.160.000.090.10 0.930.761.570.82inf0.711.631.161.39inf26.123.46infinf7.00inf
BZ 0.150.160.160.070.000.000.190.020.000.21 infinfinf1.211.35infinfinf1.720.74inf1.011.136.882.90
DJY 0.220.170.230.100.010.000.230.100.020.250.00 inf5.610.941.2767.32infinf1.170.56inf0.710.794.172.22
DZ 0.120.070.130.000.000.000.120.040.000.140.000.00 5.763.437.33infinfinf6.951.04inf1.902.19inf76.44
GY 0.210.200.200.170.050.050.220.090.000.230.000.040.04 0.890.8783.64inf27.161.130.809.521.010.931.752.26
LS 0.020.000.050.000.090.090.000.120.100.000.170.210.070.22 1.812.321.421.42inf5.748.77infinfinfinf
LZ 0.25 *0.200.30 *0.100.180.090.220.270.190.260.160.160.030.220.12 1.431.420.941.810.672.630.600.933.571.39
QL 0.100.100.100.040.000.000.140.000.000.130.000.000.000.000.100.15 infinf3.311.11inf3.022.1211.247.59
YA 0.150.130.140.060.000.000.170.020.000.180.000.000.000.000.150.150.00 inf1.970.93inf1.411.368.064.11
FL 0.130.100.110.070.000.000.160.000.000.150.000.000.000.010.150.210.000.00 2.060.85inf1.711.455.865.78
PS 0.000.000.000.000.050.060.000.060.060.000.130.180.030.180.000.120.070.110.11 8.06infinfinfinfinf
NC 0.030.070.070.100.160.220.000.170.130.010.250.310.190.240.040.270.180.210.230.03 1.51infinf2.325.38
QJ 0.050.000.050.000.000.000.060.000.000.070.000.000.000.030.030.090.000.000.000.000.14 4.285.62infinf
SZ 0.000.000.000.000.060.140.000.080.060.000.200.260.120.200.000.290.080.150.130.000.000.06 infhuninf
WZ 0.000.000.000.000.080.120.000.080.080.000.180.240.100.210.000.210.110.150.150.000.000.040.00 infinf
XS 0.040.000.050.000.000.000.030.020.020.030.040.060.000.130.000.070.020.030.040.000.100.000.000.00 inf
YY 0.000.000.000.000.000.020.000.000.000.000.080.100.000.100.000.150.030.060.040.000.040.000.000.000.00

(inf) Nm is infinite; (hun) Nm > 100; (0.00) Fst ≤ 0.00; * p < 0.02; Nm = (1 - Fst)/4Fst.

The Mantel tests based on both markers did not support a significant correlation between geographic distance and genetic distance, thus excluding the effect of distance segregation on genetic differentiation (Figure 4).
Figure 4

Relationship between genetic distance (Fst/(1 - Fst)) and logarithm geographic distance of C. oryzae populations based on COI sequence (A) and ITS1 sequence (B).

3.5. Haplotype Network Analysis

To understand the relationships of identified haplotypes, the median-joining haplotype network was constructed (Figure 5). Among the 55 COI haplotypes, H1 occupied the center of the network and was shared by all populations. H1 was also the most common haplotype, accounting for 80.6% of all samples. The remaining 54 haplotypes were distributed around H1 in a star pattern. H21 and H44 were far away from H1 and mainly shared by NC, LZ, and FL populations (Figure 5A). Among the 26 ITS1 haplotypes, H1 was the ancestral haplotype shared by all populations and occupied a central position in the network. The four haplotypes, H3, H6, H9, and H10, were derived from H1 and collectively accounted for 37.5% of all samples (Figure 5B).
Figure 5

Median-joining network of haplotypes for C. oryzae based on COI haplotypes (A) and ITS1 haplotypes (B). Each circle represents a haplotype, and the area of a circle is proportional to the number of individuals with that haplotype. Colors within nodes refer to C. oryzae sampling regions.

4. Discussion

Genetic diversity, which refers to the sum of genetic variation among populations within a species or individuals within a population [1], is a fundamental guarantee for maintaining species evolution [2]. Genetic diversity is caused by the variation of genetic material and is influenced by mutation rate, effective population size, gene flow, and other factors [41]. A population with rich genetic diversity often possesses a strong adaptability to the environment, thus facilitating population outbreaks and their large-scale spread. Haplotype diversity and nucleotide diversity are the main indicators of genetic diversity. In this study, the ITS1 analysis presented that C. oryzae populations had higher haplotype diversity (Hd > 0.5) and lower nucleotide diversity (Pi < 0.005), suggesting that the population experienced a recent “Bottleneck effect” followed by a short period of rapid population expansion [42]. Meanwhile, the COI analysis revealed low haplotype diversity, probably due to different genetic patterns of molecular markers [43]. This is consistent with the finding that ITS1 possesses greater genetic diversity than COI in Halyomorpha halys [21]. Historical population demographics is one of the core elements of molecular phylogeography. The analysis of historical population demographics is in favor of understanding the effects of external environmental factors on population development and distribution, and also provides a reference for developing pest management strategies [44]. To this end, neutrality tests (Tajima’s D and Fu’s Fs values) and mismatch distributions are commonly used [45,46]. In this study, when all samples were calculated as a population, the Tajima’s D and Fu’s Fs values were significantly negative except for Tajima’s D for ITS1 data, indicating the expansion of the population size, which is consistent with the previous speculation [25]. Additionally, the extremely small Fu’s Fs values (−3.40 × 1038) implied that the population expansion occurred not so long ago as the Fu’s Fs values are more sensitive to recent population expansion [35]. Likewise, the mismatch distribution showed population expansion as well because of the single-peaked curve, insignificant SSD values, and small rg values [46]. Moreover, the haplotype network for COI with a star-shaped distribution further supported speculation of population expansion [47]. Genetic distance, AMOVA [48], Fst value [49], and Nm value [50], are important indicators of the genetic differentiation of populations. The genetic distance between populations was generally close except for NC and LZ for COI, and the genetic variation was mainly from within the population revealed by AMOVA. Similarly, the Fst and Nm values between paired populations indicated that the genetic differentiation between populations was stemmed from a few populations such as NC. In fact, the population density of NC is relatively lower than that of other populations. Whether the worse performance of the NC population is correlated with its genetic background remains unknown and necessitates further investigation. Factors such as geographic isolation and farming patterns often affect the adaptation of populations to the environment and initiate genetic differentiation [51]. The Mantel test showed that there was no significant correlation between genetic distances and geographical distances, indicating that genetic differentiation of these populations is not caused by geographical isolation, but other factors, such as tillage practices and farmland landscape patterns [52]. It has been demonstrated that frequent gene flow can improve the population’s adaptability to the environment and cause outbreaks of pests [53]. In this study, gene flow was found to have existed among C. oryzae populations, which was in line with the previous study [30]. However, the degree of gene flow varied remarkably with geographic populations, and it was probably related to the occurrence level of these populations. For instance, QJ and YY populations that showed intensive gene flow performed much better than other populations, while NC and GY populations which showed restricted gene flow occurred lightly. The genetic structure of C. oryzae has been analyzed previously by Zhou et al. [30]. Likewise, we analyzed the genetic structure of C. oryzae as well, but using more geographic populations collected from a larger area, within which the ecological environment is more diverse and the annual occurrence generation of C. oryzae changes accordingly [27]. Both studies revealed that C. oryzae populations have low or medium levels of genetic differentiation and experienced recent expansion events. However, disparities in the genetic diversity and experience of the “Bottleneck effect” were presented between two studies, probably due to the differences in sample size, molecular markers, and sampling locations. The “Bottleneck effect” refers to the dramatic variation of genetic structure owing to the sharp reduction in population size caused by deteriorated environmental conditions, such as farmland ecology and pesticide application levels [54]. For the past several decades, C. oryzae has been subjected to highly toxic pesticides, such as furadan, oxamyl, and triazophos [25]. We, therefore, speculated that the “Bottleneck effect” of C. oryzae might be caused by the abundant use of these pesticides. In addition, the recent expansion of C. oryzae may be related to changes in factors such as agroecological environments, tillage and cultivation systems, winter temperatures, and control agents [27]. In the future, the investigations of the relationship between the population dynamics of C. oryzae and farmland environments and farming practices may elucidate the causes of the “Bottleneck effect”, genetic differentiation, population expansion, and frequent outbreaks, thereby providing a theoretical basis for formulating management strategies.

5. Conclusions

This study showed that C. oryzae populations suffered from a recent “Bottleneck effect”, followed by a rapid expansion. We also speculated that genetic differentiation and gene flow among populations are responsible for the geographical differences in the occurrence level.
  37 in total

Review 1.  The biogeography of mitochondrial and nuclear discordance in animals.

Authors:  David P L Toews; Alan Brelsford
Journal:  Mol Ecol       Date:  2012-06-27       Impact factor: 6.185

2.  DnaSP v5: a software for comprehensive analysis of DNA polymorphism data.

Authors:  P Librado; J Rozas
Journal:  Bioinformatics       Date:  2009-04-03       Impact factor: 6.937

3.  Measuring local genetic variability in populations of codling moth (Lepidoptera: Tortricidae) across an unmanaged and commercial orchard interface.

Authors:  Eduardo Fuentes-Contreras; Esteban Basoalto; Pierre Franck; Blas Lavandero; Alan L Knight; Claudio C Ramírez
Journal:  Environ Entomol       Date:  2014-04       Impact factor: 2.377

4.  Evaluation of Appropriate Reference Genes For Investigating Gene Expression in Chlorops oryzae (Diptera: Chloropidae).

Authors:  Ping Tian; Lin Qiu; Ailin Zhou; Guo Chen; Hualiang He; Wenbing Ding; Youzhi Li
Journal:  J Econ Entomol       Date:  2019-09-23       Impact factor: 2.381

5.  Paternal comeback in mitochondrial DNA inheritance.

Authors:  John Vissing
Journal:  Proc Natl Acad Sci U S A       Date:  2019-01-11       Impact factor: 11.205

6.  Study on the Genetic Differentiation of Geographic Populations of Calliptamus italicus (Orthoptera: Acrididae) in Sino-Kazakh Border Areas Based on Mitochondrial COI and COII Genes.

Authors:  Ye Xu; Ji-Wei Mai; Bing-Jie Yu; Hong-Xia Hu; Liang Yuan; Roman Jashenko; Rong Ji
Journal:  J Econ Entomol       Date:  2019-08-03       Impact factor: 2.381

7.  DNA barcoding identification of Pseudococcidae (Hemiptera: Coccoidea) using the mitochondrial COI gene.

Authors:  Sizhu Zheng; Yang Li; Xiaojun Yang; Jingyun Chen; Jing Hua; Yuan Gao
Journal:  Mitochondrial DNA B Resour       Date:  2018-04-04       Impact factor: 0.658

8.  The complete mitogenome of Chlorops oryzae Matsumura (Diptera: Chloropidae).

Authors:  Jia Wang; Xian-Ya Li; Run-Bang Du; Ying-Hong Liu
Journal:  Mitochondrial DNA B Resour       Date:  2021-06-03       Impact factor: 0.658

9.  Molecular phylogeny of Anopheles hyrcanus group members based on ITS2 rDNA.

Authors:  Yuan Fang; Wen-Qi Shi; Yi Zhang
Journal:  Parasit Vectors       Date:  2017-09-07       Impact factor: 3.876

10.  The utility of mtDNA and rDNA for barcoding and phylogeny of plant-parasitic nematodes from Longidoridae (Nematoda, Enoplea).

Authors:  J E Palomares-Rius; C Cantalapiedra-Navarrete; A Archidona-Yuste; S A Subbotin; P Castillo
Journal:  Sci Rep       Date:  2017-09-07       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.