Literature DB >> 26163593

Characterization of 12 Novel Microsatellite Markers of Sogatella furcifera (Hemiptera: Delphacidae) Identified From Next-Generation Sequence Data.

Hwa Yeun Nam1, Brad Coates2, Kyung Seok Kim2, Marana Park1, Joon-Ho Lee3.   

Abstract

The white-backed planthopper, Sogatella furcifera (Horváth) (Hemiptera: Delphacidae), is a major pest of rice and has long-range migratory behavior in Asia. Microsatellite markers (simple sequence repeats) have been widely used to determine the origins and genetic diversity of insect pests. We identified novel microsatellite loci for S. furcifera samples collected from Laos, Vietnam, and three localities in Bangladesh from next-generation Roche 454 pyrosequencing data. Size polymorphism at 12 microsatellite loci was verified for 40 adult individuals collected from Shinan, South Korea. The average number of alleles per locus was 7.92. The mean values of observed (H(o)) and expected heterozygosities (H(E)) were 0.615 and 0.757, respectively. These new microsatellite markers will be a resource for future ecological genetic studies of S. furcifera samples across more broad geographic regions in Asia and may assist in estimations of genetic differentiation and gene flow among populations for implementation of more effective management strategies to control this serious rice pest.
© The Author 2015. Published by Oxford University Press on behalf of the Entomological Society of America.

Entities:  

Keywords:  Sogatella furcifera; genetic diversity; microsatellite marker

Mesh:

Year:  2015        PMID: 26163593      PMCID: PMC4535569          DOI: 10.1093/jisesa/iev069

Source DB:  PubMed          Journal:  J Insect Sci        ISSN: 1536-2442            Impact factor:   1.857


The white-backed planthopper (Horvárth) (Hemiptera: Delphacidae) is a major insect pest of rice throughout South and East Asia (Dyck and Thomas 1979). Prior to 1978, Sogatella furcifera caused intermittent damage in cultivated rice (Tang et al). However, as the adoption of hybrid rice varieties extended into South and East Asia, crop damage caused by S. furcifera also increased (Tang et al, Sogawa et al. 2009). Subsequent outbreaks of S. furcifera have been reported in China, Japan, and Korea at the beginning of the 21st century (Zhai and Cheng 2006, Otuka et al. 2007) and has increased concerns over increasing crop damage and reduced yields. This species has long-range migratory patterns, which include known winter hibernation in the Indochina Peninsula followed by northward migration into Southern and Central China by way of the southwest winds from March to July (National Coordinated Research Group for Whiteback Planthopper 1981). In Korea, S. furcifera immigrates by atmospheric currents from mid-June to late-July. This northward movement from overwintering regions coincides with the early stages rice cultivation in nurseries, and the subsequent effects on plant health can be highly devastating (Park 1973, Kisimoto 1976, Asahina and Tsruoka 1986, Uhm et al). In many cases, however, such as in Korea, the primary overwintering S. furcifera source population that migrates into rice production regions remains unclear (Sogawa 1997). Therefore, identification of population genetic structure may provide the migration routes and origin source of S. furcifera. The population structure of S. furcifera has previously been investigated using variation in the mitochondrial cytochrome c oxidase subunit I (COI) sequence Mun et al and inter-simple sequence repeat (ISSR) polymerase chain reaction (PCR) markers (Liu et al. 2010). Mun et al described limited variation in a 823-bp fragment of COI among S. furcifera from four countries (China, Korea, Philippines, and Vietnam), wherein two haplotypes were identified. The only one T COI haplotype appeared to be fixed among individuals from Malaysia (n = 4) might be a result of sampling error (Mun et al). Analogously, Liu et al detected little genetic variation among 47 S. furcifera populations using ISSRs. Microsatellite or simple sequence repeat (SSR) markers detect changes in the number of tandem repeats in arrays of one to six nucleotides at a single locus and are one of widely applied molecular tools for population genetic studies (Valdes et al. 1993, Akkaya et al. 1995, Schuler et al. 1996, Kelkar et al. 2010, Park et al. 2013). Microsatellites have been widely applied for the estimation of genetic diversity and gene flow (Kim et al, 2010), as well as migratory pattern in insects (Zhang et al, Wei et al). In this study, we aimed to use next-generation Roche 454 pyrosequencing technology to develop the microsatellite markers (SSR) which facilitate the rapid identification of tandem repeats among sequences (Csencsics et al). In past, several studies used enrichment analysis to develop molecular markers (SSR), but there are limitation of this analysis: redundancy, multiple-process representation, and poor specificity (Squirrell et al). Also, we may not need long sequence read as achieved by Sanger sequencing and short sequence read of this technology provide sufficient opportunities for development of new application, which benefit from the particular fragment of the molecular (Morozova and Marra 2008). Roche 454 pyrosequencing technology demand shorter time and read longer length compare to other technologies, Illumina Genome Analyzer and AB SOLiD (Shendure and Ji 2008). The polymorphism and utility of these 12 microsatellite markers for future estimation of population genetic parameters were assessed by genotyping S. furcifera samples collected from Shinan, Korea, in 2012.

Materials and Methods

Sample Collection and DNA Extraction

Individuals of the white-backed planthopper, S. furcifera, were collected from nine locations in six countries: Laos, Nepal, Thailand, Vietnam, two sites in Korea, and three sites in Bangladesh (Table 1) by institutions of AFACI (Asian Food and Agriculture Cooperation Initiative) in 2012 and placed in 95% ethanol and stored at −20°C. DNA was extracted from pools of individuals (without abdomen) for each location using the QIAamp DNA Mini Kit (Qiagen, Germany). DNA samples were sent to the USDA-ARS Corn Insect & Crop Genetic Research Unit (CICGRU) in Ames, IA. Agarose gel electrophoresis determined that some DNA samples had decomposed during shipment to the United States such that high molecular weight DNA library construction could not be completed. Thus, DNA samples collected of five locations in three countries (Laos, Vietnam, and three sites in Bangladesh) were determined to be of high quality and were used for Roche 454 sequencing based on the approach described by Puritz and Toonen (2013).
Table 1.

Sampling information of S. furcifera

CountrySample siteSampling dateSample sizeCoordinates
Bangladesh 1Tarash, SirajgonjMay 10, 20124524° 23'45.0” N; 89° 22'39.7” E
Bangladesh 2BRRi R/S, GazipurNov. 7, 20126023° 98'60.3” N; 90° 41'14.6” E
Bangladesh 3Sagordi, BarisalSept. 27, 20126322° 67'71.6” N; 90° 36'43.4” E
LaosVientiane, PhontongAug. 11, 20125018° 30'39.0” N; 102° 25'05.7” E
VietnamNam Dinh, Hai LocSept. 28, 20128720° 10'55.85” N; 106° 20'07.79” E
Sampling information of S. furcifera

Roche 454 Library Preparation and Sequencing

Approximately 0.5 µg of total genomic DNA from each location was digested with 5U each of EcoRI and PstI enzymes in a 40 μl reaction at 37°C for 4 h and then ligated to adapters EcoRI- and PstI-Ad (Vuylsteke et al) as described by Vos et al. Next digested genomic fragments from each population were PCR amplified separately using 4.0 μl of digested and ligated DNA template, 10 μM of each primer PriA_454M04_Eco and PriB_454M11_Pst (Table 2), and 1.0 U LongAmp high-fidelity DNA polymerase (New England Biolabs, Ipswich, MA) according to manufacturer's instructions. Thermocycler conditions included initial denaturation of 95°C for 2 min, then a six touchdown cycles of 95°C for 30 s, 65°C for 30 s −2°C per cycle, and 65°C for 2 min, then 32 cycles of 95°C for 30 s, 55°C for 20 s, and 65°C for 2 min. Resulting PCR products were checked on 1.5% agarose and then residual single-stranded primer digested from successful reaction products with Exonuclease I and shrimp alkaline phosphatase (New England Biolabs) at 37°C for 1 h, then inactivated at 72°C for 5 min. Reaction products were further purified using silica-based columns with the IBI Gel/PCR DNA Fragment Extraction Kit (Peosta, IA) according to manufacturer's instructions and eluted DNA quantified on an NanoDrop2000 (Thermo Scientific, Wilmington, DE). PCR products were combined in approximate equal ratios into a single pool and diluted to 2.0 ng μl−1. The pool of genomic DNA representative of all geographic locations was sent to the University of Illinois, W.M, Keck Center for Comparative and Functional Genomics (Urbana, IL), High-Throughput Sequencing, from which a single barcoded library was generated. Sequence data were generated from this single library on an ∼1/4 of a Roche 454 titanium plate.
Table 2.

Primer sequences used to create pooled plant hopper libraries for sequencing on the Roche 454

Primer sequences used to create pooled plant hopper libraries for sequencing on the Roche 454

Sequence Quality Filtering and Microsatellite Predictions

Three files, .sff, .fna, and .qual, were obtained from the W.M. Keck Center for Comparative and Functional Genomics. The .fna and .qual files were trimmed of quality scores (q) < 20, homopolymer stretches ≥ 6 nt, and minimum length = 100 using the script 454QC.pl from the NGSqcToolkit (Patel and Jain 2012). The resulting filtered reads were used as input for the program SciRoKo (Koefler et al) that identified mismatched microsatellite di-, tri-, tetra-, and pentanucleotide repeat motifs > 3 units in length, and a base substitution mismatch penalty of 5, seed length of 8, and max mismatches of 1. Sequence ± 300 bp of each predicted motif was obtained using a custom PERL script and output into a .fasta file. Since redundant genomic sequences likely remained among the 454 reads, highly similar sequences were assembled into contigs using CAP3 (Huang and Madan 1999). These assemble contigs and singletons containing trinucleotide repeat microsatellite repeats were extracted from this .fasta file using the script SelectSeqs.pl, and output into a separate .fasta file, and used as input for BatchPrimer3 using the SSR screening and design option (program available at http://probes.pw.usda.gov/batchprimer3/index.html).

Microsatellite Marker Development

To verify the utility of microsatellite markers, we used the S. furcifera samples (n = 40) which were collected from Shinan, South Korea in 2012. Genomic DNA was extracted from the whole body of individuals without abdomen using the AccuPrep DNA Extraction Kit (BIONEER). Amplification of microsatellite maker was conducted in total 10 μl reaction volume to genotype the samples: 4.9 μl distilled water, 1.0 μl 10X PCR buffer, 1.0 μl 10 mM dNTP Mixture, 0.5 μl of each primer, 0.1 μl of Taq polymerase, and 2.0 μl template DNA. The PCR was performed under the following conditions: initial denaturation for 4 min at 94°C, followed by 35 cycles of 94°C for 30 s, annealing at 61°C for 30 s, 72°C for 40 s, a final extension was performed at 72°C for 15 min. The forward primer was labeled with fluorescent dye (HEX, 6FAM, or NED dyes) and multiplexed in two groups (Table 3). Amplified PCR mixtures were separated electrophoresed on an ABI Prim 3730 XL DNA Analyzer (Applied Biosystems Inc.) with the GENESCAN-500 (Rox) size standard. The genotype data were analyzed by using GeneMapper version 3.7 (Applied Biosystems Inc.).
Table 3.

Setting for multiplexing of 12 trinucleotide microsatellite markers

Multiplex6-FAM (blue)HEX (green)NED (yellow)
Multiplex 1T11T13, T16T5, T9, T17
Multiplex 2T3, T4, T15T7, T18T8
Setting for multiplexing of 12 trinucleotide microsatellite markers GenAlEx 6.501 (Peakall and Smouse 2006) was used to calculate the three measure of genetic diversity: the number of alleles (A) per locus, observed heterozygosity (Ho), and heterozygosity expected under Hardy–Weinberg assumptions for each locus (HE). The inbreeding coefficient at each locus, FIS, imply a deficiency of heterozygotes compared with that expected under Hardy–Weinberg equilibrium (HWE). HWE was tested for each locus in the population by using GENEPOP version 4.2.1 (Raymond and Rousset 1995). The possible occurrence of null alleles was tested using the program MICROCHECKER (Oosterhout et al). This program estimates the frequency of null alleles and modifies the allele and genotype frequencies of the amplified alleles. We used an admixture model and correlated allele frequency among populations to define the ancestry of individuals. The population structure of S. furcifera in Shinan was examined by the Bayesian clustering procedure implemented in the STRUCTURE 2.3.3 (Pritchard et al). The number of Markov chain Monte Carlo replications was set to 200,000 iterations after a burn-in period of 100,000. Likelihood values, Ln Pr(X/K), was identified for each of the 10 STRUCTURE runs at K values from 1 to 10. Also, the mean posterior probability for each K and delta K analysis of LnP(D) was calculated.

Results

PCR-amplified libraries were successfully PCR amplified from pools S. furcifera individuals and showed approximate size range of 200–500 bp (results not shown) and were used for Roche 454 sequencing. A total of 32.8 million nucleotides were obtained from Roche 454 sequencing of 199,878 raw reads (N50 = 299 bases; minimum length = 40; maximum length = 1,053), whereas after filtering the read count was reduced to 150,895 containing 29.8 million nucleotides (N50 = 339; minimum length = 100; maximum length = 677; mean q = 38.57; Fig. 1). SciRoKo output identified a total of 636 filtered Roche 454 fragments containing 17 unique repeat motifs, wherein a majority (n = 291) contained (AAG) repeats (Table 4, Supp File 1 [online only]). The number of repeat units in each array had a mean of 8.2 ± 4.1 (Supp File 1 [online only]). Because of potential redundancy among independent reads, highly similar sequences were merged into contigs. The CAP3 assembly of microsatellite containing sequences putatively representative of those from the same genetic locus resulting in merger of 506 sequences being merged into 37 contigs (13.7 ± 25.6 sequences and 253 ± 99 bp per contig; Supp File 2 [online only]). Thirty-nine microsatellite containing sequences were unique and were not assembled into contigs (singletons). Redundancy among next-generation sequencing (NGS) reads have been used to predict allelic variation at a microsatellite locus in prior studies (Vukosavljev et al. 2015), clustering assumes that similar sequences are derived from homologous genetic loci and can oftentimes lead to spurious results (see Discussion).Regardless, these fragments in this study containing putative microsatellite motifs ranged in size from 100 to 780 bp (mean = 220.7 ± 80 bp; Supp File 3 [online only]). Filtering of microsatellites further to only those with trinucleotide motifs was conducted using BatchPrimer3 and resulted in oligonucleotide primer pairs being successfully designed for 22 loci with tri-nucleotide repeat motifs (Supp Table 1 [online only]).
Fig. 1.

Roche 454 read length (A) and quality statistics (B) for raw and filtered sequence data.

Table 4.

Summary statistics for microsatellite motifs predicted from filtered S. furcifera 454 read data

MotifCountsAverage_lengthAverage_mismatchesCounts/Mb
AG5733.61.052.51
AC1817.30.220.79
AT1415.90.000.62
AAG29124.60.3112.82
AGG2125.31.050.93
AGC931.11.000.40
ACG917.70.000.40
AAC723.71.290.31
ATC320.00.330.13
AAGG4147.75.881.81
AAAT1221.80.080.53
ACTC1116.60.000.48
ACGT574.00.800.22
AAGT317.00.000.13
ACGTC11216.00.004.94
AACGT453.34.000.18
AGCGC322.71.000.13
Roche 454 read length (A) and quality statistics (B) for raw and filtered sequence data. Summary statistics for microsatellite motifs predicted from filtered S. furcifera 454 read data By processing PCR amplification, primarily we examined electrophoresis to inspect the size range of the markers. Moreover, we used GeneMapper version 3.7 (Applied Biosystems Inc.) program and verified the allele size and the real peak in the size range of the 21 microsatellite marker. As a result, we selected proper 12 microsatellite markers (Table 5), WBPH_T3, WBPH_T4, WBPH_T5, WBPH_T7, WBPH_T8, WBPH_T9, WBPH_T11, WBPH_T13, WBPH_ T15, WBPH_T16, WBPH_T17, and WBPH_T18.
Table 5.

Twelve proper microsatellite markers for S. furcifera

LocusPrimer sequence (5’–3’)Repeat motifMean of NNo. allelesSize range (bp)HoHEDFIS
WBPH_T3F: (6-FAM)- CGACAGCACGTACTCCTGCTTGAG408208–2500.5750.739*0.2337
R: ACACGACGCTCTTCCTCCTTC
WBPH_T4F: (6-FAM)- GGAAGAAACGGATGGAATTACGAGA405103–1560.8750.558*−0.5582
R: ACGACGCTCTTCCTCCTCATC
WBPH_T5F: (NED)- TTCCAATCCTGCTTACAGTCCAATTC407200–2500.4000.655*0.4000
R: GCGTACATGCAGTGGACAGAT
WBPH_T7F: (HEX)- CCCTCTTCTCTCGCCCTCTGAC40869–1260.6750.788*0.1563
R: GTCGTGCTGAGGCTCGTC
WBPH_T8F: (NED)- TCAGCCAGAGCTGTAGAATCAAAGA4010100–1370.2250.804*0.7261
R: CAGCGTCTCTGTCCATTCG
WBPH_T9F: (NED)- GCCGCCCAGTTCTGTAAAGTCGCA40956–990.8750.852*−0.0149
R: CTGATGCTGCCGCTGTTGT
WBPH_T11F: (6-FAM)- CTAAAACGCTCGCGTCTGCGAT409295–3590.6250.703*0.1232
R: GCTCAGTCAGTGATACGTCTTCG
WBPH_T13F: (HEX)- GCCTCCTCTGCTGTTGAGAAAGAA407356–3860.5000.786*0.3604
R: CATTGGCCATCTTGGTGACTG
WBPH_T15F: (6-FAM)- GCGCGCGCATATATACAGTTGCGT408157–2110.5000.786*0.3747
R: AAGCGACGCAAGTGACGATAA
WBPH_T16F: (HEX)- GGGTACACCGTTCGAGTCGTTCGT4010202–2520.6000.833*0.2914
R: CCGCTCAGTCAGTGATACGC
WBPH_T17F: (NED)- TCCTGAGGCACGCTAACTGACATC408340–3770.7000.807*0.1445
R: CTTGTGCGTGGGTCATGAGAT
WBPH_T18F: (HEX)- GTGCGAAGGGAAATGCAGAAGGAA406183–2250.8250.774NS−0.0528
R: TTCTCCATCGCATCTCTTGTTCT
Across loci7.920.6150.757*0.1820

Microsatellite primer sequences with fluorescent labeled dyes, repeat motifs, mean of individuals (N), number of alleles (A), size of PCR products in base pairs (bp), expected heteozygosity (HE), observed heterozygosity (Ho), D, deviation from HWE; NS, not significant.

*Significant deviations from Hardy–Weinberg expectations (P < 0.05) and inbreeding (FIS) are shown.

Twelve proper microsatellite markers for S. furcifera Microsatellite primer sequences with fluorescent labeled dyes, repeat motifs, mean of individuals (N), number of alleles (A), size of PCR products in base pairs (bp), expected heteozygosity (HE), observed heterozygosity (Ho), D, deviation from HWE; NS, not significant. *Significant deviations from Hardy–Weinberg expectations (P < 0.05) and inbreeding (FIS) are shown. These 12 microsatellite loci were genotyped for 40 individual specimens and samples were collected from Shinan, Korea, in 2012. All microsatellite markers were polymorphic, with the alleles per locus ranging from 5 to 11 (mean = 7.92). The mean Ho and HE values were 0.615 (0.225–0.875) and 0.757 (0.558–0.852), respectively (Table 5). The FIS ranged from −0.5582 in WBPH_T4 to 0.7261 in WBPH_T8, with a mean of 0.182 across loci. Most of the loci showed significant deviation from HWE in the direction of heterozygote deficiency, and only one loci WBPH_T18 had no significant deviation. Through the program MICROCHECKER, WBPH_T3 (0.1187), WBPH_T5 (0.1939), WBPH_T8 (0.349), WBPH_T13 (0.1838), WBPH_T15 (0.1769), and WBPH_T16 (0.1345) appeared to harbor possible null alleles; however, further verification including other random mating populations is required since the program MICROCHECKER estimates null alleles for the population assuming the population under HWE. Structure 2.3.3 software was applied to determine the population structure of 40 S. furcifera sampled from Shinan in 2012. In addition, likelihood values and delta K analysis of LnP(D) have been examined (Evanno et al) (Fig. 2). The highest likelihood values in all runs were determined for K = 1, implying that the S. furcifera from Shinan constitute a single genetic cluster. Therefore, the 40 S. furcifera specimen collected from Shinan where the occurrence of the species was reported in Korea have genetically similar ancestry with no distinct genetic structuring among them.
Fig. 2.

Plot of mean posterior probability (LnP(D)) values per clusters (K), based on 10 iterations per K, from STRUCTURE analyses (Pritchard et al. 2000) and delta K analysis of LnP(D) (Evanno et al) to estimate the genetic structure of the 40 S. furcifera specimens sampled from Shinan in 2012.

Plot of mean posterior probability (LnP(D)) values per clusters (K), based on 10 iterations per K, from STRUCTURE analyses (Pritchard et al. 2000) and delta K analysis of LnP(D) (Evanno et al) to estimate the genetic structure of the 40 S. furcifera specimens sampled from Shinan in 2012.

Discussion

The isolation of microsatellite loci from genomic DNA sources had previously been laborious and time-consuming due to methods requiring enrichment, cloning, plasmid purification, and Sanger sequencing (Nunome et al, Techen et al), which reduced the number of candidate loci that could be feasibly identified. In contrast, NGS methods have increased the throughput and cost effectiveness of reduced representation library sequencing and has allowed the rapid identification of tandem repeats among sequences in genomic and expressed sequence tag libraries (Csencsics et al, Agunbiade et al). In this study, we identified 636 candidate microsatellite repeats from Roche 454 read data. Although the proportion of fragments containing tandem repeats was low (636 of 150,895 filtered reads = 0.42%), output from mass sequencing platforms offer sufficient raw data in which to discover microsatellite sequence elements. Microsatellite repeats within these Roch 454 did not contain GC-rich repeats such as (CG) or (CGG) arrays, which may be due to our methods of subsampling the S. furcifera genome from PstI and EcoRI sites. Specifically, microsatellites repeats of a particular nucleotide composition may not be randomly distributed in a genome, such that our methods used for library construction biased the repeats captured within the resulting sequence data. Whole-genome shotgun sequencing approaches may prove to be more appropriate to obtain a more random sampling of loci but was not investigated in this study. These discoveries let to the development of molecular genetic markers for 12 microsatellite loci and have been successfully applied estimates of polymorphism at these loci to obtain population genetics parameters for 40 S. furcifera samples collected in Korea. All markers were developed from loci containing trinucleotide repeats, which, although generally more infrequent compared with dinucleotide repeat microsatellite loci (Schug et al. 1998), tend to produce less Taq polymerase-induced stutter bands that increases accuracy of downstream genotyping (Edwards et al. 1991). Eleven out of 12 microsatellite loci indicated significant heterozygote deficiencies in Shinan population, and MICROCHECKER analysis showed the signs of “null alleles” in six microsatellite loci (WBPH_T3, WBPH_T5, WBPH_T8, WBPH_T13, WBPH_T15, and WBPH_T16). “Null alleles” are not uncommon among microsatellite markers and is generally considered to be caused by the failure of PCR to amplify one or more allele in a population due to mutations in the regions of genomic DNA that flank the tandem repeat, such that oligonucleotide primer binding does not occur. Alternatively, the observed heterozygote deficiencies might possibly owing to nonrandom sampling or actual increased frequency of homozygote genotypes in the population. Therefore, additional studies are required to estimate Ho among a broader sampling of populations and will indicate if these six markers can be applied for population genetics studies after analytical methods are used to correct for null alleles (Chapuis and Estoup 2007, Kim et al. 2009). Although molecular marker-based S. furcifera population genetic studies have been previously conducted, the current set of microsatellite markers comparatively offer distinct advantages including codominant alleles, scalability, and increased assay reproducibility. By comparison, maternally inherited mitochondrial haplotypes allow only the estimation of female movements, and direct Sanger sequencing methods such as those used by Mun et al are not cost effective for large-scale population genetic studies. Additionally, mitochondrial markers have recently been shown incapable of detecting regional variation among S. furcifera populations (Matsumoto et al). ISSR markers were previously applied for estimation of genetic differentiation among S. furcifera populations (Liu et al). Although ISSR methods offer a relatively easy and inexpensive access to molecular genetic study (Cichorz et al), yet several researchers doubts about their reproducibility, ascendancy, and homology (Hansen et al). More recently, 21 microsatellite markers were developed from S. furcifera expressed sequence tag resources and were used in preliminary genotyping of samples from China (Sun et al). Several complications have been encountered in past efforts to develop molecular microsatellite markers of S. furcifera. First, Zhang et al observed that tandem repeats between the two S. furcifera individuals were identical, and only a few nucleotide variations were revealed, and moreover that the flanking sequences of this repeat region between the two S. furcifera individuals were exactly same except for one site difference, which suggests that S. furcifera population may have low overall levels of molecular diversity. Similar results were observed in this study from studying sequence data alone. Specifically, the CAP3-assembled contigs showed little copy number variation among microsatellite repeat arrays, with the exceptions of Contig 11 which showed (TCT)9–11 repeat, Contig 17 with a (GAA)8–11 repeat, and Contig 31 with a (TCT)9–11 repeat (Supp File 2 [online only]). All other mutations were within microsatellite flanking sequences and provided empirical evidence that our data could detect variation at 3 of 37 loci for which redundant sequences had been obtained. Although prior studies have used redundancy among NGS reads to predict allelic variation at a microsatellite locus (Vukosavljev et al. 2015), clustering of sequences makes assumptions that all similar sequences are derived from one homologous genetic loci. Quite to the contrary, it has been shown that microsatellite repeats are often located within highly repetitive genome regions wherein flanking sequences tend to be highly similar (Megle’cz et al. 2004, 2007; Zhang 2004; Van't Hof et al. 2007). This phenomenon may be partially influenced by both integration of transposable DNA elements into microsatellites (Coates et al. 2010) and transposition of microsatellite repeats within transposons themselves (Coates 2015; Coates et al. 2011, 2012). This suggests that microsatellite sequences that are highly prevalent among raw nonnormalized sequence reads may indeed be multicopy within a genome. Second, a S. fucifera repeat unit had also been previously shown to be within the genome of the small brown planthopper, Laodelphax striatellus, which belong to different genera of Delphacidae, and these shared microsatellite sequences were almost identical except for mutations or single nucleotide deletions in a few repeat units (Zhang et al). Therefore, such applications aimed to estimate allelic variation may be flawed due to specious clustering of nonhomologous sequence data. Indeed, validation for variation at a single locus tends to be accurate using empirical PCR-amplified microsatellite marker data (Agunbiade et al. 2012) and was the tactic used in our work. These 12 new microsatellite markers will facilitate the study of the gene flow and migration route of S. furcifera in Asia. For example, these microsatellites could be suitable to elucidate invasion routes of the insects form China to Korea using the Approximate Bayesian Computation method as was done to manifest frequent and in progress introduction of western corn rootworm from North to Europe (Miller et al). These markers can be used to the study migration and genetic structure of insect genetic structure data of migration insect. For example, Llewellyn et al used microsatellite to study genetic variability of the grain aphid, Sitobion avenae, in Britain relate to climate and clonal fluctuation. These markers are also suitable to explain migration routes of insects from Korea and other countries. These markers can characterize the migration patterns, gene flow, and genetic connectivity among geographic populations of S. furcifera in Asia. Successful results from this examination will contribute to map effectively strategies for the migration route and origin of this species.

Supplementary Data

Supplementary data are available at Journal of Insect Science online.
  42 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  A novel class of miniature inverted repeat transposable elements (MITEs) that contain hitchhiking (GTCY)(n) microsatellites.

Authors:  B S Coates; J A Kroemer; D V Sumerford; R L Hellmich
Journal:  Insect Mol Biol       Date:  2010-10-26       Impact factor: 3.585

Review 3.  Applications of next-generation sequencing technologies in functional genomics.

Authors:  Olena Morozova; Marco A Marra
Journal:  Genomics       Date:  2008-08-24       Impact factor: 5.736

4.  AFLP: a new technique for DNA fingerprinting.

Authors:  P Vos; R Hogers; M Bleeker; M Reijans; T van de Lee; M Hornes; A Frijters; J Pot; J Peleman; M Kuiper
Journal:  Nucleic Acids Res       Date:  1995-11-11       Impact factor: 16.971

5.  The distribution and frequency of microsatellite loci in Drosophila melanogaster.

Authors:  M D Schug; K A Wetterstrand; M S Gaudette; R H Lim; C M Hutter; C F Aquadro
Journal:  Mol Ecol       Date:  1998-01       Impact factor: 6.185

6.  Allele frequencies at microsatellite loci: the stepwise mutation model revisited.

Authors:  A M Valdes; M Slatkin; N B Freimer
Journal:  Genetics       Date:  1993-03       Impact factor: 4.562

7.  Cost-effective, species-specific microsatellite development for the endangered Dwarf Bulrush (Typha minima) using next-generation sequencing technology.

Authors:  Daniela Csencsics; Sabine Brodbeck; Rolf Holderegger
Journal:  J Hered       Date:  2010-06-18       Impact factor: 2.645

8.  Spatial and temporal genetic analyses show high gene flow among European corn borer (Lepidoptera: Crambidae) populations across the central U.S. corn belt.

Authors:  Kyung Seok Kim; Mark J Bagley; Brad S Coates; Richard L Hellmich; Thomas W Sappington
Journal:  Environ Entomol       Date:  2009-08       Impact factor: 2.377

9.  Unexpected relationships of substructured populations in Chinese Locusta migratoria.

Authors:  De-Xing Zhang; Lu-Na Yan; Ya-Jie Ji; Godfrey M Hewitt; Zu-Shi Huang
Journal:  BMC Evol Biol       Date:  2009-06-28       Impact factor: 3.260

10.  Genetic structure and demographic history reveal migration of the diamondback moth Plutella xylostella (Lepidoptera: Plutellidae) from the southern to northern regions of China.

Authors:  Shu-Jun Wei; Bao-Cai Shi; Ya-Jun Gong; Gui-Hua Jin; Xue-Xin Chen; Xiang-Feng Meng
Journal:  PLoS One       Date:  2013-04-02       Impact factor: 3.240

View more
  1 in total

1.  Development and Characterization of Novel Microsatellite Markers for the Peach Fruit Moth Carposina sasakii (Lepidoptera: Carposinidae) Using Next-Generation Sequencing.

Authors:  You-Zhu Wang; Li-Jun Cao; Jia-Ying Zhu; Shu-Jun Wei
Journal:  Int J Mol Sci       Date:  2016-03-15       Impact factor: 5.923

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.