Literature DB >> 35651765

Complete Chloroplast Genome of the Inverted Repeat-Lacking Species Vicia bungei and Development of Polymorphic Simple Sequence Repeat Markers.

Ick-Hyun Jo1, Seahee Han2, Donghwan Shim3, Hojin Ryu4, Tae Kyung Hyun5, Yi Lee5, Daeil Kim6, Yoon-Sup So7, Jong-Wook Chung5.   

Abstract

Background: Vicia bungei is an economically important forage crop in South Korea and China. Although detailed genetic and genomic data can improve population genetic studies, conservation efforts, and improved breeding of crops, few such data are available for Vicia species in general and none at all for V. bungei. Therefore, the main objectives of this study were to sequence, assemble, and annotate V. bungei chloroplast genome and to identify simple sequence repeats (SSRs) as polymorphic genetic markers.
Results: The whole-genome sequence of V. bungei was generated using an Illumina MiSeq platform. De novo assembly of complete chloroplast genome sequences was performed for the low-coverage sequence using CLC Genome Assembler with a 200-600-bp overlap size. Vicia bungei chloroplast genome was 130,796-bp long. The genome lacked an inverted repeat unit and thus resembled those of species in the inverted repeat-lacking clade within Fabaceae. Genome annotation using Dual OrganellarGenoMe Annotator (DOGMA) identified 107 genes, comprising 75 protein-coding, 28 transfer RNA, and 4 ribosomal RNA genes. In total, 432 SSRs were detected in V. bungei chloroplast genome, including 64 mononucleotides, 14 dinucleotides, 5 trinucleotides, 4 tetranucleotides, 233 pentanucleotides, 90 hexanucleotides, and 14 complex repeated motifs. These were used to develop 232 novel chloroplast SSR markers, 39 of which were chosen at random to test amplification and genetic diversity in Vicia species (20 accessions from seven species). The unweighted pair group method with arithmetic mean cluster analysis identified seven clusters at the interspecies level and intraspecific differences within clusters.
Conclusion: The complete chloroplast genome sequence of V. bungei was determined. This reference genome should facilitate chloroplast resequencing and future searches for additional genetic markers using population samples. The novel chloroplast genome resources and SSR markers will greatly contribute to the conservation of the genus Vicia and facilitate genetic and evolutionary studies of this genus and of other higher plants.
Copyright © 2022 Jo, Han, Shim, Ryu, Hyun, Lee, Kim, So and Chung.

Entities:  

Keywords:  SSR marker; Vicia bungei; chloroplast genome; inverted repeat-lacking clade; phylogeny

Year:  2022        PMID: 35651765      PMCID: PMC9149428          DOI: 10.3389/fpls.2022.891783

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   6.627


Introduction

Vicia L. is a genus in the Fabaceae family containing approximately 180–210 species. These species are widely distributed across temperate regions of the northern hemisphere and extend to temperate regions of South America and tropical Africa (Hanelt and Mettin, 1989). Vicia species are used as green manure, cover, forage, and honey crops, making it an economically important genus and a valuable genetic resource (Montemurro et al., 2013). Despite their high economic value, very few genetic and genomic data are available for species of Vicia, other than Vicia villosa (hairy vetch) and Vicia faba (broad bean). Vicia bungei, native to South Korea and China, is phenotypically and ecologically similar to Vicia americana (Endo et al., 2000). Although several chloroplast genomes of different Vicia species have been obtained through next-generation sequencing (NGS) (Cooper et al., 2017; Li et al., 2018; Xin and Yang, 2020), to date, no genetic or genomic studies have been conducted on V. bungei, despite the potentially valuable genetic resources present in wild varieties of this species. In addition, more effective molecular markers are required to support the phylogenetic and population genetic studies underlying the identification, conservation, utilization, and breeding of Vicia species. The chloroplast genome has long been a focus of research into plant molecular evolution and systematics because of its small size, high copy number, and conservation among species. It has been extensively characterized at the molecular level (Kim et al., 2005). Recent technical advances in NGS technologies mean that the number of completely sequenced chloroplast genomes has increased rapidly, and such sequences play a progressively important role in the identification of molecular markers and in molecular phylogenetic analyses (Kim et al., 2016). Chloroplast genetic markers are potentially more effective indicators of population subdivision and differentiation than are nuclear markers (Schaal et al., 1998; Petit et al., 2005). Chloroplast simple sequence repeats (cpSSRs), generally defined as microsatellites with tandem repeats of 1–6 bp, are valuable resources for assessing genetic and genome diversity, as well as in phylogenetic and systematic evolutionary analyses; cpSSRs have several advantageous characteristics, including haploidy, non-recombination, uniparental inheritance, and low nucleotide substitution rate (Powell et al., 1995). The chloroplast genome can provide unique insight into evolutionary processes as it retains ancient genetic patterns (Provan et al., 2001). Moreover, as the genetic information in angiosperm chloroplasts is inherited maternally, chloroplast markers serve as useful indicators of maternal ancestry (Raveendar et al., 2015). A new complete chloroplast genome sequence for V. bungei has been generated and compared with sequences of related genera in the Fabaceae family. This comparison enabled the development of cpSSR markers for future population genetics studies, explaining the structure. Our findings help explaining the structure of the complete V. bungei chloroplast genome and reveal evolutionary relationships within the genus Vicia.

Materials and Methods

Plant Material and DNA Extraction

Leaves from 1-year-old V. bungei plants were collected from the Industrial Plant Science and Technology greenhouse at Chungbuk National University (Cheongju, South Korea; 36° 37′ 44.3″ N, 127° 27′ 02.5″ E), immediately snap frozen in liquid nitrogen, and stored at −80°C until analysis. DNA was extracted using automated QIACube system (Qiagen, Hilden, Germany) with a DNeasy Plant Mini Kit (Qiagen, Hilden, Germany) according to the manufacturer’s protocol. Qualitative and quantitative assessment of the DNA samples was performed by spectrophotometry (NanoDrop; Thermofisher, Waltham, MA, United States) before their use in the DNA Sequencing.

Sequencing, Assembly, Phylogenetic Relationships, and Comparison of Chloroplast Genome

DNA Sequencing was conducted using the Illumina MiSeq sequencing platform (Illumina, San Diego, CA, United States), and 2.0 Gb of sequence data was generated. Quality trimming and assembly of the reads were achieved using the dnaLCW method (Kim et al., 2015) and CLC Assembly Cell version 4.21 (CLC Inc., Qiagen, Aarhus, Denmark). Phylogenetic relationships were analyzed with the maximum-likelihood method using 61 conserved chloroplast protein sequences from 20 Fabaceae species (downloaded from GenBank; see Supplementary Figure 1) and V. bungei. The analysis was conducted in MEGA7 (Kumar et al., 2016) with 1,000 bootstrap replicates. The complete chloroplast genome of V. bungei was compared with five published chloroplast genomes (Vicia sativa, V. faba, Vicia sepium, Vicia ramuliflora, and Vicia costata) using the mVISTA program (Mayor et al., 2000).

Chloroplast Simple Sequence Repeat Detection and Primer Design

Simple sequence repeat mining was performed using the MIcroSAtellite identification tool (Thiel et al., 2003). The following search parameters were set for identification: mono-, di-, tri-, tetra-, penta-, and hexa-nucleotide motifs with a minimum of ten, five, four, three, two, and two repeats, respectively. The primers for SSR markers were designed using Primer 3.0 software.[1] The parameters for designing the primers were set as follows: a primer length of 18–22 bp, with 20 bp set as the optimum value; an optimum annealing temperature of 58°C; and a polymerase chain reaction (PCR) product size of 100–300 bp.

Chloroplast Simple Sequence Repeat Marker Validation and Data Analysis

A total of 39 developed CpSSR markers were randomly selected to assess the genetic diversity of Vicia species (20 accessions from seven species; Supplementary Table 1). The PCR mixture (total volume 40 μl) contained 20 ng genomic DNA, 10 pmol each primer, 2.5 mM MgCl2, 0.25 mM dNTPs, and 0.5 U Taq polymerase (Inclone, Deajeon, South Korea). Polymerase chain reaction amplification was performed under the following conditions: 94°C for 1 min; 30 cycles of 94°C for 30 s, 55°C for 30 s, and 72°C for 30 s; and a final extension at 72°C for 5 min. The size of PCR products was analyzed using the Fragment Analyzer (Advanced Analytical Technologies Inc., Ankeny, IA, United States), and allele sizes were scored using the PROSize 2.0 (Advanced Analytical Technologies). The number of alleles, the major allele frequency, the expected heterozygosity and polymorphic information content were calculated using the PowerMarker v3.25.[2] The expected heterozygosity formula is as follows: A closely related diversity measure is the polymorphism information content (PIC): Phylogenetic analysis of Vicia species (20 accessions from seven species) was performed using UPGMA cluster analysis, and unrooted tree construction was based on the CS chord 1967 distance method in PowerMarker v3.25 software.

Results

Genomic Characteristics and Phylogenetic Relationships With Fabaceae

The complete chloroplast genome of V. bungei was 130,796-bp long and lacked an inverted repeat unit (Figure 1). The overall GC content was 34.73%. A total of 107 genes were identified, including 75 protein-coding, 28 transfer RNA, and 4 ribosomal RNA genes (DNA-directed RNA polymerase genes) (Supplementary Table 2). Furthermore, 18 ribosomal subunit genes (ten small subunits and eight large subunits) were detected. Eleven genes, including petB, petD, atpF, ndhA, ndhB, rpl16, rpl2, rps12, rpoC1, clpP, and ycf3, contained one or two introns. Additionally, rps12 was identified as a trans-splicing gene.
FIGURE 1

Schematic map of the chloroplast genome of Vicia bungei.

Schematic map of the chloroplast genome of Vicia bungei. A total of 45 chloroplast genes were detected and involved in photosynthesis, and encoded subunits of NADH oxidoreductase (11 genes), subunits of photosystem I (seven genes), subunits of photosystem II (14 genes), subunits of the cytochrome b6/f complex (seven genes), different subunits of ATP synthase (seven genes), and the large chain of ribulose bisphosphate carboxylase (one gene). In addition, five genes were involved in different functions, which two of them remained unknown (Table 1). Vicia bungei chloroplast genome resembled to other Vicia species plastomes in the inverted repeat-lacking clade (IRLC), as it lacked rpl22, rps16, and one intron of clpP (Kim et al., 2016; Li et al., 2018; Xin and Yang, 2020). The complete chloroplast genome sequence, with gene annotations, was submitted to GenBank (accession number MT362055).
TABLE 1

Genes present in the Vicia bungei chloroplast genome.

RoleGene
Photosystem I psaA, psaB, psaC, psaI, psaJ, ycf3b, ycf4
Photosystem II psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
Cytochrome b6/f petA, petBa, petDa, petG, petL, petN
ATP synthase atpA, atpB, atpE, atpFa, atpH, atpI
Rubisco rbcL
NADH oxidoreductase ndhAa, ndhBa, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
Large subunit ribosomal proteins rpl14, rpl16a, rpl2a, rpl20, rpl23, rpl32, rpl33, rpl36
Small subunit ribosomal proteins rps11, rps12a,c, rps15, rps18, rps19, rps2, rps3, rps4, rps7, rps8
RNA polymerase rpoA, rpoB, rpoC1a, rpoC2
Unknown function protein-coding gene ycf1, ycf2
Other gene accD, ccsA, cemA, clpPa, matK
Ribosomal RNAs rrn16, rrn23, rrn4.5, rrn5
Transfer RNAs trnA-UGCa, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnG-GCC, trnH-GUG, trnI-GAUa, trnK-UUUa, trnL-CAA, trnL-UAAa, trnL-UAG, trnM-CAU, trnN-GUU, trnP-UGG, trnQ-UUG, trnR-ACG, trnR-UCU, trnS-GCU,trnS-GGA, trnS-UGA, trnT-CGUa, trnT-GGU, trnT-UGU, trnV-UACa, trnW-CCA, trnY-GUA, trnfM-CAU

Genes present in the Vicia bungei chloroplast genome. Phylogenetic analysis for 21 Fabaceae species, based on 61 conserved plastid protein sequences, showed that V. bungei clustered with two species from the same genus, V. sepium and V. sativa, supported by high bootstrap values in the maximum-likelihood tree (Supplementary Figure 1).

Development of Chloroplast Simple Sequence Repeat Markers

A total of 432 simple sequence repeats (SSRs) were detected, which included 64 mononucleotides, 14 dinucleotides, 5 trinucleotides, 4 tetranucleotides, 233 pentanucleotides, 98 hexanucleotides, and 14 complex repeated motifs, in the chloroplast genome of V. bungei (Supplementary Figure 2). The majority of the identified SSRs were pentanucleotide repeats (53.9%), followed by hexanucleotides repeats (22.7%). A total of 432 potential SSR motifs were identified, 313 (73.45%) of which occurred within the intergenic regions (Supplementary Table 3). A total of 232 pairs of SSR primers were designed from 432 potential SSR motifs (Supplementary Table 4) that showed potential for marker development; these included mono-, di-, tetra-, penta-, and hexanucleotides, and complex repeated motifs. Pentanucleotides (54.31%) were the most abundant group within the selected SSR markers, followed by hexa- (25%), mono- (14.65%), di- (6%), and tetranucleotides (4%) and complex repeated nucleotide (4%) markers (Supplementary Table 4). The most common motif from the pentanucleotide markers was the GAATT/GAAAT (4.76%), followed by CAAAA/CATAA (3.97%), AAAGA/AATGA (3.97%), and TATAT/TATTT (3.17%) (Supplementary Table 4).

Genetic Diversity in Vicia Species

Of the 232 cpSSR markers described above, 39 were selected randomly, in order to evaluate their amplification potential and to assess genetic diversity in the genus Vicia (20 accessions from seven species). Amplification of all the selected cpSSR markers produced clear fragments; 35 fragments showed polymorphisms, and four were monomorphic. The major allele frequencies within the 39 SSR markers across 20 accessions (Vicia spp.) ranged from 0.20 (VBCP38 and VBCP42) to 1.0 (VBCP41, VBC54, VBC65, and VBC164), with a mean value of 0.62. The number of alleles per marker varied between 1 (VBCP41, VBC54, VBC65, and VBC164) and 16 (VBCP42), with a mean value of 3.92. The expected heterozygosity ranged from 0 (VBCP41, VBC54, VBC65, and VBC164) to 0.92 (VBCP39), with a mean value of 0.48. The polymorphic information content ranged from 0 (VBCP41, VBC54, VBC65, and VBC164) to 0.91 (VBCP39), with a mean value of 0.569 (Table 2). The most polymorphic loci (i.e., those exhibiting the highest diversity) were VBCP39 and VBCP42, based on their relatively large allele number (15 and 16, respectively).
TABLE 2

Summary of the 39 polymorphic chloroplast simple sequence repeat markers and their genetic diversity statistics across 20 Vicia spp. accessions.

MarkerLocationRepeat motifLeft sequenceRight sequenceMAFNAHEPIC
VBCP35rpoB(AT)nTATCAACGGGTCTTCCATCTTGCGAGCTATACTTGGGATTCAGG0.504.000.670.62
VBCP37rpoC2(TA)nATAACACAATCGGCAGATCCAATTCTGTAAACACCCGAAATGGA0.704.000.480.44
VBCP38trnK-UUU∼matK(TA)nCACGGCTTTCCCTATGTATACATGCAGAGGTTCCATAGAAATCG0.2011.000.890.87
VBCP39psbB∼petL(TA)nTCAGTGAATACAGACAATGGATACATCATCTGTGACATCTACGCAG0.1615.000.920.91
VBCP40ycf1(TC)nATAGAATGCTCTCCCAAGCTTCTTTCGAGATACTGGGCGACTA0.434.000.660.59
VBCP41psbB(AACC)nAAAGGAACATCCGCTCTAACAAACATCGGTAATAATCCGGCAAA1.001.000.000.00
VBCP42rpl32∼ndhF(ATTA)nTCCTCATCCTCGCTCTATAGATTCTTTGCACAATGGTCCCAATA0.2016.000.900.90
VBCP43ndhD(TATT)nATCAATGGCTTCTCTTGCATTGCGAAATAAATAATTCTCTGGGCCC0.802.000.320.27
VBCP44psaB(TTTC)nCGGTGTTTATCAGTGGTGGTATCAAGCTAAGGAACTGACTCCAA0.802.000.320.27
VBCP53rpoB∼trnC-GCA(AACAA)nTCATTCTTCATCGAATCACATGAACCCGAAGTCTAGGTGAAATTT0.753.000.410.37
VBCP54psbD(AACCC)nTTCATATGATGGGAGTTGCTGGGAGCACTCATCCATAAACCAGT1.001.000.000.00
VBCP56atpB(AACTT)nCCCAGGGAAATATGTTGGTCTACTTTCACTTCTGAATCCCAAACA0.852.000.260.22
VBCP63trnE-UUC∼trnT-GGU(AATGA)nAAGAATTGAGTTGAGGGACAGGACATAGCAACTCATTAACGAACA0.455.000.680.63
VBCP65rpoC2(AATGG)nTGAACTGTTATAACTTGACCCGATGGCAACTTGACAAATTAACTGA1.001.000.000.00
VBCP69psbM∼trnD-GUC(ACCAA)nGGATCTCGATGATATCAAATCGGAAGATCATTTCGAACAGGTATCCC0.603.000.560.49
VBCP75rpoC1(ATAAA)nCCCTACTGTTTCTCCATTAGGTTTTGGCTCTGGAACTGAATCAT0.802.000.320.27
VBCP79trnE-UUC∼trnT-GGU(ATGAT)nGAGATGTCCTAAACCGCTAGACAGATTGGTGATTGGAATGAACAA0.853.000.270.25
VBCP81atpH∼atpI(ATTCA)nTTTCGTTTCTACCCTTGTAGTTTACGGTATGGAACAAACACATGT0.404.000.710.65
VBCP85rbcL∼atpB(ATTTG)nCAAGAACAAGGTCTACTCGACATCACTGTCAAGGTCAAGAGTCT0.454.000.630.55
VBCP88rpoC2(ATTTT)nTTGGTGGAATAATGACGTTATGTTGGGAGAAGCTGTAGGGATTAT0.752.000.380.30
VBCP90rpoC2(CAAAA)nTTGACAACTTTGAGTTCCAGATTACATAGTGCCATCTTGATACCG0.704.000.480.44
VBCP98trnK-UUU∼matK(CATAA)nCACGGCTTTCCCTATGTATACATGCAGAGGTTCCATAGAAATCG0.256.000.820.79
VBCP99matK∼trnK-UUU(CATAA)nCCTCGCTTCTTCCTTCTCATTTCGATTAGTGCTTGCTGTGGAAA0.734.000.430.39
VBCP100petN∼psbM(CATTG)nCTGCTGGTTGTAGTCTGATCATTCGCATTTATAGCTACTGCACT0.602.000.480.36
VBCP106psbC(CTTAT)nACATGTATGGTTGGGTTCCATTAGTAAATGCTTGAGCTTGAGAAG0.852.000.260.22
VBCP108atpI∼rps2(CTTTT)nTGACCTACTTCCACAGCAGATATTTAGATTTGGTTGGGCGGG0.803.000.340.30
VBCP109rpoC2(CTTTT)nCGTTCTTGAATCGATTGGAATGGACTTCGCAAGGATCAAGATCAA0.454.000.630.55
VBCP112trnT-GGU∼psbD(GAAAT)nTCCTTTCATTGTCAGATACTCCTAGATTCTTGCAGAGTGAGAACC0.563.000.590.52
VBCP116rpoC1∼rpoC1(GAATG)nAATTGACCATAGACCCATTCCCCCTAGTTATATCGCGAGCCTTT0.852.000.260.22
VBCP122atpF(GCACT)nTTAGTAAGAAGTCATTCGCCGGCTATCCATAAGAGGAGATGCGC0.552.000.500.37
VBCP123matK(GGATA)nCCAATTACAAAGAAACAGCCGTTCTTCCTTAGAGGAGGCAGAAA0.902.000.180.16
VBCP127trnD-GUC∼trnY-GUA(GTATA)nGACTCGAACCCGCAACTTCCCGAGTCATCCGTGTCGATAAAG0.505.000.690.65
VBCP131trnY-GUA∼trnE-UUC(TACCC)nATTGCCAACGAATTTACAGTCCCATAGTAGAATGGAAGTCGGGC0.802.000.320.27
VBCP144rpoC1∼rpoC1(TCTAA)nTCCTCTCATCCGGCTAAAGTATTTTCTGTCGTAATTTCGAATTGCA0.553.000.570.48
VBCP149rpoC2(TGATT)nGGGACATTAGTTCGTTCTTTCGACCATGGATTCACTTTCTAATGGA0.355.000.770.73
VBCP155petN∼psbM(TTATT)nGGGAAGAAGTGGACTCTAAAGGGGCAACAATTTCAATATTTGTGTG0.255.000.770.73
VBCP156trnE-UUC∼trnT-GGU(TTCAA)nAAGAATTGAGTTGAGGGACAGGACATAGCAACTCATTAACGAACA0.307.000.790.76
VBCP162trnE-UUC∼trnT-GGU(TTGAG)nTTGTATTTCACACTAAGTCGGAAAACCGATTTGAATTGAAGTCATCT0.652.000.460.35
VBCP164atpE∼trnT-CGU(TTTAG)nTAGGACACGAGTAGAGGCTATCGTTCACATGTTTCGTAAAGGGC1.001.000.000.00
Mean0.623.920.480.43
Summary of the 39 polymorphic chloroplast simple sequence repeat markers and their genetic diversity statistics across 20 Vicia spp. accessions. These 39 cpSSR markers were used to further analyze the genetic diversity of the 20 Vicia spp. accessions. The unweighted pair group method with arithmetic mean (UPGMA) cluster phylogenetic analysis clearly distinguished the accessions by genotype and grouped them into seven major clusters that corresponded with the different species (Figure 2). Targeted analysis of cpSSR regions in V. bungei identified a unique chloroplast type for each of the seven species examined (group A: Vicia dasycarpa; group B: Vicia hirsuta; group C: Vicia narbonensis; group D: Vicia angustifolia var. segetilis; group E: V. bungei; group F: Vicia linearifolia; group G: Vicia chosenensis). The UPGMA cluster analysis also revealed intraspecific variation between the 20 Vicia spp. accessions.
FIGURE 2

Phylogenetic relationships between 20 Vicia spp. accessions. The phylogenetic tree was constructed with data from 39 chloroplast simple sequence repeat markers using the unweighted pair group method with arithmetic mean method. Letters A–E on the right-hand side represent each plastome group based on the plastome haplotypes.

Phylogenetic relationships between 20 Vicia spp. accessions. The phylogenetic tree was constructed with data from 39 chloroplast simple sequence repeat markers using the unweighted pair group method with arithmetic mean method. Letters A–E on the right-hand side represent each plastome group based on the plastome haplotypes.

Discussion

Most chloroplast genomes have a circular structure that contains a large single-copy region, an inverted repeat A region, a small single-copy region, and an inverted repeat B region (Raveendar et al., 2015; Zhu et al., 2016). The chloroplast genomes of some legumes, including V. sepium, have lost one of the two inverted repeats, and these species form the IRLC (Li et al., 2018). The chloroplast genome of V. bungei indicates that this economically important crop is also a member of the IRLC, suggesting that it may be part of the same evolutionary clade as V. sepium and V. sativa, which may share a similar evolutionary history. A chloroplast gene is rarely lost arbitrarily. Instead, the gene is either transferred to the nuclear genome, or its function is replaced by a nuclear gene (Wang et al., 2018). Recent sequencing and analyses of some IRLC plastomes (Kim et al., 2005; Li et al., 2018; Moghaddam et al., 2022) have revealed important evolutionary patterns in this clade, including loss of the genes rpl22 and rps16, the deletion of one intron of clpP (Jansen et al., 2008; Li et al., 2020), multiple sequence inversions (Schwarz et al., 2015; Williams et al., 2015), and gene transfers to the nucleus (Moghaddam and Kazempour-Osaloo, 2020; Wu et al., 2021). Vicia bungei plastome lacked one intron of clpP, which was consistent with the finding of Jansen et al. (2008) and confirmed the parallel loss of this clpP intron in V. bungei and in members of the papilionoid IRLC. Recently, Williams et al. (2015) demonstrated that the rates of synonymous and non-synonymous mutations are accelerated in the clpP sequence of Acacia, suggesting there may be a functional nuclear-encoded copy of this gene in at least some mimosoid legumes. The mechanism by which rpl22, rps16, and one intron of clpP gene have been lost from V. bungei requires further in-depth research. Although the structure of plastid genomes is largely conserved across land plants (Wicke et al., 2011), other exceptions to this pattern, beyond these IRLC legumes, include Geraniaceae (Guisinger et al., 2011) and Campanulaceae (Haberle et al., 2008). Some Fabaceae plastomes have been highly rearranged owing to multiple rounds of translocations and/or inversions. As a result, the plastomes of the IRLC have undergone considerable diversification in both gene order and gene/intron content (Wang et al., 2018; Li et al., 2020; Xin and Yang, 2020; Moghaddam et al., 2022). The plastome of V. bungei most closely resembled those of V. sepium and V. sativa. This finding is consistent with previous research that indicated similar evolutionary evaluation analysis of V. sepium and V. sativa (Li et al., 2020). Plastome differences between some major clades provide valuable information for resolving phylogenetic relationships based on DNA sequence analyses. The study of genomic variation and phylogeny of Fabaceae species provides an increased understanding of general chloroplast genome evolution. In our results of multiple alignments among six Vicia species cp genomes, we observed 11 major variant regions of V. bungei in cp genome (Supplementary Figure 3). However, our research focus on the development of cpSSR markers for Vicia species identification. Thus, these variant regions would be very useful for further evolutionary studies as well as cpSSR marker design. In other flowering plant chloroplast genomes, the most common SSR motifs are mono-, di-, and trinucleotide repeats (George et al., 2015). Nevertheless, data mining, under the recursive criteria adopted here, revealed that the majority of SSRs present in the V. bungei chloroplast genome sequences contained relatively long penta- (53.9%) or hexanucleotide (22.7%) motifs. In this study, a total of 432 potential SSR were found in the chloroplast genome of V. bungei. The number of repeat motifs was richer in V. bungei chloroplast genome compared with the V. sepium chloroplast genome (Li et al., 2020). These results provide a firm foundation for further investigations of chloroplast genome evolution in Vicia L. and other IRLC legumes such as Medicago and Pisum and species. Raveendar et al. (2015) used SSR markers to detect genetic diversity in V. sativa and 22 other Vicia species. Li et al. (2018) used chloroplast genome sequences to determine the genetic diversity in V. sepium and 21 closely related Fabaceae species. Han et al. (2021) used barcoding loci (ITS2, matK, and rbcL) as DNA markers to differentiate 19 Vicia taxa. Previous studies of phylogenetic relationships among species of the subgenus Vicia did not detect any intraspecific variation when cDNA SSR, cpSSR, or DNA barcoding markers were applied to representative plants of each species. By contrast, our UPGMA cluster analysis classified seven clusters at the interspecies level and revealed intraspecific differences within clusters. These results suggest that the 39 cpSSR markers developed in V. bungei differentiated efficiently between Vicia species genotypes and also provide estimates of their genetic diversity. These cpSSR markers will thus be useful, not only for authenticating Vicia species but also for providing the baseline data essential for advancing systematic breeding in the field and the development of conservation strategies, as well as for guiding the collection of germplasm.

Conclusion

Complete chloroplast genomes have helped to reveal intraspecies relationships, but also allow to measure divergence within interspecies. Growing genomic resources for Vicia spp. provide tools to extend our knowledge on this critically important forage crop species. To develop cpSSR markers that can be utilized to classify Vicia species and analyze the genetic diversity of related species, potential cpSSR motifs were mined from the chloroplast genome of V. bungei, and finally, 39 cpSSR markers were developed. The UPGMA cluster analysis detected intra-and interspecific variation between 20 accessions with 39 cpSSR markers to distinguish Vicia species. The chloroplast genome and cpSSR markers found in this study would provide useful information for genetic diversity analyses in Vicia species.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

I-HJ and SH performed the experiments, analyzed the data, prepared figures and tables, and wrote the manuscript. DS, HR, TH, YL, DK, Y-SS, and J-WC reviewed drafts of the manuscript. All authors have read and approved the final manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  25 in total

Review 1.  Comparative organization of chloroplast, mitochondrial and nuclear diversity in plant populations.

Authors:  Rémy J Petit; Jérôme Duminil; Silvia Fineschi; Arndt Hampe; Daniela Salvini; Giovanni G Vendramin
Journal:  Mol Ecol       Date:  2005-03       Impact factor: 6.185

2.  Comparative analysis of microsatellites in chloroplast genomes of lower and higher plants.

Authors:  Biju George; Bhavin S Bhatt; Mayur Awasthi; Binu George; Achuit K Singh
Journal:  Curr Genet       Date:  2015-05-22       Impact factor: 3.886

3.  VISTA : visualizing global DNA sequence alignments of arbitrary length.

Authors:  C Mayor; M Brudno; J R Schwartz; A Poliakov; E M Rubin; K A Frazer; L S Pachter; I Dubchak
Journal:  Bioinformatics       Date:  2000-11       Impact factor: 6.937

4.  Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.).

Authors:  T Thiel; W Michalek; R K Varshney; A Graner
Journal:  Theor Appl Genet       Date:  2002-09-14       Impact factor: 5.699

5.  Polymorphic simple sequence repeat regions in chloroplast genomes: applications to the population genetics of pines.

Authors:  W Powell; M Morgante; R McDevitt; G G Vendramin; J A Rafalski
Journal:  Proc Natl Acad Sci U S A       Date:  1995-08-15       Impact factor: 11.205

6.  Extensive rearrangements in the chloroplast genome of Trachelium caeruleum are associated with repeats and tRNA genes.

Authors:  Rosemarie C Haberle; H Matthew Fourcade; Jeffrey L Boore; Robert K Jansen
Journal:  J Mol Evol       Date:  2008-03-11       Impact factor: 2.395

7.  Complete plastid genome sequence of the chickpea (Cicer arietinum) and the phylogenetic distribution of rps12 and clpP intron losses among legumes (Leguminosae).

Authors:  Robert K Jansen; Martin F Wojciechowski; Elumalai Sanniyasi; Seung-Bum Lee; Henry Daniell
Journal:  Mol Phylogenet Evol       Date:  2008-06-27       Impact factor: 4.286

8.  The complete chloroplast genome of an inverted-repeat-lacking species, Vicia sepium, and its phylogeny.

Authors:  Chaoyang Li; Yunlin Zhao; Huimin Huang; Yi Ding; Yinhui Hu; Zhenggang Xu
Journal:  Mitochondrial DNA B Resour       Date:  2018-01-24       Impact factor: 0.658

9.  The complete chloroplast genome of Onobrychis gaubae (Fabaceae-Papilionoideae): comparative analysis with related IR-lacking clade species.

Authors:  Mahtab Moghaddam; Atsushi Ohta; Motoki Shimizu; Ryohei Terauchi; Shahrokh Kazempour-Osaloo
Journal:  BMC Plant Biol       Date:  2022-02-19       Impact factor: 4.215

10.  Plastid Genome Evolution in the Early-Diverging Legume Subfamily Cercidoideae (Fabaceae).

Authors:  Yin-Huan Wang; Susann Wicke; Hong Wang; Jian-Jun Jin; Si-Yun Chen; Shu-Dong Zhang; De-Zhu Li; Ting-Shuang Yi
Journal:  Front Plant Sci       Date:  2018-02-08       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.