Literature DB >> 29725599

Development of Chloroplast Genomic Resources in Chinese Yam (Dioscorea polystachya).

Junling Cao1,2,3, Dan Jiang4, Zhenyu Zhao1, Subo Yuan5, Yujun Zhang4, Teng Zhang1, Wenhao Zhong1, Qingjun Yuan1, Luqi Huang1.   

Abstract

Chinese yam has been used both as a food and in traditional herbal medicine. Developing more effective genetic markers in this species is necessary to assess its genetic diversity and perform cultivar identification. In this study, new chloroplast genomic resources were developed using whole chloroplast genomes from six genotypes originating from different geographical locations. The Dioscorea polystachya chloroplast genome is a circular molecule consisting of two single-copy regions separated by a pair of inverted repeats. Comparative analyses of six D. polystachya chloroplast genomes revealed 141 single nucleotide polymorphisms (SNPs). Seventy simple sequence repeats (SSRs) were found in the six genotypes, including 24 polymorphic SSRs. Forty-three common indels and five small inversions were detected. Phylogenetic analysis based on the complete chloroplast genome provided the best resolution among the genotypes. Our evaluation of chloroplast genome resources among these genotypes led us to consider the complete chloroplast genome sequence of D. polystachya as a source of reliable and valuable molecular markers for revealing biogeographical structure and the extent of genetic variation in wild populations and for identifying different cultivars.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29725599      PMCID: PMC5872661          DOI: 10.1155/2018/6293847

Source DB:  PubMed          Journal:  Biomed Res Int            Impact factor:   3.411


1. Introduction

Chinese yam (Dioscorea polystachya Turcz.) belongs to section Enantiophyllum in genus Dioscorea, which also includes economically important food yams of tropical origin such as D. alata (water yam) and D. rotundata (white guinea yam) [1]. It is allogamous with fleshy tuber, branched stems, papery to thinly leathery leaves, and its seeds are inserted near middle of capsule and winged all round [2]. Chinese yam originated in China and was domesticated in the Song Dynasty, dating back approximately 1000 years [3]. It has been used as a dietary food and as a traditional medicine for strengthening stomach function, alleviating anorexia, and treating diarrhea [4]. Nowadays, there are mainly 80 cultivars on the Chinese market [5]. For a long time, cultivated yams mainly rely on clonally propagated using vegetative propagation of tubers, which led to serious degradation [3]. Its production systems face the problem that the cultivars have the limited diversity during long-term vegetative reproduction [6]. Detailed analysis of the genetic diversity in this species is important, because an accurate assessment of the genetic structure and diversity of cultivated and wild yams can be invaluable in crop breeding for diverse applications [7]. For example, analysis of the genetic variability among cultivated and wild yams can facilitate understanding of the process of domestication followed by Chinese farmers to generate agricultural biodiversity. However, there is lack of adequate information on the diversity evaluation of Chinese yam. Providing the potential conservation approaches for sustainable use, thereby saving the genetic diversity of this species in nature, is important. Molecular resources have recently been developed in Chinese yam. For example, random-amplified polymorphic DNA (RAPD), inter-simple sequence repeats (ISSR), intron sequence amplified polymorphism (ISAP), and sequence characterized amplified region (SCAR) markers have been used to examine the genetic relationships among different cultivars and identify the most popular cultivar [3, 8–10]. However, these markers have low diversity, stability, and reproducibility. The development of more effective genetic markers will be necessary to assess genetic diversity and identify cultivars. Recently, the chloroplast genome has been developed with the availability of the next-generation sequencing [11]. The chloroplast genomes of higher plants harbor approximately 130 genes in a 120–160 kb sequence [12]. Chloroplast genomes usually have a circular structure consisting of two copies of the large inverted repeat (IR) region separated by small single-copy (SSC) and large single-copy (LSC) regions and exhibit highly conserved gene content and order [13]. The nucleotide substitution rate of chloroplast genes is lower than that of nuclear genes but higher than that of mitochondrial genes [14, 15]. Most protein-coding genes (83 or 81 genes) have been used for phylogenetic analyses and have proven to be effective in resolving difficult phylogenetic relationships [16-18]. Noncoding regions are most likely to evolve faster than coding regions in the chloroplast genome, and, therefore, these mutation “hot spots” have been used to identify species and clarify relationships at lower taxonomic levels [19-23]. Chloroplast genomes are typically uniparentally inherited, which may greatly facilitate the use of chloroplast genome markers in plant population genetic studies [24]. Chloroplast genome markers, such as single nucleotide polymorphisms (SNPs) and simple sequence repeats (SSRs), have been used to monitor gene flow, population differentiation, and cytoplasmic diversity [25-28]. These chloroplast genome markers can also be applied to investigate domestication processes, such as the evolutionary history of Scutellaria baicalensis [29]. Another application of chloroplast genome markers is phylogeographical analysis, because the uniparental inheritance shows a clearer geographical structure than nuclear markers do [30]. The cultivars yam also is clonally propagated. Herein, we sequenced six wild D. polystachya genotypes from different geographical locations using the Illumina HiSeq platform. The first objective was to evaluate the intraspecific variation in this species, and the second objective was to obtain useful chloroplast molecular markers, including SNPs, SSRs, and indels, for evolutionary studies by comparing the chloroplast genomes. The genomic and marker resources developed in this study will not only reveal biogeographical structure and extensive population genetic variation in the wild populations of D. polystachya but also provide a molecular toolkit for cultivar identification.

2. Materials and Methods

2.1. Plant Materials, DNA Extraction, and Sequencing

In total, six genotypes of D. polystachya were used (Table 1). Chinese yam was obtained from Hebei, Shandong, Henan, Beijing, Jiangsu, and Fujian, China, to represent the geographical distribution of this species. Voucher specimens were deposited in herbaria of the Institute of Chinese Materia Medica (CMMI), China Academy of Chinese Medical Sciences. Fresh leaves of each accession were immediately dried with silica gel prior to DNA extraction. Total genomic DNA was isolated from each individual plant using the mCTAB extraction protocol [31] and purified using the Wizard DNA Cleanup System (Promega, Madison, WI, USA). DNA samples were randomly fragmented into 400–600 bp lengths using an ultrasonicator. An Illumina paired-end DNA library with 500 bp insert size was constructed using a NEBNext® Ultra™ DNA Library Prep Kit following the manufacturer's instructions. Paired-end sequencing (2 × 150 bp) was conducted on an Illumina HiSeq X Ten platform.
Table 1

Genes identified in the chloroplast genome of D. polystachya.

GenotypeFLWTSWYTWXSWNJWMHW
LocalityShijiazhuang, Hebei, ChinaTai'an, Shandong, ChinaJiaozuo, Henan, ChinaXiangshan, Beijing, ChinaNanjing, Jiangsu, ChinaMinhou, Fujian, China
Raw data no.70,997,84047,638,57461,275,83664,254,66463,759,00862,610,816
Mapped read no.1,076,604904,0745,925,9161,396,336876,4721,119,774
Percentage of chloroplast genome reads (%)1.52%1.90%9.67%2.17%1.37%1.79%
Chloroplast genome coverage (X)1,0548855,7991,3678581,096
Accession number in GenBankMG267375MG267376MG267379MG267377MG267380MG267378
Size (bp)153,255153,255153,292153,257153,281153,243
LSC (bp)83,45683,45683,49283,45883,48483,431
SSC (bp)18,82118,82118,81618,82118,81518,834
IRs (bp)25,48925,48925,49225,48925,49125,489

2.2. Assembly and Annotation

The paired-end reads were qualitatively assessed and assembled using SPAdes 3.6.1 [32]. Chloroplast genome sequence contigs were selected from the initial assembly by performing a BLAST search using the Dioscorea elephantipes chloroplast genome sequence as a reference (GenBank accession number: EF380353). The selected contigs were assembled with Sequencher 5.4.5 (http://www.genecodes.com/). Gaps in the contigs were filled by PCR amplification and Sanger sequencing. The four junctions between the IRs and the SSC/LSC regions were checked by amplification with specific primers followed by Sanger sequencing [33]. The chloroplast genome annotation was performed with Plann [34] using the D. elephantipes reference sequence from GenBank. The chloroplast genome map was drawn using Genome Vx software [35].

2.3. Molecular Marker Development and Validation

All sequenced D. polystachya chloroplast genomes were aligned using MAFFT v7 [36], assuming collinear genomes for the full alignment, and then adjusted manually using Se-Al 2.0 [37]. Variable and parsimony-informative base sites across the complete chloroplast genomes were calculated using MEGA 6.0 software [38]. The chloroplast genome sequences were analyzed to identify potential microsatellites (simple sequence repeats) using MISA software (http://pgrc.ipk-gatersleben.de/misa/). The minimum numbers (thresholds) for the SSR motifs were 10, 5, 4, 3, 3, and 3 for mono-, di-, tri-, tetra-, penta-, and hexanucleotide repeats, respectively. All of the repeats found were manually verified, and redundant results were removed. Based on the aligned sequence matrix, the microstructural events were checked manually and were further divided into three categories: (i) SSR, (ii) non-SSR-related indels (common indels), and (iii) inverted sequences. Using the XSW genotype genome sequence as the standard reference, the size, location, and evolutionary direction of the microstructural events were counted. The proposed secondary structures of the inverted regions were analyzed using mfold software [39].

2.4. Phylogenetic Reconstruction

Phylogenetic analysis was conducted using the chloroplast genome sequences of six genotypes of D. polystachya and four other Dioscorea species with available chloroplast genome sequences from GenBank (D. nipponica, D. villosa, D. zingiberensis, and D. elephantipes). Tacca chantrieri was used as an outgroup. Sequence alignments were carried out using MAFFT v7 [36] and then adjusted manually using Se-Al 2.0 [37]. We performed independent phylogenetic analyses using Bayesian inference (BI) and maximum likelihood (ML). RAxML version 8.0.20 was used for ML analyses with the GTR + G model. Node support values were determined with 500 rapid bootstrapping replicates. MrBayes 3.2.2 [40] was used to perform a BI analysis. The Markov chain Monte Carlo (MCMC) analysis was run for 2 × 5,000,000 generations. The average standard deviation of split frequencies remained below 0.01 after the fifty percent burn-in. The remaining trees were used to build a 50% majority-rule consensus tree.

3. Results

3.1. Chloroplast Genome Sequencing, Characterization, and Annotation

Using the Illumina HiSeq X Ten system, the total DNA from six genotypes of D. polystachya was sequenced to produce 47,638,574–70,997,840 paired-end raw reads (150 bp average read length) per genotype. After de novo and reference-guided assembly, the finished, high-quality chloroplast genome sequences of these six genotypes of D. polystachya were obtained. The chloroplast genome sequences were deposited in GenBank (Table 1). The D. polystachya chloroplast genomes ranged from 153,243 to 153,292 base pairs in length. The chloroplast genome can be divided into four regions: a pair of IR regions, a LSC region, and a SSC region. The overall GC content of the chloroplast genome was 37%, which is consistent with those of previously reported Dioscorea species [41]. The GC contents of the LSC and SSC regions were 34.8% and 30.9%, respectively, while that of the IR region was 42.9% (Table 1). A total of 112 unique genes were identified in the D. polystachya chloroplast genome, including 79 protein-coding genes, 29 tRNA genes, and 4 ribosomal RNA genes (Figure 1 and Table 2). A total of 62 protein-coding and 22 tRNA genes were located in the LSC region, while 12 protein-coding genes and one tRNA gene were located in the SSC region. All the rRNA genes were located in the IR region, along with six protein-coding (ndhB, rpl23, rps7, rps12, ycf2, and rpl2) genes and eight tRNA (trnA-UGC, trnH-GUG, trnI-CAU, trnI-GAU, trnL-CAA, trnN-GUU, trnR-ACG, and trnV-GAC) genes.
Figure 1

Map of the Dioscorea polystachya chloroplast genome. The genes inside and outside of the circle are transcribed in the clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are shown in different colors. Thick lines indicate the extent of the inverted repeats (IRa and IRb) that separate the small single-copy (SSC) and large single-copy (LSC) regions of the genome.

Table 2

Details of the complete chloroplast genomes of six D. polystachya genotypes.

Category for genesGroup of gene Name of gene
Photosynthesis related genesRubisco rbcL
Photosystem I psaA, psaB, psaC, psaI, psaJ
Assembly/stability of photosystem Iycf3, ycf4
Photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
ATP synthase atpA, atpB, atpE, atpF, atpH, atpI
cytochrome b/f complex petA, petB, petD, petG, petL, petN
Cytochrome c synthesis ccsA
NADPH dehydrogenasendhA, ndhB, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK

Transcription and translation related genesTranscription rpoA, rpoB, rpoC1, rpoC2
Ribosomal proteins rps2, rps3, rps4, rps7, rps8, rps11, rps12, rps14, rps15, rps16, rps18, rps19,rpl2, rpl14, rpl16, rpl20, rpl22, rpl23, rpl32, rpl33, rpl36
Translation initiation factor infA

RNA genesRibosomal RNA rrn5, rrn4.5, rrn16, rrn23
Transfer RNAtrnAUGC, trnCGCA, trnDGUC, trnEUUC, trnFGAA, trnGGCC, trnGUCC, trnHGUG, trnICAU, trnIGAU,trnKUUU, trnLCAA, trnLUAA, trnLUAG, trnfMCAUI, trnMCAU, trnNGUU, trnPUGG, trnQUUG, trnRACG, trnRUCU, trnSGCU, trnSGGA, trnSUGA, trnTGGU, trnTUGU, trnVGAC, trnVUAC, trnWCCA, trnYGUA

Other genesRNA processing matK
Carbon metabolism cemA
Fatty acid synthesis accD
ProteolysisclpP

Genes of unknown functionConserved reading frames ycf1, ycf2

Intron-containing genes are marked by asterisks (∗).

The D. polystachya chloroplast genome contained 18 intron-containing genes. Among them, sixteen genes had a single intron (ten protein-coding and six tRNA genes) and two genes (clpP and ycf3) contained two introns. The trnK-UUU gene had the largest intron, which contained the matK gene. The rps12 gene was trans-spliced, with the 5′ end located in the LSC region and the duplicated 3′ end in the IR region.

3.2. Numbers and Pattern of SNP Mutations

The length of the alignment of the six chloroplast genomes was 153,497 bp. In total, 141 SNPs were detected, 84 of which were found in the LSC region, 7 in the IR region, and 43 in the SSC region (Table S1). A total of 134 of these SNPs were found in the IRs, 54 of which were in intergenic spacers, 70 in coding region, and 10 in intron regions. Twenty coding regions harbored SNPs; ycf1 had the highest number of SNPs (19), followed by rpoC2 (five), and rpoB (five). Five intron regions harbored SNPs (four in atpF, two in trnG and rpoC1, and one in trnV and rps16). The pattern of SNP mutations is shown in Figure 2. A total of 88 transitions (Ts) and 53 transversions (Tv) were present, and the Tv to Ts ratio was 1 : 0.6, indicating a bias in favor of transitions. The most frequently occurring SNP mutations were from C to T and from G to A; mutations from C to G and from G to C exhibited the lowest frequency.
Figure 2

The patterns of nucleotide substitution among the six D. polystachya chloroplast genomes. The patterns were divided into six types as indicated by the six non-strand-specific base-substitution types (i.e., numbers of considered G to A and C to T sites for each respective set of associated mutation types).

3.3. Microsatellites

With MISA analysis, 66 SSR loci were detected in the D. polystachya chloroplast genome. These SSRs included 37 mononucleotide motifs, which ranged in length from 10 to 16 nucleotides, and 11 dinucleotide, 7 trinucleotide, 4 tetranucleotide, and 7 pentanucleotide SSRs (Figure 3). Among the 48 mononucleotide and dinucleotide SSRs, 46 contained only A or T. Most SSRs were located in the noncoding portions of the LSC and SSC regions. After in silico comparative analysis, twenty-four SSR loci showed polymorphisms among the six genotypes of D. polystachya (Table 3). The clpP intron had the highest number of polymorphic SSRs (three), followed by matK-trnK and psbE-petL with two polymorphic SSRs. The other fifteen spacers and the rpl16 intron contained only one polymorphic SSR each (Table S2). We designed primer pairs to amplify those SSRs and the other 42 SSR loci (Table S3).
Figure 3

Analyses of simple sequence repeats (SSRs) in the D. polystachya chloroplast genomes. (a) Number of different SSR types detected by MISA. (b) Number of SSRs in the LSC, SSC, and IR regions. (c) Number of SSRs in spacers, exons, and introns. (d) Frequency of identified SSR motifs in the different repeat classes.

Table 3

SSRs identified from in silico comparative analysis of the chloroplast genomes of six D. polystachya genotypes.

PositionRegionLocationSSR typeForward sequenceReverse sequencePRODUCT1 size (bp)
matK-trnK LSCspace(TATAT)3CCGAGGACAAGGAATCCAATCAAGGTTCTCCTGAGAGTGAACCA270
matK-trnK LSCspace(A)10CCGAATTGGGCCATAAGACTCTACCATGACTGATCCTGAAAGGT223
atpA-atpF LSCspace(A)12TGCCATCACTTCATCAAGACCACCTCGGAGCCATGGAAGAAATA253
atpH-atpI LSCspace(A)10TACAGCCAATCCAGCAGCAATATGAGTTACTTCTCCACCCGATG161
rps2-rpoC2 LSCspace(A)15ACCAAATCAATGATCGGACCAATAGTGCACCGTTCAAGACAAGA255
rpoB-trnC LSCspace(A)11AGACAGAATAATTGGGGGTAGGAACCCCATCTATGTTTAGGTTGCT273
petN-psbM LSCspace(T)12TGGACCAGTTCTTAACAGAATAATGGGACATATGGCCGTCGAAAGAA138
trnE-trnT LSCspace(T)11CGATGTCGGATTGGTACACGTAGCATATGCACTCATTCAAGGACA183
trnS-psbZ LSCspace(TA)6TTCAAGACCGGAGCTATCAACCGCATGTGGTCGAGGAGAGTTTA232
trnF-ndhJ LSCspace(TA)7GCTCCCTCTTTCTCCTTTGTTCTACCGCGCACATCACTTAGAT280
petA-psbJ LSCspace(A)12CTTGGCATCTGTGATTTTGGCATGTTCCTTTCATTTATCCCGTCA221
psbE-petL LSCspace(A)11CCAAGCTTTACTGTACCGAATCCTGTGTGTGTCGTGTAGCTTGAT215
psbE-petL LSCspace(A)10ATCAAGCTACACGACACACACAAGCAGCCAACAGAAAACCAAAA199
clpP LSCintron 1(T)10CACCCTTCCTTTCGTTGGAGTAATCGGGAGTACATTTCAGCGTC213
clpP LSCintron 2(T)11CACCTTTGGATGCATACGGTTCTATAGTATAGGGCGGGGTCCAA163
clpP LSCintron 2(T)12CCGGGTAAAGATCTGTCCGAATAGCGTGAAGTGCAATTAGATCA276
rps11-rpl36 LSCspace(T)12ACCAATACGTCCATTCCTACGGTAGGCGTGGACGAATTATGGTG238
rps8-rpl14 LSCspace(T)10TCCCTACCCATGACGAACTAGAACTCGAGTTTTTGGTGCGATTC259
rpl16 LSCintron(T)10GCTCCTCGCGAATGAAATGATTGCTCGCGAAACCCTTGTTTATT275
rpl16-rps3 LSCspace(T)12CGAGTCACACACTAAGCATAGCGTTCCCCTACAAACAATTCGCG279
rps12-trnV IRspace(ATA)4g(TAA)6TGGTTCTGCTTCCCCTCTTTTTGCAAAGGGTCGAGAAACTCAAC274
ycf1-rps15 SSCspace(T)16CCATTCAACTGGATCTAGGAGGATGTGGATTTTACCGATCGGGAA241
rpl32-ndhF SSCspace(T)10TATCTATACTTATTGCACCAATAACCAAGTATTAACCAGTGTTAA176

3.4. Indels

The indels involving SSR polymorphisms were filtered out of these analyses. We retrieved 44 common indels from the D. polystachya chloroplast genomes (Table S4). No indels were found in the coding regions. A total of 27 spacer regions harbored indels; the psbM-trnD and rbcL-accD spacer had the highest number of indels (three), followed by trnK-trnQ, psbI-trnS, trnS-trnG, petN-psbM, trnT-psbD, trnF-ndhJ, psbE-petL, and trnL-rpl32, all containing two indels. The other spacer regions contained only one indel (Table S4). Five indels were located in intronic regions, including the atpF (two indels) and clpP (three indels) introns. The sizes of the indels ranged from 1 to 28 bp, with one bp indels being the most common (Figure 4). The largest indel, found in the atpF intron with a 28 bp length, was a deletion in the MHW genotype. The second longest, which was found in rbcL-accD, was an insertion in the YTW genotype. Finally, 13 insertion and 9 deletion indels were specific to the NJW genotype, 12 insertion and 5 deletion indels to YTW, one insertion in the psbZ-trnG region to XSW, and one insertion in trnL-rpl32 region and one deletion in atpF intron to MHW. Two deletions in petN-psbM and psbM-trnD independently occurred in the YTW and NJW genotypes.
Figure 4

Indels identified in the chloroplast genomes of D. polystachya. (a) Numbers of individual indels shown by sequence length. (b) Relative frequency of indel occurrence in the LSC, SSC, and IR regions.

3.5. Small Inversions

Five small inversions of 2 to 51 bp were identified in the D. polystachya chloroplast genomes. All of the inversions and their flanking inverted repeat sequences could form stem-loop structures. The flanking repeats were from 6 to 22 bp in length (Table 4). Two inversions occurred in the LSC region and three in the SSC region. Inversions in the trnK-matK spacer, ndhA intron, and ndhD occurred in the NJW genotype, while an inversion in ccsA-trnL occurred in YTW. An inversion in the trnL intron occurred in the YTW and MHW genotypes.
Table 4

The locations, directions, and lengths of five small inversions.

LocationRegionLength of inversions in cpg (bp)Direction of the small inversions
Length of inversion Length of inverted repeatFLWTSWYTWXSWNJWMHW
trnK-matK LSC5113----Inverted-
trnL intronLSC422--Inverted--Inverted
ndhA intronSSC214----Inverted-
ndhD SSC26----Inverted-
ccsA-trnL SSC320--Inverted---

3.6. Phylogenetic Analysis

The phylogenetic position of D. polystachya within the genus Dioscorea was established using complete chloroplast genomes (Figure 5). The chloroplast genome of Tacca chantrieri was used as the outgroup. The ML and BI trees reconstructed were congruent, and both phylogenetic trees had high support. The six Dioscorea species were grouped into two branches with 100% bootstrap support, and the NJW genotype was the earliest diverging lineage in D. polystachya. The XSW, TSW, and FLW genotypes formed a monophyletic clade.
Figure 5

Phylogenetic relationships among Dioscorea species constructed from complete chloroplast genome sequences using maximum likelihood (ML) and Bayesian inference (BI). The ML topology is shown, with the ML bootstrap support value/Bayesian posterior probability given for each node.

4. Discussion

In this study, we obtained the chloroplast genomes of six D. polystachya genotypes using NGS methods, which provided important resources for the discovery of molecular markers. Understanding the genetic relationship of D. polystachya is vital to breeding programs and conservation strategies. The D. polystachya chloroplast genome exhibited a typical circular structure and was similar in genome size and GC content to the other published Dioscorea chloroplast genomes [41]. Using these chloroplast genome data, we were able to develop genetic resources, including SNPs, microsatellites (simple sequence repeats), indels, and small inversions, that constitute essential tools for studies of evolution, population genetics, and the origin of domestication in this species. This information will facilitate the establishment of an effective DNA-barcoding-based identification method and provide valuable markers to study the population genetics of D. polystachya. Among the six genotypes examined, only 141 SNPs were detected. Despite the higher AT content in chloroplast genomes, AT to TA and GC to CG transversions were found to occur significantly less frequently among the four types of transversions (Figure 2). This result clearly indicates a bias in chloroplast genome evolution. In general, most SNPs occurred in the noncoding regions of the D. polystachya plastid genomes, which may undergo less natural selection. However, no significant difference was present in the distribution of mutations among the genome regions (Table S1). Variations in mutation rates can be related to the function of genes. ycf1 had the highest number of SNPs (19) in the D. polystachya chloroplast genome, while atp, psa, and psb exhibited the lowest evolutionary rates (Table S1). The ycf1 gene is the second longest gene; it is essential for plant viability and encodes Tic214, a vital component of the Arabidopsis TIC complex [42]. The two parts of ycf1 in the SSC region (ycf1a and ycf1b) were highly variable in flowering plants [19, 43] and are suitable as markers for phylogeny and species identification [44]. Moreover, indels are another important class of genetic variation. A total of 43 common indels were identified in the D. polystachya chloroplast genomes, all in noncoding regions (Table S3). The indel sizes ranged from 1 to 28 bp. According to our results, the mutation rates of these indels were lower than those of nucleotide substitutions. Most indels were specific to individual genotypes, and many were informative for evolutionary studies. trnL-F, rbcL-accD, and trnS-trnG constitute the most frequently applied markers in plant molecular systematics and DNA barcoding [45-47]. As in previous reports, the variable regions psbM-trnD and rbcL-accD contained the most indels in these D. polystachya chloroplast genomes [19]. Adding indels to phylogenetic analyses significantly increases resolution and support compared to simple substitution-based matrices of chloroplast DNA sequences [48]. SSRs, which consist of tandemly repeated motifs of six base pairs (bp) or less, have become widely used as chloroplast genome markers due to their ability to generate highly informative DNA markers. The most common types are mononucleotide repeats, ranging in size from 10 to 15 nucleotides; the occurrence of di-, tri-, tetra-, penta-, and hexanucleotide repeats is less common [28]. After in silico comparative analysis, we identified 24 SSR loci showing polymorphisms, which may allow investigation of spontaneous gene flow among wild and domesticated D. polystachya and phylogeographical studies. Because chloroplast genome sequences are highly conserved, chloroplast genome SSRs are transferable across species; thus, these loci can likely be used in studies of other Dioscorea species [28]. In this study, we identified SNPs, indels, microsatellites, and small inversions in Chinese yam by comparative analyses of six chloroplast genomes. These resources will allow the identification of commercial cultivars of Chinese yam and the determination of their purity. Furthermore, chloroplast genomic resources are important for further studies of domestication, population genetics, and phylogenetic analysis, possibly in combination with other informative molecular markers from the mitochondrial and/or nuclear genomes.
  35 in total

1.  Chloroplast simple sequence repeats (cpSSRs): technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species.

Authors:  Daniel Ebert; Rod Peakall
Journal:  Mol Ecol Resour       Date:  2009-01-28       Impact factor: 7.090

2.  Uncovering the protein translocon at the chloroplast inner envelope membrane.

Authors:  Shingo Kikuchi; Jocelyn Bédard; Minako Hirano; Yoshino Hirabayashi; Maya Oishi; Midori Imai; Mai Takase; Toru Ide; Masato Nakai
Journal:  Science       Date:  2013-02-01       Impact factor: 47.728

3.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

4.  Analysis of 81 genes from 64 plastid genomes resolves relationships in angiosperms and identifies genome-scale evolutionary patterns.

Authors:  Robert K Jansen; Zhengqiu Cai; Linda A Raubeson; Henry Daniell; Claude W Depamphilis; James Leebens-Mack; Kai F Müller; Mary Guisinger-Bellian; Rosemarie C Haberle; Anne K Hansen; Timothy W Chumley; Seung-Bum Lee; Rhiannon Peery; Joel R McNeal; Jennifer V Kuehl; Jeffrey L Boore
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-28       Impact factor: 11.205

5.  MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.

Authors:  Fredrik Ronquist; Maxim Teslenko; Paul van der Mark; Daniel L Ayres; Aaron Darling; Sebastian Höhna; Bret Larget; Liang Liu; Marc A Suchard; John P Huelsenbeck
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

6.  Mutation rates in plastid genomes: they are lower than you might think.

Authors:  David Roy Smith
Journal:  Genome Biol Evol       Date:  2015-04-13       Impact factor: 3.416

Review 7.  Chloroplast genomes: diversity, evolution, and applications in genetic engineering.

Authors:  Henry Daniell; Choun-Sea Lin; Ming Yu; Wan-Jung Chang
Journal:  Genome Biol       Date:  2016-06-23       Impact factor: 13.583

8.  Chloroplast genome resources and molecular markers differentiate rubber dandelion species from weedy relatives.

Authors:  Yingxiao Zhang; Brian J Iaffaldano; Xiaofeng Zhuang; John Cardina; Katrina Cornish
Journal:  BMC Plant Biol       Date:  2017-02-02       Impact factor: 4.215

9.  Sampling strategy and potential utility of indels for DNA barcoding of closely related plant species: a case study in taxus.

Authors:  Jie Liu; Jim Provan; Lian-Ming Gao; De-Zhu Li
Journal:  Int J Mol Sci       Date:  2012-07-13       Impact factor: 6.208

10.  Sequencing angiosperm plastid genomes made easy: a complete set of universal primers and a case study on the phylogeny of saxifragales.

Authors:  Wenpan Dong; Chao Xu; Tao Cheng; Kui Lin; Shiliang Zhou
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

View more
  12 in total

1.  First reported chloroplast genome sequence of Punica granatum (cultivar Helow) from Jabal Al-Akhdar, Oman: phylogenetic comparative assortment with Lagerstroemia.

Authors:  Abdul Latif Khan; Sajjad Asaf; In-Jung Lee; Ahmed Al-Harrasi; Ahmed Al-Rawahi
Journal:  Genetica       Date:  2018-08-29       Impact factor: 1.082

2.  Comparative analyses of 32 complete plastomes of Tef (Eragrostis tef ) accessions from Ethiopia: phylogenetic relationships and mutational hotspots.

Authors:  Girma Eshetu Teshome; Yeshitila Mekbib; Guangwan Hu; Zhi-Zhong Li; Jinming Chen
Journal:  PeerJ       Date:  2020-06-19       Impact factor: 2.984

3.  Paternity Assignment in White Guinea Yam (Dioscorea Rotundata) Half-Sib Progenies from Polycross Mating Design Using SNP Markers.

Authors:  Prince E Norman; Agre A Paterne; Agyemang Danquah; Pangirayi B Tongoona; Eric Y Danquah; David De Koeyer; Ugochukwu N Ikeogu; Robert Asiedu; Asrat Asfaw
Journal:  Plants (Basel)       Date:  2020-04-19

4.  Complete Chloroplast Genomes of Anthurium huixtlense and Pothos scandens (Pothoideae, Araceae): Unique Inverted Repeat Expansion and Contraction Affect Rate of Evolution.

Authors:  Claudia L Henriquez; Furrukh Mehmood; Monica M Carlsen; Madiha Islam; Mohammad Tahir Waheed; Peter Poczai; Thomas B Croat; Ibrar Ahmed
Journal:  J Mol Evol       Date:  2020-07-09       Impact factor: 2.395

5.  The complete chloroplast genome sequence of the Dioscorea esculenta (Lour.) Burkill (Dioscoreaceae).

Authors:  Xin Chen; Lijuan Cai; Yu Zhang; Wenzheng Su; Bicong Li; Qinghong Zhou; Qianglong Zhu
Journal:  Mitochondrial DNA B Resour       Date:  2020-11-20       Impact factor: 0.658

6.  The complete chloroplast genome of a new candidate cultivar, Dae Ryun, of Abeliophyllum distichum Nakai (Oleaceae).

Authors:  Jongsun Park; Juhyeon Min; Yongsung Kim; Hong Xi; Woochan Kwon; Taewon Jang; Gwanho Kim; Jae-Ho Park
Journal:  Mitochondrial DNA B Resour       Date:  2019-10-23       Impact factor: 0.658

7.  Evolutionary directions of single nucleotide substitutions and structural mutations in the chloroplast genomes of the family Calycanthaceae.

Authors:  Wenpan Dong; Chao Xu; Jun Wen; Shiliang Zhou
Journal:  BMC Evol Biol       Date:  2020-07-31       Impact factor: 3.260

8.  Development of nuclear SSR and chloroplast genome markers in diverse Liriodendron chinense germplasm based on low-coverage whole genome sequencing.

Authors:  Bin Li; Furong Lin; Ping Huang; Wenying Guo; Yongqi Zheng
Journal:  Biol Res       Date:  2020-05-14       Impact factor: 5.612

9.  Characterization of the Complete Chloroplast Genome of Acer truncatum Bunge (Sapindales: Aceraceae): A New Woody Oil Tree Species Producing Nervonic Acid.

Authors:  Qiuyue Ma; Yanan Wang; Lu Zhu; Changwei Bi; Shuxian Li; Shushun Li; Jing Wen; Kunyuan Yan; Qianzhong Li
Journal:  Biomed Res Int       Date:  2019-11-24       Impact factor: 3.411

Review 10.  An underutilized orphan tuber crop-Chinese yam : a review.

Authors:  Janina Epping; Natalie Laibach
Journal:  Planta       Date:  2020-09-21       Impact factor: 4.116

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.