| Literature DB >> 29725599 |
Junling Cao1,2,3, Dan Jiang4, Zhenyu Zhao1, Subo Yuan5, Yujun Zhang4, Teng Zhang1, Wenhao Zhong1, Qingjun Yuan1, Luqi Huang1.
Abstract
Chinese yam has been used both as a food and in traditional herbal medicine. Developing more effective genetic markers in this species is necessary to assess its genetic diversity and perform cultivar identification. In this study, new chloroplast genomic resources were developed using whole chloroplast genomes from six genotypes originating from different geographical locations. The Dioscorea polystachya chloroplast genome is a circular molecule consisting of two single-copy regions separated by a pair of inverted repeats. Comparative analyses of six D. polystachya chloroplast genomes revealed 141 single nucleotide polymorphisms (SNPs). Seventy simple sequence repeats (SSRs) were found in the six genotypes, including 24 polymorphic SSRs. Forty-three common indels and five small inversions were detected. Phylogenetic analysis based on the complete chloroplast genome provided the best resolution among the genotypes. Our evaluation of chloroplast genome resources among these genotypes led us to consider the complete chloroplast genome sequence of D. polystachya as a source of reliable and valuable molecular markers for revealing biogeographical structure and the extent of genetic variation in wild populations and for identifying different cultivars.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29725599 PMCID: PMC5872661 DOI: 10.1155/2018/6293847
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Genes identified in the chloroplast genome of D. polystachya.
| Genotype | FLW | TSW | YTW | XSW | NJW | MHW |
|---|---|---|---|---|---|---|
| Locality | Shijiazhuang, Hebei, China | Tai'an, Shandong, China | Jiaozuo, Henan, China | Xiangshan, Beijing, China | Nanjing, Jiangsu, China | Minhou, Fujian, China |
| Raw data no. | 70,997,840 | 47,638,574 | 61,275,836 | 64,254,664 | 63,759,008 | 62,610,816 |
| Mapped read no. | 1,076,604 | 904,074 | 5,925,916 | 1,396,336 | 876,472 | 1,119,774 |
| Percentage of chloroplast genome reads (%) | 1.52% | 1.90% | 9.67% | 2.17% | 1.37% | 1.79% |
| Chloroplast genome coverage (X) | 1,054 | 885 | 5,799 | 1,367 | 858 | 1,096 |
| Accession number in GenBank | MG267375 | MG267376 | MG267379 | MG267377 | MG267380 | MG267378 |
| Size (bp) | 153,255 | 153,255 | 153,292 | 153,257 | 153,281 | 153,243 |
| LSC (bp) | 83,456 | 83,456 | 83,492 | 83,458 | 83,484 | 83,431 |
| SSC (bp) | 18,821 | 18,821 | 18,816 | 18,821 | 18,815 | 18,834 |
| IRs (bp) | 25,489 | 25,489 | 25,492 | 25,489 | 25,491 | 25,489 |
Figure 1Map of the Dioscorea polystachya chloroplast genome. The genes inside and outside of the circle are transcribed in the clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are shown in different colors. Thick lines indicate the extent of the inverted repeats (IRa and IRb) that separate the small single-copy (SSC) and large single-copy (LSC) regions of the genome.
Details of the complete chloroplast genomes of six D. polystachya genotypes.
| Category for genes | Group of gene |
|
|---|---|---|
| Photosynthesis related genes | Rubisco |
|
| Photosystem I |
| |
| Assembly/stability of photosystem I | | |
| Photosystem II |
| |
| ATP synthase |
| |
| cytochrome b/f complex |
| |
| Cytochrome c synthesis |
| |
| NADPH dehydrogenase | | |
|
| ||
| Transcription and translation related genes | Transcription |
|
| Ribosomal proteins |
| |
| Translation initiation factor |
| |
|
| ||
| RNA genes | Ribosomal RNA |
|
| Transfer RNA | | |
|
| ||
| Other genes | RNA processing |
|
| Carbon metabolism |
| |
| Fatty acid synthesis |
| |
| Proteolysis | | |
|
| ||
| Genes of unknown function | Conserved reading frames |
|
Intron-containing genes are marked by asterisks (∗).
Figure 2The patterns of nucleotide substitution among the six D. polystachya chloroplast genomes. The patterns were divided into six types as indicated by the six non-strand-specific base-substitution types (i.e., numbers of considered G to A and C to T sites for each respective set of associated mutation types).
Figure 3Analyses of simple sequence repeats (SSRs) in the D. polystachya chloroplast genomes. (a) Number of different SSR types detected by MISA. (b) Number of SSRs in the LSC, SSC, and IR regions. (c) Number of SSRs in spacers, exons, and introns. (d) Frequency of identified SSR motifs in the different repeat classes.
SSRs identified from in silico comparative analysis of the chloroplast genomes of six D. polystachya genotypes.
| Position | Region | Location | SSR type | Forward sequence | Reverse sequence | PRODUCT1 size (bp) |
|---|---|---|---|---|---|---|
|
| LSC | space | (TATAT)3 | CCGAGGACAAGGAATCCAATCA | AGGTTCTCCTGAGAGTGAACCA | 270 |
|
| LSC | space | (A)10 | CCGAATTGGGCCATAAGACTCT | ACCATGACTGATCCTGAAAGGT | 223 |
|
| LSC | space | (A)12 | TGCCATCACTTCATCAAGACCA | CCTCGGAGCCATGGAAGAAATA | 253 |
|
| LSC | space | (A)10 | TACAGCCAATCCAGCAGCAATA | TGAGTTACTTCTCCACCCGATG | 161 |
|
| LSC | space | (A)15 | ACCAAATCAATGATCGGACCAA | TAGTGCACCGTTCAAGACAAGA | 255 |
|
| LSC | space | (A)11 | AGACAGAATAATTGGGGGTAGGA | ACCCCATCTATGTTTAGGTTGCT | 273 |
|
| LSC | space | (T)12 | TGGACCAGTTCTTAACAGAATAATG | GGACATATGGCCGTCGAAAGAA | 138 |
|
| LSC | space | (T)11 | CGATGTCGGATTGGTACACGTA | GCATATGCACTCATTCAAGGACA | 183 |
|
| LSC | space | (TA)6 | TTCAAGACCGGAGCTATCAACC | GCATGTGGTCGAGGAGAGTTTA | 232 |
|
| LSC | space | (TA)7 | GCTCCCTCTTTCTCCTTTGTTC | TACCGCGCACATCACTTAGAT | 280 |
|
| LSC | space | (A)12 | CTTGGCATCTGTGATTTTGGCA | TGTTCCTTTCATTTATCCCGTCA | 221 |
|
| LSC | space | (A)11 | CCAAGCTTTACTGTACCGAATCC | TGTGTGTGTCGTGTAGCTTGAT | 215 |
|
| LSC | space | (A)10 | ATCAAGCTACACGACACACACA | AGCAGCCAACAGAAAACCAAAA | 199 |
|
| LSC | intron 1 | (T)10 | CACCCTTCCTTTCGTTGGAGTA | ATCGGGAGTACATTTCAGCGTC | 213 |
|
| LSC | intron 2 | (T)11 | CACCTTTGGATGCATACGGTTC | TATAGTATAGGGCGGGGTCCAA | 163 |
|
| LSC | intron 2 | (T)12 | CCGGGTAAAGATCTGTCCGAAT | AGCGTGAAGTGCAATTAGATCA | 276 |
|
| LSC | space | (T)12 | ACCAATACGTCCATTCCTACGG | TAGGCGTGGACGAATTATGGTG | 238 |
|
| LSC | space | (T)10 | TCCCTACCCATGACGAACTAGA | ACTCGAGTTTTTGGTGCGATTC | 259 |
|
| LSC | intron | (T)10 | GCTCCTCGCGAATGAAATGATT | GCTCGCGAAACCCTTGTTTATT | 275 |
|
| LSC | space | (T)12 | CGAGTCACACACTAAGCATAGC | GTTCCCCTACAAACAATTCGCG | 279 |
|
| IR | space | (ATA)4g(TAA)6 | TGGTTCTGCTTCCCCTCTTTTT | GCAAAGGGTCGAGAAACTCAAC | 274 |
|
| SSC | space | (T)16 | CCATTCAACTGGATCTAGGAGGA | TGTGGATTTTACCGATCGGGAA | 241 |
|
| SSC | space | (T)10 | TATCTATACTTATTGCACCAATA | ACCAAGTATTAACCAGTGTTAA | 176 |
Figure 4Indels identified in the chloroplast genomes of D. polystachya. (a) Numbers of individual indels shown by sequence length. (b) Relative frequency of indel occurrence in the LSC, SSC, and IR regions.
The locations, directions, and lengths of five small inversions.
| Location | Region | Length of inversions in cpg (bp) | Direction of the small inversions | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Length of inversion | Length of inverted repeat | FLW | TSW | YTW | XSW | NJW | MHW | ||
|
| LSC | 51 | 13 | - | - | - | - | Inverted | - |
|
| LSC | 4 | 22 | - | - | Inverted | - | - | Inverted |
|
| SSC | 2 | 14 | - | - | - | - | Inverted | - |
|
| SSC | 2 | 6 | - | - | - | - | Inverted | - |
|
| SSC | 3 | 20 | - | - | Inverted | - | - | - |
Figure 5Phylogenetic relationships among Dioscorea species constructed from complete chloroplast genome sequences using maximum likelihood (ML) and Bayesian inference (BI). The ML topology is shown, with the ML bootstrap support value/Bayesian posterior probability given for each node.