| Literature DB >> 34975990 |
Yu Fan1,2, Ya'nan Jin2,3, Mengqi Ding2, Yu Tang4, Jianping Cheng1, Kaixuan Zhang2, Meiliang Zhou2.
Abstract
Buckwheat (Fagopyrum genus, Polygonaceae), is an annual or perennial, herbaceous or semi-shrub dicotyledonous plant. There are mainly three cultivated buckwheat species, common buckwheat (Fagopyrum esculentum) is widely cultivated in Asia, Europe, and America, while Tartary buckwheat (F. tataricum) and F. cymosum (also known as F. dibotrys) are mainly cultivated in China. The genus Fagopyrum is taxonomically confusing due to the complex phenotypes of different Fagopyrum species. In this study, the chloroplast (cp) genomes of three Fagopyrum species, F. longistylum, F. leptopodum, F. urophyllum, were sequenced, and five published cp genomes of Fagopyrum were retrieved for comparative analyses. We determined the sequence differentiation, repeated sequences of the cp genomes, and the phylogeny of Fagopyrum species. The eight cp genomes ranged, gene number, gene order, and GC content were presented. Most of variations of Fagopyrum species cp genomes existed in the LSC and SSC regions. Among eight Fagopyrum chloroplast genomes, six variable regions (ndhF-rpl32, trnS-trnG, trnC, trnE-trnT, psbD, and trnV) were detected as promising DNA barcodes. In addition, a total of 66 different SSR (simple sequence repeats) types were found in the eight Fagopyrum species, ranging from 8 to 16 bp. Interestingly, many SSRs showed significant differences especially in some photosystem genes, which provided valuable information for understanding the differences in light adaptation among different Fagopyrum species. Genus Fagopyrum has shown a typical branch that is distinguished from the Rumex, Rheum, and Reynoutria, which supports the unique taxonomic status in Fagopyrum among the Polygonaceae. In addition, phylogenetic analysis based on the cp genomes strongly supported the division of eight Fagopyrum species into two independent evolutionary directions, suggesting that the separation of cymosum group and urophyllum group may be earlier than the flower type differentiation in Fagopyrum plants. The results of the chloroplast-based phylogenetic tree were further supported by the matK and Internal Transcribed Spacer (ITS) sequences of 17 Fagopyrum species, which may help to further anchor the taxonomic status of other members in the urophyllum group in Fagopyrum. This study provides valuable information and high-quality cp genomes for identifying species and evolutionary analysis for future Fagopyrum research.Entities:
Keywords: Fagopyrum; Polygonaceae; chloroplast genome; comparative analysis; phylogenetic relationship
Year: 2021 PMID: 34975990 PMCID: PMC8715082 DOI: 10.3389/fpls.2021.799904
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
FIGURE 1The morphological characters of plants and flowers of eight Fagopyrum species. (A) F. tataricum; (B) F. cymosum; (C) F. esculentum; (D) F. esculentum subsp. ancestrale; (E) F. luojishanense; (F) F. urophyllum; (G) F. leptopodum; (H) F. longistylum.
FIGURE 2Gene map of the eight Fagopyrum species. The genes shown outside of the circle are transcribed clockwise, while those inside are transcribed counterclockwise. Genes belonging to different functional groups are color-coded. The dashed area in the inner circle indicates the GC content of the chloroplast genome.
Comparison of the complete chloroplast genomes for eight Fagopyrum species.
|
|
|
|
|
|
|
| |||
| Accession number |
|
|
|
|
|
|
|
| |
| Total | Total length (bp) | 159,272 | 159,320 | 159,576 | 159,599 | 159,325 | 159,376 | 159,350 | 159,265 |
| GC (%) | 37.87 | 37.93 | 37.95 | 37.99 | 37.82 | 37.79 | 37.78 | 37.84 | |
| LSC | Length (bp) | 84,397 | 84,422 | 84,875 | 84,885 | 84,417 | 84,282 | 84,250 | 84,431 |
| GC (%) | 36.20 | 36.26 | 36.29 | 36.32 | 36.03 | 36.02 | 36.01 | 36.05 | |
| Length (%) | 52.99 | 52.99 | 53.19 | 53.19 | 52.98 | 52.88 | 52.87 | 53.01 | |
| SSC | Length (bp) | 13,241 | 13,264 | 13,331 | 13,344 | 13,226 | 13,402 | 13,406 | 13,094 |
| GC (%) | 32.78 | 32.90 | 32.99 | 32.96 | 32.17 | 32.10 | 31.97 | 32.36 | |
| Length (%) | 8.31 | 8.33 | 8.35 | 8.36 | 8.30 | 8.41 | 8.41 | 8.22 | |
| IRa/IRb | Length (bp) | 30,817 | 30,817 | 30,685 | 30,685 | 30,841 | 30,846 | 30,847 | 30,870 |
| GC (%) | 41.26 | 41.29 | 41.38 | 41.37 | 41.48 | 41.45 | 41.46 | 41.45 | |
| Length (%) | 19.35 | 19.34 | 19.23 | 19.23 | 19.36 | 19.35 | 19.36 | 19.38 |
Genes contained in the chloroplast genome of eight Fagopyrum species.
| Category for genes | Groups of genes | Name of genes |
| Photosynthesis related genes | Rubisco |
|
| Photosystem I |
| |
| Assembly/stability of photosystem I |
| |
| Photosystem II |
| |
| ATP synthase |
| |
| Cytochrome b/f complex |
| |
| Cytochrome c synthesis |
| |
| NADPH dehydrogenase |
| |
| Transcription and translation related genes | Transcription |
|
| Ribosomal proteins |
| |
| Translation initiation factor |
| |
| RNA genes | Ribosomal RNA |
|
| Transfer RNA |
| |
| Other genes | RNA processing |
|
| Carbon metabolism |
| |
| Fatty acid synthesis |
| |
| Proteolysis |
| |
| Genes of unknown function | Conserved reading frames |
|
| Pseudogenes |
|
Codon content of amino acids and stop codon of eight Fagopyrum species.
| Amino acid | Codon | RSCUa | |||||||||||||||||
|
|
|
| F. |
|
|
|
|
|
|
|
|
|
|
| |||||
| Ala | GCA | 1.12 | 1.13 | 1.14 | 1.11 | 1.16 | 1.16 | 1.16 | 1.15 | Ile | AUA | 0.97 | 0.97 | 0.97 | 0.96 | 0.97 | 0.97 | 0.98 | 0.98 |
| GCC | 0.76 | 0.76 | 0.77 | 0.78 | 0.71 | 0.7 | 0.71 | 0.71 | AUC | 0.57 | 0.57 | 0.56 | 0.56 | 0.55 | 0.55 | 0.55 | 0.55 | ||
| GCG | 0.48 | 0.48 | 0.47 | 0.47 | 0.49 | 0.5 | 0.49 | 0.49 | AUU | 1.46 | 1.46 | 1.47 | 1.48 | 1.48 | 1.48 | 1.47 | 1.48 | ||
| GCU | 1.64 | 1.64 | 1.63 | 1.64 | 1.64 | 1.64 | 1.64 | 1.64 | Lys | AAA | 1.5 | 1.5 | 1.49 | 1.49 | 1.5 | 1.5 | 1.5 | 1.51 | |
| Arg | AGA | 1.67 | 1.67 | 1.65 | 1.66 | 1.71 | 1.7 | 1.7 | 1.71 | AAG | 0.5 | 0.5 | 0.51 | 0.51 | 0.5 | 0.5 | 0.5 | 0.49 | |
| AGG | 0.75 | 0.75 | 0.77 | 0.76 | 0.73 | 0.74 | 0.73 | 0.73 | Met | AUG | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| CGA | 1.45 | 1.45 | 1.48 | 1.48 | 1.46 | 1.47 | 1.45 | 1.45 | Phe | UUC | 0.65 | 0.66 | 0.67 | 0.67 | 0.66 | 0.65 | 0.65 | 0.66 | |
| CGC | 0.36 | 0.36 | 0.36 | 0.36 | 0.35 | 0.33 | 0.35 | 0.34 | UUU | 1.35 | 1.34 | 1.33 | 1.33 | 1.34 | 1.35 | 1.35 | 1.34 | ||
| CGG | 0.45 | 0.44 | 0.43 | 0.45 | 0.43 | 0.44 | 0.45 | 0.43 | Pro | CCA | 1.08 | 1.08 | 1.06 | 1.08 | 1.08 | 1.09 | 1.08 | 1.08 | |
| CGU | 1.32 | 1.32 | 1.31 | 1.3 | 1.33 | 1.32 | 1.32 | 1.33 | CCC | 0.74 | 0.74 | 0.76 | 0.78 | 0.77 | 0.77 | 0.77 | 0.77 | ||
| Asn | AAC | 0.47 | 0.47 | 0.48 | 0.48 | 0.47 | 0.48 | 0.48 | 0.47 | CCG | 0.61 | 0.61 | 0.62 | 0.6 | 0.63 | 0.62 | 0.61 | 0.63 | |
| AAU | 1.53 | 1.53 | 1.52 | 1.52 | 1.53 | 1.52 | 1.52 | 1.53 | CCU | 1.57 | 1.57 | 1.56 | 1.54 | 1.53 | 1.52 | 1.54 | 1.53 | ||
| Asp | GAC | 0.42 | 0.42 | 0.42 | 0.42 | 0.41 | 0.41 | 0.41 | 0.42 | Ser | AGC | 0.43 | 0.43 | 0.44 | 0.44 | 0.42 | 0.42 | 0.43 | 0.43 |
| GAU | 1.58 | 1.58 | 1.58 | 1.58 | 1.59 | 1.59 | 1.59 | 1.58 | AGU | 1.14 | 1.14 | 1.15 | 1.15 | 1.14 | 1.15 | 1.14 | 1.13 | ||
| Cys | UGC | 0.58 | 0.58 | 0.58 | 0.57 | 0.6 | 0.61 | 0.6 | 0.61 | UCA | 1.2 | 1.19 | 1.22 | 1.21 | 1.19 | 1.18 | 1.19 | 1.19 | |
| UGU | 1.42 | 1.42 | 1.42 | 1.43 | 1.4 | 1.39 | 1.4 | 1.39 | UCC | 0.94 | 0.95 | 0.94 | 0.94 | 0.96 | 0.97 | 0.95 | 0.96 | ||
| Gln | CAA | 1.53 | 1.53 | 1.52 | 1.51 | 1.52 | 1.52 | 1.51 | 1.52 | UCG | 0.65 | 0.66 | 0.63 | 0.62 | 0.64 | 0.65 | 0.65 | 0.64 | |
| CAG | 0.47 | 0.47 | 0.48 | 0.49 | 0.48 | 0.48 | 0.49 | 0.48 | UCU | 1.63 | 1.63 | 1.61 | 1.63 | 1.65 | 1.63 | 1.64 | 1.64 | ||
| Glu | GAA | 1.47 | 1.47 | 1.46 | 1.47 | 1.47 | 1.47 | 1.48 | 1.48 | Thr | ACA | 1.21 | 1.21 | 1.22 | 1.22 | 1.21 | 1.22 | 1.22 | 1.21 |
| GAG | 0.53 | 0.53 | 0.54 | 0.53 | 0.53 | 0.53 | 0.52 | 0.52 | ACC | 0.74 | 0.75 | 0.74 | 0.73 | 0.71 | 0.71 | 0.71 | 0.71 | ||
| Gly | GGA | 1.53 | 1.52 | 1.52 | 1.52 | 1.57 | 1.57 | 1.57 | 1.57 | ACG | 0.54 | 0.53 | 0.53 | 0.53 | 0.58 | 0.56 | 0.57 | 0.58 | |
| GGC | 0.5 | 0.5 | 0.51 | 0.51 | 0.52 | 0.52 | 0.52 | 0.52 | ACU | 1.52 | 1.51 | 1.51 | 1.52 | 1.5 | 1.51 | 1.5 | 1.5 | ||
| GGG | 0.74 | 0.75 | 0.75 | 0.74 | 0.69 | 0.69 | 0.7 | 0.69 | Trp | UGG | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| GGU | 1.23 | 1.23 | 1.22 | 1.23 | 1.22 | 1.22 | 1.22 | 1.22 | Tyr | UAC | 0.44 | 0.44 | 0.42 | 0.41 | 0.42 | 0.42 | 0.42 | 0.42 | |
| His | CAC | 0.47 | 0.47 | 0.47 | 0.47 | 0.47 | 0.47 | 0.48 | 0.47 | UAU | 1.56 | 1.56 | 1.58 | 1.59 | 1.58 | 1.58 | 1.58 | 1.58 | |
| CAU | 1.53 | 1.53 | 1.53 | 1.53 | 1.53 | 1.53 | 1.52 | 1.53 | Val | GUA | 1.41 | 1.41 | 1.41 | 1.41 | 1.39 | 1.41 | 1.41 | 1.4 | |
| Leu | CUA | 0.85 | 0.84 | 0.85 | 0.84 | 0.84 | 0.85 | 0.85 | 0.84 | GUC | 0.58 | 0.58 | 0.58 | 0.59 | 0.59 | 0.58 | 0.58 | 0.58 | |
| CUC | 0.41 | 0.41 | 0.41 | 0.41 | 0.41 | 0.41 | 0.41 | 0.42 | GUG | 0.54 | 0.54 | 0.54 | 0.54 | 0.54 | 0.53 | 0.53 | 0.54 | ||
| CUG | 0.38 | 0.38 | 0.41 | 0.4 | 0.42 | 0.41 | 0.41 | 0.42 | GUU | 1.46 | 1.47 | 1.46 | 1.46 | 1.48 | 1.48 | 1.48 | 1.48 | ||
| CUU | 1.27 | 1.27 | 1.25 | 1.25 | 1.27 | 1.27 | 1.28 | 1.27 | Stop | UAA | 1.68 | 1.68 | 1.74 | 1.74 | 1.63 | 1.68 | 1.63 | 1.63 | |
| UUA | 1.87 | 1.87 | 1.86 | 1.87 | 1.86 | 1.86 | 1.85 | 1.85 | UAG | 0.74 | 0.74 | 0.63 | 0.63 | 0.79 | 0.74 | 0.79 | 0.74 | ||
| UUG | 1.22 | 1.23 | 1.22 | 1.22 | 1.2 | 1.2 | 1.2 | 1.19 | UGA | 0.58 | 0.58 | 0.63 | 0.63 | 0.58 | 0.58 | 0.58 | 0.63 | ||
FIGURE 3Sequence alignment of chloroplast genome among eight Fagopyrum species and three Polygonaceae species (Rumex hypogaeus, Reynoutria japonica, and Rheum officinale) with F. tataricum as a reference by using mVISTA. The Y-scale represents the percentage of identity ranging from 50 to 100%. Coding and non-coding regions are marked in purple and pink, respectively.
FIGURE 4Comparison of nucleotide diversity (Pi) values among the eight Fagopyrum species. X-axis, position of the midpoint of each window; Y-axis, nucleotide diversity (Pi) of each window.
FIGURE 5Comparison of the junctions between the LSC, SSC, and IR regions among eight Fagopyrum species chloroplast genomes. The figure is not scaled LSC, SSC, and IR.
FIGURE 6Analysis of SSRs in the eight Fagopyrum species cp genomes. (A) Distribution of SSRs in genic and intergenic. (B) Distribution of SSRs in LSC, IRa/IRb, and SSC. (C) The number of different SSR types detected in the eight Fagopyrum species cp genomes. (D) The number of different base SSR types in the eight Fagopyrum species cp genomes.
FIGURE 7Phylogenetic tree obtained for eight Fagopyrum species inferred from ML analysis constructed by the complete chloroplast genomes. The number on the branches displayed the bootstrap support values.