| Literature DB >> 33005000 |
Wei Tan1, Han Gao1, Weiling Jiang1, Huanyu Zhang1, Xiaolei Yu1, Erwei Liu2, Xiaoxuan Tian3.
Abstract
Chloroplast genomes have been widely considered an informative and valuable resource for molecular marker development and phylogenetic reconstruction in plant species. This study evaluated the complete chloroplast genomes of the traditional Chinese medicine Gleditsia sinensis and G. japonica, an adulterant of the former. The complete chloroplast genomes of G. sinensis and G. japonica were found to be of sizes 163,175 bp and 162,391 bp, respectively. A total of 111 genes were identified in each chloroplast genome, including 77 coding sequences, 30 tRNA, and 4 rRNA genes. Comparative analysis demonstrated that the chloroplast genomes of these two species were highly conserved in genome size, GC contents, and gene organization. Additionally, nucleotide diversity analysis of the two chloroplast genomes revealed that the two short regions of ycf1b were highly diverse, and could be treated as mini-barcode candidate regions. The mini-barcode of primers ZJ818F-1038R was proven to precisely discriminate between these two species and reflect their biomass ratio accurately. Overall, the findings of our study will shed light on the genetic evolution and guide species identification of G. sinensis and G. japonica.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33005000 PMCID: PMC7529812 DOI: 10.1038/s41598-020-73392-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gene map of the complete chloroplast genomes of the two Gleditsia species. Genes on the inside of the circle are transcribed clockwise, whereas those outside are transcribed counter-clockwise. The dark gray and light gray shading within the inner circle correspond to the percentages of G + C and A + T contents, respectively.
Comparison of the chloroplast genome organization of the two Gleditsia species.
| Genome features | ||
|---|---|---|
| Total reads (bp) | 29,589,646 | 28,279,048 |
| Size (bp) | 163,175 | 162,391 |
| LSC (bp) | 91,540 | 91,449 |
| SSC (bp) | 19,249 | 19,449 |
| IR (bp) | 26,193 | 25,866 |
| Number of genes | 111 | 111 |
| Protein-coding genes | 77 | 77 |
| tRNA genes | 30 | 30 |
| rRNA genes | 4 | 4 |
| Total G + C content (%) | 35.6 | 35.5 |
Gene contents in the chloroplast genomes of the two Gleditsia species.
| Gene category | Gene groups | Names of genes |
|---|---|---|
| Self-replicating | Large subunit of ribosome (LSU) | |
| Small subunit of ribosome (SSU) | ||
| DNA dependent RNA polymerase | ||
| rRNA genes | ||
| tRNA genes | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| NADH dehydrogenase | ||
| Cytochrome b/f complex | ||
| Subunits of ATP synthase | ||
| Large subunit of Rubisco | ||
| Other genes | Protease | |
| Maturase | ||
| Envelop membrane protein | ||
| Subunit of acetyl-CoA | ||
| C-type cytochrome synthesis gene | ||
| Unknown function | Proteins of unknown function |
agenes containing a single intron.
bgenes containing two introns.
c genes divided into two independent transcription units.
Figure 2Analysis of repeated sequences in the two Gleditsia species. (A) The numbers of different SSR types, including mononucleotide, dinucleotide, and trinucleotide; (B) Number of different SSR repeat units. (C) Frequency of repeat sequences in the two chloroplast genomes as determined by REPuter; (D) Frequency of tandem repeat sequences by length.
Figure 3Codon contents of the 20 amino acids and stop codons in all protein-coding genes in the chloroplast genomes of the two Gleditsia species.
Figure 4Visual alignment of the chloroplast genomes of the two Gleditsia species. VISTA-based identity plot showing sequence identity among the two species, using G. japonica as reference.
Figure 5Phylogenetic tree reconstruction of the 155 species inferred from maximum likelihood (ML) based on 75 protein-coding genes of the complete chloroplast genomes. (A) Phylogenetic relationship of Leguminosae, the orange dots at nodes on the tree indicate bootstrap values (= 100). (B) Phylogenetic relationship of Caesalpiniaceae, numbers at nodes on the tree represent bootstrap values.
Figure 6Nucleotide diversity (Pi) based on sliding window analysis of G. sinensis and G. japonica using 130 chloroplast genes. X-axis, the position of the midpoint of a window; Y-axis, nucleotide diversity of each window.
Two pairs of primers of ycf1b mini-barcodes.
| Primer name | ZJ818F-1038R | ZJ1118F-1287R |
|---|---|---|
| Forward primer sequence 5′ to 3 | CTTCCAAAACGAAGAT | CTAGTTTGTCAACTTTTC |
| Reverse primer sequence 5′ to 3 | AGCATTTTCAAATCGA | AAATTCCTTATCTAGAGC |
| Amplicon size (bp) | 221 | 170 |
| Sequence size excluding primers (bp) | 189 | 134 |
Features of nine marker regions in these two Gleditsia species.
| Number | Marker region | Aligned length (bp) | Variable sites | K2P | References | |
|---|---|---|---|---|---|---|
| Number | % | |||||
| 1 | 189 | 6 | 3.175 | 0.03260 | This paper | |
| 2 | 134 | 6 | 4.478 | 0.04690 | This paper | |
| 3 | 1500 | 8 | 0.533 | 0.00536 | Wojciechowski et al | |
| 4 | 703 | 4 | 0.569 | 0.00571 | Chen et al | |
| 5 | 438 | 5 | 1.142 | 0.01240 | Liu et al | |
| 6 | 2098 | 11 | 0.524 | 0.00527 | Schnabel et al | |
| 7 | 1122 | 11 | 0.980 | 0.01040 | Schnabel et al | |
| 8 | 566 | 5 | 0.883 | 0.00890 | Herendeen et al | |
| 9 | 499 | 7 | 1.402 | 0.01440 | Herendeen et al | |
Figure 7The relationship between biomass and read counts of the products amplified by the two primer pairs (ZJ818F-1038R and ZJ1118F-1287R) in the two species. X-axis, the proportion of biomass; Y-axis, the proportion of reads.