| Literature DB >> 34895202 |
Fang Li1,2, Xuena Xie2, Rong Huang2, Enwei Tian2, Chan Li2, Zhi Chao3,4.
Abstract
BACKGROUND: Whole chloroplast genome (cpDNA) sequence is becoming widely used in the phylogenetic studies of plant and species identification, but in most cases the cpDNA were acquired from silica gel dried fresh leaves. So far few reports have been available to describe cpDNA acquisition from crude drugs derived from plant materials, the DNA of which usually was seriously damaged during their processing. In this study, we retrieved cpDNA from the commonly used crude drug Eriobotryae Folium (Pipaye in Chinese, which is the dried leaves of Eriobotrya japonica, PPY) using genome skimming technique.Entities:
Keywords: Chloroplast genome; Crude drug; Eriobotrya japonica; Eriobotryae Folium; Genome skimming; Identification
Mesh:
Substances:
Year: 2021 PMID: 34895202 PMCID: PMC8666020 DOI: 10.1186/s12896-021-00728-0
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Fig. 1Chloroplast genome map of E. japonica. The genes outside of the circle are transcribed clockwise, while those inside are transcribed counterclockwise. Small single copy (SSC), large single copy (LSC), and inverted repeats (IRa, IRb) are indicated
Summary of cpDNA characteristics of 11 samples
| PPY-1 | PPY-2 | PPY-3 | SP | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| DNA concentration (ng/μL) | 273.1 | 310.1 | 41.9 | 29.4 | 33.6 | 9.76 | 88 | 21.9 | 47.5 | 43.8 | 362.7 |
| Coverage | 309.71 ± 45.78 | 747.45 ± 99.19 | 60.02 ± 20.91 | 140.14 ± 28.56 | 188.80 ± 35.90 | 59.99 ± 20.96 | 505.66 ± 44.15 | 579.12 ± 94.80 | 492.26 ± 54.56 | 359.61 ± 71.08 | 258.53 ± 36.78 |
| Size (bp) | 159,115 | 159,156 | 159,155 | 159,156 | 159,156 | 159,202 | 159,270 | 159,393 | 159,177 | 163,250 | 159,690 |
| Number of genes | 112 | 112 | 112 | 112 | 112 | 112 | 112 | 112 | 112 | 113 | 112 |
| Number of protien-coding genes | 78 | 78 | 78 | 78 | 78 | 78 | 78 | 78 | 78 | 79 | 78 |
| Number of tRNA genes | 30 | 30 | 30 | 30 | 30 | 30 | 30 | 30 | 30 | 30 | 30 |
| Number of rRNA genes | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 |
| Overall GC content | 36.70% | 36.70% | 36.70% | 36.70% | 36.70% | 36.70% | 36.70% | 36.70% | 31.00% | 36.10% | 39.3% |
| LSC length (bp) | 87,222 | 87,223 | 87,222 | 87,223 | 87,223 | 87,271 | 87,250 | 87,401 | 87,330 | 90,907 | 87,776 |
| IR length (bp) | 26,326 | 26,326 | 26,326 | 26,326 | 26,326 | 26,325 | 26,335 | 26,330 | 26,317 | 26,497 | 26,580 |
| SSC length (bp) | 19,281 | 19,281 | 19,281 | 19,281 | 19,281 | 19,281 | 19,350 | 19,332 | 19,213 | 19,349 | 18,754 |
| GC content in IR (%) | 42.70% | 42.70% | 42.70% | 42.70% | 42.70% | 42.70% | 42.70% | 42.70% | 42.70% | 42.80% | 42.70% |
| Reference sequence | KY085905, MN577877, NC034639 for cpDNA MH711704, MG938044, MF096288, KX675082 for ITS/ITS2 | MN577877 for cpDNA KJ170784, KP093136 for ITS/ITS2 | MK920282 for cpDNA MG938042, JQ392434 for ITS/ITS2 | MN577877 for cpDNA MH246945, KP093137 for ITS/ITS2 | NC042740 for cpDNA AY096031 for ITS/ITS2 | JN867587, NC020318, MN990594 for cpDNA | |||||
Fig. 2Comparison of the border regions of the LSC, SSC and IR regions among the 11 cp genomes. The genes cross the LSC/IRb or IRb/SSC regions, indicating that the LSC/IRb boundary has moved backward or the IRb/SSC boundary moves forward in these species
Indexes of codon usage bias in 11 samples representing 6 species
| PPY-1 | PPY-2 | PPY-3 | SP | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| PCG length (bp) | 78,612 | 78,612 | 78,612 | 78,612 | 78,612 | 78,612 | 78,630 | 78,630 | 78,600 | 77,301 | 77,811 |
| Codon Number | 26,204 | 26,204 | 26,204 | 26,204 | 26,204 | 26,204 | 26,210 | 26,210 | 26,200 | 25,767 | 26,122 |
| Amino acid No | 26,120 | 26,120 | 26,120 | 26,120 | 26,120 | 26,120 | 26,126 | 26,126 | 26,116 | 25,682 | 26,038 |
| SC No | 25,052 | 25,052 | 25,052 | 25,052 | 25,052 | 25,052 | 25,058 | 25,058 | 25,049 | 24,632 | 24,959 |
| ENC | 49.52 | 49.52 | 49.52 | 49.52 | 49.52 | 49.52 | 49.51 | 49.52 | 49.52 | 49.33 | 50.66 |
| CAI | 0.166 | 0.166 | 0.166 | 0.166 | 0.166 | 0.166 | 0.166 | 0.166 | 0.165 | 0.165 | 0.167 |
| CBI | − 0.106 | − 0.106 | − 0.106 | − 0.106 | − 0.106 | − 0.106 | − 0.106 | − 0.106 | − 0.107 | − 0.114 | − 0.096 |
| FOP | 0.350 | 0.350 | 0.350 | 0.350 | 0.350 | 0.350 | 0.350 | 0.350 | 0.350 | 0.346 | 0.357 |
| GC content (%) | 0.378 | 0.378 | 0.378 | 0.378 | 0.378 | 0.378 | 0.378 | 0.378 | 0.378 | 0.375 | 0.392 |
| GC3 content (%) | 0.267 | 0.267 | 0.267 | 0.267 | 0.267 | 0.267 | 0.267 | 0.267 | 0.267 | 0.264 | 0.288 |
Fig. 3Amino acid frequencies in 11 samples protein-coding sequences
Comparison of simple repeats (SSR) in 11 cp genomes
| Sample | Mononucleotides | Dinucleotides | Trinucleotides | Tetranucleotides | Pentanucleotides | Hexanucleotides | Total |
|---|---|---|---|---|---|---|---|
| 70 | 15 | 0 | 6 | 1 | 0 | 92 | |
| 70 | 15 | 0 | 6 | 1 | 0 | 92 | |
| PPY-1 | 70 | 15 | 0 | 6 | 1 | 0 | 92 |
| PPY-2 | 70 | 15 | 0 | 6 | 1 | 0 | 92 |
| PPY-3 | 70 | 15 | 0 | 6 | 1 | 0 | 92 |
| SP | 70 | 15 | 0 | 6 | 1 | 0 | 92 |
| 70 | 15 | 0 | 4 | 1 | 0 | 90 | |
| 83 | 17 | 0 | 6 | 1 | 1 | 108 | |
| 75 | 17 | 0 | 6 | 0 | 0 | 98 | |
| 58 | 18 | 6 | 9 | 1 | 1 | 93 | |
| 30 | 9 | 3 | 9 | 0 | 2 | 53 | |
| Total | 736 | 166 | 9 | 70 | 9 | 4 | 994 |
| Ratio | 74.04% | 16.70% | 0.91% | 7.04% | 0.91% | 0.40% | 100% |
Comparison of long repeat sequences in 11 cp genomes
| Sample | Tandem repeat | Complement repeat | Forward repeat | Palindromic repeat | Reverse repeat | Total |
|---|---|---|---|---|---|---|
| 131 | 0 | 25 | 20 | 3 | 179 | |
| 131 | 0 | 25 | 20 | 3 | 179 | |
| PPY-1 | 131 | 0 | 25 | 20 | 3 | 179 |
| PPY-2 | 131 | 0 | 25 | 20 | 3 | 179 |
| PPY-3 | 131 | 0 | 25 | 20 | 3 | 179 |
| SP | 133 | 0 | 26 | 20 | 3 | 182 |
| 133 | 2 | 23 | 18 | 7 | 183 | |
| 132 | 1 | 22 | 16 | 11 | 182 | |
| 130 | 0 | 22 | 17 | 11 | 180 | |
| 216 | 1 | 19 | 19 | 8 | 263 | |
| 49 | 0 | 11 | 16 | 0 | 76 | |
| total | 1448 | 4 | 248 | 206 | 55 | 1961 |
Fig. 4Comparative chloroplast genomic analysis. The red area represents the non-coding area, and the purple area represents the coding area. The large twists and turns indicate large variations
Fig. 5Comparative analysis of the nucleotide diversity (Pi) value of the cp genomes among the 11 species. A Coding regions, B non-coding regions
List of genes found in Eriobotrya japonica cpDNA
| Gene category | Gene group | Gene name |
|---|---|---|
| Photosynthesis related genes | Photosystem I | |
| photosystem II | ||
| Cytochrome b/f complex | ||
| ATP synthase | ||
| NADH dehydrogenase | ||
| RubisCO large subunit | ||
| Transcription and translation related genes | Ribosomal proteins(SSU) | |
| Ribosomal proteins(LSU) | ||
| RNA genes | Ribosomal RNAs | |
| Transfer RNAs | ||
| RNA polymerase | ||
| Other genes | ||
| Proteins of unknown function | ycf |
Fig. 6Phylogenetic tree constructed using ML based on complete cp genomes. The number above the branches are bootstrap support values
Fig. 7Phylogenetic tree constructed using ML tree based on 20 ITS sequences. The numbers above the branches are bootstrap support values
Information of samples
| Samples | Collecting site locality | Geographical coordinates | Specimen voucher/batch no. | GenBank accession of cp genome |
|---|---|---|---|---|
| Medicinal Plant Garden of Southern Medical University | 23° 19′ 45″ N, 113° 34′ 37″ E | Chao Zhi EJ201403 | MT479167 | |
| South China Botanical Garden | 23° 19′ 23″ N, 113° 37′ 18″ E | Chao Zhi EJ201910 | MT473726 | |
| Wuhan Botanical Garden | 30° 54′ 49″ N, 114° 43′ 30″ E | Chao Zhi 201,812 | MT473722 | |
| Guangdong Tree Park | 23° 20′ 13″ N, 113° 38′ 05″ E | Chao Zhi ED201812 | MT473724 | |
| Chenhedong Nature Reserve, Guangdong | 23° 44′ 02″ N, 113° 50′ 64″ E | Chao Zhi EF201903 | MT473725 | |
| South China Botanical Garden | 23° 18′ 51″ N, 113° 36′ 77″ E | Chao Zhi DT201403 | MT473723 | |
| Medicinal Plant Garden of Southern Medical University | 23° 19′ 45″ N, 113° 34′ 37″ E | Chao Zhi MG201403 | MT473732 | |
| SP | prepared from | – | – | MT473731 |
| PPY-1 | Kangmei Pharmaceutical Co., Ltd, Guangdong | – | YC20181201 | MT473727 |
| PPY-2 | Dongfang Pharmacy, Guangzhou | – | YC20181202 | MT473728 |
| PPY-3 | Henglu Pharmacy, Guangzhou | – | YC20181203 | MT473730 |