| Literature DB >> 31706273 |
Soo-Rang Lee1, Kyeonghee Kim2, Byoung-Yoon Lee2, Chae Eun Lim3.
Abstract
BACKGROUND: The genus Hosta is a group of economically appreciated perennial herbs consisting of approximately 25 species that is endemic to eastern Asia. Due to considerable morphological variability, the genus has been well recognized as a group with taxonomic problems. Chloroplast is a cytoplasmic organelle with its own genome, which is the most commonly used for phylogenetic and genetic diversity analyses for land plants. To understand the genomic architecture of Hosta chloroplasts and examine the level of nucleotide and size variation, we newly sequenced four (H. clausa, H. jonesii, H. minor, and H. venusta) and analyzed six Hosta species (including the four, H. capitata and H. yingeri) distributed throughout South Korea.Entities:
Keywords: Chloroplast genome; Codon usage; Hosta; Phylogeny; Repeats; Sequence divergence
Mesh:
Year: 2019 PMID: 31706273 PMCID: PMC6842461 DOI: 10.1186/s12864-019-6215-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Sample information and summary of chloroplast genome characteristics for four Hosta species in Korea. The species acronyms are as followings: CLA- H. clausa; MIN- H. minor; VEN- H. venusta; JON- H. jonesii
| Category | CLA | MIN | VEN | JON |
|---|---|---|---|---|
| Collection site | Mt. Daeam, Gangwon-do | Mt. Gaejwa, Busan-si | Seoguipo, Jeju-do | Yanga-ri, Gyeongsangnam-do |
| Voucher No. | NIBR-VP 000063279 | NIBR-VP 0000538762 | NIBR-VP 0000632798 | NIBR-VP 0000538843 |
| NCBI accession No. | MK732315 | MK732316 | MK732314 | MK732318 |
| Reads after trimming | 6,690,938 | 12,171,518 | 5,497,667 | 10,194,165 |
| Mean coverage | 258.7 | 246.7 | 278.1 | 166.9 |
| Total length (bp) | 156,624 | 156,671 | 156,676 | 156,708 |
| LSC length (bp) | 85,004 | 85,094 | 85,099 | 85,088 |
| SSC length (bp) | 18,228 | 18,225 | 18,225 | 18,244 |
| IRa length (bp) | 26,696 | 26,676 | 26,676 | 26,698 |
| IRb length (bp) | 26,696 | 26,676 | 26,676 | 26,698 |
| Total GC content (%) | 37.81 | 37.80 | 37.80 | 37.80 |
| Total number of genes | 132 | 132 | 132 | 131 |
Fig. 1Chloroplast map of six Hosta species in Korea. The colored boxes represent conserved chloroplast genes. Genes shown inside the circle are transcribed clockwise, whereas genes outside the circle are transcribed counter-clockwise. The small grey bar graphs inner circle shows the GC contents
List of genes within chloroplast genomes of six Hosta species in Korea. ×2 refers to genes duplicated in the IR regions
| Category | Group of genes | Names of genes |
|---|---|---|
| Transcription & Translation | Ribosomal protein, LSU | |
| Ribosomal protein, SSU | ||
| RNA polymerase | ||
| Ribosomal RNAs | ||
| Transfer RNAs | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| NADH dehydrogenase | ||
| Cytochrome b6/f complex | ||
| ATP synthase | ||
| Rubisco large subunit | ||
| ATP-dependent protease subunit P | ||
| Other genes | Chloroplast envelope membrane protein | |
| Maturase | ||
| c-type | ||
| Subunit Acetyl- CoA-Carboxylate | ||
| Photosystem I assembly& stability | ||
| Conserved ORFs | ||
| Pseudogenes | ||
Abbreviations: LSU rRNA Large subunit ribosomal ribonucleic acid, SSU rRNA Small subunit ribosomal ribonucleic acid
Gene marked with asterisks are the gene with a single (a) or double (b) introns. ψPseudogenes are presented in the species indicated with parentheses. See Table 1 legend for the species acronyms
Fig. 2Plots of percent sequence identity of the chloroplast genomes of six Korean Hosta species with H. ventricosa (NCBI accession number: NC_032706.1) as a reference. The percentage of sequence identities were estimated and the plots were visualized in mVISTA
Fig. 3Plot of sliding window analysis on the whole chloroplast genome for nucleotide diversity (pi) compared among six Hosta species in Korea. The dashed lines are the borders of LSC, SSC and IR regions
Fig. 4Comparisons of LSC, SSC and IR region boundaries among the chloroplast genomes of six Korean Hosta species
Distribution of simple sequence repeats (SSRs) in six Hosta species in Korea. c denotes for compound SSR of which comprised more than two SSRs adjacent to each other. The number of polymorphic SSRs were counted when the SSRs are polymorphic at least in one species
| Number of SSRs (No. of polymorphic SSRs) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Species | Unit size | Total | ||||||
| 1 | 2 | 3 | 4 | 5 | 6 | c | ||
| 34 (11) | 10 (1) | 0 | 4 | 1 | 0 | 2 (1) | 51 (12) | |
| 36 (14) | 12 (4) | 0 | 4 | 1 (1) | 0 | 4 (3) | 57 (22) | |
| 35 (14) | 10 (2) | 0 | 5 (1) | 1 (1) | 0 | 2 (1) | 53 (19) | |
| 36 (15) | 10 (2) | 0 | 5 (1) | 1 (1) | 0 | 1 | 53 (19) | |
| 40 (19) | 10 (2) | 0 | 4 | 2 (2) | 0 | 3 (2) | 59 (25) | |
| 39 (18) | 10 (2) | 0 | 4 | 2 (2) | 0 | 2 (1) | 57 (23) | |
Fig. 5Phylogenetic relationships among the six Hosta species and 15 species in Agavoideae (Asparagaceae) inferred using Neighbor joining (NJ) and Maximum likelihood (ML) methods based on the whole chloroplast genomes. The values presented on each node are the bootstrap supports