| Literature DB >> 33237925 |
Jie Li1, Guang-Ying Ye1, Hai-Lin Liu1, Zai-Hua Wang1.
Abstract
Abelmoschus is an economically and phylogenetically valuable genus in the family Malvaceae. Owing to coexistence of wild and cultivated form and interspecific hybridization, this genus is controversial in systematics and taxonomy and requires detailed investigation. Here, we present whole chloroplast genome sequences and annotation of three important species: A. moschatus, A. manihot and A. sagittifolius, and compared with A. esculentus published previously. These chloroplast genome sequences ranged from 163121 bp to 163453 bp in length and contained 132 genes with 87 protein-coding genes, 37 transfer RNA and 8 ribosomal RNA genes. Comparative analyses revealed that amino acid frequency and codon usage had similarity among four species, while the number of repeat sequences in A. esculentus were much lower than other three species. Six categories of simple sequence repeats (SSRs) were detected, but A. moschatus and A. manihot did not contain hexanucleotide SSRs. Single nucleotide polymorphisms (SNPs) of A/T, T/A and C/T were the largest number type, and the ratio of transition to transversion was from 0.37 to 0.55. Abelmoschus species showed relatively independent inverted-repeats (IR) boundary traits with different boundary genes compared with the other related Malvaceae species. The intergenic spacer regions had more polymorphic than protein-coding regions and intronic regions, and thirty mutational hotpots (≥200 bp) were identified in Abelmoschus, such as start-psbA, atpB-rbcL, petD-exon2-rpoA, clpP-intron1 and clpP-exon2.These mutational hotpots could be used as polymorphic markers to resolve taxonomic discrepancies and biogeographical origin in genus Abelmoschus. Moreover, phylogenetic analysis of 33 Malvaceae species indicated that they were well divided into six subfamilies, and genus Abelmoschus was a well-supported clade within genus Hibiscus.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33237925 PMCID: PMC7688171 DOI: 10.1371/journal.pone.0242591
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Chloroplast genome map of three Abelmoschus species.
Genes shown outside the circle are transcribed clockwise and those inside counterclockwise. Genes belonging to different functional groups are color-coded.
Summary statistics for the chloroplast genomes of Abelmoschus species.
| Genome features | ||||
|---|---|---|---|---|
| Genome size (bp) | 163430 | 163428 | 163453 | 163121 |
| LSC size (bp) | 88243 | 88194 | 88314 | 88071 |
| SSC size (bp) | 18931 | 18934 | 18815 | 19032 |
| IR size (bp) | 28128 | 28150 | 28162 | 28009 |
| Number of genes | 132(112) | 132(112) | 132(112) | 131(111) |
| Protein genes [unique] | 87(78) | 87(78) | 87(78) | 87(78) |
| tRNA genes [unique] | 37(30) | 37(30) | 37(30) | 36(29) |
| rRNA genes [unique] | 8(4) | 8(4) | 8(4) | 8(4) |
| Duplicated genes in IR | 20 | 20 | 20 | 20 |
| GC content (%) | 36.71 | 36.70 | 36.69 | 36.74 |
| GC content in LSC (%) | 34.47 | 34.48 | 34.45 | 34.55 |
| GC content in SSC (%) | 31.55 | 31.55 | 31.59 | 31.48 |
| GC content in IR (%) | 41.95 | 41.93 | 41.90 | 41.97 |
a Data was from the A. esculentus chloroplast genome (NC_035234.1), and the number of genes should add one because gene trnH-GUG was not annotated.
List of annotated genes in the chloroplast genomes of A. moschatus, A. manihot, A. sagittifolius and A. esculentus.
| Category | Gene group | Gene name |
|---|---|---|
| Photosynthesis | Subunits of photosystem I (5) | |
| Subunits of photosystem II (15) | ||
| Subunits of NADH dehydrogenase (12) | ||
| Subunits of cytochrome b/f complex (6) | ||
| Subunits of ATP synthase (6) | ||
| Large subunit of rubisco (1) | ||
| Self-replication | Proteins of large ribosomal subunit (12) | |
| Proteins of small ribosomal subunit (16) | ||
| Subunits of RNA polymerase (4) | ||
| Ribosomal RNAs (8) | ||
| Transfer RNAs (37) | ||
| Other genes | Maturase (1) | |
| Protease (1) | ||
| Envelope membrane protein (1) | ||
| Acetyl-CoA carboxylase (1) | ||
| c-type cytochrome synthesis gene (1) | ||
| unknown function | Conserved hypothetical chloroplast ORF (5) |
Gene*: Gene with one intron; Gene**: Gene with two introns; Gene (×2): Number of copies of multi-copy gene.
Information on 18 intron-containing genes in the chloroplast genomes of Abelmoschus species.
| Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) a | Exon III (bp) |
|---|---|---|---|---|---|---|
| LSC | 37 | 2571/2563/2573/2576 | 35 | |||
| LSC | 40 | 862/865/870/ 856 | 227 | |||
| LSC | 23 | 803/809/804/ 811 | 48 | |||
| LSC | 145 | 815/815/816/814 | 410 | |||
| LSC | 432 | 773/774/781/780 | 1626 | |||
| LSC | 124 | 791/790/791/790 | 230 | 809/804/813/ 834 | 153 | |
| LSC | 35 | 557/558/559/556 | 50 | |||
| LSC | 38 | 590/590/590/608 | 35 | |||
| LSC/IRb | 114 | - | 232 | 536/536/536/536 | 26 | |
| LSC | 71 | 676 / 679/ 677/676 | 292 | 943/942/943/948 | 228 | |
| LSC | 6 | 812/812/812/821 | 642 | |||
| LSC | 8 | 757/757/757/757 | 475 | |||
| IR | 9 | 1148/1147/1147/ 1142 | 399 | |||
| IR | 391 | 698/698/698/696 | 434 | |||
| IR | 777 | 683/683/683/683 | 756 | |||
| IR | 37 | 957/957/957/957 | 35 | |||
| IR | 38 | 794/794/794/794 | 35 | |||
| SSC | 553 | 1119/1120/1119/1119 | 539 |
a The fragment size of intron is in the order of A. moschatus / A. manihot / A. sagittifolius / A.esculentus.
b The rps12 gene is divided into 5'-rps12 in the LSC region and 3'-rps12 in the IR region.
Fig 2Comparison of SSRs and repeat sequences among four Abelmoschus species.
(a) Numbers of different types of SSRs; Mono-: mononucleotide, Di-: dinucleotide, Tri-: trinucleotide, Tetra-: tetranucleotide, Penta-: pentanucleotide, Hexa-: hexanucleotide; (b) Location of SSRs in different chloroplast genome regions. LSC: large single copy, SSC: small single copy, IR: inverted-repeat region. IGS: Intergenic spacer regions, CDS: coding DNA sequences, Intron: intronic regions; (c) Different types of repeat sequences. Total: total numbers of all repeats. F: forward repeats, P: palindromic repeats, R: reverse repeats, C: complementary repeats; (d) Number of repeats present in different locations of chloroplast genomes. LSC/IR: one copy of repeat present in LSC and another in IR, LSC/SSC: one copy of repeat present in LSC and another in SSC, SSC/IR: one copy present in SSC and another in IR; (e) Number of repeats in different size. For example, 30–34 represent the numbers of repeats with the size from 30 to 34 bp; (f) Number of repeats in different regions of chloroplast genomes. IGS/Intron: one copy of repeat present in intergenic spacer regions and another in intronic regions. IGS/CDS: one copy of repeat present in intergenic spacer regions and another in coding regions.
Fig 3Comparison of SNPs and Indels in four Abelmoschus species.
A. moschatus was used as reference for SNPs and Indels detection. (a) The number of different types of SNPs. (b) The number of SNPs in LSC, IR and SSC regions. (c) The number of Indels in LSC, IR and SSC regions. (d) Total length of Indels in LSC, IR and SSC regions. (e) Average length of Indels in LSC, IR and SSC regions. LSC: large single copy, SSC: small single copy, IR: inverted-repeat region.
Fig 4Nucleotide diversity (π) of different regions among A. moschatus, A. manihot, A. sagittifolius and A. esculentus.
Regions with 0 nucleotide diversity were ignored. The x-axis represents chloroplast genome regions, and the y-axis represents nucleotide diversity.
Mutational hotspots in four Abelmoschus species.
| Region | Genetic divergence | Total Number of mutations | Region length |
|---|---|---|---|
| 0.0165 | 23 | 642 | |
| 0.0111 | 30 | 1154 | |
| 0.0097 | 6 | 266 | |
| 0.0089 | 14 | 671 | |
| 0.0078 | 5 | 292 | |
| 0.0063 | 11 | 743 | |
| 0.0063 | 33 | 2343 | |
| 0.0058 | 11 | 862 | |
| 0.0056 | 3 | 228 | |
| 0.0045 | 5 | 475 | |
| 0.0044 | 24 | 2380 | |
| 0.0040 | 16 | 1907 | |
| 0.0037 | 4 | 464 | |
| 0.0035 | 13 | 1653 | |
| 0.0032 | 3 | 396 | |
| 0.0032 | 2 | 267 | |
| 0.0032 | 6 | 802 | |
| 0.0031 | 4 | 597 | |
| 0.0028 | 6 | 1155 | |
| 0.0027 | 4 | 631 | |
| 0.0026 | 7 | 1267 | |
| 0.0026 | 7 | 1169 | |
| 0.0025 | 9 | 1515 | |
| 0.0025 | 6 | 1132 | |
| 0.0023 | 12 | 2196 | |
| 0.0023 | 5 | 948 | |
| 0.0022 | 5 | 1023 | |
| 0.0021 | 2 | 410 | |
| 0.0021 | 1 | 207 | |
| 0.0021 | 5 | 1043 |
Fig 5Comparative analysis of boundary regions: IR, SSC and LSC among four Abelmoschus species and three related species in Malvaceae.
Fig 6Co-linear analysis of seven Malvaceae chloroplast genomes.
The Abelmoschus moschatus genome is shown at top as the reference. Within each of the alignment, local collinear blocks are represented by blocks of the same color connected by lines.
Fig 7Maximum likelihood phylogenetics tree of 33 species in family Malvaceae based on chloroplast genomes (Indels removed).