| Literature DB >> 35711937 |
Conglong Xia1,2, Manjiong Wang1, Yunhui Guan2, Jian Li1,2.
Abstract
Aconitum is an important medicinal group of the Ranunculaceae family and has been used as conventional medicine in Bai, Yi, and other ethnic groups of China. There are about 350 Aconitum species globally and about 170 species in China. It is challenging to identify the species in morphology, and the lack of molecular biology information hinders the identification and rational utilization of the germplasm of this genus. Therefore, it is necessary to increase the molecular data of Aconitum species. This paper acquired the complete chloroplast (CP) genome sequence of ten medicinal plants of Aconitum species from Yunnan by Illumina paired-end (PE) sequencing technology and compared it with other species in the same family and genus. These CP genomes exhibited typical circular quadripartite structure, and their sizes ranged from 155,475 (A. stylosum) to 155,921 bp (A. vilmoinianum), including a large single-copy region (LSC), a small single-copy region (SSC), and two inverted repeat regions (IRs). Their gene content, order, and GC content (38.1%) were similar. Moreover, their number of genes ranged from 129 (A. vilmoinianum) to 132 (A. ramulosum), including 83 to 85 protein-coding genes (PCGs), 37 tRNA genes (tRNAs), eight rRNA genes (rRNAs), and two pseudogenes. In addition, we performed repeated sequence analysis, genomic structure, and comparative analysis using 42 Aconitum chloroplast genomes, including ten Aconitum chloroplast genomes and other sequenced Aconitum species. A total of 48-79 simple sequence repeats (SSRs) and 17 to 77 long repeat sequences were identified. IR regions showed higher variability than the SSC region and LSC region. Seven mutational hotspots were screened out, including trnK-UUU-trnQ-UGG, psbD, ndhJ-ndhK, clpP, psbH-petB, ycf1, and trnA-UGC-trnI-GAU, respectively. The phylogenetic trees of ten Aconitum species and other Aconitum species revealed that the complete CP genome was beneficial in determining the complex phylogenetic relationships among Aconitum species. This study provides a potential molecular marker and genomic resource for phylogeny and species identification of Aconitum species and an important reference and basis for Ranunculaceae species identification and phylogeny.Entities:
Keywords: Aconitum; chloroplast genome; comparative analysis; medicinal plants; phylogenomics
Year: 2022 PMID: 35711937 PMCID: PMC9194378 DOI: 10.3389/fgene.2022.878182
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Information on the collection of ten Aconitum spp.
| Sample ID | Species | Collection dates | Collection locations | Altitude | Longitude and Latitude | Preservation method |
|---|---|---|---|---|---|---|
| WT001 |
| 2020.09 | Zhongdian county | 3315.91 | 27°90′38″ N 99°63′82″ E | Refrigerator (−20°C) |
| WT002 |
| 2020.09 | Deqing county | 4285.73 | 27°90′38″ N 99°63′82″ E | Refrigerator (−20°C) |
| WT003 |
| 2020.09 | Weixi county | 2752.50 | 26°85′99″ N 99°73′16″ E | Refrigerator (−20°C) |
| WT004 |
| 2020.09 | Lijiang county | 3633.98 | 27°04′77″ N 100°19′34″ E | Refrigerator (−20°C) |
| WT005 |
| 2020.09 | Heqing county | 3109.36 | 26°48′01″ N 100°06′02″ E | Refrigerator (−20°C) |
| WT006 |
| 2020.09 | Lushui county | 3130.40 | 25°51′26″ N 99°01′22″ E | Refrigerator (−20°C) |
| WT007 |
| 2020.09 | Bingchuan county | 2533.86 | 25°97′50″ N 100°68′26″ E | Refrigerator (−20°C) |
| WT008 |
| 2020.09 | Deqing county | 4285.73 | 28°33′97″ N 99°07′51″ E | Refrigerator (−20°C) |
| WT009 |
| 2020.09 | Heqing county | 3109.36 | 26°48′01″ N 100°06′02″ E | Refrigerator (−20°C) |
| WT0010 |
| 2020.09 | Heqing county | 3109.36 | 26°48′01″ N 100°06′02″ E | Refrigerator (−20°C) |
FIGURE 1Pictures of original plants of ten Aconitum spp.
Basic characteristics of complete chloroplast genomes in ten Aconitum plants.
| Species names |
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| Total length (bp) | 155,921 | 155,475 | 155,853 | 155,858 | 155,872 | 155,732 | 155,479 | 155,799 | 155,733 | 155,841 |
| LSC length (bp) | 86,524 | 86,098 | 86,443 | 86,449 | 86,493 | 86,313 | 86,318 | 86,420 | 86,362 | 86,470 |
| SSC length (bp) | 16,925 | 16,929 | 16,938 | 16,937 | 16,921 | 16,983 | 16,943 | 16,903 | 16,913 | 16,913 |
| IR length (bp) | 52,472 | 52,448 | 52,472 | 52,472 | 52,458 | 52,436 | 52,218 | 52,476 | 52,458 | 52,458 |
| Coding (bp) | 79,014 | 78,906 | 78,996 | 78,996 | 78,978 | 78,984 | 78,891 | 78,324 | 78,258 | 78,366 |
| Non-coding (bp) | 76,907 | 76,569 | 76,857 | 76,862 | 76,894 | 76,748 | 76,588 | 77,475 | 77,475 | 77,475 |
| Total number of genes | 129 | 129 | 129 | 129 | 129 | 129 | 129 | 132 | 132 | 132 |
| Total number of unique genes | 111 | 111 | 111 | 111 | 111 | 111 | 111 | 114 | 114 | 114 |
| protein-coding genes (duplicated) | 83 (7) | 83 (7) | 83 (7) | 83 (7) | 83 (7) | 83 (7) | 83 (7) | 85 (7) | 85 (7) | 85 (7) |
| tRNA gene (duplicated) | 37 (7) | 37 (7) | 37 (7) | 37 (7) | 37 (7) | 37 (7) | 37 (7) | 37 (7) | 37 (7) | 37 (7) |
| rRNA gene (duplicated) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) |
| Pseudogenes | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 2 |
| GC content (%) | 38.1 | 38.1 | 38.1 | 38.1 | 38.1 | 38.1 | 38.1 | 38.1 | 38.1 | 38.1 |
| GC content of LSC (%) | 36.2 | 36.2 | 36.2 | 36.2 | 36.2 | 36.2 | 36.2 | 36.2 | 36.2 | 36.2 |
| GC content of IR (%) | 43.0 | 43.0 | 43.0 | 43.0 | 43.0 | 43.0 | 43.0 | 43.0 | 43.0 | 43.0 |
| GC content of SSC (%) | 32.6 | 32.6 | 32.6 | 32.6 | 32.6 | 32.6 | 32.6 | 32.6 | 32.6 | 32.6 |
| A (bp) | 47,737 | 47,598 | 47,630 | 47,712 | 47,728 | 47,674 | 47,585 | 47,840 | 47,827 | 47,839 |
| C (bp) | 29,836 | 29,781 | 30,123 | 29,833 | 29,824 | 29,833 | 29,803 | 30,197 | 30,170 | 30,212 |
| G (bp) | 29,517 | 29,466 | 29,058 | 29,515 | 29,524 | 29,513 | 29,481 | 29,141 | 29,138 | 29,144 |
| T (bp) | 48,825 | 48,624 | 48,391 | 48,798 | 48,796 | 48,712 | 48,610 | 48,621 | 48,598 | 48,646 |
FIGURE 2Gene map of the complete chloroplast genomes of Aconitum species. Genes inside the circle are transcribed clockwise, and those on the outside are transcribed counter-clockwise. Genes belonging to different functional groups have been colour-coded. The darker grey area in the inner circle corresponds to GC content, whereas the lighter grey corresponds to AT content.
Base composition of coding regions.
| Protein coding | Frist position codon | Second position codon | Third position codon | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| T(U)% | C% | A% | G% | Total | T(U)% | C% | A% | G% | Total | T(U)% | C% | A% | G% | Total | T(U)% | C% | A% | G% | Total | |
|
| 31.1 | 17.8 | 30.5 | 20.5 | 79,014 | 23.5 | 18.9 | 30.6 | 27.0 | 26,338 | 32.2 | 20.5 | 29.4 | 18.0 | 26,338 | 37.7 | 14.2 | 31.6 | 16.5 | 26,338 |
|
| 31.2 | 17.9 | 30.5 | 20.5 | 78,906 | 23.5 | 18.9 | 30.6 | 27.0 | 26,302 | 32.3 | 20.5 | 29.3 | 18.0 | 26,302 | 37.7 | 14.2 | 31.6 | 16.5 | 26,302 |
|
| 31.1 | 17.9 | 30.5 | 20.5 | 78,456 | 23.5 | 18.9 | 30.6 | 27.1 | 26,152 | 32.2 | 20.5 | 29.3 | 18.0 | 26,152 | 37.7 | 14.2 | 31.6 | 16.5 | 26,152 |
|
| 31.1 | 17.8 | 30.5 | 20.5 | 78,996 | 23.5 | 18.9 | 30.6 | 27.0 | 26,332 | 32.2 | 20.5 | 29.3 | 18.0 | 26,332 | 37.7 | 14.2 | 31.6 | 16.5 | 26,332 |
|
| 31.1 | 17.8 | 30.5 | 20.5 | 78,996 | 23.5 | 18.9 | 30.6 | 27.0 | 26,332 | 32.2 | 20.5 | 29.3 | 18.0 | 26,332 | 37.7 | 14.2 | 31.6 | 16.5 | 26,332 |
|
| 31.1 | 17.8 | 30.5 | 20.5 | 78,996 | 23.5 | 18.9 | 30.6 | 27.0 | 26,332 | 32.2 | 20.5 | 29.3 | 18.0 | 26,332 | 37.7 | 14.2 | 31.6 | 16.5 | 26,332 |
|
| 30.6 | 18.4 | 30.0 | 21.0 | 63,372 | 22.8 | 19.5 | 29.9 | 27.8 | 21,124 | 31.5 | 21.0 | 29.1 | 18.4 | 21,124 | 37.5 | 14.7 | 31.0 | 16.8 | 21,124 |
|
| 30.5 | 18.3 | 30.1 | 21.0 | 63,543 | 22.7 | 19.4 | 30.0 | 27.9 | 21,181 | 31.4 | 20.9 | 29.2 | 18.4 | 21,181 | 37.5 | 14.7 | 31.1 | 16.8 | 21,181 |
|
| 31.1 | 17.8 | 30.5 | 20.5 | 79,014 | 23.5 | 18.9 | 30.6 | 27.0 | 26,338 | 32.2 | 20.5 | 29.4 | 18.0 | 26,338 | 37.7 | 14.2 | 31.6 | 16.5 | 26,338 |
|
| 31.2 | 17.8 | 30.5 | 20.5 | 78,978 | 23.5 | 18.9 | 30.6 | 27.0 | 26,326 | 32.2 | 20.5 | 29.3 | 18.0 | 26,326 | 37.7 | 14.2 | 31.6 | 16.5 | 26,326 |
List of genes in the chloroplast genome of ten Aconitum species.
| Category | Group genes | Name of genes |
|---|---|---|
| Transcription and translation | Large subunit of ribosome (LSU) |
|
| Small subunit of ribosome (SSU) |
| |
| RNA polymerase |
| |
| Translational initiation factor |
| |
| rRNA genes |
| |
| tRNA genes |
| |
| Photosynthesis | Photosystem I |
|
| Photosystem II |
| |
| NADH oxidoreductase |
| |
| Cytochrome b6/f complex |
| |
| ATP synthase |
| |
| RubiscoCO large subunit |
| |
| ATP-dependent protease subunit gene |
| |
| Other genes | Maturase |
|
| Envelop membrane protein |
| |
| Subunit Acetyl- CoA-Carboxylate |
| |
| c-type cytochrome synthesis gene |
| |
| Unknown | Conserved Open reading frames |
|
“×2” indicated duplication of the gene in the IR, region; “*” indicated that the gene contains an intron; “**” indicated that the gene contains two introns; “ψ” indicated the gene as a pseudogene.
FIGURE 3Frequency of amino acids encoded in the coding region by codons of chloroplast genes from ten Aconitum species.
FIGURE 4Analysis of codon bias in the chloroplast genome of ten Aconitum species. (A) Analysis of terminal bases at codons in chloroplast genomes; (B) RSCU value analysis of codons in chloroplast genomes.
FIGURE 5Analysis of simple sequence repeat (SSR) in the chloroplast genomes of 42 Aconitum species.
FIGURE 6Distribution types and numbers of SSRs in LSC, SSC, IR regions in the chloroplast genomes of 42 Aconitum spp.
FIGURE 7Type and number of long repeat sequences distributed in the chloroplast genomes of 42 Aconitum species.
FIGURE 8Comparison of LSC, SSC, and IR region borders among CP genomes of 42 Aconitum species.
FIGURE 9Global chloroplast genome alignments for 42 Aconitum species using the mVISTA program, with A. vilmorinianum as the reference. Y-axis shows the range of sequence identity (50–100%).
FIGURE 10Sliding window analyses of the whole plastomes of Aconitum species. X-axis: position of the midpoint of a window, Y-axis: nucleotide diversity (Pi) of each window. (A): Sliding window analysis of ten Aconitum species sequenced in this study. (B): Sliding window analyses the whole plastomes of 42 Aconitum species (comprising ten species for this study).
FIGURE 11Phylogenetic relationship of 42 Aconitum species inferred from Maximum Likelihood (ML) based on two data sets. (A) ML tree constructed based on complete chloroplast genome. (B) ML tree constructed based on protein-coding genes (PCGs).