| Literature DB >> 35795179 |
Changhong Li1, Yu Liu1,2, Furong Lin1, Yongqi Zheng1, Ping Huang1.
Abstract
Dalbergia spp. are numerous and widely distributed in pantropical areas in Asia, Africa and America, and most of the species have important economic and ecological value as precious timber. In this study, we determined and characterized six complete chloroplast genomes of Dalbergia species (Dalbergia obtusifolia, D. hupeana, D. mimosoides, D. sissoo, D. hancei, D. balansae), which displayed the typical quadripartite structure of angiosperms. The sizes of the genomes ranged from 155,698 bp (D. hancei) to 156,419 bp (D. obtusifolia). The complete chloroplast genomes of Dalbergia include 37 tRNA genes, eight rRNA genes and 84 protein-coding genes. We analysed the sequence diversity of Dalberigia chloroplast genomes coupled with previous reports. The results showed 12 noncoding regions (rps16-accD, trnR-UCU-trnG-UCC, ndhE-ndhG, trnG-UCC-psbZ, rps8-rpl14, trnP-UGG-psaJ, ndhH-rps15, trnQ-UUG-rps16, trnS-GCU-psbI, rps12-clpP, psbA-trnK-UUU, trnK-UUU-intron), and four coding regions (rps16, ycf1, rps15 and ndhF) showed many nucleotide variations that could be used as potential molecular markers. Based on a site-specific model, we analysed the selective pressure of chloroplast genes in Dalbergia species. Twenty-two genes with positively selected sites were detected, involving the photosynthetic system (ndhC, adhD, ndhF, petB, psaA, psaB, psbB, psbC, psbK and rbcL), self-replication category of genes (rpoA, rpoC2, rps3, rps12 and rps18) and others (accD, ccsA, cemA, clpP, matK, ycf1 and ycf2). Additionally, we identified potential RNA editing sites that were relatively conserved in the genus Dalbergia. Furthermore, the comparative analysis of cp genomes of Dalbergieae species indicated that the boundary of IRs/SSC was highly variable, which resulted in the size variation of cp genomes. Finally, phylogenetic analysis showed an inferred phylogenetic tree of Papilionoideae species with high bootstrap support and suggested that Amorpheae was the sister of the clade Dalbergieae. Moreover, three genera of the Pterocarpus clade showed a nested evolutionary relationship. These complete cp genomes provided valuable information for understanding the genetic variation and phylogenetic relationship of Dalbergia species with their relatives. ©2022 Li et al.Entities:
Keywords: Chloroplast genome; Dalbergia genus; Gene selective pressure; Phylogenetic relationship; Sequence diversity
Year: 2022 PMID: 35795179 PMCID: PMC9252178 DOI: 10.7717/peerj.13570
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 3.061
Figure 1Circular map of the cp genome for Dalbergia chloroplast genome.
The gray arrows show that genes inside the cycle are transcribed clockwise, and genes outside the circle are transcribed counterclockwise. The innermost shaded areas inside the inner circle correspond to the GC content in the cp genome. Genes in different functional groups are color coded. The boundaries of four regions (IRa, IRb, LSC, SSC) are noted in the inner circle.
Comparison of features of cp genomes of six Dalbergia species.
| Taxon | Accession | Genome size (bp) | GC | LSC | IRs | SSC | Forward repeat(s) | Complement repeat(s) | Reverse repeat(s) | Palindromic repeat(s) | Tandem repeat(s) | SSRs |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| 156,419 | 36.00% | 85,724 | 25,699 | 19,297 | 32 | 19 | 15 | 40 | 15 | 158 |
|
|
| 155,858 | 36.20% | 85,330 | 25,680 | 19,168 | 19 | 0 | 6 | 32 | 10 | 156 |
|
|
| 155,773 | 36.20% | 85,425 | 25,708 | 18,932 | 17 | 3 | 3 | 26 | 6 | 151 |
|
|
| 155,701 | 36.10% | 85,387 | 25,664 | 18,986 | 27 | 4 | 2 | 29 | 12 | 176 |
|
|
| 155,698 | 36.20% | 85,421 | 25,690 | 18,897 | 18 | 1 | 0 | 25 | 8 | 149 |
|
|
| 155,868 | 36.20% | 85,270 | 25,672 | 19,254 | 19 | 0 | 8 | 31 | 7 | 153 |
List of genes encoded in the cp genomes of the genus Dalbergia.
| Function | Genes |
|---|---|
| RNA transfer | trnA-UGC |
| RNA ribosomal | rrn16, rrn23, rrn4.5, rrn5 |
| RNA polymerase | rpoC1 |
| Clpp, Matk | clpP |
| Ribosomal proteins (SSU) | rps2, rps3, rps4, rps7, rps8, rps11, rps12 |
| Ribosomal proteins (LSU) | rpl2 |
| Hypothetical chloroplast reading frames (ycf) | ycf1, ycf1 |
| ATP synthase | atpA, atpB, atpE, atpF |
| Photosystem I | psaA, psaB, psaC, psaI, psaJ |
| Photosystem II | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ |
| RubisCO large subunit | rbcL |
| Cytochrome complex | petA, petB |
| NADH dehydrogenase | ndhA |
| Others | accD, cemA, ccsA, |
Notes.
Genes containing one intron.
genes containing two introns.
pseudogene.
Log-likelihood values of the site-specific models, with detected sites having non-synonymous/ synonymous (dN/dS) values >1.
| Gene | Models | Number of | lnL | Likelihood ratio | Number of positively | Positively selected site (s) |
|---|---|---|---|---|---|---|
| accD | M8 | 45 | −2720.664276 | 0.000000000 | 2 | 102 Q 0.978 |
| M7 | 43 | −2780.691566 | ||||
| ccsA | M8 | 45 | −516.193509 | 0.000000000 | 1 | 94 N 0.990 |
| M7 | 43 | −520.088006 | ||||
| cemA | M8 | 45 | −1157.782999 | 0.000000000 | 1 | 205 L 0.956 |
| M7 | 43 | −1219.150432 | ||||
| clpP | M8 | 45 | −889.260891 | 0.004273146 | 1 | 24 Y 0.997 |
| M7 | 43 | −894.716296 | ||||
| matK | M8 | 45 | −2376.612487 | 0.014212023 | 2 | 76 A 0.987 |
| M7 | 43 | −2380.866154 | ||||
| ndhC | M8 | 45 | −594.667603 | 0.000000000 | 2 | 23 L 0.980 |
| M7 | 43 | −651.981618 | ||||
| ndhD | M8 | 45 | −2693.372607 | 0.000000000 | 4 | 220 L 1.000 |
| M7 | 43 | −2750.835062 | ||||
| ndhF | M8 | 45 | −3489.422379 | 0.000283610 | 3 | 372 N 0.969 |
| M7 | 43 | −3497.590288 | ||||
| petB | M8 | 45 | −1002.104915 | 0.000000000 | 1 | 204 P 0.976 |
| M7 | 43 | −1060.388486 | ||||
| psaA | M8 | 45 | −3497.811392 | 0.000000000 | 1 | 321 E 0.999 |
| M7 | 43 | −3563.246297 | ||||
| psaB | M8 | 45 | −3515.015299 | 0.000000000 | 1 | 206 Y 0.999 |
| M7 | 43 | −3577.728010 | ||||
| psbB | M8 | 45 | −2306.906597 | 0.000000000 | 1 | 296 Q 0.992** |
| M7 | 43 | −2381.411208 | ||||
| psbC | M8 | 45 | −2252.030367 | 0.000000000 | 1 | 209 I 0.961 |
| M7 | 43 | −2293.950845 | ||||
| psbK | M8 | 45 | −277.920155 | 0.000735872 | 1 | 43 V 1.000 |
| M7 | 43 | −285.134609 | ||||
| rbcL | M8 | 45 | −2439.780528 | 0.000000000 | 4 | 23 T 0.991 |
| M7 | 43 | −2508.204270 | ||||
| rpoA | M8 | 45 | −1665.905867 | 0.021426153 | 2 | 140 T 0.982 |
| M7 | 43 | −1669.749010 | ||||
| rpoC2 | M8 | 45 | −7455.004187 | 0.000115194 | 2 | 515 L 0.999 |
| M7 | 43 | −7464.073079 | ||||
| rps3 | M8 | 45 | −1135.452527 | 0.000000000 | 1 | 66 K 0.999 |
| M7 | 43 | −1200.963047 | ||||
| rps12 | M8 | 45 | −536.439203 | 0.014760302 | 1 | 116 K 0.973 |
| M7 | 43 | −540.655017 | ||||
| rps18 | M8 | 45 | −390.409751 | 0.009407171 | 1 | 84 I 0.994 |
| M7 | 43 | −395.076034 | ||||
| ycf1 | M8 | 45 | −12407.532472 | 0.000000000 | 15 | 118 Q 0.980 |
| M7 | 43 | −12451.520712 | ||||
| ycf2 | M8 | 45 | −9803.132649 | 0.000000000 | 3 | 660 N 1.000 |
| M7 | 43 | −9824.817329 |
Notes.
p < 0.05.
p < 0.01.
Predicted RNA editing sites in the cp genomes of Dalbergia species.
| Taxon |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 9 | 1 |
| 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 3 | 35 |
|
| 1 | 1 |
| 2 | 1 | 2 | 3 | 10 | 2 | 2 | 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 3 | 39 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 10 | 2 | 2 | 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 39 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 9 | 1 |
| 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 34 |
|
| 1 | 1 | 2 | 2 | 1 | 2 | 3 | 9 | 2 | 2 | 3 | 1 | 1 | 1 |
| 1 | 1 | 1 | 2 | 2 | 38 |
|
| 1 | 1 |
| 2 | 1 | 2 | 3 | 9 | 2 | 1 | 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 3 | 37 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 4 | 9 | 1 | 1 | 2 | 1 | 1 | 2 |
| 1 | 1 | 1 | 1 | 3 | 36 |
|
| 2 |
| 1 | 2 | 1 | 2 | 5 | 10 | 2 | 3 | 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 3 | 43 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 9 | 2 | 3 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 38 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 4 | 9 | 2 | 1 | 4 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 3 | 40 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 4 | 9 | 2 | 1 | 3 | 1 | 1 | 1 |
| 1 | 1 | 1 | 1 | 3 | 37 |
|
| 1 | 1 | 1 | 1 | 1 | 2 | 4 | 9 | 2 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 32 |
|
| 1 | 1 | 1 | 1 | 1 | 2 | 4 | 9 |
| 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 34 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 9 | 2 |
| 2 | 1 | 1 | 1 |
| 1 | 1 |
| 2 | 2 | 33 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 9 | 1 | 2 | 3 | 1 | 1 | 1 |
| 1 | 1 | 1 | 2 | 2 | 36 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 8 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 34 |
|
| 1 | 1 | 1 | 1 | 1 | 2 | 4 | 9 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 36 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 9 | 2 | 4 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 39 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 9 | 1 | 2 | 2 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 2 | 2 | 37 |
|
| 1 | 1 | 1 | 2 | 1 | 2 | 3 | 9 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 36 |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 4 | 9 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 |
| 1 | 2 | 2 | 33 |
Figure 2Comparisons of sequence identity of cp genomes for 43 Dalbergieae species.
Vertical axis represents identity ranging from 50 to 100%. Each arrow indicates the annotated gene and its transcriptional direction. Genome regions are color coded as an exon, mRNA or tRNA, untranslated region (UTR) and conserved noncoding sequence (CNS).
Figure 3Percentages of variable sites in homologous regions across 21 Dalbergia species.
(A) The introns and spacers (IGS); and (B) protein coding sequences (CDS).
Figure 4Comparison of the boundaries between LSC, SSC, and two IR regions in cp genomes of Dalbergieae species.
The number of base pairs (bp) represents the distance from the boundary to the end of the gene.
Figure 5Phylogenetic tree reconstruction of 171 taxa based on 77 genes in the chloroplast genome sequences.
(A) Maximum likelihood (ML) method. (B) Maximum parsimony (MP) method.