| Literature DB >> 36140690 |
Jialong Pei1, Yong Wang2, Juan Zhuo1, Huibin Gao2, Naresh Vasupalli1, Dan Hou1, Xinchun Lin1.
Abstract
Dendrocalamus farinosus is one of the essential bamboo species mainly used for food and timber in the southwestern region of China. In this study, the complete chloroplast (cp) genome of D. farinosus is sequenced, assembled, and the phylogenetic relationship analyzed. The cp genome has a circular and quadripartite structure, has a total length of 139,499 bp and contains 132 genes: 89 protein-coding genes, eight rRNAs and 35 tRNAs. The repeat analyses showed that three types of repeats (palindromic, forward and reverse) are present in the genome. A total of 51 simple sequence repeats are identified in the cp genome. The comparative analysis between different species belonging to Dendrocalamus revealed that although the cp genomes are conserved, many differences exist between the genomes. The analysis shows that the non-coding regions were more divergent than the coding regions, and the inverted repeat regions are more conserved than the single-copy regions. Moreover, these results also indicate that rpoC2 may be used to distinguish between different bamboo species. Phylogenetic analysis results supported that D. farinosus was closely related to D. latiflorus. Furthermore, these bamboo species' geographical distribution and rhizome types indicate two evolutionary pathways: one is from the tropics to the alpine zone, and the other is from the tropics to the warm temperate zone. Our study will be helpful in the determination of the cp genome sequences of D. farinosus, and provides new molecular data to understand the Bambusoideae evolution.Entities:
Keywords: Bambusoideae; Dendrocalamus farinosus; chloroplast genome; phylogenomics
Mesh:
Year: 2022 PMID: 36140690 PMCID: PMC9498922 DOI: 10.3390/genes13091519
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1The circle gene map of the D. farinosus cp genome. Genes on the outside and inside the map are transcribed clockwise and counterclockwise, respectively. Genes belonging to different functional groups are color-coded. The darker grey in the inner circle corresponds to GC content. The SSC region, LSC region, and inverted repeats (IRa and IRb) are indicated.
Genes present in the cp genome of D. farinosus.
| Category | Group of Genes | Name of Genes |
|---|---|---|
| Genes for photosynthesis | Subunits of ATP synthase |
|
| Subunits of NADH-dehydrogenase | ||
| Subunits of cytochrome b/f complex | ||
| Subunits of photosystem I |
| |
| Subunits of photosystem II |
| |
| Subunit of rubisco |
| |
| photosystem assembly/stability factors | ||
| Self replication | Large subunit of ribosome | |
| Small subunit of ribosome | ||
| DNA dependent RNA polymerase |
| |
| tRNA Genes | ||
| rRNA Genes | ||
| Other genes | c-type cytochrom synthesis gene |
|
| Envelop membrane protein |
| |
| Protease |
| |
| Translational initiation factor |
| |
| Maturase |
| |
| Component of TIC complex | ||
| Subunit acetyl-coA carboxylase |
|
‘+’—Gene with one intron, ‘++’—gene with two introns and ‘a’—gene with copies.
The introns in the genes of the D. farinosus cp genome.
| S. No | Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) |
|---|---|---|---|---|---|---|---|
| 1 |
| LSC | 36 | 846 | 222 | ||
| 2 |
| LSC | 158 | 837 | 409 | ||
| 3 |
| LSC | 132 | 732 | 226 | 724 | 161 |
| 4 |
| IR | 405 | 660 | 432 | ||
| 5 |
| IR | 777 | 712 | 756 | ||
| 6 |
| IR | 232 | 537 | 36 | ||
| 7 |
| SSC | 548 | 1026 | 541 | ||
| 8 |
| LSC | 38 | 2505 | 34 | ||
| 9 |
| LSC | 23 | 673 | 48 | ||
| 10 |
| LSC | 35 | 539 | 50 | ||
| 11 |
| LSC | 39 | 596 | 37 | ||
| 12 |
| IR | 42 | 946 | 35 | ||
| 13 |
| IR | 38 | 811 | 35 |
Figure 2Codon content of 20 amino acids in all protein-coding genes of the D. farinosus cp genome.
Distribution of SSRs in the D. farinosus cp genome.
| SSR Type | Unit | Length | Number | Position on Genome (bp) |
|---|---|---|---|---|
| P1 | A | 10 | 4 | 7647–7656, 12,227–12,236, 31,304–31,313, 39,008–39,017 |
| A | 11 | 3 | 49,902–49,912, 107,247–107,257, 139,481–139,491 | |
| A | 12 | 2 | 45,950–45,961, 46,921–46,932 | |
| A | 13 | 1 | 110,286–110,298 | |
| T | 10 | 4 | 38,212–38,221, 43,628–43,637, 67,161–67,170, 82,514–82,523 | |
| T | 11 | 2 | 34,229–34,239, 79,557–79,567 | |
| T | 12 | 1 | 51,765–51,776 | |
| P2 | AT | 10 | 1 | 27,587–27,596 |
| TA | 10 | 2 | 87,344–87,353, 135,177–135,186 | |
| TC | 10 | 1 | 117,486–117,495 | |
| p3 | AAT | 12 | 1 | 26,292–26,303 |
| CAG | 12 | 1 | 684–695 | |
| TTC | 12 | 1 | 67,459–67,470 | |
| P4 | AAAG | 12 | 1 | 18,988–18,999 |
| AACG | 12 | 1 | 101,658–101,669 | |
| AATA | 12 | 1 | 110,503–110,514 | |
| ATAC | 12 | 1 | 17,899–17,910 | |
| GAAA | 12 | 1 | 65,129–65,140 | |
| GTAG | 16 | 1 | 53,971–53,986 | |
| TATT | 12 | 1 | 50,161–50,172 | |
| TCCT | 12 | 1 | 44,873–44,884 | |
| TCGT | 12 | 1 | 120,860–120,871 | |
| p5 | TTTTA | 15 | 1 | 31,515–31,529 |
| C | (A)10/(A)11 | 50 | 1 | 35,536–35,585 |
| (T)10/(T)10 | 28 | 1 | 78,948–78,975 | |
| (T)10/(T)10 | 32 | 1 | 8601–8632 | |
| (T)10/(A)13 | 106 | 1 | 48,475–48,580 | |
| (AGAA)3/(T)11 | 39 | 1 | 70,751–70,789 | |
| (TTTA)3/(A)13 | 61 | 1 | 74,099–74,159 | |
| (TCT)4/(T)11 | 95 | 1 | 82,955–83,049 | |
| (T)10/(AATA)3/(TATT)3 | 143 | 1 | 58,342–58,484 |
Figure 3Sequence alignment of the six cp genomes of Dendrocalamus by mVISTA, using D. farinosus as the reference. The six genomes of Dendrocalamus are D. farinosus, D. fugongensis, D. sinicus, D. membranaceus, D. latiflorus and D. strictus. The horizontal axis indicates the coordinates within the cp genome. Genome regions are color-coded as follows; genes are represented in blue; conserved non-coding sequences are represented in dark pink; forward and backward arrows represent the direction of the gene.
Figure 4Sequence alignment of the six cp genomes of Bambusoideae with mVISTA using D. farinosus as a reference. The six genomes of Bambusoideae are D. farinosus, D. bambusoides, Y. brevipaniculata, Ph. edulis, F. edulis and B. flexuosa. The horizontal axis indicates the coordinates within the cp genome. Genome regions are color-coded as follows; genes are represented in blue; conserved non-coding sequences are represented in dark pink; forward and backward arrows represent the direction of the gene.
Figure 5The nonsynonymous (dN)/synonymous (dS) ratio values of 19 genes from six Bambusoideae cp genomes (D. farinosus, Y. brevipaniculata, Ph. edulis, F. edulis, D. bambusoides and B. flexuosa).
Figure 6Comparison of the borders of the SSC, LSC, IRa and IRb regions among six cp genomes of Bambusoideae. JLB: junction of the LSC and the IRb. JSB: junction of the SSC and the IRb. JSA: junction of the SSC and the IRa. JLA: junction of the LSC and the IRa.
Figure 7The maximum-likelihood tree based on the 19 cp genomes of Bambuseae and the distribution of the main geographical provenances in China. (A) The bootstrap value based on 1000 replicates is shown on each node. (B) Distribution of the main geographical provenances of 34 bamboo species in China. Red circle: the distribution of bamboos in high altitude regions; Yellow circle: the distribution of bamboos in the warm temperate zone; Green circle: the distribution of bamboos in the subtropics; Blue circle: the distribution of bamboos in the tropics.