| Literature DB >> 31380427 |
Yingnan Chen1, Nan Hu1, Huaitong Wu1.
Abstract
Salix wilsonii is an important ornamental willow tree widely distributed in China. In this study, an integrated circular chloroplast genome was reconstructed for S. wilsonii based on the chloroplast reads screened from the whole-genome sequencing data generated with the PacBio RSII platform. The obtained pseudomolecule was 155,750 bp long and had a typical quadripartite structure, comprising a large single copy region (LSC, 84,638 bp) and a small single copy region (SSC, 16,282 bp) separated by two inverted repeat regions (IR, 27,415 bp). The S. wilsonii chloroplast genome encoded 115 unique genes, including four rRNA genes, 30 tRNA genes, 78 protein-coding genes, and three pseudogenes. Repetitive sequence analysis identified 32 tandem repeats, 22 forward repeats, two reverse repeats, and five palindromic repeats. Additionally, a total of 118 perfect microsatellites were detected, with mononucleotide repeats being the most common (89.83%). By comparing the S. wilsonii chloroplast genome with those of other rosid plant species, significant contractions or expansions were identified at the IR-LSC/SSC borders. Phylogenetic analysis of 17 willow species confirmed that S. wilsonii was most closely related to S. chaenomeloides and revealed the monophyly of the genus Salix. The complete S. wilsonii chloroplast genome provides an additional sequence-based resource for studying the evolution of organelle genomes in woody plants.Entities:
Year: 2019 PMID: 31380427 PMCID: PMC6662467 DOI: 10.1155/2019/5190425
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Assembly of Salix wilsonii cp genome. (a) Gene map of the chloroplast genome of S. wilsonii. (b) Dot matrix alignment of cp genomes between S. wilsonii and S. babylonica.
Genes present in the cp genome of Salix wilsonii.
| Gene category | Group of genes | Name of genes | ||||
|---|---|---|---|---|---|---|
| Self-replication | Ribosomal RNA genes |
|
|
|
| |
| Transfer RNA genes |
|
|
|
|
| |
|
|
|
|
|
| ||
|
|
|
|
|
| ||
|
|
|
|
|
| ||
|
|
|
|
|
| ||
|
|
|
|
|
| ||
| Large subunit of ribosome (LSU) |
|
|
|
|
| |
|
|
|
| ||||
| Small subunit of ribosome (SSU) |
|
|
|
|
| |
|
|
|
|
|
| ||
|
| ||||||
| RNA polymerase |
|
|
|
| ||
|
| ||||||
| Genes for photosynthesis | Photosystem h |
|
|
|
|
|
| Photosystem II |
|
|
|
|
| |
|
|
|
|
|
| ||
|
|
|
|
|
| ||
| Cytochrome b/f complex |
|
|
|
|
| |
|
| ||||||
| ATP synthase |
|
|
|
|
| |
|
| ||||||
| ATP-dependent protease subunit p |
| |||||
| Large subunit of rubisco |
| |||||
| NADH dehydrogenase |
|
|
|
|
| |
|
|
|
|
|
| ||
|
| ||||||
|
| ||||||
| Other genes | Maturase |
| ||||
| Envelop membrane protein |
| |||||
| Subunit of acetyl-CoA-carboxylase |
| |||||
| C-type cytochrome synthesis gene |
| |||||
|
| ||||||
| Unknown function | Hypothetical chloroplast reading frames |
|
|
|
|
|
|
|
| |||||
| Pseudogene |
|
|
| |||
The relative synonymous codon usage in the Salix wilsonii cp genome.
| Amino | Codon | Number | RSCU | Amino | Codon | Number | RSCU |
|---|---|---|---|---|---|---|---|
| acid | acid | ||||||
| Ala | GCU | 614 | 1.83 | Leu | UUA | 843 | 1.82 |
| GCA | 371 | 1.11 | CUU | 578 | 1.25 | ||
| GCC | 210 | 0.63 | UUG | 568 | 1.23 | ||
| GCG | 146 | 0.44 | CUA | 393 | 0.85 | ||
| Asn | AAU | 949 | 1.52 | CUC | 207 | 0.45 | |
| AAC | 299 | 0.48 | CUG | 187 | 0.4 | ||
| Asp | GAU | 808 | 1.57 | Lys | AAA | 972 | 1.44 |
| GAC | 223 | 0.43 | AAG | 374 | 0.56 | ||
| Arg | AGA | 482 | 1.87 | Met | AUG | 622 | 1 |
| CGA | 355 | 1.38 | Phe | UUU | 979 | 1.29 | |
| CGU | 321 | 1.24 | UUC | 543 | 0.71 | ||
| AGG | 164 | 0.64 | Pro | CCU | 424 | 1.56 | |
| CGG | 117 | 0.45 | CCA | 307 | 1.13 | ||
| CGC | 109 | 0.42 | CCC | 202 | 0.74 | ||
| Cys | UGU | 208 | 1.37 | CCG | 157 | 0.58 | |
| UGC | 95 | 0.63 | Ser | UCU | 574 | 1.67 | |
| Gln | CAA | 669 | 1.49 | AGU | 408 | 1.19 | |
| CAG | 228 | 0.51 | UCA | 404 | 1.17 | ||
| Gly | GGA | 706 | 1.58 | UCC | 341 | 0.99 | |
| GGU | 554 | 1.24 | UCG | 200 | 0.58 | ||
| GGG | 333 | 0.75 | AGC | 136 | 0.4 | ||
| GGC | 194 | 0.43 | Thr | ACU | 528 | 1.59 | |
| Glu | GAA | 1003 | 1.48 | ACA | 413 | 1.25 | |
| GAG | 352 | 0.52 | ACC | 248 | 0.75 | ||
| His | CAU | 471 | 1.51 | ACG | 136 | 0.41 | |
| CAC | 151 | 0.49 | Trp | UGG | 449 | 1 | |
| Ile | AUU | 1091 | 1.48 | Tyr | UAU | 782 | 1.64 |
| AUA | 682 | 0.92 | UAC | 174 | 0.36 | ||
| AUC | 442 | 0.6 | Val | GUA | 532 | 1.52 | |
| GUU | 500 | 1.43 | |||||
| GUG | 202 | 0.58 | |||||
| GUC | 169 | 0.48 |
Note: ∗ relative synonymous codon usage, RSCU.
Genes with introns in the cp genome of Salix wilsonii.
| Gene | Location | Exon I | Intron I | Exon II | Intron II | Exon III |
|---|---|---|---|---|---|---|
| (bp) | (bp) | (bp) | (bp) | (bp) | ||
|
| LSC | 145 | 731 | 410 | ||
|
| LSC | 69 | 829 | 291 | 598 | 228 |
|
| SSC | 564 | 1074 | 546 | ||
|
| IR | 777 | 682 | 756 | ||
|
| LSC | 5 | 221 | 643 | ||
|
| LSC | 9 | 782 | 489 | ||
|
| IR | 399 | 629 | 471 | ||
|
| LSC | 9 | 1114 | 402 | ||
|
| LSC | 453 | 779 | 1617 | ||
|
| trans | 114 | - | 231 | 537 | 30 |
|
| IR | 38 | 800 | 35 | ||
|
| LSC | 23 | 703 | 48 | ||
|
| IR | 37 | 947 | 35 | ||
|
| LSC | 37 | 2558 | 29 | ||
|
| LSC | 37 | 583 | 50 | ||
|
| LSC | 39 | 607 | 37 | ||
|
| LSC | 129 | 722 | 228 | 716 | 153 |
Numbers of SSRs identified in the cp genome of Salix wilsonii.
| SSR repeat type | SSR repeat unit | Number of repeats | Total | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | |||
| Monomer | A/T | 31 | 36 | 18 | 9 | 4 | 3 | 2 | 1 | 1 | 105 | |||||
| C/G | 1 | 1 | ||||||||||||||
| Dimer | TA | 1 | 1 | |||||||||||||
| Tripolymer | TAT | 1 | 1 | |||||||||||||
| Tetramer | AATG | 1 | 1 | |||||||||||||
| AGAA | 1 | 1 | ||||||||||||||
| TAGA | 1 | 1 | ||||||||||||||
| TATT | 1 | 1 | ||||||||||||||
| TTCA | 1 | 1 | ||||||||||||||
| TTTA | 2 | 2 | ||||||||||||||
| TTTC | 1 | 1 | ||||||||||||||
| Pentamer | AATTT | 1 | 1 | |||||||||||||
| ATTAA | 1 | 1 | ||||||||||||||
| Compound | 37 | |||||||||||||||
| Total | 155 | |||||||||||||||
Figure 2Comparison of IR boundaries among the cp genomes of four rosid plants. “Ψ” means pseudogene.
Figure 3Maximum likelihood tree of willows and outgroups based on whole cp genome sequences. The branch length (≥0.0002) and the bootstrap value that supported each node (in bold) are shown above the branch. ∗ indicates the species selected for genome comparison analysis.
Figure 4Complete chloroplast genome comparison of 12 Salix species using mVISTA program with S. arbutifolia as a reference. Cp genome regions are color-coded as protein-coding (exon), rRNA, tRNA, and conserved noncoding sequences (CNS).