| Literature DB >> 36076168 |
Jing-Yi Peng1, Xiao-Shuang Zhang2, Dai-Gui Zhang1,3, Yi Wang1, Tao Deng2, Xian-Han Huang2, Tian-Hui Kuang2, Qiang Zhou4,5.
Abstract
BACKGROUND: Sinosenecio B. Nordenstam (Asteraceae) currently comprises 44 species. To investigate the interspecific relationship, several chloroplast markers, including ndhC-trnV, rpl32-trnL, matK, and rbcL, are used to analyze the phylogeny of Sinosenecio. However, the chloroplast genomes of this genus have not been thoroughly investigated. We sequenced and assembled the Sinosenecio albonervius chloroplast genome for the first time. A detailed comparative analysis was performed in this study using the previously reported chloroplast genomes of three Sinosenecio species.Entities:
Keywords: Comparison of the chloroplast genome; Phylogenetic analysis; Sinosenecio
Mesh:
Year: 2022 PMID: 36076168 PMCID: PMC9454173 DOI: 10.1186/s12864-022-08872-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Fig. 1Gene map of the chloroplast genomes of S. albonervius. Genes inside the circle are transcribed clockwise, and those on the outside are transcribed counter-clockwise. Genes belonging to different functional groups have been colour-coded. The darker grey area in the inner circle corresponds to GC content, whereas the lighter grey corresponds to AT content
The gene composition of S. albonervius chloroplast genome, "a" labeled genes have intron
| Group of genes | Name of genes |
|---|---|
| ATP synthase | |
| Photosystem II | |
| NADPH dehydrogenase | |
| Cytochrome b/f compelx | |
| C-type cytochrome synthesis | |
| Photosystem I | |
| Photosystem biogenesis factor | |
| Large subunit of rubisco | |
| Small ribosomal units | |
| Large ribosomal units | |
| RNA polymerase sub-units | |
| Translation initiation factor | |
| Ribosomal RNA | |
| Transfer RNA | |
| Acetyl-CoA-carboxylase sub-unit | |
| Envelope membrane protein | |
| Protease | |
| Maturase | |
| Hypothetical genes reading frames |
Genes with introns in the chloroplast genomes of S. albonervius as well as the lengths of the exons and introns
| Gene | Location | Exon 1 (bp) | Intron 1 (bp) | Exon 2 (bp) | Intron 2 (bp) | Exon 3 (bp) |
|---|---|---|---|---|---|---|
| LSC | 37 | 2560 | 35 | |||
| LSC | 41 | 841 | 214 | |||
| LSC | 432 | 719 | 1635 | |||
| LSC | 145 | 704 | 410 | |||
| LSC | 23 | 725 | 47 | |||
| LSC | 124 | 696 | 230 | 740 | 153 | |
| LSC | 37 | 452 | 50 | |||
| LSC | 38 | 573 | 37 | |||
| LSC / IR | 114 | 530 | 232 | 26 | ||
| LSC | 71 | 806 | 291 | 606 | 229 | |
| LSC | 6 | 772 | 642 | |||
| LSC | 8 | 718 | 475 | |||
| LSC | 9 | 1061 | 399 | |||
| IR | 393 | 667 | 435 | |||
| IR | 777 | 671 | 756 | |||
| IR | 42 | 772 | 35 | |||
| IR | 38 | 821 | 35 | |||
| SSC | 553 | 1072 | 539 |
Comparison of four Sinosenecio species chloroplast genomes
| Characteristics | ||||
|---|---|---|---|---|
| Accession number | OL678114 | NC057061 | MZ325394 | NC057622 |
| Total length (bp) | 151,224 | 151,257 | 151,315 | 150,926 |
| LSC length (bp) | 83,355 | 83,373 | 83,445 | 83,092 |
| SSC length (bp) | 18,173 | 18,178 | 18,172 | 18,130 |
| IR length (bp) | 24,848 | 24,853 | 24,849 | 24,852 |
| Total number of genes | 134 | 134 | 134 | 134 |
| Protein coding genes | 87 | 87 | 87 | 87 |
| tRNA genes | 37 | 37 | 37 | 37 |
| rRNA genes | 8 | 8 | 8 | 8 |
| Total GC content | 37.4% | 37.4% | 37.4% | 37.3% |
| GC content in IRs | 43.0% | 43.0% | 43.0% | 43.0% |
| GC content in LSC | 35.5% | 35.5% | 35.5% | 35.4% |
| GC content in SSC | 30.6% | 30.6% | 30.6% | 30.6% |
Fig. 2Simple sequence repeats. A Proportion of SSR types in S. albonervius chloroplast genome. B The number of SSRs in LSC, SSC and IRs in Sinosenecio. C SSR types in Sinosenecio. D Specific forms of SSRs in Sinosenecio
Fig. 3The repeat sequence types in Sinosenecio
Codon usage for S. albonervius chloroplast genome by using 54 CDS
| Amino Acid | Codon | Number | RSCU | Amino Acid | Codon | Number | RSCU |
|---|---|---|---|---|---|---|---|
| Phe | UUU | 828 | 1.37 | Ser | UCU | 478 | 1.81 |
| UUC | 382 | 0.63 | UCC | 231 | 0.87 | ||
| Leu | UUA | 738 | 1.94 | UCA | 324 | 1.22 | |
| UUG | 472 | 1.24 | UCG | 126 | 0.48 | ||
| CUU | 490 | 1.29 | Pro | CCU | 342 | 1.55 | |
| CUC | 136 | 0.36 | CCC | 159 | 0.72 | ||
| CUA | 301 | 0.79 | CCA | 262 | 1.19 | ||
| CUG | 144 | 0.38 | CCG | 120 | 0.54 | ||
| Ile | AUU | 897 | 1.47 | Thr | ACU | 427 | 1.63 |
| AUC | 328 | 0.54 | ACC | 197 | 0.75 | ||
| AUA | 601 | 0.99 | ACA | 330 | 1.26 | ||
| Met | AUG | 518 | 1 | ACG | 92 | 0.35 | |
| Val | GUU | 424 | 1.49 | Ala | GCU | 533 | 1.77 |
| GUC | 123 | 0.43 | GCC | 189 | 0.63 | ||
| GUA | 433 | 1.53 | GCA | 343 | 1.14 | ||
| GUG | 155 | 0.55 | GCG | 139 | 0.46 | ||
| Tyr | UAU | 670 | 1.64 | Cys | UGU | 166 | 1.39 |
| UAC | 148 | 0.36 | UGC | 72 | 0.61 | ||
| TER | UAA | 32 | 1.78 | TER | UGA | 12 | 0.67 |
| UAG | 10 | 0.56 | Trp | UGG | 383 | 1 | |
| His | CAU | 373 | 1.49 | Arg | CGU | 285 | 1.36 |
| CAC | 128 | 0.51 | CGC | 85 | 0.41 | ||
| Gln | CAA | 594 | 1.53 | CGA | 277 | 1.33 | |
| CAG | 180 | 0.47 | CGG | 84 | 0.4 | ||
| Asn | AAU | 830 | 1.59 | Ser | AGU | 340 | 1.28 |
| AAC | 217 | 0.41 | AGC | 89 | 0.34 | ||
| Lys | AAA | 836 | 1.51 | Arg | AGA | 389 | 1.86 |
| AAG | 273 | 0.49 | AGG | 134 | 0.64 | ||
| Asp | GAU | 671 | 1.58 | Gly | GGU | 490 | 1.33 |
| GAC | 177 | 0.42 | GGC | 178 | 0.48 | ||
| Glu | GAA | 834 | 1.50 | GGA | 565 | 1.53 | |
| GAG | 275 | 0.50 | GGG | 242 | 0.66 |
Fig. 4Codon content of amino acids and stop codons in 54 CDS of S. albonervius
RNA editing sites in the S. albonervius chloroplast genome
| Gene Name | Nt pos | AA pos | Align Col | Effect | Score |
|---|---|---|---|---|---|
| 451 | 151 | 162 | CAC (H) = > UAC (Y) | 1 | |
| 824 | 275 | 304 | UCG (S) = > UUG (L) | 0.8 | |
| 1225 | 409 | 450 | CCA (P) = > UCA (S) | 1 | |
| 1433 | 478 | 519 | CCU (P) = > CUU (L) | 1 | |
| 773 | 258 | 258 | UCA (S) = > UUA (L) | 1 | |
| 791 | 264 | 264 | CCC (P) = > CUC (L) | 1 | |
| 629 | 210 | 213 | UCA (S) = > UUA (L) | 1 | |
| 110 | 37 | 39 | CCA (P) = > CUA (L) | 0.86 | |
| 370 | 124 | 127 | CCC (P) = > UCC (S) | 0.86 | |
| 284 | 95 | 108 | UCU (S) = > UUU (F) | 0.86 | |
| 637 | 213 | 229 | CAU (H) = > UAU (Y) | 1 | |
| 1240 | 414 | 430 | CAU (H) = > UAU (Y) | 1 | |
| 566 | 189 | 189 | UCA (S) = > UUA (L) | 1 | |
| 1073 | 358 | 358 | UCC (S) = > UUC (F) | 1 | |
| 149 | 50 | 50 | UCA (S) = > UUA (L) | 1 | |
| 467 | 156 | 156 | CCA (P) = > CUA (L) | 1 | |
| 586 | 196 | 196 | CAU (H) = > UAU (Y) | 1 | |
| 611 | 204 | 204 | UCA (S) = > UUA (L) | 0.8 | |
| 737 | 246 | 246 | CCA (P) = > CUA (L) | 1 | |
| 746 | 249 | 249 | UCU (S) = > UUU (F) | 1 | |
| 830 | 277 | 277 | UCA (S) = > UUA (L) | 1 | |
| 836 | 279 | 279 | UCA (S) = > UUA (L) | 1 | |
| 1481 | 494 | 494 | CCA (P) = > CUA (L) | 1 | |
| 359 | 120 | 128 | UCA (S) = > UUA (L) | 1 | |
| 575 | 192 | 200 | UCA (S) = > UUA (L) | 1 | |
| 854 | 285 | 293 | UCA (S) = > UUA (L) | 1 | |
| 863 | 288 | 296 | CCC (P) = > CUC (L) | 1 | |
| 1286 | 429 | 437 | UCA (S) = > UUA (L) | 0.8 | |
| 290 | 97 | 97 | UCA (S) = > UUA (L) | 1 | |
| 1340 | 447 | 447 | UCU (S) = > UUU (F) | 1 | |
| 166 | 56 | 56 | CAU (H) = > UAU (Y) | 0.8 | |
| 314 | 105 | 105 | ACA (U) = > AUA (I) | 0.8 | |
| 418 | 140 | 140 | CGG (R) = > UGG (W) | 1 | |
| 611 | 204 | 204 | CCA (P) = > CUA (L) | 1 | |
| 77 | 26 | 26 | UCU (S) = > UUU (F) | 1 | |
| 308 | 103 | 103 | UCA (S) = > UUA (L) | 0.86 | |
| 824 | 275 | 279 | UCA (S) = > UUA (L) | 1 | |
| 983 | 328 | 345 | GCG (A) = > GUG (V) | 1 | |
| 511 | 171 | 171 | CCC (P) = > UCC (S) | 1 | |
| 1592 | 531 | 548 | GCA (A) = > GUA (V) | 0.86 | |
| 2039 | 680 | 710 | CCC (P) = > CUC (L) | 1 | |
| 2701 | 901 | 1101 | CAU (H) = > UAU (Y) | 1 | |
| 3695 | 1232 | 1452 | UCG (S) = > UUG (L) | 0.86 | |
| 248 | 83 | 83 | UCA (S) = > UUA (L) | 1 | |
| 80 | 27 | 27 | UCA (S) = > UUA (L) | 1 | |
| 149 | 50 | 53 | CCA (P) = > CUA (L) | 1 |
Fig. 5The chloroplast genomes comparison of four Sinosenecio species is visualized with S. oldhamianus as a reference. The X-axis represents the coordinate in the chloroplast genome. The Y-axis shows different species names, and sequence similarity of aligned regions is displayed as horizontal bars, which expresses as a percentage within 50–100%
Fig. 6Comparison of connection sites of LSC, IRb, SSC, and IRa in the chloroplast genomes. JLB (IRB/LSC), JSB (IRB/SSC), JSA (SSC/IRA), and JLA (IRA/LSC) represent the junction sites between two adjacent regions in the genome
Fig. 7Sliding window analyses of Sinosenecio chloroplast genomes using a window length of 600 bp and step size of 200 bp. The nucleotide diversity (Pi) value of each window is shown on Y-axis, and positions are shown on X-axis
Fig. 8The ML tree based on the chloroplast genomes sequences with GenBank accession numbers. The supported values of each node are shown in this tree, and red fonts indicate the phylogenetic position of Sinosenecio