| Literature DB >> 32407424 |
Chun-Jiao Li1, Ruo-Nan Wang2, De-Zhu Li1.
Abstract
The conflicts exist between the phylogeny of Campanulaceae based on nuclear ITS sequence and plastid markers, particularly in the subdivision of Cyanantheae (Campanulaceae). Besides, various and complicated plastid genome structures can be found in species of the Campanulaceae. However, limited availability of genomic information largely hinders the studies of molecular evolution and phylogeny of Campanulaceae. We reported the complete plastid genomes of three Cyanantheae species, compared them to eight published Campanulaceae plastomes, and shed light on a deeper understanding of the applicability of plastomes. We found that there were obvious differences among gene order, GC content, gene compositions and IR junctions of LSC/IRa. Almost all protein-coding genes and amino acid sequences showed obvious codon preferences. We identified 14 genes with highly positively selected sites and branch-site model displayed 96 sites under potentially positive selection on the three lineages of phylogenetic tree. Phylogenetic analyses showed that Cyananthus was more closely related to Codonopsis compared with Cyclocodon and also clearly illustrated the relationship among the Cyanantheae species. We also found six coding regions having high nucleotide divergence value. Hotpot regions were considered to be useful molecular markers for resolving phylogenetic relationships and species authentication of Campanulaceae.Entities:
Year: 2020 PMID: 32407424 PMCID: PMC7224561 DOI: 10.1371/journal.pone.0233167
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Plastid genomes of Cyananthus flavus (A), Cyclocodon parviflorus (B), and Codonopsis hongii (C). Genes inside the circle are transcribed clockwise, and genes outside the circle are transcribed counter-clockwise. The dark-gray inner circle corresponds to the GC content, and the light-gray represents the AT content.
Fig 2Percentages of variable sites in protein-coding regions.
The blue line indicates the comparison of eleven species among the family Campanulaceae; the gray line indicates the comparison of six Cyanantheae species; the orange line indicates the Campanuleae species; the yellow line indicates the out-group. X axis: position of the midpoint of a window. Y axis: nucleotide diversity of each window.
Fig 3Comparison of the borders between IR and LSC/SSC regions and the gene composition of IR regions.
Comparison of plastome features of Campanulaceae species.
| Species | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| MT074354 | MT074353 | MN849357 | KX352464 | MH018574 | KY587457 | KP889213 | KU198434 | EU090187 | MF770635 | MF770625 | |
| 165675 | 166588 | 169524 | 171818 | 169447 | 169321 | 171724 | 169341 | 162321 | 166019 | 178956 | |
| 82501 | 83994 | 85326 | 79112 | 85253 | 85506 | 105555 | 102323 | 100110 | 81503 | 79041 | |
| 8120 | 8014 | 7912 | 7840 | 8060 | 8067 | 11295 | 7744 | 7661 | 7792 | 8085 | |
| 37527 | 37290 | 38143 | 42433 | 38067 | 37874 | 27437 | 29637 | 27276 | 38362 | 45915 | |
| 88242 | 88587 | 88296 | 90144 | 88761 | 87332 | 76245 | 78771 | 71508 | 89784 | 92511 | |
| 53.26 | 53.18 | 52.08 | 52.46 | 52.38 | 51.58 | 44.39 | 46.52 | 44.05 | 54.08 | 51.69 | |
| 77433 | 78001 | 81228 | 81674 | 80686 | 81989 | 95479 | 90570 | 90813 | 76235 | 86445 | |
| 136(25) | 135(25) | 135(24) | 146(32) | 138(24) | 133(24) | 140(21) | 134(18) | 147(22) | 139(25) | 150(33) | |
| 89(13) | 91(14) | 90(13) | 95(20) | 95(19) | 86(12) | 82(7) | 83(7) | 83(7) | 89(13) | 99(21) | |
| 37(7) | 36(7) | 37(7) | 36(7) | 38(7) | 37(7) | 37(5) | 36(5) | 44(7) | 36(7) | 36(7) | |
| 8(4) | 8(4) | 8(4) | 8(4) | 8(4) | 8(4) | 8(4) | 8(4) | 8(4) | 8(4) | 8(4) | |
| 38.1 | 37.9 | 38.2 | 38.1 | 38.2 | 38.3 | 38.8 | 38.8 | 38.3 | 39.0 | 36.8 | |
| 36.6 | 36.7 | 36.9 | 37.2 | 36.9 | 36.9 | 37.5 | 37.8 | 37.1 | 38.4 | 35.8 | |
| 31.1 | 31.3 | 32.0 | 31.0 | 32.4 | 32.4 | 34.9 | 32.6 | 32.2 | 33.5 | 31.0 | |
| 40.4 | 40.0 | 40.4 | 36.9 | 40.4 | 40.5 | 42.0 | 41.4 | 41.4 | 40.2 | 38.2 |
List of genes present in three newly sequenced plastomes.
| Category of genes | Group of gene | Name of gene |
|---|---|---|
| Ribosomal RNA genes | ||
| Transfer RNA genes | ||
| Small subunit of ribosome | ||
| Large subunit of ribosome | ||
| DNA-dependent RNA polymerase | ||
| Subunits of NADH-dehydrogenase | ||
| Subunits of photosystem I | ||
| Subunits of photosystem II | ||
| Subunits of cytochrome b/f complex | ||
| Subunits of ATP synthase | ||
| Subunits of rubisco | ||
| Maturase | ||
| Protease | ||
| Envelope membrane protein | ||
| C-type cytochrome synthesis gene | ||
| Conserved open reading frames |
a gene is in Cyananthus flavus; c gene is in Cyclocodon parviflorus; d gene is in Codonopsis hongii;
* gene contains one intron;
** gene contains two introns; (×2) indicates that the number of the repeat unit is 2.
Fig 4Codon contents of 20 amino acids and stop codons in all protein-coding genes of the Campanulaceae plastomes.
The color of the histogram corresponds to the color of codons.
Fig 5Heatmap analysis for codon distribution of all protein-coding genes of all considered species.
Colour key: Higher red values indicate higher RSCU values, and lower blue values indicate lower RSCU values.
Fig 6Frequency of three types of dispersed repeats by length.
(F: forward, P: palindrome, R: reverse).
Fig 7The distribution maps of simple sequence repeats (SSR).
Classification of SSRs by repeat types. p1, mononucleotides (mono-); p2, dinucleotides (di-); p3, trinucleotides (tri-); p4, tetranucleotides (tetra-); p5, pentanucleotides (penta-); p6, hexanucleotides (hex-); c, compound.
Fig 8Repeat number in the different regions of Campanulaceae plastomes, including Tandem repeats, Dispersed repeats and SSR repeats.
The yellow line refers to SSC regions, the gray lines refers to IRa regions, the orange line refers to LSC regions, and the blue line refers to the total plastome.
Fig 9Phylogenetic relationship of all Campanulaceae species by using the 76 protein-coding genes, based on the Maximum likelihood (ML) analysis and Bayesian inference (BI) analysis.