| Literature DB >> 31934501 |
Zhanjun Wang1, Beibei Xu1,2, Bao Li1, Qingqing Zhou1, Guiyi Wang1, Xingzhou Jiang1, Chenchen Wang1, Zhongdong Xu1.
Abstract
Euphorbiaceae plants are important as suppliers of biodiesel. In the current study, the codon usage patterns and sources of variance in chloroplast genome sequences of six different Euphorbiaceae plant species have been systematically analyzed. Our results revealed that the chloroplast genomes of six Euphorbiaceae plant species were biased towards A/T bases and A/T-ending codons, followed by detection of 17 identical high-frequency codons including GCT, TGT, GAT, GAA, TTT, GGA, CAT, AAA, TTA, AAT, CCT, CAA, AGA, TCT, ACT, TAT and TAA. It was found that mutation pressure was a minor factor affecting the variation of codon usage, however, natural selection played a significant role. Comparative analysis of codon usage frequencies of six Euphorbiaceae plant species with four model organisms reflected that Arabidopsis thaliana, Populus trichocarpa, and Saccharomyces cerevisiae should be considered as suitable exogenous expression receptor systems for chloroplast genes of six Euphorbiaceae plant species. Furthermore, it is optimal to choose Saccharomyces cerevisiae as the exogenous expression receptor. The outcome of the present study might provide important reference information for further understanding the codon usage patterns of chloroplast genomes in other plant species. ©2020 Wang et al.Entities:
Keywords: Chloroplast genome; Codon usage bias; Euphorbiaceae plants
Year: 2020 PMID: 31934501 PMCID: PMC6951282 DOI: 10.7717/peerj.8251
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Genomic features of chloroplast genomes of six Euphorbiaceae plant species.
L_aa means the total number of amino acids; GC1, GC2 and GC3 indicate the GC content at the first, second and third codon positions.
| Accession No. |
|
|
|
|
|
|
| CDSs number (before processing) | 85 | 84 | 84 | 83 | 86 | 85 |
| CDSs number (after processing) | 53 | 55 | 58 | 55 | 55 | 57 |
| L_aa | 23,157 | 23,918 | 24,407 | 21,902 | 24,249 | 24,582 |
| GC1 | 0.453 | 0.451 | 0.454 | 0.454 | 0.451 | 0.454 |
| GC2 | 0.372 | 0.372 | 0.375 | 0.375 | 0.374 | 0.374 |
| GC3 | 0.287 | 0.292 | 0.294 | 0.286 | 0.299 | 0.296 |
| Average GC at three locations | 0.371 | 0.372 | 0.374 | 0.372 | 0.375 | 0.375 |
Figure 1ENc-plot of chloroplast genomes of six Euphorbiaceae plant species.
(A) Euphorbia esula; (B) Hevea brasiliensis; (C) Jatropha curcas; (D) Manihot esculenta; (E) Ricinus communis; (F) Vernicia fordii.
Figure 2PR2-plot of chloroplast genomes six Euphorbiaceae plant species.
(A) Euphorbia esula; (B) Hevea brasiliensis; (C) Jatropha curcas; (D) Manihot esculenta; (E) Ricinus communis; (F) Vernicia fordii.
Figure 3Neutrality plot of chloroplast genomes six Euphorbiaceae plant species.
(A) Euphorbia esula; (B) Hevea brasiliensis; (C) Jatropha curcas; (D) Manihot esculenta; (E) Ricinus communis; (F) Vernicia fordii.
Figure 4Correspondence analysis of chloroplast genomes of six Euphorbiaceae plant species.
(A) Euphorbia esula; (B) Hevea brasiliensis; (C) Jatropha curcas; (D) Manihot esculenta; (E) Ricinus communis; (F) Vernicia fordii.
Correlation analysis of axis 1 and codon usage index of chloroplast genomes of six Euphorbiaceae plant species.
CAI means codon adaptation index; GC3s indicates the GC content at the third codon position of synonymous codons; L_aa is defined as total number of amino acids.
| CAI | −0.023 | −0.114 | −0.035 | −0.185 | 0.152 | −0.307 |
| GC3s | 0.421 | −0.320 | −0.581 | −0.352 | 0.284 | 0.324 |
| L_aa | 0.165 | −0.059 | −0.020 | −0.059 | 0.068 | 0.216 |
Notes.
P < 0.05.
P < 0.01.