| Literature DB >> 33327935 |
Zenan Shen1,2, Zhimeng Gan3, Fa Zhang1,2, Xinyao Yi4, Jinzhi Zhang3, Xiaohua Wan5,6.
Abstract
BACKGROUND: Codon usage is an important determinant of gene expression levels that can help us understand codon biology, evolution and mRNA translation of species. The majority of previous codon usage studies have focused on single species analysis, although few studies have focused on the species within the same genus. In this study, we proposed a multispecies codon usage analysis workflow to reveal the genetic features and correlation in citrus.Entities:
Keywords: Citrus; Codon usage; Correlation; Evolution; GC biology
Mesh:
Substances:
Year: 2020 PMID: 33327935 PMCID: PMC7739459 DOI: 10.1186/s12864-020-6641-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
GC content of CDS across 8 Citrus Species
| Atalantia | 59755 | 43.51 | 50.70 | 40.47 | 39.35 | 37.08 | 52.48 | |
| Mandarin | 48789 | 43.80 | 50.85 | 40.53 | 40.01 | 37.74 | 52.47 | |
| Pummelo | 38039 | 43.79 | 50.67 | 40.56 | 40.13 | 37.87 | 52.55 | |
| Sweet | 40773 | 43.50 | 50.52 | 40.12 | 39.87 | 37.59 | 52.44 | |
| Citron | 40808 | 43.70 | 50.63 | 40.49 | 39.98 | 37.70 | 52.63 | |
| Mandarin | 36852 | 43.59 | 50.51 | 40.32 | 39.94 | 37.66 | 52.65 | |
| Papeda | 36936 | 43.77 | 50.58 | 40.54 | 40.20 | 37.93 | 52.59 | |
| Mandarin | 29687 | 43.73 | 50.57 | 40.35 | 40.28 | 38.02 | 52.83 | |
| Average | - | 41455 | 43.67 | 50.63 | 40.42 | 39.97 | 37.70 | 52.58 |
Genes represents the number of sequences after filtering; GC1, GC2 and GC3 represent the GC content of the first, second, third base of codon; GC3s represents the GC content of the third synonymous position; ENC represents the effective number of codons
Fig. 1Neutrality plot of 8 citrus species. The green solid line represents the regression line. a Atlantia buxifolia, the regression line is y=−0.0258x+46.5950,R2=0.0418. b Fortunella hindsii, the regression line is y=0.0781x+42.5627,R2=0.1218. c Citrus grandis, the regression line is y=0.0921+41.9104,R2=0.1288. d Citrus sinensis, the regression line is y=0.2712x+34.4916,R2=0.3047. e Citrus medica, the regression line is y=−0.0275x+46.6589,R2=0.0494. f Citrus reticulata ‘Mangshan’, the regression line is y=−0.0954x+49.2216,R2=0.1476. g Citrus ichangensis, the regression line is y=0.0174x+44.8579,R2=0.0341. h Citrus clementina, the regression line is y=−0.2456x+51.3338,R2=0.0341
Fig. 2Neutrality plot of 8 citrus species. ENCs were plotted against GC content at the third position. The green solid line represents the expected curve of positions of genes when the codon usage was only determined by the GC3s composition. a Atlantia buxifolia. b Fortunella hindsii. c Citrus grandis. d Citrus sinensis. e Citrus medica. f Citrus reticulata ‘Mangshan’. g Citrus ichangensis. h Citrus clementina
Fig. 3Frequency distribution of (ENCexp-ENCobs)/ENCexp. ENCexp represents expected ENC values and ENCobs represents ENC observed values. The peak located in 0 to 0.1
The top five high-frequency codons
| Citrus Species | codon(RSCU) | N | ||||
|---|---|---|---|---|---|---|
| AGA(1.93) | GCT(1.70) | GTT(1.68) | TCT(1.61) | TTG(1.55) | 15 | |
| AGA(1.89) | GCT(1.62) | GTT(1.63) | TCT(1.56) | TTG(1.54) | 11 | |
| AGA(1.93) | GCT(1.65) | GTT(1.65) | TCT(1.56) | TTG(1.54) | 11 | |
| AGA(1.96) | GCT(1.66) | GTT(1.65) | TCT(1.57) | TTG(1.54) | 12 | |
| AGA(1.95) | GCT(1.66) | GTT(1.66) | TCT(1.58) | TTG(1.54) | 14 | |
| AGA(1.97) | GCT(1.66) | GTT(1.66) | TCT(1.57) | TTG(1.55) | 13 | |
| AGA(1.95) | GCT(1.66) | GTT(1.65) | TCT(1.57) | TTG(1.54) | 13 | |
| AGA(1.94) | GCT(1.66) | GTT(1.65) | TCT(1.57) | TTG(1.55) | 13 | |
N: the number of high-frequency codons of each citrus species
Fig. 4Heat map of RSCU of 59 codons from 30 species using Euclidean distance and average clustering module. GC and GC3 distribution in ORFs from 30 plant genomes
Fig. 5Heat map of pearson correlation coefficient among Citrus species. Ab: Atlantia buxifolia; Fh: Fortunella hindsii; Cg: Citrus grandis; Cs: Citrus sinensis; Cm: Citrus medica; Cr: Citrus reticulata ‘Mangshan’; Ci: Citrus ichangensis; Cc: Citrus Clementina
Fig. 6Process of workflow