| Literature DB >> 23315666 |
Touqeer Ahmad1, Gaurav Sablok, Tatiana V Tatarinova, Qiang Xu, Xiu-Xin Deng, Wen-Wu Guo.
Abstract
Citrus, as one of the globally important fruit trees, has been an object of interest for understanding genetics and evolutionary process in fruit crops. Meta-analyses of 19 Citrus species, including 4 globally and economically important Citrus sinensis, Citrus clementina, Citrus reticulata, and 1 Citrus relative Poncirus trifoliata, were performed. We observed that codons ending with A- or T- at the wobble position were preferred in contrast to C- or G- ending codons, indicating a close association with AT richness of Citrus species and P. trifoliata. The present study postulates a large repertoire of a set of optimal codons for the Citrus genus and P. trifoliata and demonstrates that GCT and GGT are evolutionary conserved optimal codons. Our observation suggested that mutational bias is the dominating force in shaping the codon usage bias (CUB) in Citrus and P. trifoliata. Correspondence analysis (COA) revealed that the principal axis [axis 1; COA/relative synonymous codon usage (RSCU)] contributes only a minor portion (∼10.96%) of the recorded variance. In all analysed species, except P. trifoliata, Gravy and aromaticity played minor roles in resolving CUB. Compositional constraints were found to be strongly associated with the amino acid signatures in Citrus species and P. trifoliata. Our present analysis postulates compositional constraints in Citrus species and P. trifoliata and plausible role of the stress with GC3 and coevolution pattern of amino acid.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23315666 PMCID: PMC3628444 DOI: 10.1093/dnares/dss039
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Genomic composition of coding region of Citrus species and P. trifoliata at GC, GC1, GC2, GC3, and GC3
| No. | Genes/ESTa | Unigenes count | Genesb | GC | GC1 | GC2 | GC3 | GC3 | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 44 275 | – | 41362 | 43.92 | 50.4 | 40.33 | 41.03 | 38.8 | |
| 2 | 8219 | 7550 | 4715 | 47.78 | 52.96 | 43.87 | 46.52 | 44.73 | |
| 3 | 14 584 | 11 952 | 6787 | 46.93 | 52.47 | 42.59 | 45.72 | 43.94 | |
| 4 | 118 365 | 51 591 | 33 765 | 47.04 | 51.84 | 42.51 | 46.76 | 44.99 | |
| 5 | 1843 | 1283 | 677 | 44.66 | 51.67 | 39.77 | 42.55 | 40.47 | |
| 6 | 1017 | 858 | 701 | 45.86 | 51.79 | 41.13 | 44.66 | 42.67 | |
| 7 | 2924 | 1628 | 588 | 46.11 | 52.67 | 41.42 | 44.25 | 42.35 | |
| 8 | 8188 | 7933 | 3964 | 46.47 | 50.76 | 42.75 | 45.9 | 44.25 | |
| 9 | 11 045 | 9761 | 4256 | 45.82 | 51.13 | 41.37 | 44.96 | 43.17 | |
| 10 | 1115 | 896 | 711 | 45.65 | 51.91 | 40.9 | 44.13 | 42.15 | |
| 11 | 5768 | 3735 | 2644 | 45.2 | 51.51 | 40.57 | 43.52 | 41.5 | |
| 12 | 55 980 | 46 170 | 30 816 | 46.85 | 52.3 | 42.77 | 45.48 | 43.62 | |
| 13 | 5823 | 3685 | 2253 | 46.48 | 52.41 | 41.64 | 45.41 | 43.41 | |
| 14 | 1837 | 1522 | 992 | 46.07 | 51.67 | 41.41 | 45.15 | 43.23 | |
| 15 | 5216 | 4688 | 1746 | 47.58 | 51.51 | 43.58 | 47.64 | 46.17 | |
| 16 | 62 695 | 35 740 | 19 445 | 46.78 | 52.33 | 42.67 | 45.35 | 43.52 | |
| 17 | 19 072 | 9289 | 4592 | 45.67 | 51.72 | 41.14 | 44.15 | 42.21 | |
| 18 | 8039 | 4621 | 2517 | 45.32 | 52.19 | 40.51 | 43.27 | 41.3 | |
| 19 | 7954 | 3335 | 2596 | 45.23 | 51.96 | 40.77 | 42.98 | 40.96 |
Means of GC% were calculated at the first, second, and third positions. GC3 represents the GC at the third synonymous position.
aIn C. sinensis, genome-predicted coding regions are used, whereas for the rest of the species, the number represents the EST count in the column (genes/EST).
bSelected genes above 300 bp threshold.
Figure 1.The distribution of GC3 content in Citrus species, P. trifoliata and A. thaliana genes. The GC3 content showed unimodal distribution.
Figure 2.Nc versus GC3 plot of Citrus species and P. trifoliata genes. The solid black line indicates the expected Nc value, if the codon bias is only due to GC3s. Species order: A = C. sinensis; B = C. aurantifolia; C = C. aurantium; D = C. clementina; E = C. clementina × C. tangerina; F = C. jambhiri; G = C. japonica var. margarita; H = C. limettioides; I = C. limonia; J = C. medica; K = C. reshni; L = C. reticulata; M = C. reticulata × C. temple; N = C. sinensis × P. trifoliata; O = C. sunki; P = P. trifoliata; Q = C. unshiu; R = C. paradisi; and S = C. paradisi × P. trifoliata.
Figure 3.Heat map of the average RSCU of the 59 degenerate codons in the Citrus species and P. trifoliata using Euclidean distance and average linkage clustering module.
Star map showing optimal codon distribution in Citrus species and P. trifoliata
| Codon/AA | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TTT/Phe | * | * | * | * | ||||||||||||
| TTC/Phe | * | * | * | * | * | * | * | * | * | * | ||||||
| TTA/Leu | * | * | * | * | ||||||||||||
| TTG/Leu | * | * | * | * | ||||||||||||
| CTT/Leu | * | * | * | * | * | * | * | * | * | * | * | * | * | * | ||
| CTC/Leu | * | * | * | * | * | * | * | * | * | * | * | |||||
| CTA/Leu | * | * | * | * | * | |||||||||||
| CTG/Leu | * | * | * | |||||||||||||
| ATT/Ile | * | * | * | * | * | |||||||||||
| ATC/Ile | * | * | * | * | * | * | * | * | * | * | * | * | * | |||
| ATA/Ile | * | * | * | |||||||||||||
| GTT/Val | * | * | * | * | * | * | * | * | ||||||||
| GTC/Val | * | * | * | * | * | * | * | * | * | * | * | * | * | |||
| GTA/Val | * | * | * | * | ||||||||||||
| GTG/Val | * | * | ||||||||||||||
| TCT/Ser | * | * | * | * | * | * | * | * | * | * | * | * | * | * | ||
| TCC/Ser | * | * | * | * | * | * | * | * | * | |||||||
| TCA/Ser | * | * | * | * | * | * | * | * | * | * | * | |||||
| TCG/Ser | * | * | ||||||||||||||
| AGT/Ser | * | * | * | * | * | |||||||||||
| AGC/Ser | * | * | * | |||||||||||||
| CCT/Pro | * | * | * | * | * | * | * | * | * | * | * | * | * | * | * | |
| CCC/Pro | * | * | * | * | * | * | * | |||||||||
| CCA/Pro | * | * | * | * | * | * | * | * | * | * | * | * | * | * | * | |
| CCG/Pro | * | * | ||||||||||||||
| ACT/Thr | * | * | * | * | * | * | * | * | * | * | * | * | * | * | * | |
| ACC/Thr | * | * | * | * | * | * | * | * | * | * | * | |||||
| ACA/Thr | * | * | * | * | * | |||||||||||
| ACG/Thr | * | * | ||||||||||||||
| GCT/Ala | * | * | * | * | * | * | * | * | * | * | * | * | * | * | * | * |
| GCC/Ala | * | * | * | * | * | * | * | * | * | * | * | |||||
| GCA/Ala | * | * | * | * | * | * | ||||||||||
| GCG/Ala | ||||||||||||||||
| TAT/Tyr | * | * | * | |||||||||||||
| TAC/Tyr | * | * | * | * | * | * | * | |||||||||
| CAT/His | * | * | * | * | * | * | ||||||||||
| CAC/His | * | * | ||||||||||||||
| CAA/Gln | * | * | * | * | ||||||||||||
| CAG/Gln | * | * | * | * | * | * | ||||||||||
| AAT/Asn | * | * | * | * | * | * | * | * | * | |||||||
| AAA/Lys | * | * | * | |||||||||||||
| AAG/Lys | * | * | * | * | * | * | * | * | * | * | * | * | ||||
| GAT/Asp | * | * | * | * | * | * | * | |||||||||
| GAA/Glu | * | * | * | * | ||||||||||||
| GAG/Glu | * | * | * | * | * | * | ||||||||||
| TGT/Cys | * | * | * | |||||||||||||
| TGC/Cys | * | * | * | * | * | * | * | * | * | * | ||||||
| CGT/Arg | * | * | * | * | * | * | * | * | * | * | * | * | * | |||
| CGC/Arg | * | * | * | * | * | * | * | * | * | * | * | * | ||||
| CGA/Arg | * | * | * | * | * | * | ||||||||||
| CGG/Arg | ||||||||||||||||
| AGA/Arg | * | * | * | * | * | * | * | |||||||||
| AGG/Arg | * | * | ||||||||||||||
| GGT/Gly | * | * | * | * | * | * | * | * | * | * | * | * | * | * | * | * |
| GGC/Gly | * | * | * | * | * | |||||||||||
| GGA/Gly | * | * | * | * | * | * | * | * | * |
Species are in the following order: A = C. sinensis; B = C. aurantifolia; C = C. aurantium; D = C. clementina; E = C. jambhiri; F = C. limettioides; G = C. limonia; H = C. medica; I = C. reshni; J = C. reticulata; K = C. reticulata × C. temple; L = C. sinensis × P. trifoliata; M = P. trifoliata; N = C. unshiu; O = C. paradisi; and P = C. paradisi × P. trifoliata. AA represents amino acid.
Figure 4.GC3 gradient across Citrus species and P. trifoliata. Species order: A = C. sinensis; B = C. aurantifolia; C = C. aurantium; D = C. clementina; E = C. clementina × C. tangerina; F = C. jambhiri; G = C. japonica var. margarita; H = C. limettioides; I = C. limonia; J = C. medica; K = C. reshni; L = C. reticulata; M = C. reticulata × C. temple; N = C. sinensis × P. trifoliata; O = C. sunki; P = P. trifoliata; Q = C. unshiu; R = C. paradisi; and S = C. paradisi × P. trifoliata. These plots show gradients of GC3 for 5% of GC3-rich and GC3-poor genes for all analysed genomes. Gradients from 5′ and 3′ ends of the CDS are shown in the same plot, separated by a vertical line. For all genomes, GC3-rich genes become more GC3 rich towards the middle of the CDS.
Figure 5.Distribution of stress-related genes in Citrus species and P. trifoliata according to the GO of Arabidopsis.
Figure 6.Hamming distance versus change in GC3 visualization across Citrus species and P. trifoliata. Relative difference GC3 composition between genes A and B, defined as GC3 (A)-GC3 (B)/[GC3 (A) + GC3 (B)] is positively correlated with the Hamming distance between corresponding protein sequences, calculated as 1–[I (AB) + I (BA)]/[I (AA) + I (BB)], where I (AB) is the number of identities in alignment of protein A to protein B. The resulting Hamming distance was rounded to 1 DP, and an average difference in GC3 was computed for each category.