| Literature DB >> 18827062 |
Pamela Mukhopadhyay1, Surajit Basak, Tapash Chandra Ghosh.
Abstract
Intra-genomic variation between housekeeping and tissue-specific genes has always been a study of interest in higher eukaryotes. To-date, however, no such investigation has been done in plants. Availability of whole genome expression data for both rice and Arabidopsis has made it possible to examine the evolutionary forces in shaping codon usage pattern in both housekeeping and tissue-specific genes in plants. In the present work, we have taken 4065 rice-Arabidopsis homologous gene pairs to study evolutionary forces responsible for codon usage divergence between housekeeping and tissue-specific genes. In both rice and Arabidopsis, it is mutational bias that regulates error minimization in highly expressed genes of both housekeeping and tissue-specific genes. Our results show that, in comparison to tissue-specific genes, housekeeping genes are under strong selective constraint in plants. However, in tissue-specific genes, lowly expressed genes are under stronger selective constraint compared with highly expressed genes. We demonstrated that constraint acting on mRNA secondary structure is responsible for modulating codon usage variations in rice tissue-specific genes. Thus, different evolutionary forces must underline the evolution of synonymous codon usage of highly expressed genes of housekeeping and tissue-specific genes in rice and Arabidopsis.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18827062 PMCID: PMC2608846 DOI: 10.1093/dnares/dsn023
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Average values of synonymous substitution rates for housekeeping and tissue-specific classes of genes in highly expressed (HEG) and lowly expressed genes (LEG) of rice
| Housekeeping | Tissue specific | Level of significance (b) | |
|---|---|---|---|
| HEG | 3.12 | 3.74 | |
| LEG | 3.34 | 3.41 | NS |
| Level of significance (a) |
Level of significance (a) indicates significance of the difference between highly (HEG) and lowly (LEG) expressed housekeeping and tissue-specific genes of rice.
Level of significance (b) indicates significance of the difference between highly (HEG) expressed housekeeping and tissue-specific genes of rice and lowly (LEG) expressed housekeeping and tissue-specific genes of rice.
NS indicates not-significant.
RSCU values of highly expressed (HEG) and lowly expressed (LEG) housekeeping and tissue-specific genes in rice
| AA | Codons | RSCU (HEG) | RSCU (LEG) | tRNA copy number of | AA | Codons | RSCU (HEG) | RSCU (LEG) | tRNA copy number of |
|---|---|---|---|---|---|---|---|---|---|
| Phe | TTT | 0.78 (0.57) | 0.94 (0.72) | 0 | Ala | GCT | 1.12 (0.63) | 1.09 (0.76) | 25 |
| 1.22 (1.43) | 1.06 (1.28) | 15 | 1.17 (1.43) | 1.13 (1.34) | 0 | ||||
| Tyr | TAT | 0.81 (0.59) | 1 (0.76) | 0 | GCA | 0.82 (0.54) | 0.95 (0.72) | 11 | |
| 1.19 (1.41) | 1 (1.24) | 16 | GCG | 0.88 (1.4) | 0.84 (1.17) | 13 | |||
| His | CAT | 0.96 (0.74) | 1.04 (0.91) | 0 | Gly | GGT | 1.01 (0.69) | 1.05 (0.73) | 0 |
| 1.04 (1.26) | 0.96 (1.09) | 11 | 1.3 (1.83) | 1.05 (1.63) | 24 | ||||
| Asn | AAT | 0.88 (0.84) | 1.14 (0.89) | 0 | GGA | 0.86 (0.69) | 0.95 (0.78) | 13 | |
| 1.12 (1.16) | 0.86 (1.11) | 14 | GGG | 0.82 (0.79) | 0.95 (0.86) | 8 | |||
| Asp | GAT | 1.05 (0.84) | 1.18 (0.92) | 0 | Leu | TTA | 0.37 (0.38) | 0.55 (0.41) | 7 |
| 0.95 (1.16) | 0.82 (1.08) | 28 | TTG | 0.98 (0.88) | 1.22 (0.97) | 9 | |||
| Cys | TGT | 0.69 (0.39) | 0.98 (0.66) | 0 | CTT | 1.3 (0.75) | 1.28 (0.95) | 19 | |
| 1.31 (1.61) | 1.02 (1.34) | 10 | 1.57 (1.94) | 1.34 (1.74) | 0 | ||||
| Gln | 0.7 (0.87) | 0.85 (0.78) | 16 | CTA | 0.41 (0.49) | 0.56 (0.51) | 8 | ||
| CAG | 1.3 (1.13) | 1.15 (1.22) | 13 | CTG | 1.37 (1.56) | 1.05 (1.43) | 6 | ||
| Lys | AAA | 0.56 (0.55) | 0.79 (0.67) | 10 | Ser | TCT | 1.14 (0.73) | 1.21 (0.92) | 17 |
| 1.44 (1.45) | 1.21 (1.33) | 22 | TCC | 1.2 (1.24) | 1.17 (1.23) | 0 | |||
| Glu | GAA | 0.7 (0.64) | 0.83 (0.7) | 15 | TCA | 1.06 (0.68) | 1.19 (0.94) | 10 | |
| 1.3 (1.36) | 1.17 (1.3) | 29 | TCG | 0.76 (1.18) | 0.64 (0.94) | 7 | |||
| Val | GTT | 1.21 (0.7) | 1.26 (0.87) | 21 | AGT | 0.75 (0.84) | 0.84 (0.68) | 0 | |
| 1.09 (1.24) | 0.92 (1.23) | 0 | AGC | 1.1 (1.34) | 0.95 (1.28) | 13 | |||
| GTA | 0.39 (0.32) | 0.56 (0.39) | 4 | Arg | 0.8 (0.61) | 0.63 (0.56) | 16 | ||
| GTG | 1.32 (1.74) | 1.27 (1.52) | 10 | 1.37 (1.71) | 1.26 (1.42) | 0 | |||
| Pro | CCT | 1.16 (0.68) | 1.14 (0.89) | 16 | CGA | 0.39 (0.34) | 0.48 (0.49) | 4 | |
| 0.83 (0.98) | 0.86 (0.8) | 0 | CGG | 0.88 (1.05) | 0.94 (1.12) | 7 | |||
| CCA | 1.1 (0.78) | 1.16 (0.99) | 11 | AGA | 0.96 (0.65) | 1.19 (0.96) | 9 | ||
| CCG | 0.91 (1.56) | 0.83 (1.32) | 10 | AGG | 1.6 (1.63) | 1.51 (1.45) | 10 | ||
| Thr | ACT | 1.09 (0.79) | 1.12 (0.84) | 9 | Ile | ATT | 1.12 (0.76) | 1.24 (0.98) | 23 |
| 1.26 (1.44) | 0.96 (1.23) | 0 | 1.32 (1.8) | 1 (1.39) | 0 | ||||
| ACA | 1.05 (0.7) | 1.38 (0.96) | 8 | ATA | 0.56 (0.45) | 0.76 (0.63) | 6 | ||
| ACG | 0.61 (1.07) | 0.54 (0.98) | 0 |
RSCU values within parenthesis represent tissue-specific genes of rice, and the values outside represent housekeeping rice genes. Arrows indicate the correspondence between codon and their isoaccepting tRNA based on revised wobble rules. Codons marked with asterisk hold a perfect correspondence with most abundant tRNA gene copy number in both housekeeping and tissue-specific genes. Codons marked with superscript H shows higher preference in highly expressed housekeeping genes. Codons marked with superscript T shows higher preference in highly expressed tissue-specific genes.
RSCU values of highly expressed (HEG) and lowly expressed (LEG) housekeeping and tissue-specific genes in Arabidopsis
| AA | Codons | RSCU (HEG) | RSCU (LEG) | tRNA copy number of Arabidopsis | AA | Codons | RSCU (HEG) | RSCU (LEG) | tRNA copy number of Arabidopsis |
|---|---|---|---|---|---|---|---|---|---|
| Phe | TTT | 0.87 (1.05) | 1.04 (1.13) | 0 | Ala | 1.9 (1.8) | 1.73 (1.76) | 16 | |
| 1.13 (0.95) | 0.96 (0.87) | 16 | 0.7 (0.65) | 0.57 (0.57) | 0 | ||||
| Tyr | TAT | 0.81 (0.97) | 1.12 (1.13) | 0 | GCA | 0.95 (1.1) | 1.18 (1.07) | 10 | |
| 1.19 (1.03) | 0.88 (0.87) | 76 | GCG | 0.45 (0.45) | 0.52 (0.6) | 7 | |||
| His | CAT | 1.01 (1.18) | 1.24 (1.32) | 0 | Gly | 1.55 (1.32) | 1.33 (1.33) | 1 | |
| 0.99 (0.82) | 0.76 (0.68) | 10 | GGC | 0.5 (0.54) | 0.46 (0.46) | 23 | |||
| Asn | AAT | 0.84 (1) | 1.04 (1.1) | 0 | GGA | 1.47 (1.53) | 1.57 (1.53) | 12 | |
| 1.16 (1) | 0.96 (0.9) | 16 | GGG | 0.48 (0.61) | 0.64 (0.69) | 5 | |||
| Asp | GAT | 1.26 (1.28) | 1.38 (1.38) | 0 | Leu | TTA | 0.57 (0.69) | 0.82 (0.87) | 6 |
| 0.74 (0.72) | 0.62 (0.62) | 26 | TTG | 1.36 (1.39) | 1.16 (1.38) | 10 | |||
| Cys | TGT | 1.16 (1.11) | 1.21 (1.21) | 0 | CTT | 1.73 (1.57) | 1.62 (1.53) | 12 | |
| TGC | 0.84 (0.89) | 0.79 (0.79) | 15 | CTC | 1.25 (1.01) | 1.12 (0.93) | 1 | ||
| Gln | CAA | 0.94 (1.01) | 1.14 (1.13) | 8 | CTA | 0.48 (0.58) | 0.68 (0.66) | 10 | |
| 1.06 (0.99) | 0.86 (0.87) | 9 | CTG | 0.62 (0.76) | 0.6 (0.64) | 3 | |||
| Lys | AAA | 0.75 (0.86) | 1.07 (1.03) | 13 | Ser | TCT | 1.71 (1.68) | 1.61 (1.65) | 37 |
| 1.25 (1.14) | 0.93 (0.97) | 18 | 0.82 (1.64) | 0.69 (0.7) | 1 | ||||
| Glu | GAA | 0.88 (0.97) | 1.06 (1.09) | 12 | TCA | 1.1 (1.23) | 1.24 (1.28) | 9 | |
| 1.12 (1.03) | 0.94 (0.91) | 13 | TCG | 0.66 (0.58) | 0.77 (0.59) | 4 | |||
| Val | GTT | 1.74 (1.58) | 1.69 (1.76) | 15 | AGT | 0.87 (1) | 0.94 (0.99) | 0 | |
| 0.83 (0.88) | 0.7 (0.66) | 0 | AGC | 0.83 (0.87) | 0.75 (0.78) | 13 | |||
| GTA | 0.39 (0.42) | 0.6 (0.63) | 7 | Arg | 1.33 (1.28) | 1 (0.97) | 9 | ||
| GTG | 1.04 (1.12) | 1.01 (0.96) | 8 | 0.46 (0.45) | 0.34 (0.43) | 0 | |||
| Pro | CCT | 1.56 (1.57) | 1.52 (1.72) | 16 | CGA | 0.53 (0.67) | 0.69 (0.78) | 6 | |
| CCC | 0.5 (0.41) | 0.39 (0.46) | 0 | CGG | 0.34 (0.34) | 0.72 (0.5) | 4 | ||
| CCA | 1.32 (1.38) | 1.33 (1.28) | 45 | AGA | 1.87 (1.87) | 2.16 (2.25) | 9 | ||
| CCG | 0.62 (0.64) | 0.76 (0.54) | 5 | AGG | 1.47 (1.39) | 1.08 (1.08) | 8 | ||
| Thr | ACT | 1.53 (1.47) | 1.39 (1.44) | 10 | Ile | ATT | 1.29 (1.26) | 1.25 (1.24) | 19 |
| 1.02 (1.01) | 0.7 (0.83) | 0 | 1.3 (1.14) | 1.07 (0.98) | 0 | ||||
| ACA | 0.96 (0.95) | 1.33 (1.21) | 8 | ATA | 0.42 (0.6) | 0.68 (0.78) | 5 | ||
| ACG | 0.49 (1.58) | 0.59 (0.52) | 6 |
RSCU values within parenthesis represent tissue-specific genes of Arabidopsis, and the values outside represent housekeeping Arabidopsis genes. Arrows indicate the correspondence between codon and their isoaccepting tRNA based on revised wobble rules. Codons marked with asterisk hold perfect correspondence with most abundant tRNA copy number in both housekeeping and tissue-specific genes. Codons marked with superscript H show significantly (P < 0.05) higher preference in highly expressed housekeeping genes.
Average error minimization values (wRn) of housekeeping and tissue-specific classes of genes in highly expressed (HEG) and lowly expressed genes (LEG) of rice
| Housekeeping (wRn) | Tissue specific (wRn) | |
|---|---|---|
| HEG | −0.39463 | −0.26440 |
| LEG | −0.28266 | −0.24700 |
| Level of significance | NS |
Level of significance between highly expressed (HEG) and lowly expressed (LEG) housekeeping and tissue-specific genes of rice is shown. NS indicates average values of error minimization (wRn) not significant between highly and lowly expressed tissue-specific genes of rice.
Average error minimization values (wRn) of housekeeping and tissue-specific classes of genes in highly expressed (HEG) and lowly expressed genes (LEG) of Arabidopsis
| Housekeeping (wRn) | Tissue specific (wRn) | |
|---|---|---|
| HEG | −0.1937 | −0.1514 |
| LEG | −0.0059 | 0.0531 |
| Level of significance |
Level of significance between highly expressed (HEG) and lowly expressed (LEG) housekeeping and tissue-specific genes of Arabidopsis is shown.
Average GC3 values for housekeeping and tissue-specific classes of genes in highly expressed (HEG) and lowly expressed genes (LEG) of Arabidopsis
| Housekeeping | Tissue specific | |
|---|---|---|
| HEG | 45.64 | 45.34 |
| LEG | 41.88 | 40.46 |
| Level of significance |
Level of significance between highly expressed (HEG) and lowly expressed (LEG) housekeeping and tissue-specific genes of Arabidopsis is shown.
Average GC3 values for housekeeping and tissue-specific classes of genes in highly expressed (HEG) and lowly expressed genes (LEG) of rice
| Housekeeping | Tissue specific | |
|---|---|---|
| HEG | 69.12 | 68.71 |
| LEG | 62.17 | 65.84 |
| Level of significance | NS |
Level of significance between highly expressed (HEG) and lowly expressed (LEG) housekeeping and tissue-specific genes of rice is shown. NS indicates average values of GC3 not significant between highly and lowly expressed tissue-specific genes of rice.