| Literature DB >> 17974594 |
Xiang Jia Min1, Donal A Hickey.
Abstract
Variations in GC content between genomes have been extensively documented. Genomes with comparable GC contents can, however, still differ in the apportionment of the G and C nucleotides between the two DNA strands. This asymmetric strand bias is known as GC skew. Here, we have investigated the impact of differences in nucleotide skew on the amino acid composition of the encoded proteins. We compared orthologous genes between animal mitochondrial genomes that show large differences in GC and AT skews. Specifically, we compared the mitochondrial genomes of mammals, which are characterized by a negative GC skew and a positive AT skew, to those of flatworms, which show the opposite skews for both GC and AT base pairs. We found that the mammalian proteins are highly enriched in amino acids encoded by CA-rich codons (as predicted by their negative GC and positive AT skews), whereas their flatworm orthologs were enriched in amino acids encoded by GT-rich codons (also as predicted from their skews). We found that these differences in mitochondrial strand asymmetry (measured as GC and AT skews) can have very large, predictable effects on the composition of the encoded proteins.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17974594 PMCID: PMC2779903 DOI: 10.1093/dnares/dsm019
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Partition of the vertebrate mitochondrial genetic code into GT-rich, CA-rich and ‘other’ codons
GT-rich codons (italic) include GT, TG, GG, TT codons at the first two codon positions. TTA and TTG are excluded as there are other codons for Leu. CA-rich codons (bold) include CA, AC, CC, AA codons at the first two codon positions. Different codon assignments in Mammals and Platyhelminthes are underlined. In Platyhelminthes, AGA and AGG code for Ser, ATA for Ile, and AAA for Asn. The numbers following each codon are codon usage per thousand codons in the 11 conserved proteins of mammals (the first number) and Platyhelminthes (the second number).
Figure 1Contrasting patterns of DNA strand asymmetry in the mitochondrial coding sequences of mammals and Platyhelminthes. (A) The GC skew, (G − C)/(G + C), is negative for all mammalian species (shown in blue), and positive for all Platyhelminthe species (shown in red). (B) The AT skew, (A − T)/(A + T), is positive for mammals (shown in blue) and negative for Platyhelminthes (shown in red).
Figure 2The amino composition of mitochondrial proteins reflects the DNA strand asymmetry. (A) The proportions of cysteine (C), Valine (V), phenylalanine (F), Glycine (G), and Tryptophan (W) are relatively low in mammalian proteins (shown in blue) and relatively high in flatworms (shown in red). These amino acids are encoded by GT-rich codons. (B) The proportions Glutamine (Q), Threonine (T), Proline (P), Histidine (H), Asparagine (N), and Lysine (K) are, in contrast, relatively high in mammals (shown in blue) and relatively low in flatworms (shown in red). These amino acids are encoded by CA-rich codons.
Figure 3The proportions of individual amino acids that were most affected by DNA strand asymmetry in mammals and Platyhelminthes. (A) The proportions of cysteine and Valine are low in mammalian proteins (shown in blue) and high in flatworm proteins (shown in red). (B) The proportions of Glutamine and Threonine are high in mammals (shown in blue) and low in flatworms (shown in red).