| Literature DB >> 15113406 |
Jan C Biro1, Josephine M K Biro.
Abstract
BACKGROUND: There are two different theories about the development of the genetic code. Woese suggested that it was developed in connection with the amino acid repertoire, while Crick argued that any connection between codons and amino acids is only the result of an "accident". This question is fundamental to understand the nature of specific protein-nucleic acid interactions.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15113406 PMCID: PMC394317 DOI: 10.1186/1471-2105-5-30
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1ClustalW alignment of PstI sequences and the PstI recognition sequence. The PstI sequence was compared to PstI RS repeats (direct, D and reverse, R readings) by two different ways: as nucleic acids [NA, N] or as overlappingly translated [OTS, P] and regularly translated [PROT] sequences. The result of the alignments is visualized by Jalview where the shaded areas emphasize sequence identity. Only a short fragment of the alignments is shown.
Figure 2Recognition sequence-like sites in the restriction enzymes: Multiple Sequence Alignment (MSA) of 12 REs (PstI isoschizomers) and their common RS. The protein sequences of the enzymes and the overlappingly translated sequences (OTS) of the RS direct (Dir, D) and reverse (Rev, R) readings were compared. The first alignment includes all 14 sequences (colored by conservation) while the other 12 alignments indicate individual comparisons of the enzymes to their common RS (colored by conservation). The section of the alignments seen corresponds to the entire protein sequence (326 amino acids) of PstI. The vertical lines between the alignments indicate the conserved residues which align to the direct (red) and reverse (green) readings of the RS.
Figure 3Statistical evaluation of MSA (RS/RE) The number of conserved residues shown in Figure 2 was counted (RS-D/RE and RS-R/RE). In the control experiments similar alignments were constructed but the RS was replaced by shuffled RS (non-RS/RE) or the RE was replaced by shuffled RE (RS/non-RE). Each bar represents the mean +/- S.E.M, n = 12.
Restriction enzymes with known crystallographic structure
| BamHI | G†GATCC | GDISPRLA | 1BHM | X55285 | 642 | P23940 | 213 | GGCCTA-1, GCCTA-1, TAGG-1, GGAT-2, AGGC-1, GATC-1 |
| BgIII | A†GATCT | RNISLX | 1DFM | U49842 | 672 | - | 223 | AGATC-1, TAGAT-1, AGAT-5, GATC-2, CTAG-2 |
| EcoRI | G†AATTC | ENIPSR | 1ERI | J01675 | 909 | P00642 | 302 | CGAATT-1, AGCTTA-1, TTCGA-2, AAGCT-3, AATTC-2, AATT-8, TTAA-6, TAAG-3, CTTA-3, TCGA-3, GAAT-4, AAGC-3, TTCG-1 |
| EcoRV | GAT†ATC | NIPSR | 1AZO | X00530 | 738 | P04390 | 245 | ATATCG-1, ATATC-1, GATAT-3, TATAG-1, ATAT-11, TATC-2, GATA-2, TATA-4, CTAT-2 |
| NaeI | GCC†GGC | APRG | 1IAW | U09581 | 954 | - | 317 | GGCGCCGG-1, GCCGCGG-1, CGGCGC-2, GCCGCG-1, CGCGC-2, GCCGG-1, CGCGG-2, GCGGC-2, GGCCG-2, CGGCC-1, CGGC-3, CCGG-5, GGCG3, CCGC-4, GCCG-3, GGCC-1, CGCG-4 |
| NgoMIV | G†CCGGC | APRG | 1FIU | M86915 | 861 | P31032 | 286 | CCGGCG-1, GCCGC-4, GGCG-1, GCCG-3, CGCC-2, CCGC-1, GCGG-2, CGGC-2 |
| PvuII | CAG†CTG | OSALCA | 1PVI | AF305615 | 474 | P23657 | 157 | AGCT-2, GCTG-1 |
AC#: accession number, N.A.: nucleic acid, A.A.: amino acid, †: cut site
Figure 4The location of RS-like sequences in the 3D structure of REs. The figure shows one subunit of the RE and the dsDNA (the RS). The color code of the ribbon backbone indicates the length of the RS-like strings: yellow = 3, orange = 4, pink = 5, red >= 6. The solid spirals indicate the dsDNA; the red and blue lines are the RSs while the white parts are not RSs. (The EcoRI structure is an exception, there is only a single DNA strand and only one enzyme subunit).
Figure 5Expected (E) vs. Found (F) RS-like oligonucleotides in the REs. Expected and observed numbers (N) of RS-like nucleotides from 4 to 8 residues long are shown. Statistically significant E – F differences are indicated. NS: not significant, *: single value. For details see the Results.
Figure 6Amino acids in REs coded by RS-like codons and co-located with RSs. The solid spirals indicate the dsDNA; the red/pink and blue/light blue lines are the RSs while the white lines are not RSs. (EcoRI is an exception, there is only a single DNA strand), a.a.: amino acid. The ribbons belonging to the amino acids are green.
Codon-amino acid co-location in 7 REs: The proportion of the involved codon residues
| 41.5 +/- 3.6 | 20.5 +/- 3.1 | 2.2 +/- 0.4 | 5 +/- 0.4 | 18.2 +/- 2.8 | 18.7 +/- 3.1 |
| p < 0.001 | NS | NS | |||
A (1st), B (2nd), C (3rd)-residues in the codon-like sequences of the RSs. C-: nucleic acids in the codon-like sequences of the control (shuffled) RSs. Mean +/- S.E.M., n = 7 (i.e. 7 groups corresponding to the 7 RE). NS: not significant.
Codon-amino acid co-location: The shortest atomic distances (Å)
| BamHI | 6.2_0.9 (9) | NS |
| BgiII | 5.9_0.4 (9) | NS |
| EcoRI | 5.1_0.8 (5) | NS |
| EcoRV | 6.9_0.8 (8) | NS |
| NaeI | 3.9_0.5 (9) | P < 0.05 |
| NgoMIV | 5.4_0.6 (10) | NS |
| PvuII | 5.0_0.6 (5) | NS |
| Average | 5.5_0.2 (55) | |
| Hydrophobic | 7.3_0.3 (15) | p < 0.01 |
| Hydrophilic | 5.8_0.3 (19) | NS |
| Positively charged | 3.4_0.2 (15) | p < 0.001 |
| Negatively charged | 5.3_0.6 (6) | NS |
* compared to the average, NS: not significant
Figure 7Co-location of codon-like triplets and amino acids in RE-RS complexes. Examples are taken from Figure 6.
Figure 8Co-location of codon-like triplets and amino acids in RE-RS complexes. Examples are taken from Figure 6.