Agnieszka Kiliszek1, Katarzyna Banaszak1, Zbigniew Dauter2, Wojciech Rypniewski3. 1. Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland. 2. Synchrotron Radiation Research Section, MCL, National Cancer Institute, Argonne National Laboratory, Argonne, IL 60439, USA. 3. Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland wojtekr@ibch.poznan.pl.
Abstract
PNA is a promising molecule for antisense therapy of trinucleotide repeat disorders. We present the first crystal structures of RNA-PNA duplexes. They contain CUG repeats, relevant to myotonic dystrophy type I, and CAG repeats associated with poly-glutamine diseases. We also report the first PNA-PNA duplex containing mismatches. A comparison of the PNA homoduplex and the PNA-RNA heteroduplexes reveals PNA's intrinsic structural properties, shedding light on its reported sequence selectivity or intolerance of mismatches when it interacts with nucleic acids. PNA has a much lower helical twist than RNA and the resulting duplex has an intermediate conformation. PNA retains its overall conformation while locally there is much disorder, especially peptide bond flipping. In addition to the Watson-Crick pairing, the structures contain interesting interactions between the RNA's phosphate groups and the Π electrons of the peptide bonds in PNA.
PNA is a promising molecule for antisense therapy of trinucleotide repeat disorders. We present the first crystal structures of RNA-PNA duplexes. They contain CUGrepeats, relevant to myotonic dystrophy type I, and CAGrepeats associated with poly-glutamine diseases. We also report the first PNA-PNA duplex containing mismatches. A comparison of the PNA homoduplex and the PNA-RNA heteroduplexes reveals PNA's intrinsic structural properties, shedding light on its reported sequence selectivity or intolerance of mismatches when it interacts with nucleic acids. PNA has a much lower helical twist than RNA and the resulting duplex has an intermediate conformation. PNA retains its overall conformation while locally there is much disorder, especially peptide bond flipping. In addition to the Watson-Crick pairing, the structures contain interesting interactions between the RNA's phosphate groups and the Π electrons of the peptide bonds in PNA.
RNA-mediated diseases are genetic disorders in which a mutation results in toxic mRNA rather than gene silencing or malfunctioning protein. In a large class of the RNA-mediated diseases, known as Tri-nucleotideRepeat Expansion Disorders (TREDs), the mutated gene contains an abnormally expanded microsatellite sequence consisting of trinucleotiderepeats (1). Tetra-, penta- or even hexanucleotiderepeats also are known to be pathogenic (2). In a normal gene, the number of repeated units is usually less than 30 but when it exceeds 40–50, pathology begins (3).Myotonic dystrophy type 1 was the first neurological disorder attributed to pathogenic RNA. It is linked to CTGrepeats located in the 3′ untranslated region (3′UTR) of DMPK gene (4). The transcript contains extended CUGrepeats and shows abnormal affinity for several cellularproteins, including MBNL—the regulator of alternative splicing which acts antagonistically to CUG-binding protein (CUG-BP). The sequestration of MBNL lowers the level of available protein while the level of CUG-BP increases. The combined effect is a change in the alternative splicing pattern (5). Although splicing proceeds, the mRNA is incorrect for the type or age of the cell.Another type of pathogenic microsatellites consist of CAGrepeats. They are located mainly in coding regions and their translation leads to toxic proteins containing polyglutamine (polyQ) runs. However, there are indications that RNA is also involved in the pathogenesis (6–9). Transcripts containing CAGrepeats co-localize with MBNL and otherproteins (i.e. SRSF6, nucleolin, U2AF65) involved in splicing or otherprocesses affecting the function of the cell (10–12).Myotonic dystrophy and polyQ diseases usually affect adults. They are progressive with no effective treatment but in recent years therapeutic approaches are under intensive development (13,14). One approach relies on antisense oligonucleotides (ASO) which serve as blockers binding directly to complementary target to prevent interactions of the repeat RNA with nuclearproteins (15). They can also inhibit the expression of toxic polyQproteins. To be effective they should exhibit high affinity and target selectivity, good cellular uptake and stability in the cell. Natural oligomers are not suitable for this purpose because they are sensitive to nucleases. Several modified ASOs have been proposed instead. A promising class of compounds are peptide nucleic acids (PNA) which mimic nucleic acids while containing charge-neutral N-(2-aminoethyl)glycine instead of electronegative sugar phosphodiester backbone (Figure 1) (16). Despite significantly different chemical nature, PNA hybridizes with high affinity to complementary RNA or DNA, obeying the canonical Watson–Crick rules (17). On the other hand, PNA exhibits outstanding base-pairing selectivity, which is manifest by low tolerance to even single mismatches. Mismatches destabilize PNA–DNA duplexes by at least a factor of two, in terms of free energy, compared to the effect of mismatches on DNA–DNA duplexes (18–21). PNA is resistant to nucleases and proteases (22) and can be easily modified to adjust its permeability across the cell membrane (23). Promising results of using PNA against CAGrepeats were obtained by Corey and colleagues (24,25). They targeted mutant transcripts of the huntingtin gene associated with Huntington disease (HD) and mutated mRNA of ATX3related to spinocerebellar ataxia type 3 (SCA3). One of the antisense PNA, composed of six CTGrepeats inhibited selectively the translation of the polyQprotein in HD cells, while not affecting the expression of genes containing normal CAGrepeats and not causing cellulartoxicity.
Figure 1.
The chemical structure of PNA (A) and RNA (B). ‘B’ stands for a nucleobase.
The chemical structure of PNA (A) and RNA (B). ‘B’ stands for a nucleobase.Structural studies aimed at understanding the molecular bases of the pathogenic mechanism of TREDs revealed that extended CUG and CAGrepeats formed hairpin structures. The main part of the hairpin is the double-stranded stem which interacts with proteins. Crystallography revealed that duplexes of the CNG tracks, representing the stem of the hairpin, have the form of A-RNA consisting of G–C and C–G pairs interrupted by non-canonical N–N pairs which bestow specific properties on the repeats (26).The main goal of the research presented here was to solve the crystal structures of RNA containing CUG and CAGrepeats in complex with antisense PNA. We wanted to see how the PNA recognized the RNA and how it affected the structure of the repeats. We also wanted to understand PNA's outstanding sequence selectivity. Here we present crystal structures of two duplexes of RNA, containing CUG and CAGrepeats, with complementary PNA; and one PNA–PNA duplex with T–T mismatches. To our knowledge these are the first crystallographic RNA–PNA structures. The PNA–PNA model is the first example of a PNA duplex containing non-Watson–Crick pairs.
MATERIALS AND METHODS
Synthesis, purification and crystallization of the RNA–PNA and PNA–PNA duplexes
The PNA oligomers were synthesized and purified commercially in the laboratory of prof. PiotrRekowski from the University of Gdańsk. Crude RNA oligomers were purchased from Future Synthesis and were subsequently purified using the TLC method on silica gel plates with ammonia/1-propanol/water solvent. RNA was eluted with water and lyophilized under vacuum using Speed-Vac.All oligomers were dissolved in water and then mixed to the final concentration of 1 mM each. MgCl2 was added only to the solution of r(GCAGCAGC)-p(GCTGCTGC). Before the crystallization, all PNA and RNA mixtures were annealed for 3–5 min at 95°C, then cooled to ambient temperature within 2–3 h. Crystals were grown by the sitting drop method at 19°C. The crystals of r(GCAGCAGC)-p(GCTGCTGC) grew in 40 mM MgCl2, 50 mM HEPES pH 7.0 and 1.6 M (NH4)2SO4. The crystallization media of the second r(GCUGCUGC)-p(GCAGCAGC) duplex contained 200 mM KCl, 50 mM MgCl2, 50 mM TRIS pH 7.5 and 10% PEG 4000. The p(GCTGCTGC)2 duplex crystallized in 6 mM MgCl2, 0.03 M HEPES-Na pH 7.0 and 2.4 M LiCl.
X-ray data collection, structure solution and refinement
X-ray diffraction data were collected on the BL 14.2 beam line at the BESSY synchrotron in Berlin and on EMBL P13, PETRA III, Hamburg. Only the PNA–PNA crystal was additionally cryoprotected by 3M sodium formate (v/v) in the mother liquor. All three crystals diffracted to atomic resolution (Table 1). The data were integrated and scaled using the program suite XDS (27) or DENZO and SCALEPACK (28). The space group and the cell parameters of the two RNA–PNA crystals were virtually the same (Table 1). The structure of r(GCAGCAGC)-p(GCTGCTGC) was solved by molecularreplacement (MR) using PHASER (29). The search model was one RNA strand of GGCAGCAGCC oligomer (PDB code: 3NJ6). The phases from MR were used in a free-atom refinement using ARP/wARP (30). The R-factor/R-free statistics were improving continuously from the first cycle of adding/removing atoms and after 40 cycles they reached values of 34/37%. The free-atom electron density map showed the RNA and PNA atoms clearly resolved. The manual rebuilding and map inspection were done using Coot (31). The second r(GCAGCAGC)-p(GCTGCTGC) structure was solved by taking the phases from the first r(GCAGCAGC)-p(GCTGCTGC) model. The final refinement of all structures was performed in Refmac5 (32).
Table 1.
Data collection and refinement statistics
rCAG-pCTG
rCUG-pCAG
p(GCTGCTGC)2
Data collection
Space group
P3121
P3121
P212121
Cell dimensions
a, b, c (Å)
35.2, 35.2, 68.9
35.1, 35.1, 69.1
28.8, 42.9, 64.3
Resolution (Å)
30.0–1.15 (1.22–1.15)a
30.0–1.14 (1.21–1.14)
30.0–1.06 (1.08–1.06)
Rsym
0.064 (0.75)
0.051 (0.90)
0.068 (0.60)
I / σI
18.7 (3.0)
23.1 (2.0)
23.9 (2.2)
Completeness (%)
98.2 (96.4)
98.4 (90.3)
99.8 (99.7)
Redundancy
9.0 (8.7)
9.8 (7.4)
4.5 (4.1)
Refinement
Resolution (Å)
30.0–1.15 (1.22–1.15)
30.0–1.14 (1.21–1.14)
30.0–1.06 (1.08–1.06)
No. reflections
16914 (891)
17374 (1109)
35227 (1464)
Rwork / Rfree
13.9 / 17.7
12.1 / 15.3
16.3 / 19.3
No. atoms
RNA or PNA
555
416
1681
Ligand/ion
2
3
21
Water
124
121
229
B-factors (Å2)
RNA or PNA
12.7
12.5
11.4
Ligand/ion
46.9
24.5
20.6
Water
26.1
26.4
24.3
R.m.s. deviations
Bond lengths (Å)
0.024
0.025
0.02
Bond angles (°)
2.8
2.6
2.8
PDB code
5EME
5EMF
5EMG
aValues in parentheses are for highest-resolution shell.
aValues in parentheses are for highest-resolution shell.In the case of PNA–PNA, the space group was assigned as P212121. POINTLESS (33) indicated that the space group could be Pbca but closer inspection revealed reflections which would be systematically absent in the centrosymmetric space group. Solution of the p(GCTGCTGC)2 model was obtained by direct methods using SHELXD (34). A run of 1000 phase trials produced clear solutions with 62 phase sets characterized by the correlation coefficient (CC) all/weak larger than 59/32%, whereas all remaining trials had these values smaller than 43/21%. The CC all/weak values are the correlation coefficients between the observed (Eobs) and calculated (Ecalc) normalized structure factors for 6757 reflections used for phase evaluation and a group of weakerreflections, not taking part in phase estimation, respectively. The best SHELXD phase trial (with CC all/weak of 61.63/36.02%) produced the atomic model comprising 340 atoms. The electron density map computed from this model was clearly interpretable and allowed building of the complete and correct sequence of residues in the PNA molecule, that was submitted to furtherrefinement. Despite the high resolution, the PNA–PNA structure had a high level of disorder, especially in the backbone. Most of it could be modeled as two alternative conformation. Restrains for the PNA backbone were generated based on the small-molecule database CSD (35). The helical parameters were calculated with 3DNA (36). All pictures were drawn using UCSF Chimera (37) and PyMOL v0.99rc6 (38). Atomic coordinates of the crystallographic models have been deposited with the Protein Data Bank (accession codes 5EME, 5EMF and 5EMG).
RESULTS
rCUG-PNA and rCAG-PNA
X-ray stpan class="Chemical">ructures of two RNA–PNA duplexes were analyzed: r(GCUGCUGC)-p(GCAGCAGC) and r(GCAGCAGC)-p(GCTGCTGC), abbreviated here as rCUG-PNA and rCAG-PNA. The statistics of the data and the final model are shown in Table 1. Both duplexes are fully complementary with Watson–Crick base pairs. They have the form of A-RNA with an unusually low twist: 26° forrCUG-PNA and rCAG-PNA, compared to 33° for a canonical A-RNA (Figure 2B and C). The rise is low (2.4 Å) compared to values found for A-helices 2.6–3.3 Å. The base pair inclination angle is 14°, somewhat lower than for a typical A-RNA (16°). Average X-displacement, ≈−6 Å, exceeds typical values of A-RNA (−5 Å) or B-DNA (−4 Å) (Table 2).
Figure 2.
Comparison of the overall structure of mixed RNA–PNA duplex against PNA and RNA (pdb code 3glp) duplexes. PNA has a markedly lower twist than RNA and the mixed duplex has an intermediate value of the twist.
Table 2.
Helical parameters
rCUG-PNA rCAG-PNA
p(GCTGCTGC)2
A-RNAa
P-PNAb
Form
A
P
A
P
Twist(°)
26
20 (18 for G–C; 24 for T–T)
33
19
Rise (Å)
2.4
3.9
2.6-3.3
3.5
Inclination (°)
14
12 (9 for G–C; 19 for T–T)
16
3
x-displacement (Å)
−6
−9 (−6 for G–C; -15 for T–T)
−5
−7
aParameters for A-RNA were taken from Saenger (1984) (40).
bThe parameters were calculated using seven available 3D structures of PNA-PNA duplexes (PDB codes: 1PUP, 1RRU, 1QPY, 1HZS, 2K4G, 3MBS, 3MBU) (35,44–48).
Comparison of the overall structure of mixed RNA–PNA duplex against PNA and RNA (pdb code 3glp) duplexes. PNA has a markedly lower twist than RNA and the mixed duplex has an intermediate value of the twist.aParameters for A-RNA were taken from Saenger (1984) (40).bThe parameters were calculated using seven available 3D structures of PNA-PNA duplexes (PDB codes: 1PUP, 1RRU, 1QPY, 1HZS, 2K4G, 3MBS, 3MBU) (35,44–48).Crystal packing is similar in both duplexes. The 3′-end of each duplex is stacked end-to-end against a molecule related to it by the crystallographic 2-fold axis. Therefore, the 3′-end of the RNA strand is next to the N-terminus of the PNA strand of the other duplex, and vice versa. The other end of the duplex makes no base-stacking interactions but adjoins the PNA strand of a neighboring molecule (Supplementary Figure S1). The phosphate group of the 3′-terminal RNA residue interacts with the last peptide link on the PNA's C-terminal side. The distance between the phosphate O atom and the carbonyl C atom is 3.0 Å—closer than the sum of the Van der Waals radii, and the O-C line is perpendicular to the peptide plane. The O atom also forms a salt bridge with the terminal amine group of another PNA molecule (distance = 2.9 Å) and a water molecule (2.8 Å) (Figure 3A). In the rCUG-PNA structure, the two P-O bonds appear to be unequal: the length of the interacting O atom is 1.47 Å (e.s.d. = 0.02 Å), the other bond is 1.42 Å long (e.s.d. = 0.03 Å). The difference, if true, would indicate an increased polarity of the longer bond. In the rCAG-PNA duplex, the distances cannot be measured reliably because of a double conformation of the RNA backbone. In the otherphosphate groups, the P-O bonds appear equivalent. Another interaction (at 3.1 Å) between carbonyl C and O from a phosphate group is observed between guanine 4 and cytosine 5 residue from a neighboring duplex. At the other end of the duplex, the N-terminal carbonyl C from the carboxymethyl group is approached (3.1–3.2 Å) in a similar way by the O2' atom at the 5′ RNA end of another duplex (Figure 3B). The carboxyl groups of PNA are exposed to a large solvent-filled cavity and interact only with ordered water molecules.
Figure 3.
Stereo pairs showing Π…lone-pair interactions implied by the close contacts between the peptide plane of PNA and: the phosphate group of RNA (A), the hydroxyl O2′ atom of the ribose moiety (B). Both O atoms make a close approach to the electron-depleted C atom of the peptide bond. Distances are marked in Å. Water molecules are shown as red spheres.
Stereo pairs showing Π…lone-pair interactions implied by the close contacts between the peptide plane of PNA and: the phosphate group of RNA (A), the hydroxyl O2′ atom of the ribose moiety (B). Both O atoms make a close approach to the electron-depleted C atom of the peptide bond. Distances are marked in Å. Water molecules are shown as red spheres.
PNA–PNA
Crystals of p(GCTGCTGC)2 were obtained from a mixture of p(GCTGCTGC) and r(GCUGCUGC). Instead of RNA–PNA duplexes containing U-T mismatches, we obtained crystals of PNA–PNA with T–T non-canonical base pairs (Table 1).PNA forms semi-infinite columns in which the pan class="Chemical">right-handed and left-handed duplexes p(GCTGCTGC)2 are stacked alternately (Supplementary Figure S2). Each duplex contains two T–T mismatches lodged between standard C–G and G–C pairs. The T–T pairs can be classified according to Leontis and Westhof as WC–WC pairs (39) or an asymmetric homo-pyrimidine XVI (40) (Figure 4). There are two hydrogen bonds between the thyminerings: N3H···O4 (2.7–2.9 Å donor–acceptor distance) and O2···HN3 (3.1–3.3 Å). One of the rings is inclined toward the minor groove more than the other. The distance between the PNA strands, measured between the C8' atoms, is 10.7 Å for the C–G pairs and 9.0 Å for the T–T pairs. The helical twist is low: 18° for the canonical base pairs and 24° for the T–T pairs. Rise is very high: 3.9 Å/b.p.. Angle is 5–11° for the canonical base pairs and 17–20° for T–T. The T–T pairs are associated with a jump in X-displacement between each T and the following G (Table 2; Figure 2A).
Figure 4.
T–T base pairing interactions in the PNA–PNA duplex. Hydrogen bonds and donor–acceptor distances are indicated. The inter-strand distance is also shown, measured between the C8' atoms. Contours represent the 2F electron density map at the 1 σ level.
T–T base pairing interactions in the PNA–PNA duplex. Hydrogen bonds and donor–acceptor distances are indicated. The inter-strand distance is also shown, measured between the C8' atoms. Contours represent the 2F electron density map at the 1 σ level.One of the T–T pairs forms an inner complex with a Na+ ion in the ‘major groove’ (the term major groove is used by analogy with nucleic acids, but this side of the PNA duplex is in fact convex). The other T–T pair is associated with a Na+ ion which interacts with it though ordered water molecules in the minor groove and forms an inner complex with the carbonyl O atom of the less inclined T. There are 12 ordered Cl− ions: most interacting in the minor groove with the exo-amino groups of guaninerings or on the other side of the duplex, interacting with the exo-amino groups of the cytosinerings. Some are wedged between the backbones of the neighboring molecules and interact with the carbonyl C atoms.
PNA backbone conformation
The carbonyl O atoms of the carboxymethyl moieties point in the direction of the C-terminus and most make intra-strand CH···O bonds with the C8 atoms of guanine/adenine orC6 atoms of cytosine/thyminerings. In some residues, they also participate in CH···O interactions with the C8' atoms (donor–acceptor distance 3.1–3.3 Å) (Figure 5). All the amide bonds are trans and can be observed in one of two conformations differing by a 180° flip of the peptide plane. In one, the carbonyl O atom points toward the minor groove, in the other it points outward (Figure 5). The conformation of the peptide bond has little effect on the overall structure. This is clear from a comparison of the two duplexes, in the two different crystal structures, which are closely superposable although they have different distributions of the two conformations. Some residues are statically disordered, showing both conformations. The backbone dihedral angles at the Cα atom of the glycine moieties correspond to the two most common conformations found on the Ramachandran (ψ–φ) diagram forglycineresidues in proteins. A detailed comparison of the PNA backbone conformations in this and other solved structures is given in Supplementary Table S1.
Figure 5.
Intra-strand interactions in PNA. The carbonyl O atoms (red) of the carboxymethyl moiety usually make a H-bond with the neighboring nucleobase. The peptide planes are found in two possible conformations, differing by 180°. Some residues show static disorder with both conformations superposed, having only a local effect on the structure.
Intra-strand interactions in PNA. The carbonyl O atoms (red) of the carboxymethyl moiety usually make a H-bond with the neighboring nucleobase. The peptide planes are found in two possible conformations, differing by 180°. Some residues show static disorder with both conformations superposed, having only a local effect on the structure.
Comparison of PNA–RNA and PNA–PNA duplexes
A comparison with nucleic acids is not straightforward because the duplexes containing PNA have only one groove, corresponding to the minor groove of nucleic acids. The other side of the duplex has the form of a crest formed by the nucleobases (Supplementary Figure S3). The groove is filled with ordered water molecules.PNA strands are hydrated in a characteristic way: in the minor groove each residue has an ordered water molecule wedged between the base and the peptide backbone. The base always acts as a hydrogen bond acceptor (N3 in purines, O2 in pyrimidines) while the backbone supplies either an NH group or a carbonyl O, depending on the orientation of the peptide plane (Figure 6). Hydration of RNA strands in the minor groove consists of water molecules interacting with the 2′ hydroxyl groups and with the functional groups of the base rings (same as above). The ribosering and the bases can share one water molecule or they can interact with separate waters.
Figure 6.
Hydration patterns in RNA (A) and PNA chains (B). Water molecules are shown as red spheres, hydrogen bonds are indicated with dashed lines. In PNA, water molecules link the nucleobases with the peptide backbone. Flipping of the peptide bond does not affect the position of the water molecule.
Hydration patterns in RNA (A) and PNA chains (B). Water molecules are shown as red spheres, hydrogen bonds are indicated with dashed lines. In PNA, water molecules link the nucleobases with the peptide backbone. Flipping of the peptide bond does not affect the position of the water molecule.Base-stacking is observed primarily within steps CG/GC, in both PNA–RNA (9–11 Å2) and PNA–PNA (8 Å2) duplexes. On the other hand, pairs T–T in PNA–PNA interact the least, and the effect is mainly cross-strand with guaninerings in step TG/CT (4 Å2) (Supplementary Figure S4).
DISCUSSION
PNA is a fully synthetic molecule that has raised hopes and expectations as a tool in biotechnology and medicine (16). It can be delivered to the living cell and is resistant to proteases and nucleases, which makes it a promising therapeutic agent. PNA oligomers hybridize in a sequence-specific manner to DNA and RNA. Their strong sequence specificity is apparent also in this study. Only fully complementary RNA–PNA duplexes could be obtained, while co-crystallization trials of mismatching RNA with PNA resulted only in crystals of mismatched PNA–PNA. Such sequence selectivity is a desirable property for antisense therapeutic strategies against TREDs, which target anomalous RNA structures containing mismatches while minimizing the risk of interfering with the genetic material.PNA has been described as a flexible analogue of nucleic acids but it is clear that it has a well-defined intrinsic architecture, which has to be considered when explaining PNA's properties or designing strategies for applying it.A comparison of the PNA–RNA and PNA–PNA structures reveals PNA's own conformational preferences and the effect it exerts within a heteroduplex. A homoduplex of PNA is strongly unwound (20°/bp, 18 bp/turn) compared to RNA or DNA, but when it accommodates complementary RNA, the resulting double helix shows an intermediate conformation (twist = 26°) between pure PNA and typical A-RNA (≈33°). The two studied RNA–PNA duplexes are isomorphic, having identical helical parameters despite different sequences. The comparisons indicate that RNA and PNA have a similar ease of adapting to one another and the resulting duplex is halfway between PNA–PNA and RNA–RNA structures.The PNA p(GCTGCTGC) duplex can also be compared to an RNA duplex of similar sequence: (GCUGCUGC)2 (41) (Figure 2). In addition to having a lower helical twist, PNA shows a greater backbone flexibility, allowing the thyminerings to come closer (inter-strand distance = 9.0 Å) and form two hydrogen bonds, whereas in RNA the predominant form is the ‘stretched’ U–U pair with only one H-bond and 10.4 Å between the strands. Stacking interactions in the two comparable oligomers are slightly more extensive in the RNA duplex.The rules of sequence complementapan class="Chemical">rity are strongly enforced in interactions between PNA and nucleic acids. None of the available PNA–DNA structures contain mismatches and our attempts to match RNA with PNA also resulted in fully complementary duplexes. On the other hand, we demonstrate that mismatches are tolerated within a PNA–PNA duplex. Perhaps mismatches in PNA homoduplexes could form as easily as in RNA structures. The relative instability of heteroduplexes containing mismatches can be explained by the additional need of the two strands to overcome their different conformational preferences (Figure 2).
Flipping of the peptide bond in PNA occurs readily and has no effect on the overall structure. Consequently, water molecules interacting with the carbonyl oxygen atom or the amine group have no effect on the structure either. The othercarbonyl group, in the carboxymethyl moiety, has a fixed orientation, despite the rotational freedom about the χ1, χ2 and χ3 torsion angles (Figure 1 and Supplementary Table S1). The orientation of the CO groups toward the C-terminus confers polarity on the whole PNA chain. In addition, the carbonyl oxygen atoms are hydrogen bonded with aromatic CH groups of neighboring nucleobases (C8H8 forguanine and adeninerings, C6H6 forcytosine and thymine). The donor–acceptor distances of 3.2 Å (hydrogen–acceptor 2.3 Å) attest to relatively strong interactions for an H-bond involving a C atom. Polarization of the CH bonds could be facilitated by the aromatic environment of the bases, making carbon a betterdonor.In addition to Watson–Crick base pairing, crystal structures reveal interesting PNA–RNA interactions between the backbones of neighboring duplexes. Electronegative groups (O− from phosphate or hydroxyl O from ribose) approach the carbonyl C atoms of the peptide bond. There is no evidence that this interaction destabilizes the peptide bond but the geometry is similar to that found during the nucleophilic attack of a serine protease on a peptide bond. Such an arrangement indicates at least that the C-atom is electron-depleted.Finally, PNA's role has been proposed as a primordial carrier of genetic information (42). Two conditions were set to support this speculation: that components of PNA be found in primitive microorganisms and that sequence-specific PNA–PNA and PNA–RNA interactions be demonstrated. The first condition has been met when N-(2-aminoethyl)glycine was found in cyanobacteria (43), while examples of meaningful interactions between the oligomers are presented in this paper. The sensitive element of PNA is the carbonyl carbon of the peptide bond and the catalytically active group of RNA is the 2′ hydroxyl, and we see them interacting in the crystal structure.
Authors: Eric F Pettersen; Thomas D Goddard; Conrad C Huang; Gregory S Couch; Daniel M Greenblatt; Elaine C Meng; Thomas E Ferrin Journal: J Comput Chem Date: 2004-10 Impact factor: 3.376
Authors: J D Brook; M E McCurrach; H G Harley; A J Buckler; D Church; H Aburatani; K Hunter; V P Stanton; J P Thirion; T Hudson Journal: Cell Date: 1992-02-21 Impact factor: 41.582