Literature DB >> 25940631

Loop flexibility in human telomeric quadruplex small-molecule complexes.

Gavin W Collie1, Nancy H Campbell1, Stephen Neidle2.   

Abstract

Quadruplex nucleic acids can be formed at the ends of eukaryotic chromosomes. Their formation and stabilisation by appropriate small molecules can be used as a means of inhibiting the telomere maintenance functions of telomerase in human cancer cells. The crystal structures have been determined for a number of complexes between these small molecules and human telomeric DNA and RNA quadruplexes. The detailed structural characteristics of these complexes have been surveyed here and the variations in conformation for the TTA and UUA loops have been explored. Loop conformations have been classified in terms of a number of discrete types and their distribution among the crystal structures. Sugar conformation and backbone angles have also been examined and trends highlighted. One particular loop class has been found to be most prevalent. Implications for in particular, rational drug design, are discussed.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25940631      PMCID: PMC4446451          DOI: 10.1093/nar/gkv427

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


BACKGROUND—QUADRUPLEX FORM AND FUNCTION

Sequences containing short repetitive G-tracts occur in a number of genomic contexts. They are often able to form higher-order structures, termed quadruplexes (1–3), especially under the influence of appropriate small molecules, proteins or negative supercoiling (4). Historically, the first such sequence category to be so characterised has been in eukaryotic telomeres, which comprise tandem repeats of a G-tract-containing DNA sequence together with associated telomeric proteins. The hexanucleotide repeat in human telomeric DNA is 5′-TTAGGG (5). Almost all of a telomeric DNA sequence (length varies from ∼5 to ∼10 kb) is in duplex form, with the exception of the ∼150–200 nucleotides at the 3′-terminus which are single-stranded (6). This single-stranded DNA (the ‘overhang’) can be folded to form either bimolecular (two-repeat) or unimolecular (four-repeat) quadruplexes, once the associated single-stranded binding proteins (principally several copies of hPOT1), have been competed away (7,8). The concept that quadruplex-binding small molecules can stabilise telomeric quadruplexes (9), has been widely used to discover small molecules with potential anti-cancer activity via inhibition of the action of the telomerase enzyme complex (10–14). Telomerase is a key hallmark of cancer and is highly over-expressed in many cancer cell types whereas it is only expressed at low levels in normal somatic cells, suggesting that it is a plausible anti-cancer target (15,16). The reverse transcriptase action of telomerase catalyses the synthesis of 5′-TTAGGG repeats onto the overhang, to counteract the loss of repeats during replication. This overhang is required to be in a single-stranded form to hybridise with the RNA domain of the telomerase complex and for the catalytic cycle to proceed. Small-molecule induction of the overhang into a four-stranded quadruplex structure, which can be augmented by a quadruplex-binding small molecule (9), results in the overhang becoming inaccessible to the telomerase RNA template, thus effectively inhibiting telomerase catalytic function (17).

SMALL MOLECULE BINDING TO QUADRUPLEX DNA AND RNA

The demonstration of this concept, that telomerase inhibition can be achieved via quadruplex stabilisation with anthraquinone derivatives (9,18) and subsequently with many other small molecules (10–13) has led to the widespread development of this approach to selectively targeting human cancers. Information on the large and structurally diverse superfamily of quadruplex-binding small molecules is available from the G4LDB database www.g4ldb.org (19). Several compounds including the acridine derivatives BRACO-19 (20), RHPS4 (21), the cyclic natural product telomestatin (22) and its analogues, and some tetra-substituted naphthalene diimide (ND) derivatives (23), have been shown to selectively affect telomere maintenance in cancer cells. Mechanistic studies have demonstrated that the cellular responses to these agents, which occur considerably more rapidly than would be expected on the basis of a classic telomerase inhibition and telomere attrition model, involve the activation of a DNA damage response, presumably by sensing the presence of a quadruplex nucleic acid signal which is not associated with and protected by, telomeric proteins (20,21,24–30). This response is an important factor in the rapidity and selectivity of the growth-inhibitory action of these quadruplex-binding ligands in tumour cells and tumour xenografts (19,20,24). The frequency of occurrence of quadruplex-forming sequences other than at telomeres in the human genome has been evaluated by informatics approaches (31–33). These studies have shown that such sequences are especially prevalent in promoter sequences (34) and in 5′-UTRs (35–37). Their over-representation in a number of oncogene promoter sequences (33) has led to the hypothesis that promoter quadruplexes can be targets for therapeutic intervention using small molecules which stabilise a particular quadruplex structure within a promoter sequence (38,39). This in principle results in inhibition of the transcription of that particular gene. The concept has been evaluated with small molecules targeting in particular the c-MYC and c-KIT promoter quadruplexes (see for example 10–13,39), although the challenge remains of devising small molecules capable of selectively targeting a particular quadruplex in the absence of off-target effects on other quadruplexes. Importantly, the concept of quadruplexes as targets in human cancer has been validated by their direct visualisation in human cells and tissues (40–42). Extensive structural data on native human telomeric quadruplexes (HTQs) are now available from crystallographic and nuclear magnetic resonance (NMR) studies (43–53), whereas there is currently structural information available on only a small number of non-telomeric quadruplexes, of which the c-KIT, c-MYC and BCL-2 promoter quadruplexes are the best-studied (54–64). Detailed NMR structures have been determined for three small-molecule complexes with c-MYC quadruplexes: bound to the porphyrin ligand TMPyP4 (PDB ID 2A5R) (65), bound to the bis-quinolinium compound Phen-DC(3) (PDB ID 2MGN) (66) and with a drug-like quindoline compound (PDB ID 2L7V) (67). The available structural data on human telomeric quadruplex (HTQ) complexes with small molecules are dominated by results from X-ray crystallographic studies, with 16 structures deposited to date in the PDB (68–76). Of the few NMR studies on human telomeric quadruplex-small molecule complexes that have been reported, only one is of an intramolecular complex, bound to a derivative of the drug telomestatin (PDB ID 2MB3) (77). The present study focusses on the crystal structures and in particular, analyses aspects of their conformational variability in order to establish unifying trends and features. There are currently no crystal structures available for small-molecule complexes with promoter or 5′-UTR quadruplexes and only a very few NMR structures (involving promoter quadruplexes), but some of the underlying principles outlined below will also apply to these complexes. One of the small-molecule complexes in the database is with a bimolecular human telomeric RNA (TERRA) quadruplex (71), and this is complemented by the availability of a crystal structure of the native RNA quadruplex (78). The overwhelming majority of quadruplex-binding small molecules reported to date possess planar groupings and their mode of binding predominantly involves 3′ or 5′ end-stacking onto terminal G-quartets (Figure 1). Intercalative binding has not been observed experimentally for these ligands and is widely considered not to be consistent with biophysical data on quadruplex-small molecule complexes (11–13). A small number of quadruplex-binding ligands, mostly based on a polyamide motif, have been proposed to bind in quadruplex grooves rather than on G-quartet surfaces. There is no detailed experimental structural information on their interaction with human telomeric bimolecular or unimolecular quadruplexes and so groove-binding structures will not be considered any further here.
Figure 1.

(A) View of the human telomeric quadruplex native and ligand complex crystal structures, with nucleic acids drawn in stick form and ligands in surface representation. Metal ions (potassiums) are shown as purple spheres. Individual loops are highlighted with sticks in single colours. The colour coding corresponds to the more detailed view of each loop type shown in Figure 2. (B) Structures of individual ligands, shown with the PDB IDs for the structures in which they occur.

(A) View of the human telomeric quadruplex native and ligand complex crystal structures, with nucleic acids drawn in stick form and ligands in surface representation. Metal ions (potassiums) are shown as purple spheres. Individual loops are highlighted with sticks in single colours. The colour coding corresponds to the more detailed view of each loop type shown in Figure 2. (B) Structures of individual ligands, shown with the PDB IDs for the structures in which they occur.
Figure 2.

(A) Views of the various TTA loop types (coloured as in Figure 1), grouped into 12 distinct categories. (B) (i) The two sub-types of type-1 loops. Sub-type 1 (left) is from PDB structure 1KF1, sub-type 2 (right) is from PDB structure 3T5E. The structures shown here have been structurally aligned and are presented in the same orientation. (ii) An example of the structural diversity of non-type-1 loops: overlay of a type-4 loop (pink, from PDB structure 3CCO), a type-10 loop (green, from PDB structure 3SC8) and a type-1 loop (red, from PDB structure 1KF1). The non-type-1 loops are dramatically different to the ‘native’ type-1 TTA loop—as well as to one another. Note the retention of base-stacking interactions in the type-10 loop arrangement, which is a common feature of non-type-1 loops.

As a consequence of the rise of the quadruplex targeting concept, a large number of diverse quadruplex-binding small molecules have been devised and evaluated. Most of these have been discovered by biophysical screens of compound libraries (11–13). Rather fewer studies have employed rational design and optimisation of ligands using structural data from X-ray crystallography and NMR studies. In silico docking methods, mostly using native quadruplex structures as starting-points, have been used in a few instances to screen large compound libraries, (79–81). Experimental structures are crucial starting-points for in silico studies so it is hoped that the analyses and correlations presented here will help provide sound starting-points for such future studies.

THE FLEXIBILITY OF HUMAN TELOMERIC QUADRUPLEXES

It is now well-established that intramolecular quadruplexes comprising four repeats of the human telomeric sequence 5′-TTAGGG, can form a variety of topological isomers in solution, differing in the orientation of the backbone and the nature of the loops formed by the ‘spacer’ TTA triplet sequences (which can adopt lateral, diagonal or parallel arrangements dependent on the orientation of the attached strand). A number of these quadruplex topological isomers have been identified by NMR methods (43–50); the parallel form with all loops of the propeller type (also termed strand-reversal) is the sole form found in the X-ray crystallographic studies (52). The observation of a particular quadruplex topological isomer in solution is dependent on a number of factors, primarily quadruplex concentration, the presence or otherwise of a molecular crowding environment, the nature of the counter-ion and the flanking sequences at the 5′ and 3′ ends (see for example 51,82–85). There continues to be controversy as to which topological isomer is the most biologically relevant, although it is plausible that both parallel and anti-parallel ones can co-exist in the high local concentration of cellular conditions. Small molecule binding can induce the stabilisation of a particular form (for example the induction of the parallel topology by N-methylmesoporphyrin IX (86)); although there is to date very incomplete understanding of the molecular basis for the preference for a particular topology; accessible surface area of a terminal G-quartet has been suggested as being an important factor (87). This diversity in native HTQ topologies has not been reflected in the folds observed in the crystal structures of small-molecule complexes with HTQs. All have been found to have a parallel topology, regardless of whether they are bimolecular or unimolecular complexes (Table 1). They crystallise in ten different crystallographic space groups, strongly suggesting that crystal packing forces are not the determinant of the single observed (parallel) topology. The present survey has examined in particular the principal nucleic acid conformational variable regions in these structures, the TTA loops. The structural cores of all quadruplexes comprise two or more stacked G-quartets. In the case of HTQ crystal structures the core comprises three stacked G-quartets, which has been observed to be exceptionally stable. Structural alignment of the three stacked G-quartets of all HTQ crystal structures analysed in this work using the native HTQ structure (PDB ID 1KF1) as a reference gives an average r.m.s.d of just 0.86 Å (see Supplementary Table S1 and Figure S1). This stability of the HTQ G-quartets has also been confirmed by molecular dynamics simulations (88–91). Accordingly, the geometry of the G-quartet core has not been surveyed here.
Table 1.

Native DNA and RNA human telomeric quadruplexes and small-molecule complex crystal structures showing PDB codes, basic crystallographic data and TTA/UUA loop types

PDB IDSequenceDescription of structureResolution (Å)Space groupR-freeR-factorNo. of crystallographically unique loopsLoop typeRef.
1K8Pd[UBrAG3UBrTAG3T]Native DNA2.40P312 10.2800.1932aType-1(52)
1KF1d[AG3(T2AG3)3]Native DNA2.10P60.2630.2313Type-1 (x3)(52)
2HRId[TAG3T2AG3]Porphyrin TMPyP4 complex2.09C22210.2570.2082Type-2(68)
Type-3
3CE5d[TAG3T2AG3T]Acridine complex2.50I40.2130.1842Type-1(70)
Type-7
3CCOd[TAG3T2AG3T]NDb complex2.20P64220.3030.2571Type-4(69)
3CDMd[TAG3(T2AG3)3]ND complex2.10P210.2950.2346Type-1 (x4)(69)
Type-5
Type-6
3IBKr(UBrAG3U2AG3U)Native RNA2.20P31210.2310.2162cType-1(78)
3MIJr(UAG3U2AG3U)RNA acridine complex2.60P230.2480.2361Type-8(71)
3QCRd[TAG3T2AG3T]Acridine complex.3.20P62220.3780.3461dNA(71)
3SC8d[AG3(T2AG3)3]ND complex2.30P31210.2880.2603Type-1(72)
Type-10
Type-11
3T5Ed[AG3(T2AG3)3]ND complex2.10P60.2820.2453Type-1 (x3)(72)
3QSCd[AG3T2AG3T2]Salphen complex2.40C2220.2340.2141Type-9(75)
3QSFd[AG3T2AG3T2]Salphen complex2.40C2220.3200.2401Type-9(75)
3UYHd[AG3(T2AG3)3]ND complex1.95P31210.2750.2313Type-1(76)
Type-10
Type-11
4DA3d[G3(T2AG3)3]ND complex2.40P31210.2820.2423Type-1(76)
Type-10
Type-12
4DAQd[G3(T2AG3)3]ND complex2.75P31210.2750.2123Type-1(76)
Type-10
Type-12
3R6Rd[AG3(T2AG3)3]Berberine complex2.30P60.2520.2173Type-1 (x3)(73)
4G0Fd[AG3(T2AG3)3]Mesoporphyrin complex2.15P60.2620.2393Type-1 (x3)(74)
4FXMd[AG3(T2AG3)3]Mesoporphyrin complex1.65P212120.2620.2243Type-1 (x3)(74)
Total no. of experimentally determined TTA or UUA loops: 43
Total no. of distinct TTA loop types: 12
Total no. of type-1 TTA loops occurrences: 26
Total number of crystal structures: 19
Total no. of distinct space groups: 10
Total no. of HTQ-ligand complexes: 16

a+cChain B loop not resolved in the electron density; bND, Naphthalene diimide; dLoop poorly resolved in the electron density.

a+cChain B loop not resolved in the electron density; bND, Naphthalene diimide; dLoop poorly resolved in the electron density.

Methodology used in the survey

Atomic coordinates for the telomeric quadruplex structures were extracted from the Protein Data Bank (www.rcsb.org), with geometric and conformational analyses performed using the 3DNA program (92) at www.x3dna.org and visualised with the programs CHIMERA (93) at www.cgl.ucsf.edu/chimera and PyMOL (94) at www.pymol.org. Each loop was analysed individually and no averaging of conformational features was done. Disordered and poorly resolved loops were excluded from the analyses. Sugar puckers were assigned directly from pseudorotation parameters calculated by 3DNA from crystallographic coordinates.

STRUCTURAL ANALYSIS AND BACKBONE DIHEDRAL ANGLE COMPARISON OF HTQ TTA LOOPS

There are currently 19 X-ray crystal structures of native and small-molecule-complexed human telomeric DNA and RNA G-quadruplexes available in the Protein Data Bank. All of these contain d(TTA) (or r(UUA)) propeller loops (Figure 1a, Table 1). This data set comprises 17 DNA and two RNA (i.e. TERRA/telRNA) structures. The structures of individual ligands are shown in Figure 1b. These bimolecular and unimolecular quadruplex native and ligand-bound structures represent a total of 43 crystallographically-determined propeller-type loops (this number excludes three poorly-resolved loops: see Table 1). The range of resolution in these analyses is 1.65–3.2 Å, with an average of 2.30 Å. The loops have been categorised into 12 distinct groups (termed type-1, type-2…etc) on the basis of their overall three-dimensional shape (see Table 2 and Figure 2a, b).
Table 2.

Qualitative TTA/UUA loop descriptors in terms of base stacking

Loop typeStacked basesMutually perpendicular basesOrientation of 3rd base
1T1, A3T2, A3T2 close to A3
2T1, T2-A3 away from T1, T2
3T2, A3T1, A3T1 close to A3
4-T2, A3T1 away from T2, A3
5T2, A3-T1 close to T2, A3
6T2, A3T1T1 away from T2, A3
7T2, A3T1 close to T2
8U2, A3U1U1 away from U2, A3
9No stacking-All away from each other
10T2, A3-T1 away from T2, A3
11No stackingT1, T2A3 close to T1
12T2, A3-T1 away from T2, A3
(A) Views of the various TTA loop types (coloured as in Figure 1), grouped into 12 distinct categories. (B) (i) The two sub-types of type-1 loops. Sub-type 1 (left) is from PDB structure 1KF1, sub-type 2 (right) is from PDB structure 3T5E. The structures shown here have been structurally aligned and are presented in the same orientation. (ii) An example of the structural diversity of non-type-1 loops: overlay of a type-4 loop (pink, from PDB structure 3CCO), a type-10 loop (green, from PDB structure 3SC8) and a type-1 loop (red, from PDB structure 1KF1). The non-type-1 loops are dramatically different to the ‘native’ type-1 TTA loop—as well as to one another. Note the retention of base-stacking interactions in the type-10 loop arrangement, which is a common feature of non-type-1 loops.

Type-1 propeller loops

By far the most commonly observed loop type is the intercalated ‘TAT’ loop found in the native telomeric DNA structures (PDB IDs 1KF1 and 1K8P), termed type-1 loops. This loop arrangement always involves the adenine residue π-π stacking above the first thymine, with the second (central) thymine adopting one of two similar orientations: either positioned perpendicular to the G-quartet planes or stacked (or nearly stacked) on the external face of the adenine (Figure 2b (i)). Type-1 loops have been found to occur in 13 crystal structures to date (including both native and ligand-complexes), with a total of 26 occurrences overall (therefore comprising over 60% of all crystallographically-observed TTA loops). This strongly suggests that the type-1 propeller loop is an energetically favourable and stable arrangement, which is independent of crystal packing mode—Table 1 also shows that these structures occur in a variety of crystallographic space groups (five distinct space groups in total). It is perhaps significant that, although ligand binding events often result in significant rearrangement of the type-1 loop geometry (described below), ten distinct quadruplex-ligand complexes exist in which the type-1 loop is preserved. This implies further that the type-1 loop arrangement is a structurally robust motif. Comparison of the backbone and glycosidic dihedral angles for the type-1 loops of each of the 13 structures which contain such a loop reveals a high degree of conservation in backbone geometry (Figure 3a). The only significant deviation can be seen for a 23-mer-naphthalene diimide derivative complex structure (PDB ID 3CDM), which has changes in the ε and ζ angles for the loop adenine residue (Figure 3a).
Figure 3.

Plots of nucleotide conformational angles in the TTA loops. (A) Backbone and glycosidic torsion angles. These are shown for all structures containing at least one type-1 TTA (or UUA) loop. For structures which contain more than one type-1 loop only the first TTA loop is represented. (B) Comparison of backbone and glycosidic torsions angles. These are shown of the three type-1 propeller loops of the native human telomeric unimolecular quadruplex (PDB ID 1KF1) (each loop coloured separately; red, green or blue). (C) Comparison of backbone and glycosidic torsions angles. These are for the three type-1 propeller loops of a naphthalene diimide telomeric quadruplex complex (PDB ID 3T5E) (each loop coloured separately; red, green or blue). (D) Comparison of the backbone and glycosidic torsions angles. These are for the three type-1 propeller loops of the mesoporphyrin complex with a human telomeric quadruplex (PDB ID 4FXM) (each loop coloured separately; red, green or blue). (E) Comparison of the backbone and glycosidic torsions angles. These are for the three type-1 propeller loops in the berberine complex with a human telomeric quadruplex (PDB ID 3R6R) (each loop coloured separately; red, green or blue). (F) Dihedral angle plots for each of the TTA residues of each of the 12 loop types (each trace represents a distinct loop type). The loop 1 of 1KF1 (solid black line) can be used as a common reference. Dihedral angles are derived from loops from PDB structures 1KF1 (1), 2HRI (2 and 3), 3CCO (4), 3CDM (5 and 6), 3CE5 (7), 3MIJ (8), 3QSC (9), 3SC8 (10 and 11) and 4DA3 (12). (G) Type-9 loop backbone and glycosidic dihedral angles. These are from bimolecular human quadruplexes (PDB IDs 3QSF and 3QSC) complexed with salphen ligands. (H) Comparison of backbone and glycosidic dihedral angles in the TTA loops of the naphthalene diimide (ND) complex structures. Type-1 TTA loops are shown as blue dashed lines. Non-type-1 loops are shown as grey dashed lines. Structures represented are: PDB IDs 3SC8, 3T5E, 3UYH, 4DA3 and 4DAQ. (I) Comparison of TTA loop backbone and glycosidic dihedrals of naphthalene diimide complex crystal structures, for type-1 loops only.

Plots of nucleotide conformational angles in the TTA loops. (A) Backbone and glycosidic torsion angles. These are shown for all structures containing at least one type-1 TTA (or UUA) loop. For structures which contain more than one type-1 loop only the first TTA loop is represented. (B) Comparison of backbone and glycosidic torsions angles. These are shown of the three type-1 propeller loops of the native human telomeric unimolecular quadruplex (PDB ID 1KF1) (each loop coloured separately; red, green or blue). (C) Comparison of backbone and glycosidic torsions angles. These are for the three type-1 propeller loops of a naphthalene diimide telomeric quadruplex complex (PDB ID 3T5E) (each loop coloured separately; red, green or blue). (D) Comparison of the backbone and glycosidic torsions angles. These are for the three type-1 propeller loops of the mesoporphyrin complex with a human telomeric quadruplex (PDB ID 4FXM) (each loop coloured separately; red, green or blue). (E) Comparison of the backbone and glycosidic torsions angles. These are for the three type-1 propeller loops in the berberine complex with a human telomeric quadruplex (PDB ID 3R6R) (each loop coloured separately; red, green or blue). (F) Dihedral angle plots for each of the TTA residues of each of the 12 loop types (each trace represents a distinct loop type). The loop 1 of 1KF1 (solid black line) can be used as a common reference. Dihedral angles are derived from loops from PDB structures 1KF1 (1), 2HRI (2 and 3), 3CCO (4), 3CDM (5 and 6), 3CE5 (7), 3MIJ (8), 3QSC (9), 3SC8 (10 and 11) and 4DA3 (12). (G) Type-9 loop backbone and glycosidic dihedral angles. These are from bimolecular human quadruplexes (PDB IDs 3QSF and 3QSC) complexed with salphen ligands. (H) Comparison of backbone and glycosidic dihedral angles in the TTA loops of the naphthalene diimide (ND) complex structures. Type-1 TTA loops are shown as blue dashed lines. Non-type-1 loops are shown as grey dashed lines. Structures represented are: PDB IDs 3SC8, 3T5E, 3UYH, 4DA3 and 4DAQ. (I) Comparison of TTA loop backbone and glycosidic dihedrals of naphthalene diimide complex crystal structures, for type-1 loops only. Backbone dihedral comparison for the structures which contain multiple type-1 loops reveals a similar level of angle conservation. For example, the native 22-mer structure (PDB ID 1KF1) and the naphthalene diimide derivative-complexed 22-mer (PDB ID 3T5E, with identical sequence) both contain three type-1 loops, which display a significant level of intramolecular correlation when comparing backbone dihedrals (Figure 3b, c, respectively). A similar level of correlation in dihedral angles is also seen within the 22-mer complexes involving mesoporphyrin (PDB ID 4FXM) and berberine (PDB ID 3R6R) (Figure 3d, e, respectively).

Non type-1 propeller loops

Of the 43 crystallographically observed TTA (or UUA) propeller loops, less than 40% (17 in total) adopt conformations other than the intercalated type-1 loop arrangement described above. These non-type-1 loops adopt a wide variety of dissimilar and diverse structures in the crystal state, which can be divided into 11 distinct groups (type-2 to type-12) based on the orientation of the three loop bases (Figure 2a and Tables 1 and 2). The structural diversity of the 11 non-type-1 loops indicates that TTA propeller loops possess a significant degree of structural flexibility, however, this does not necessarily signify that the crystallographically observed loop types are unstable. Indeed, the majority of these non-type-1 loop arrangements share the common feature of exploiting intra-loop base π-stacking interactions (Figure 2a and b(ii) and Table 2)—a feature which would be expected to confer a degree of stability upon these loop structures. Comparison of backbone and glycosidic dihedral angles of the 11 non-type-1 loops reveals a broad distribution in backbone geometry for the different loop arrangements (Figure 3f), which is to be expected, considering the dramatic structural variety of non-type-1 loops (Figure 2). However, there do appear to be several favourable groupings of torsion angles—such as a preference for anti glycosidic bond angles, particularly for the second loop thymine—in addition to some correlation with the type-1 backbone dihedral angle distribution. As with type-1 loops, backbone dihedral angle comparison of equivalent loop types between different structures reveals a high level of similarity, as seen when comparing the type-9 loops observed in the two bimolecular salphen-complexed quadruplexes 3QSF and 3QSC (Figure 3g). Although type-1 loops are indeed observed in a number of HTQ-ligand complexes (as stated above), all non-type-1 loops occur in ligand-bound HTQ structures, thus, non-type-1 loops are almost certainly a consequence of ligand-binding events. If one considers the type-1 loop to represent the ‘native’ TTA loop conformation, the observation of a further 11 distinct loop conformations—many of which are dramatically different to the type-1 arrangement (see for example Figure 2b(ii))—indicates clearly that TTA propeller loops of human telomeric G-quadruplexes possess a significant degree of structural polymorphism. This observation would be expected to have important consequences with respect to the rational design of small molecules targeted towards HTQs, however, the almost paradoxical observation of both type-1 and non-type-1 loops within a single ligand-bound HTQ structure (e.g. PDB IDs 3CDM, 3CE5, 3SC8, 3UYH, 4DA3, 4DAQ) precludes a clear conclusion: small molecule compounds are able to elicit both dramatic and negligible effects on TTA loop geometry.

SUGAR PUCKER ANALYSIS AND COMPARISON OF HTQ TTA LOOPS

Sugar pucker distribution of type-1 propeller loops

Comparison of the deoxyfuranose and furanose sugar puckers in the nucleotides of all type-1 propeller loops reveals an exceptionally high degree of similarity in pucker distribution. Of the 26 type-1 loops observed in crystal structures, 17 (65.4%) adopt a sugar pucker distribution of C1’-exo, C3’-endo, C2’-endo respectively for the three T-T-A nucleotides in a loop (Table 3). Interestingly, this consensus pattern of pucker arrangement—a similar pucker range to those found in DNA duplex crystal structures—is also observed for RNA G-quadruplex type-1 loops. Analysis of the frequency of pucker-types for each loop residue (i.e. T, T, A/U, U, A) reveals a high level of correspondence, with the first thymine sugar adopting a C1’-exo pucker in 69.2% of observed instances, the second thymine adopting a C3’-endo pucker conformation in 100% of the observed instances and the adenine sugar adopting a C2’-endo pucker in 88.5% of all type-1 loops (Table 4). The high level of correlation in the distribution of sugar pucker geometry and backbone dihedral angles of type-1 loops strongly implies that the folding of TTA triplets into the type-1 arrangement is a highly precise, non-random process.
Table 3.

Pucker distributions for TTA/UUAs of all type-1 loops

1KF1-a1KF1-b1KF1-c1K8P3IBK3CDM-a3CDM-b3CDM-c3CDM-d3CE53SC83UYH3T5E-a
TC1’-exoC1’-exoC1’-exoC1’-exoC1’-exoC1’-exoC4’-exoC1’-exoC1’-exoC1’-exoC1’-exoC1’-exoC2’-endo
TC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endo
AC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC1’-exoC2’-endoC2’-endoC2’-endoC2’-endo
3T5E-b3T5E-c4DA34DAQ3R6R-a3R6R-b3R6R-c4FXM-a4FXM-b4FXM-c4G0F-a4G0F-b4G0F-c
TC1’-exoC2’-endoC1’-exoC1’-exoC1’-exoC1’-exoC4’-exoC2’-endoC1’-exoC1’-exoC2’-endoC2’-endoC2’-endo
TC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endoC3’-endo
AC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC1’-exoC1’-exoC2’-endo

Yellow indicates a sugar which deviates from the consensus pucker conformation. Lower-case letters denote crystallographically unique type-1 loops within a single crystal structure.

Table 4.

Percentage of each residue in type-1 loops which adopt sugar pucker consensus ‘C1’-exo, C3’-endo, C2’-endo’

Pucker type% Observed
T18/26 C1’-exo69.2%
T26/26 C3’-endo100.0%
A23/26 C2’-endo88.5%

C3’-endo pucker for the second loop thymine is 100% conserved.

Yellow indicates a sugar which deviates from the consensus pucker conformation. Lower-case letters denote crystallographically unique type-1 loops within a single crystal structure. C3’-endo pucker for the second loop thymine is 100% conserved.

Sugar pucker distribution of non-type-1 propeller loops

As with backbone dihedral angle distribution, the sugar puckering of non-type-1 loops is also highly variable when comparing all 11 non-type-1 loops (Table 5). Although there is no common consensus pucker arrangement for the non-type-1 loops, there is a clear shift from the ‘C1’-exo, C3’-endo, C2’-endo’ consensus of type-1 loops to a C2’-endo conformation for all loop residues. For example, the first thymine of the TTA loop adopts a C2’-endo pucker in 52.9% of all non-type-1 propeller loops, compared to 23.1% in all type-1 loops (6/26 occurrences) (Table 6). This presumably reflects the energetic preference for C2’-endo puckered sugars in DNA, and suggests the less favourable C1’-exo and C3’-endo of the first two residues of the type-1 loops may be compensated for by the favourable energetics of base-base π-stacking interactions.
Table 5.

Sugar pucker distributions for each non-type-1 loop

2RHI 2*2RHI 33CCO 43CDM 53CDM 63CE5 73MIJ 83QSC 93QSF 93SC8 103SC8 113UYH 103UYH 114DA3 104DA3 124DAQ 104DAQ 12
TC3’-endoC2’-endoC2’-endoC1’-endoC2’-endoC2’-endoC3’-exoC2’-endoC2’-endoC2’-endoC3’-endoC2’-endoO4’-endoC2’-endoC3’-exoC1’-exoC4’- exo
TC1’-exoC1’-exoC2’-endoC2’-endoC3’-endoC2’-endoC3’-endoC2’-endoC2’-endoC3’-exoC2’-exoC2’-endoC3’-endoC2’-endoC2’-endoC2’-endoC2’- endo
AC2’-endoO4’-endoC4’-exoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC2’-endoC1’-exoC4’-exoC1’-exoC2’-endoC1’-exoC1’-exoC1’-exoC1’- exo

*Numbers below PDB codes denote loop type.

Table 6.

Occurrence of each pucker conformation in non-type-1 loops

C1’-endoC2’-endoC3’-endoC1’-exoC2’-exoC3’-exoC4’-exoO4’-endo% C2’-endo for non-type-1 loops% C2’-endo for type-1 loops
T1921021152.9% (9/17)23.1% (6/26)
T01032110058.8% (10/17)0.0% (0/26)
A0806002147.1% (8/17)88.5% (23/26)

There is a pronounced shift towards C2’-endo puckering for all residues of the TTA loops for non-type-1 loops when compared to type-1 loops (highlighted in bold).

*Numbers below PDB codes denote loop type. There is a pronounced shift towards C2’-endo puckering for all residues of the TTA loops for non-type-1 loops when compared to type-1 loops (highlighted in bold).

ANALYSIS AND CONCLUSIONS

The solution NMR evidence for type-1 loops, which together with the diversity of crystal packing modes in the crystal structures, suggests that crystal packing forces are not contributing to the loop types. The diversity of space groups and resulting patterns of molecular packing in these crystals suggests that the intermolecular packing forces are weak and therefore different packing arrangements can readily occur. This is consistent with the assumption that what is observed in the crystalline state are low-energy conformations of these complexes. We suggest that here, the preferred conformations of the relatively flexible loops are always among those most observed in these crystal structures. Although the type-1 loop is present in five distinct space groups, many of the non-type-1 loops are observed less frequently, and thus there is not currently sufficient data to make any definitive statements about the relationship between space group and loop type. It should be noted, however, that the relationship between space group and (loop) conformation is not necessarily a one way process—a particular space group may affect the loop geometry, but equally, a loop type can influence the space group into which the molecule/DNA/RNA packs, i.e. loop-type may define the crystal packing, rather than the packing defining the loop-type. Although there is no direct evidence that non-type-1 loops are produced by ligand binding, it is clear that ligand binding is involved in some way in the formation of non-type-1 loop arrangements. Since type-1 loops are also observed in the native DNA and RNA structures, we suggest that some ligands simply do not perturb this loop geometry, whereas others do. The overall question of the relationship of these crystal structures to the structures and complexes present in solution can only be addressed once much more extensive data are obtained. However, there is certainly sufficient data for some tentative conclusions to be drawn. One important issue concerns the loops defined here—in particular, whether or not these structures are present in solution. Importantly, there is indeed evidence for the existence of type-1 loops in solution. The NMR structure of d[TAGGG(TTAGGG)]3 in crowded conditions (PDB ID 2LD8) clearly shows stacked type-1 loops for all three TTA loops of all 10 structures present in the deposited ensemble (51). Thus, the type-1 loop—which is observed frequently crystallographically—appears to be highly stable and is very much present in solution. A structural alignment of the crystal and NMR structures, 1KF1 with 2LD8, (excluding the 5′ flanking residues) gives an average RMSD of 1.981 Å (see Supplementary Table S2)—this is an excellent correlation between solution and solid state. The overall visual alignment of these two structures (1KF1 and 2LD8) may be observed by scrolling through the NMR ensemble and crystal structures and is very close. Figure 2 and Table 2 also show it is common for the loops to have at least one unstacked base, which is highly likely to be adopting multiple conformations in solution. The patterns of base-base stacking seen in almost all the loop types, on the other hand, are more likely to have significant populations in solution. An individual crystal structure provides quasi-static information about molecular structures—information embedded in temperature (B) factors in part reflects crystal lattice dynamics. Data on an ensemble of quadruplex crystal structures in which the principal variables are the bound small molecules, can, on the other hand, provide a detailed view of conformational trends in variable regions, i.e. the loops. The over-riding conclusion from this survey of available crystal structures is that these HTQ-ligand complexes have a significant degree of loop conformational variability (albeit with certain trends of conformational preference), which adds another layer of complexity to HTQ structures, and which needs to be taken into account when considering them as drug targets for in silico screening and molecular design studies. This feature of loops is likely to be common to all quadruplexes with loops, whether they contain single-nucleotide or much longer loops. One key question posed by this survey is whether a particular loop conformational class is related to the nature of the bound ligand. The type-1 loop occurs at least once in each of all six naphthalene diimide complexes, in the berberine complex and in both mesoporphyrin complexes, whereas it does not occur in the salphen, or in the other porphyrin (TMPyP4) complex. Figure 3h, i shows that for the naphthalene diimide complexes, there is close correspondence between the detailed conformations of type-1 loops at the individual torsion angle level, but this is not the case for the other type-1 loops, where there is definite (albeit sometimes small) conformational variation between them. The type-1 loop also occurs in the structure of the DNA quadruplex acridine derivative (BRACO-19) complex, but not in the RNA one—one has to bear in mind that the acridines in these two structures are structurally very distinct from one another. The native RNA quadruplex shares with the two native DNA quadruplexes the preference for type-1 loops. The loop difference between the native and acridine-bound RNA quadruplexes is due mainly to the presence of the acridine, which exploits (and presumably stabilises) an extended binding surface created by the addition of loop adenine residues to the 5′ G-quartet—a (re)arrangement very much dependent on hydrogen bonds involving the C2’ hydroxyl groups of the ribose sugars. Overall the type-1 loop is the dominant one in the majority of these crystal structures. Overall we conclude that loop types and conformations are most conserved within a closely-related group of small molecule complexes, as in the series of ND-quadruplex structures. In these, the differences between the structures are in the nature of the ND end-groups—all the ND compounds are tetra-substituted but with different cationic termini. There is some degree of structure conservation of type-1 loops beyond the ND complexes, especially when the ligands are not over-sized, as is the case for the acridine BRACO-19 and the berberine derivative. The implications for rational drug design of quadruplex-binding ligands are clear: in silico modelling based on existing crystal structures is most reliable when modelling closely similar analogues, but even then the variability in some loops is unpredictable and can best be determined by further experimental structures rather than for example, molecular dynamics with restricted simulation times. Taking one of the existing crystal structures (Table 1) as a starting-point will therefore be useful in providing qualitative guidance on optimising existing or related ligand hits, but extension for screening of structurally diverse libraries and scaffolds needs to be approached with caution.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  93 in total

1.  The G-quadruplex ligand telomestatin inhibits POT1 binding to telomeric sequences in vitro and induces GFP-POT1 dissociation from telomeres in human cells.

Authors:  Dennis Gomez; Marie-Françoise O'Donohue; Thomas Wenner; Céline Douarre; Jérome Macadré; Pascale Koebel; Marie-Josèphe Giraud-Panis; Hervé Kaplan; Alain Kolkes; Kazuo Shin-ya; Jean-François Riou
Journal:  Cancer Res       Date:  2006-07-15       Impact factor: 12.701

2.  Structure of the human telomere in K+ solution: an intramolecular (3 + 1) G-quadruplex scaffold.

Authors:  Kim Ngoc Luu; Anh Tuân Phan; Vitaly Kuryavyi; Laurent Lacroix; Dinshaw J Patel
Journal:  J Am Chem Soc       Date:  2006-08-02       Impact factor: 15.419

3.  Structural basis for binding of porphyrin to human telomeres.

Authors:  Gary N Parkinson; Ragini Ghosh; Stephen Neidle
Journal:  Biochemistry       Date:  2007-02-03       Impact factor: 3.162

4.  Small-molecule interaction with a five-guanine-tract G-quadruplex structure from the human MYC promoter.

Authors:  Anh Tuân Phan; Vitaly Kuryavyi; Hai Yan Gaw; Dinshaw J Patel
Journal:  Nat Chem Biol       Date:  2005-07-17       Impact factor: 15.040

5.  Telomerase inhibition with a novel G-quadruplex-interactive agent, telomestatin: in vitro and in vivo studies in acute leukemia.

Authors:  T Tauchi; K Shin-ya; G Sashida; M Sumi; S Okabe; J H Ohyashiki; K Ohyashiki
Journal:  Oncogene       Date:  2006-05-01       Impact factor: 9.867

6.  Human telomeric sequence forms a hybrid-type intramolecular G-quadruplex structure with mixed parallel/antiparallel strands in potassium solution.

Authors:  Attila Ambrus; Ding Chen; Jixun Dai; Tiffanie Bialis; Roger A Jones; Danzhou Yang
Journal:  Nucleic Acids Res       Date:  2006-05-19       Impact factor: 16.971

7.  G-quadruplexes in promoters throughout the human genome.

Authors:  Julian L Huppert; Shankar Balasubramanian
Journal:  Nucleic Acids Res       Date:  2006-12-14       Impact factor: 16.971

Review 8.  Quadruplex DNA: sequence, topology and structure.

Authors:  Sarah Burge; Gary N Parkinson; Pascale Hazel; Alan K Todd; Stephen Neidle
Journal:  Nucleic Acids Res       Date:  2006-09-29       Impact factor: 16.971

9.  Different loop arrangements of intramolecular human telomeric (3+1) G-quadruplexes in K+ solution.

Authors:  Anh Tuân Phan; Kim Ngoc Luu; Dinshaw J Patel
Journal:  Nucleic Acids Res       Date:  2006-10-12       Impact factor: 16.971

10.  NMR solution structure of the major G-quadruplex structure formed in the human BCL2 promoter region.

Authors:  Jixun Dai; Ding Chen; Roger A Jones; Laurence H Hurley; Danzhou Yang
Journal:  Nucleic Acids Res       Date:  2006-09-22       Impact factor: 16.971

View more
  11 in total

1.  The Hairpin Form of r(G4C2)exp in c9ALS/FTD Is Repeat-Associated Non-ATG Translated and a Target for Bioactive Small Molecules.

Authors:  Zi-Fu Wang; Andrei Ursu; Jessica L Childs-Disney; Rea Guertler; Wang-Yong Yang; Viachaslau Bernat; Suzanne G Rzuczek; Rita Fuerst; Yong-Jie Zhang; Tania F Gendron; Ilyas Yildirim; Brendan G Dwyer; Joseph E Rice; Leonard Petrucelli; Matthew D Disney
Journal:  Cell Chem Biol       Date:  2018-11-29       Impact factor: 8.116

2.  Pyridine Derivative of the Natural Alkaloid Berberine as Human Telomeric G4-DNA Binder: A Solution and Solid-State Study.

Authors:  Francesco Papi; Carla Bazzicalupi; Marta Ferraroni; Giulia Ciolli; Paolo Lombardi; Asma Yasmeen Khan; Gopinatha Suresh Kumar; Paola Gratteri
Journal:  ACS Med Chem Lett       Date:  2020-02-07       Impact factor: 4.345

3.  Loop permutation affects the topology and stability of G-quadruplexes.

Authors:  Mingpan Cheng; Yu Cheng; Jingya Hao; Guoqing Jia; Jun Zhou; Jean-Louis Mergny; Can Li
Journal:  Nucleic Acids Res       Date:  2018-10-12       Impact factor: 16.971

4.  Exploring the Dynamics of Propeller Loops in Human Telomeric DNA Quadruplexes Using Atomistic Simulations.

Authors:  Barira Islam; Petr Stadlbauer; Alejandro Gil-Ley; Guillermo Pérez-Hernández; Shozeb Haider; Stephen Neidle; Giovanni Bussi; Pavel Banas; Michal Otyepka; Jiri Sponer
Journal:  J Chem Theory Comput       Date:  2017-05-18       Impact factor: 6.006

5.  Fluorescence-based tools to probe G-quadruplexes in cell-free and cellular environments.

Authors:  Sudeshna Manna; Seergazhi G Srivatsan
Journal:  RSC Adv       Date:  2018-07-17       Impact factor: 4.036

6.  Water spines and networks in G-quadruplex structures.

Authors:  Kevin Li; Liliya Yatsunyk; Stephen Neidle
Journal:  Nucleic Acids Res       Date:  2021-01-11       Impact factor: 16.971

7.  Probing G-quadruplex topologies and recognition concurrently in real time and 3D using a dual-app nucleoside probe.

Authors:  Ashok Nuthanakanti; Ishtiyaq Ahmed; Saddam Y Khatik; Kayarat Saikrishnan; Seergazhi G Srivatsan
Journal:  Nucleic Acids Res       Date:  2019-07-09       Impact factor: 16.971

8.  Probing Human Telomeric DNA and RNA Topology and Ligand Binding in a Cellular Model by Using Responsive Fluorescent Nucleoside Probes.

Authors:  Sudeshna Manna; Cornelia H Panse; Vyankat A Sontakke; Sarangamath Sangamesh; Seergazhi G Srivatsan
Journal:  Chembiochem       Date:  2017-07-10       Impact factor: 3.164

9.  Structural dynamics of propeller loop: towards folding of RNA G-quadruplex.

Authors:  Marek Havrila; Petr Stadlbauer; Petra Kührová; Pavel Banáš; Jean-Louis Mergny; Michal Otyepka; Jirí Šponer
Journal:  Nucleic Acids Res       Date:  2018-09-28       Impact factor: 16.971

10.  Structured Waters Mediate Small Molecule Binding to G-Quadruplex Nucleic Acids.

Authors:  Stephen Neidle
Journal:  Pharmaceuticals (Basel)       Date:  2021-12-22
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.