Literature DB >> 31620420

Novel Tautomerisation Mechanisms of the Biologically Important Conformers of the Reverse Löwdin, Hoogsteen, and Reverse Hoogsteen G*·C* DNA Base Pairs via Proton Transfer: A Quantum-Mechanical Survey.

Ol'ha O Brovarets'1,2, Timothy A Oliynyk1, Dmytro M Hovorun1,3,4.   

Abstract

For the first time, in this study with the use of QM/QTAIM methods we have exhaustively investigated the tautomerization of the biologically-important conformers of the G*·C* DNA base pair-reverse Löwdin G*·C*(rWC), Hoogsteen G*'·C*(H), and reverse Hoogsteen G*'·C*(rH) DNA base pairs-via the single (SPT) or double (DPT) proton transfer along the neighboring intermolecular H-bonds. These tautomeric reactions finally lead to the formation of the novel G· C O 2 * (rWC), G N 2 * · C(rWC), G*'N2·C(rWC), G N 7 * · C(H), and G*'N7·C(rH) DNA base mispairs. Gibbs free energies of activation for these reactions are within the range 3.64-31.65 kcal·mol-1 in vacuum under normal conditions. All TSs are planar structures (Cs symmetry) with a single exception-the essentially non-planar transition state TSG*·C*(rWC)↔G+·C-(rWC) (C1 symmetry). Analysis of the kinetic parameters of the considered tautomerization reactions indicates that in reality only the reverse Hoogsteen G*'·C*(rH) base pair undergoes tautomerization. However, the population of its tautomerised state G*'N7·C(rH) amounts to an insignificant value-2.3·10-17. So, the G*·C*(rWC), G*'·C*(H), and G*'·C*(rH) base pairs possess a permanent tautomeric status, which does not depend on proton mobility along the neighboring H-bonds. The investigated tautomerization processes were analyzed in details by applying the author's unique methodology-sweeps of the main physical and chemical parameters along the intrinsic reaction coordinate (IRC). In general, the obtained data demonstrate the tautomeric mobility and diversity of the G*·C* DNA base pair.
Copyright © 2019 Brovarets', Oliynyk and Hovorun.

Entities:  

Keywords:  Hoogsteen; conformer; double proton transfer; quantum-mechanical calculation; reverse Hoogsteen; reverse Löwdin base pair; single proton transfer; transition state

Year:  2019        PMID: 31620420      PMCID: PMC6759773          DOI: 10.3389/fchem.2019.00597

Source DB:  PubMed          Journal:  Front Chem        ISSN: 2296-2646            Impact factor:   5.221


Introduction

The study of the tautomerization mechanisms of the hydrogen (H) bonded nucleotide base pair is an important topic of modern quantum biophysics, biochemistry, molecular, and structural biology (Sinden, 1994; Sponer and Lankas, 2006; Alkorta et al., 2018). For over 65 years, this area of research has been under the intense scrutiny of both theoretics and experimentators, since the establishment of the spatial organization of DNA and formulation of the so-called “tautomeric hypothesis of the origin of spontaneous point mutations (transitions and transversions)” (Watson and Crick, 1953a,b; Erdmann et al., 2014) for this biologically important macromolecule—carrier of the genetic information, which is transmitted from generation to generation. Lately, this tautomeric hypothesis has been experiencing an era of renaissance (Brovarets' and Hovorun, 2018). Thus, for the first time, within the framework of this hypothesis the new structural mechanisms of the tautomerization of pairs of nucleotide bases have been discovered, in which the transition of bases within the base pair into the mutagenic tautomeric form is accompanied by a significant change in the geometry of the base pair itself (Brovarets' and Hovorun, 2009, 2015a,b,c,d,e, 2016, 2018). However, at the studying of the nature of the mutagenic tautomerization of DNA bases, the researchers limited themselves to the A·T and G·C Watson-Crick base pairs (Löwdin, 1963, 1966; Florian et al., 1994; Gorb et al., 2004; Bertran et al., 2006; Brovarets' and Hovorun, 2014a,b). Now this problem is considered more complex with the involvement of several biologically important conformers of these pairs (Hoogsteen, 1963; Pous et al., 2008; Alvey et al., 2014; Brovarets' and Hovorun, 2014a,b; Nikolova et al., 2014; Acosta-Reyes et al., 2015; Poltev et al., 2016; Zhou, 2016; Szabat and Kierzek, 2017; Ye et al., 2017). These observations do not only allow to penetrate deeper into the essence of the phenomenon being studied, but also to answer, in particular, on a biologically important question—“Why Nature has exactly chosen the Watson-Crick DNA base pairs as elementary structural units for the construction of genetic material?” Nowadays, there is an explicit bias to the A·T DNA base pair at the investigations of this type. This is due to a large number of circumstances, which will be outlined and discussed below. Thus, it is widely known that the classical A·T(WC) Watson-Crick DNA base pair (Brovarets' and Hovorun, 2014b) may acquire different biologically significant conformations with various organization of the three intermolecular H-bonds—reverse Watson-Crick A·T(rWC), Hoogsteen A·T(H), and reverse Hoogsteen A·T(rH), which have been comprehensively studied in the literature (Hoogsteen, 1963; Pous et al., 2008; Alvey et al., 2014; Brovarets' and Hovorun, 2014b; Nikolova et al., 2014; Acosta-Reyes et al., 2015; Poltev et al., 2016; Zhou, 2016; Szabat and Kierzek, 2017; Ye et al., 2017). Thus, in particular, in previous works (Hoogsteen, 1963; Pous et al., 2008; Brovarets' and Hovorun, 2010, 2014b, 2015f; Brovarets', 2013a,b; Alvey et al., 2014; Nikolova et al., 2014; Acosta-Reyes et al., 2015; Poltev et al., 2016; Zhou, 2016; Szabat and Kierzek, 2017; Ye et al., 2017; Brovarets' et al., 2018a,b,c,d,e,f; Brovarets' and Tsiupa, 2019) by the methods of quantum chemistry we have investigated in details the potential (electronic) energy surface of each of the four biologically important A·T DNA base pairs—Watson-Crick A·T(WC), reverse Watson-Crick A·T(rWC), Hoogsteen A·T(H) and reverse Hoogsteen A·T(rH), leading to the novel conformational or tautomeric states of these base pairs. It was theoretically demonstrated that these A·T(WC/rWC/H/rH) base pairs possess unique ability to perform conformationally-tautomeric transition into the planar wobble A*·T(w), A·(w), and A·T*(w) base mispairs, non-planar wobble A·T(wWC), A·T(wrWC), A·T(wH), and A·T(wrH) base mispairs and incorrect A·T*(), A·(), A·T*(), and A·() base mispairs containing mutagenic tautomers of the DNA bases, and also their interconversions between each other through the non-planar transition states via the structural or conformational rearrangements and intramolecular proton transfer along the intermolecular H-bonds. In contrast to this, the classical G·C Watson-Crick DNA base pair (Brovarets', 2013b; Brovarets' and Hovorun, 2014a, 2015f) could not acquire different conformations in the main tautomeric state due to the obstacles presented by its geometrical structure. This, however, can be overcome through the G·C(WC) → G*·C*(WC) tautomerisation via the double proton transfer (DPT), according to Löwdin's mechanism (Löwdin, 1963, 1966; Brovarets' and Hovorun, 2014a, 2018). This Löwdin's reaction can proceed over the barrier of tautomerization or under the barrier via the tunneling (Parker and Van Everv, 1971; Boutis, 1992; Al-Khalili and McFadden, 2014; Brovarets' and Hovorun, 2015g; Godbeer et al., 2015; Turaeva and Brown-Kennerly, 2015; Meisner and Kastner, 2016; Roßbach and Ochsenfeld, 2017; McFadden and Al-Khalili, 2018; Pusuluk et al., 2018; Smedarchina et al., 2018; Shekaari and Jafari, 2019; Srivastava, 2019). At this, the so-called Löwdin G*·C*(WC) base pair with geometry close to Watson-Crick, which is created in such a way, involving mutagenic tautomers of the DNA bases, can acquire similarly to the A·T(WC) base pair different conformations (Brovarets' and Hovorun, 2010, 2015f; Brovarets', 2013a,b; Brovarets' et al., 2018a,b,c,d,e,f; Brovarets' and Tsiupa, 2019)—reverse Löwdin G*·C*(rWC), Hoogsteen G*′·C*(H) and reverse Hoogsteen G*′·C*(rH) (Brovarets', 2013b) (here and below the superscript “*” denotes the rare tautomeric form of the DNA base (Glushenkov and Hovorun, 2016) and “′”—trans-orientation of the OH group). This demonstrates quite unexpected role of the Löwdin's tautomerisation for the conformational variety. Currently, there is no mention of reverse Löwdin G*·C*(rWC), Hoogsteen G*′·C*(H) and reverse Hoogsteen G*′·C*(rH) conformers or their tautomerisation via the DPT along the intermolecular H-bonds, despite a great number of investigations devoted to this important phenomenon. In our previous studies we have comperehensively investigated the tautomerisation via the DPT of the canonical A·T(WC) (Brovarets' and Hovorun, 2014b, 2015g) and G·C(WC) (Brovarets' and Hovorun, 2014a) Watson-Crick DNA base pairs, and also of the incorrect DNA base pairs—wobble G·T base pair (Brovarets' et al., 2015), short C·T (Brovarets' and Hovorun, 2013a), C*·C (Brovarets' and Hovorun, 2013b), T*·T (Brovarets' et al., 2014a), H·C (Brovarets' and Hovorun, 2013c; Brovarets' et al., 2013a) and H*·T (Brovarets' and Hovorun, 2013c; Brovarets' et al., 2013a); long A·A* (Brovarets' and Hovorun, 2013d), A·G (Brovarets' et al., 2014b), G·G* (Brovarets' and Hovorun, 2014c), H·H (Brovarets' and Hovorun, 2013c; Brovarets' et al., 2013a), H*·H (Brovarets' and Hovorun, 2013c; Brovarets' et al., 2013b) and H·A (Brovarets' and Hovorun, 2013c; Brovarets' et al., 2014c); Watson-Crick-like A·C* (Brovarets' and Hovorun, 2015h), G*·T (Brovarets' and Hovorun, 2015i), G·Asyn (Brovarets' and Hovorun, 2014d), A*· (Brovarets' and Hovorun, 2014d), A*·Asyn (Brovarets' et al., 2014d), G· (Brovarets' and Hovorun, 2014e), T·2AP*(w) (Brovarets' et al., 2017; Brovarets' and Hovorun, 2019a), and G·2AP*(w) (Brovarets' et al., 2017; Brovarets' and Hovorun, 2019a) base mispairs and protein-DNA complexes (Brovarets' et al., 2012), which we have summarized in our review (Brovarets' and Hovorun, 2019b), devoted to the microstructural mechanisms of the tautomerization by the proton transfer along the neighboring intermolecular H-bonds in 22 biologically important pairs of nucleotide bases in the framework of the author's method, which enable to trace the evolution of the physico-chemical parameters along the intrinsic reaction coordinate (IRC). In this study, we aim to reapply the approach, which we launched in our previous works (Brovarets' et al., 2018a,b,c,d,e,f; Brovarets' and Tsiupa, 2019) in order to investigate in details the tautomerisation of the reverse Löwdin G*·C*(rWC), Hoogsteen G*′·C*(H), and reverse Hoogsteen G*′·C*(rH) base pairs via PT along the neighboring intermolecular H-bonds. As a result of the previous investigations, it was established that proton mobility along the intermolecular H-bonds does not change the tautomeric status of the investigated base pairs. Along with this biologically important conclusion, for the first time we have obtained a number of important physical and chemical characteristics. As such, it was documented that tautomerisation of the reverse Löwdin G*·C*(rWC) DNA base pair along the middle H-bond induces analogous SPT along its upper and lower H-bonds. Moreover, for the first time we have described the formation of a dynamically stable base pair with participation of the yilidic form of the purine base, formed through asynchronous DPT and participation of the CH group as proton donor. Digging deeper into the mechanisms of tautomerisation of the reverse Löwdin G*·C*(rWC), Hoogsteen G*′·C*(H), and reverse Hoogsteen G*′·C*(rH) base pairs, we have carefully obtained sweeps of the physical and chemical parameters that characterize proton mobility along the IRC. Firstly, we have established a monotonic dependence of the base pair's dipole moment along the IRC. Second, it was shown that the SPT processes are characterized by the presence of 6 key points.

Computational Methods

Density Functional Theory Geometry and Vibrational Frequencies Calculations

All calculations of the geometries and harmonic vibrational frequencies of the considered base mispairs and transition states of their conversion have been conducted using Gaussian'09 package (Frisch et al., 2010) at the DFT B3LYP/6-311++G(d,p) level of theory (Lee et al., 1988; Parr and Yang, 1989; Tirado-Rives and Jorgensen, 2008), that has been already applied for analogous systems and approved to give accurate geometrical structures, normal mode frequencies, barrier heights, and characteristics of intermolecular H-bonds (Matta, 2010; Arabi and Matta, 2018; Gatti et al., 2018). We have used a scaling factor equal to 0.9668 in order to correct harmonic frequencies for the investigated base pairs (Brovarets' and Hovorun, 2010, 2015f; Brovarets', 2013a,b; Palafox, 2014; El-Sayed et al., 2015; Brovarets' et al., 2018a,b,c,d,e,f; Brovarets' and Tsiupa, 2019). We have associated structures, which were localized on the potential energy landscape by means of Synchronous Transit-guided Quasi-Newton method (Peng et al., 1996), to the minima or transition state (TS) by the absence or presence of the imaginary frequency in the vibrational spectra of the complexes, respectively. We used standard TS theory (Atkins, 1998) in order to estimate the forward and reverse barriers of the investigated tautomerisation reaction.

IRC Calculations

Reaction pathways have been monitored by following IRC in the forward and reverse directions from each TS using Hessian-based predictor-corrector algorithm for integration (Hratchian and Schlegel, 2005). In such a way we ensure that it was received proper reaction pathway from reactants to products. Further, we have obtained the sweeps of the energetic, polar, and geometric characteristics of the H-bonds and base pairs along the IRC by calculating them at each point of the IRC (Brovarets' et al., 2017, 2018g,h).

Single Point Energy Calculations

In order to take into account electronic correlation effects, we followed geometry optimizations with single point energy calculations using MP2 level of theory (Frisch et al., 1990) and 6-311++G(2df,pd) Pople's basis set of valence triple-ζ quality (Hariharan and Pople, 1973; Krishnan et al., 1980) and aug-cc-pVDZ Dunning's cc-type basis set (Kendall et al., 1992), augmented with polarization and/or diffuse function. The Gibbs free energy G for all structures was calculated by the formula: where Eel–electronic energy, while Ecorr–thermal correction.

Interaction Energies Calculations

We have obtained electronic interaction energies Eint at the MP2/6-311++G(2df,pd) level of theory as the difference between the total electronic energy of the base mispair and the electronic energies of the separate monomers. Gibbs free energy of interaction has been obtained using similar approach. At this, we also corrected the interaction energy for the basis set superposition error (BSSE) (Boys and Bernardi, 1970; Gutowski et al., 1986) according to the counterpoise procedure (Sordo et al., 1988; Sordo, 2001).

Estimation of Kinetic Parameters

The time τ99.9% spent for reaching the 99.9% of the equilibrium concentration between the initial and terminal base pairs in the system of reversible first-order forward (k) and reverse (k) reactions was estimated by formula (Atkins, 1998): The lifetime τ of the base pairs was calculated using the formula 1/k, where the values of the reverse k and forward k rate constants for the tautomerisation reactions were calculated as (Atkins, 1998): where quantum tunneling effects are accounted by Wigner's tunneling correction (Wigner, 1932; Brovarets' and Hovorun, 2014d), that has been successfully used for the DPT reactions (Brovarets' and Hovorun, 2013a,b,c,d, 2014c,d,e, 2015h,i; Brovarets' et al., 2013a,b, 2014a,b,c,d, 2015, 2017): where k—Boltzmann's constant, h—Planck's constant, ΔΔG—Gibbs free energy for the forward (f) and reverse (r) tautomerisation reactions, ν—value of the imaginary frequency at the TS of the tautomerisation reaction.

QTAIM Analysis

Bader's quantum theory of Atoms in Molecules (QTAIM) was used to analyse the electron density distribution (Bader, 1990). The topology of the electron density was analyzed using program package AIMAll (Keith, 2010) with all default options and wave functions obtained at the level of theory used for geometry optimisation. The presence of the (3,−1) bond critical point (BCP), bond path between hydrogen donor and acceptor and positive value of the Laplacian at this BCP (Δρ >0) were considered altogether as criteria for the H-bond formation (Matta and Hernández-Trujillo, 2003; Matta et al., 2006a,b; Matta, 2014; Lecomte et al., 2015; Brovarets' and Pérez-Sánchez, 2016, 2017; Brovarets' et al., 2016).

Energies of the Intermolecular H-Bonds

We calculated the energies of the intermolecular AH···B H-bonds in the base mispairs and TSs and of the sweeps of the H-bond energies by the empirical Espinosa-Molins-Lecomte (EML) formula (Espinosa et al., 1998; Matta et al., 2006b; Mata et al., 2011; Lecomte et al., 2015; Alkorta et al., 2016, 2017) based on the electron density distribution at the (3,−1) BCPs of the H-bonds: where V(r)—value of a local potential energy at the (3,−1) BCP. EML fomula has been also used for the estimation of the energy of the non-standard H-bonds CH···O in the stationary points of the base pairs on the hypersurface of their electronic energy. We evaluated the energies of the classical NH···N/O and OH···O/N intermolecular AH···B H-bonds by the empirical Iogansen's formula (Iogansen, 1999): where Δν–magnitude of the frequency shift of the stretching mode of the AH H-bonded group involved in the AH···B H-bond relatively the unbound group. We applied partial deuteration in order to minimize the effect of vibrational resonances (Brovarets' and Hovorun, 2014d,e, 2015h,i; Brovarets' et al., 2014d). The energies of the NH···N and OH···O H-bonds in the TSs containing loosened covalent bridges were calculated by the Nikolaienko-Bulavin-Hovorun formula (Nikolaienko et al., 2012): where ρ–the electron density at the (3,−1) BCP of the H-bond. The atom numbering scheme for the DNA bases is conventional (Saenger, 1984).

Results and Their Discussion

In this paper we have investigated in details the tautomerisation processes via the single (SPT) or double (DPT) proton transfer of the G*·C*(rWC), G*′·C*(H), and G*′·C*(rH) base pairs along the neighboring intermolecular H-bonds as their intrinsically inherent property (Figure 1).
Figure 1

Discovered new reaction pathways of the tautomerization in the reverse Watson-Crick G*·C*(rWC) – I. G*·C*(rWC)↔G+·C−(rWC)↔G·C*O2(rWC), II. G*·C*(rWC)↔G+·C−(rWC)↔G*N2·C(rWC), III. G*·C*(rWC)↔G*′N2·C(rWC), Hoogsteen G*′·C*(H) – IV. G*′·C*(H)↔G*N7·C(H)—and reverse Hoogsteen G*′·C*(rH) – V. G*′·C*(rH)↔G*′N7·C(rH) – DNA base pairs through the single or double proton transfer. Electronic ΔEint and Gibbs free ΔGint energies of the interaction [MP2/6-311++G(2df,pd)//B3LYP/6-311++G(d,p) level of theory, in kcal·mol−1], relative Gibbs free energies ΔG, and electronic energies ΔE [MP2/aug-cc-pVDZ//B3LYP/6-311++G(d,p) level of theory in the continuum with ε = 1 at T = 298.15 in kcal·mol−1] are presented below complexes in brackets. Dotted lines indicate AH···B H-bonds – their lengths H···B are presented in angstroms; carbon atoms are in light-blue, nitrogen—in dark-blue, hydrogen—in gray and oxygen – in red. ν—imaginary frequencies at the TSs of the tautomeric/conformational transitions.

Discovered new reaction pathways of the tautomerization in the reverse Watson-Crick G*·C*(rWC) – I. G*·C*(rWC)↔G+·C−(rWC)↔G·C*O2(rWC), II. G*·C*(rWC)↔G+·C−(rWC)↔G*N2·C(rWC), III. G*·C*(rWC)↔G*′N2·C(rWC), Hoogsteen G*′·C*(H) – IV. G*′·C*(H)↔G*N7·C(H)—and reverse Hoogsteen G*′·C*(rH) – V. G*′·C*(rH)↔G*′N7·C(rH) – DNA base pairs through the single or double proton transfer. Electronic ΔEint and Gibbs free ΔGint energies of the interaction [MP2/6-311++G(2df,pd)//B3LYP/6-311++G(d,p) level of theory, in kcal·mol−1], relative Gibbs free energies ΔG, and electronic energies ΔE [MP2/aug-cc-pVDZ//B3LYP/6-311++G(d,p) level of theory in the continuum with ε = 1 at T = 298.15 in kcal·mol−1] are presented below complexes in brackets. Dotted lines indicate AH···B H-bonds – their lengths H···B are presented in angstroms; carbon atoms are in light-blue, nitrogen—in dark-blue, hydrogen—in gray and oxygen – in red. ν—imaginary frequencies at the TSs of the tautomeric/conformational transitions. This paper is organized in the following way—firstly, we would discuss the tautomerisation process separately for each base pair and then we would present in details sweeps of the most important physico-chemical parameters along the IRC altogether for all investigated base pairs (Figures 1–12, Tables 1–8).
Figure 12

Profiles of the energy of the intermolecular H-bonds EHB estimated by the EML formula at the (3,−1) BCPs along the IRC of the investigated tautomerisations via the SPT or DPT obtained at the B3LYP/6-311++G(d,p) level of theory (see Table 3).

Table 1

Energetic (in kcal·mol−1) and kinetic (in s) characteristics of the tautomerisation of the G*·C*(rWC), G*′·C*(H), and G*′·C*(rH) DNA base pairs via the SPT or DPT obtained at the MP2/aug-cc-pVDZ//B3LYP/6-311++G(d,p) level of QM theory in the continuum with ε = 1 under normal conditions (see Figure 1).

Tautomeric transitionνiaΔGbΔEcΔΔGTSdΔΔETSeΔΔGfΔΔEgτ99.9%hτi
kcal·mol−1cm−1
G*·C*(rWC)↔G+·C(rWC)1132.14.445.684.386.72−0.061.04363.84.70·10−136.80·10−14
G+·C(rWC)↔G·C*O2(rWC)383.9−0.93−1.88−0.800.030.131.91668.12.10·10−131.77·10−13
G+·C(rWC)↔G*N2·C(rWC)1104.55.954.564.835.87−1.121.30454.77.96·10−141.15·10−14
G*·C*(rWC)↔G*′N2·C(rWC)1243.424.7424.5827.4329.292.694.711647.64.39·10−116.35·10−12
G*N2·C(rWC)↔G*′N2·C(rWC)663.514.3514.3421.2522.576.908.232878.99.24·10−81.34·10−8
G*′·C*(H)↔G*N7·C(H)675.73.203.074.016.650.813.581252.33.07·10−124.46·10−13
G*′·C*(rH)↔G*′N7·C(rH)1326.722.6922.3525.3028.762.616.412242.23.56·10−115.15·10−12

The imaginary frequency at the TS of the tautomeric transition, cm.

The Gibbs free energy of the initial relatively the terminal base pair of the tautomerisation reaction (T = 298.15 K).

The electronic energy of the initial relatively the terminal base pair of the tautomerisation reaction.

The Gibbs free energy barrier for the forward tautomerisation reaction.

The electronic energy barrier for the forward tautomerisation reaction.

The Gibbs free energy barrier for the reverse tautomerisation reaction.

The electronic energy barrier for the reverse tautomerisation reaction.

The time necessary to reach 99.9% of the equilibrium concentration between the reactant and the product of the tautomerisation reaction.

The lifetime of the product of the tautomerisation reaction.

Table 8

Patterns of the specific intermolecular interactions including AH···B H-bonds and loosened A-H-B covalent bridges that sequentially replace each other along the IRC of the investigated tautomerisations via the SPT or DPT obtained at the B3LYP/6-311++G(d,p) level of theory (see Figure 12).

PatternsIRC range, BohrSpecific intermolecular interactions, forming patterns
I. G*·C*(rWC)↔G+·C(rWC)↔G·C*O2(rWC)
I[−4.49 ÷−0.27)(G)O6H···O2(C), (G)N1···HN3(C), (G)N2H···N4(C)
II[−0.27 ÷ 0.12)(G)O6H···O2(C), (G)N1-H/H-N3(C), (G)N2H···N4(C)
III[0.12 ÷ 2.13)(G)O6H···O2(C), (G)N1H···N3(C), (G)N2H···N4(C)
IV[2.13 ÷ 2.51)(G)O6-H/H-O2(C), (G)N1H···N3(C), (G)N2H···N4(C)
V[2.51 ÷ 4.75](G)O6···HO2(C), (G)N1H···N3(C), (G)N2H···N4(C)
II. G*·C*(rWC)↔G+·C(rWC)↔G*N2·C(rWC)
I[−4.49÷−0.27)(G)O6H···O2(C), (G)N1···HN3(C), (G)N2H···N4(C)
II[−0.27 ÷ 0.12)(G)O6H···O2(C), (G)N1-H/H-N3(C), (G)N2H···N4(C)
III[0.12 ÷ 4.52)(G)O6H···O2(C), (G)N1H···N3(C), (G)N2H···N4(C)
IV[4.52 ÷ 4.91)(G)O6H···O2(C), (G)N1H···N3(C), (G)N2-H/H-N4(C)
V[4.91 ÷ 6.05](G)O6H···O2(C), (G)N1H···N3(C), (G)N2···HN4(C)
III. G*·C*(rWC)↔G*′N2·C(rwWC)
I[−10.37 ÷−4.79)(G)O6H···O2(C), (G)N1···HN3(C), (G)N2H···N4(C)
II[−4.79 ÷−3.23)(G)O6H···O2(C), (G)N1···HN3(C), (G)N2···N4(C)
III[−3.23 ÷−2.71)(G)O6H···O2(C), (G)N1-H/H-N3(C), (G)N2H···N4(C)
IV[−2.71 ÷−0.16)(G)O6H···O2(C), (G)N1H···N3(C), (G)N2H···N4(C)
V[−0.16 ÷ 0.21)(G)O6H···O2(C), (G)N1H···N3(C), (G)N2-H/H-N4(C)
VI[0.21 ÷ 4.12](G)O6H···O2(C), (G)N1H···N3(C), (G)N2···HN4(C)
IV. G*′·C*(H)↔G*N7·C(H)
I[−6.04 ÷−1.17)(G)O6H···N4(C), (G)N7···HN3(C), (G)C8H···O2(C)
II[−1.17 ÷−0.74)(G)O6-H/H-N4(C), (G)N7···HN3(C), (G)C8H···O2(C)
III[−0.74 ÷−0.10)(G)O6···HN4(C), (G)N7···HN3(C), (G)C8H···O2(C)
IV[−0.10 ÷ 0.37)(G)O6···HN4(C), (G)N7-H/H-N3(C), (G)C8H···O2(C)
V[0.37 ÷ 3.26](G)O6···HN4(C), (G)N7H···N3(C), (G)C8H···O2(C)
V. G*′·C*(rH)↔G*′N7·C(rH)
I[−13.94 ÷−1.33)(G)O6H···O2(C), (G)N7···HN3(C), (G)C8H···N4(C)
II[−1.33 ÷−0.71)(G)O6H···O2(C), (G)N7-H/H-N3(C), (G)C8H···N4(C)
III[−0.71 ÷−0.20)(G)O6H···O2(C), (G)N7H···N3(C), (G)C8H···N4(C)
IV[−0.20 ÷ 0.31)(G)O6H···O2(C), (G)N7H···N3(C), (G)C8-H/H-N4(C)
V[0.31 ÷ 9.18](G)O6H···O2(C), (G)N7H···N3(C), (G)C8···HN4(C)
Geometric structures of the 11 key points describing the evolution of the I. G*·C*(rWC)↔G+·C−(rWC)↔G·C*O2(rWC) tautomerisation via the sequential SPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo. Coordinates of the 11 key points, their relative electronic energies ΔE (in kcal·mol−1 obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum at T = 298.15) and imaginary frequencies νi (cm−1) at the TSs of their interconversions are presented below them in brackets (see Table 3). For more detailed designations see Figure 1.
Table 3

Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 11 key points and the polarity of the latters along the IRC of the I. G*·C*(rWC)↔G+·C−(rWC)↔G·C*O2(rWC) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 2).

ComplexAH···B H-bond/ A-H/H-B covalent bondρaΔρb100·εcdA···BddH···Be∠AH···Bfμg
Key point 1 (−4.49 Bohr): G*·C*(rWC)O6H···O20.0380.1243.312.7311.753171.16.54
N1···HN30.0350.0866.322.9321.894173.6
N2H···N40.0260.0778.043.0362.016178.0
Key point 2 (−0.31 Bohr): ΔρN1···H = 0O6H···O20.0700.1662.752.5281.520174.65.19
N1···HN30.1100.0004.312.6211.427176.7
N2H···N40.0420.1057.072.8311.801175.2
Key point 3 (−0.08 Bohr): ρN1−H = ρH−N3O6H···O20.0730.1602.762.5261.509174.94.23
N1-H/H-N30.148−0.1933.732.6201.310177.0
N2H···N40.0430.1037.062.8291.795175.2
Key point 4 (0.00 Bohr): TSG*·C*(rWC)↔GC−(rWC)O6H···O20.0740.1582.762.5251.505174.93.92
N1-HN30.166−0.3063.582.6211.268177.1
N1H-N30.133−0.1114.282.6211.354177.1
N2H···N40.0430.1027.062.8291.793175.2
Key point 5 (0.16 Bohr): ΔρH···N3 = 0O6H···O20.0760.1522.772.5231.494175.03.45
N1H···N30.1070.0044.752.6251.439177.1
N2H···N40.0440.1007.052.8271.788175.3
Key point 6 (1.50 Bohr): G+·C(rWC)O6H···O20.0960.1262.502.4591.384173.13.15
N1H···N30.0670.0895.792.7311.649176.3
N2H···N40.0420.0957.242.8611.816177.0
Key point 7 (2.13 Bohr): ΔρH···O2 = 0O6H···O20.1300.0062.132.4211.298171.93.35
N1H···N30.0600.0965.972.7381.672174.7
N2H···N40.0370.0917.532.8971.858176.6
Key point 8 (2.20 Bohr): TSG+·C-(rWC)G·C*O2(rWC)O6H-O20.138−0.0452.052.4161.275171.93.45
O6-HO20.199−0.6441.342.4161.147171.9
N1H···N30.0590.0975.982.7401.676174.6
N2H···N40.0370.0917.542.8991.860176.6
Key point 9 (2.32 Bohr): ρO6−H = ρH−O2O6-H/H-O20.172−0.3341.432.4091.196172.03.77
N1H···N30.0580.1006.002.7431.686174.4
N2H···N40.0370.0917.562.9021.865176.6
Key point 10 (2.51 Bohr): ΔρO6···H = 0O6···HO20.134−0.0061.632.4111.286172.04.48
N1H···N30.0560.1046.012.7471.698174.3
N2H···N40.0360.0927.572.9041.870176.6
Key point 11 (4.75 Bohr): G·C*O2(rWC)O6···HO20.0740.1502.002.5321.500173.75.43
N1H···N30.0410.0996.252.8561.825171.8
N2H···N40.0290.0827.832.9871.961174.7

The electron density at the (3,−1) BCP, a.u.

The Laplacian of the electron density at the (3,−1) BCP, a.u.

The ellipticity at the (3–1) BCP.

The distance between the A (H-bond donor) and B (H-bond acceptor) atoms of the H-bonds, Å.

The distance between the H and B atoms of the H-bonds, Å.

The H-bond angle, degree.

The dipole moment of the complex, D.

Profiles of: (A) the relative electronic energy ΔE, (B) the first derivative of the electronic energy with respect to the IRC (dE/dIRC), (C) the dipole moment μ, (D) the NBO charges qNBO, (E) the distance R(H1-H9) between the H1 and H9 glycosidic hydrogens, (F) the α1 (∠N1H1(C)H9(G)) and α2 (∠N9H9(G)H1(C)) glycosidic angles, (G) the electron density ρ; (H) the Laplacian of the electron density Δρ, (I) the ellipticity ε at the (3,−1) BCPs, (J) the distance dA··· between the electronegative A and B atoms; (K) the distance dAH/HB between the hydrogen and electronegative A or B atoms and (L) the angle ∠AH···B of the covalent and hydrogen bonds along the IRC of the investigated I. G*·C*(rWC)↔G+·C−(rWC)↔G·C*O2(rWC) tautomerisation via the sequential SPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. Geometric structures of the 11 key points describing the evolution of the II. G*·C*(rWC)↔G+·C−(rWC)↔G*N2·C(rWC) tautomerisation via the sequential SPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo (see Table 4). For more detailed designations see Figure 2.
Table 4

Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 11 key points and the polarity of the latters along the IRC of the II. G*·C*(rWC)↔G+·C−(rWC)↔G*N2·C(rWC) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 4).

ComplexAH···B H-bond/A-H/H-B covalent bondρaΔρb100·εcdA···BddH···Be∠AH···Bfμg
Key point 1 (−4.49 Bohr): G*·C*(rWC)O6H···O20.0380.1243.312.7311.753171.16.54
N1···HN30.0350.0866.322.9321.894173.6
N2H···N40.0260.0778.043.0362.016178.0
Key point 2 (−0.31 Bohr): ΔρN1···H = 0O6H···O20.0700.1662.752.5281.520174.65.19
N1···HN30.1100.0004.312.6211.427176.7
N2H···N40.0420.1057.072.8311.801175.2
Key point 3 (−0.08 Bohr): ρN1−H = ρH−N3O6H···O20.0730.1602.762.5261.509174.94.23
N1-H/H-N30.148−0.1933.732.6201.310177.0
N2H···N40.0430.1037.062.8291.795175.2
Key point 4 (0.00 Bohr): TSG*·C*(rWC)↔GC−(rWC)O6H···O20.0740.1582.762.5251.505174.93.92
N1-HN30.166−0.3063.582.6211.268177.1
N1H-N30.133−0.1114.282.6211.354177.1
N2H···N40.0430.1027.062.8291.793175.2
Key point 5 (0.16 Bohr): ΔρH···N3 = 0O6H···O20.0760.1522.772.5231.494175.03.45
N1H···N30.1070.0044.752.6251.439177.1
N2H···N40.0440.1007.052.8271.788175.3
Key point 6 (1.50 Bohr): G+·C(rWC)O6H···O20.0960.1262.502.4491.384173.13.15
N1H···N30.0670.0895.792.7311.649176.3
N2H···N40.0420.0957.242.8611.814177.0
Key point 7 (4.51 Bohr): ΔρN2···H = 0O6H···O20.0570.1363.192.6201.603173.83.91
N1H···N30.0610.0966.062.7311.663179.8
N2H···N40.112−0.0014.862.5921.409180.0
Key point 8 (4.73 Bohr): ρN2−H = ρH−N4O6H···O20.0560.1383.202.6211.609173.74.72
N1H···N30.0590.1026.092.7331.673179.9
N2-H/H-N40.152−0.2114.152.5901.294179.4
Key point 9 (4.79 Bohr): TSG+·C−(rWC)↔G*NC(rWC)O6H···O20.0560.1393.212.6211.611173.64.99
N1H···N30.0590.1036.102.7331.676179.9
N2-HN40.139−0.1304.592.5911.329179.3
N2H-N40.165−0.3004.012.5911.262179.3
Key point 10 (4.96 Bohr): ΔρH···N4 = 0O6H···O20.0550.1403.212.6221.615173.65.74
N1H···N30.0580.1066.122.7351.683179.9
N2···HN40.1110.0005.102.5941.412179.0
Key point 11 (6.05 Bohr): G*N2·C(rWC)O6H···O20.0470.1343.322.6661.670172.77.15
N1H···N30.0420.1016.532.8511.815179.0
N2···HN40.0480.0947.082.8161.756176.4

For footnote definitions see .

Figure 2

Geometric structures of the 11 key points describing the evolution of the I. G*·C*(rWC)↔G+·C−(rWC)↔G·C*O2(rWC) tautomerisation via the sequential SPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo. Coordinates of the 11 key points, their relative electronic energies ΔE (in kcal·mol−1 obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum at T = 298.15) and imaginary frequencies νi (cm−1) at the TSs of their interconversions are presented below them in brackets (see Table 3). For more detailed designations see Figure 1.

Profiles of physico-chemical parameters of the investigated II. G*·C*(rWC)↔G+·C−(rWC)↔G*N2·C(rWC) tautomerisation via the sequential SPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. For more detailed designations see Figure 3.
Figure 3

Profiles of: (A) the relative electronic energy ΔE, (B) the first derivative of the electronic energy with respect to the IRC (dE/dIRC), (C) the dipole moment μ, (D) the NBO charges qNBO, (E) the distance R(H1-H9) between the H1 and H9 glycosidic hydrogens, (F) the α1 (∠N1H1(C)H9(G)) and α2 (∠N9H9(G)H1(C)) glycosidic angles, (G) the electron density ρ; (H) the Laplacian of the electron density Δρ, (I) the ellipticity ε at the (3,−1) BCPs, (J) the distance dA··· between the electronegative A and B atoms; (K) the distance dAH/HB between the hydrogen and electronegative A or B atoms and (L) the angle ∠AH···B of the covalent and hydrogen bonds along the IRC of the investigated I. G*·C*(rWC)↔G+·C−(rWC)↔G·C*O2(rWC) tautomerisation via the sequential SPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum.

Geometric structures of the 9 key points describing the evolution of the III. G*·C*(rWC)↔G*′N2·C(rWC) tautomerisation via the DPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo (see Table 5). For more detailed designations see Figure 2.
Table 5

Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 9 key points and the polarity of the latters along the IRC of the III. G*·C*(rWC)↔G*′N2·C(rWC) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 6).

ComplexAH···B H-bond/ A-H/H-B covalent bondρaΔρb100·εcdA···BddH···Be∠AH···Bfμg
Key point 1 (−10.37 Bohr): G*·C*(rWC)O6H···O20.0380.1243.312.7311.753171.16.54
N1···HN30.0350.0866.322.9321.894173.6
N2H···N40.0260.0778.043.0362.016178.0
Key point 2 (−3.23 Bohr): ΔρN1···H = 0O6H···O20.0550.1473.482.6081.612169.84.99
N1···HN30.115−0.0104.652.6021.404173.1
N2···N40.0180.06565.912.818
Key point 3 (−3.02 Bohr): ρN1−H = ρH−N3O6H···O20.0570.1443.482.6071.604170.34.17
N1-H/H-N30.150−0.1874.062.6031.303172.4
N2H···N40.0190.06653.462.8152.293110.5
Key point 4 (−2.61 Bohr): ΔρH···N3 = 0O6H···O20.0580.1393.502.6071.594171.03.31
N1H···N30.1070.0025.192.6171.445169.8
N2H···N40.0220.07521.372.8012.169118.3
Key point 5 (−0.21 Bohr): ΔρH···N4 = 0O6H···O20.0500.1293.842.6621.657170.83.53
N1H···N30.0610.1007.182.6911.670158.7
N2H···N40.1110.0073.502.5761.418163.2
Key point 6 (0.00 Bohr): TSG*·C*(rWC)G*N2·C(WC)O6H···O20.0490.1313.842.6631.662170.64.55
N1H···N30.0590.1057.292.6921.682158.2
N2-HN40.155−0.2143.172.5751.287164.5
N2H-N40.147−0.1613.532.5751.312164.5
Key point 7 (0.02 Bohr): ρN2−H = ρH−N4O6H···O20.0490.1313.802.6641.663170.64.85
N1H···N30.0580.1067.302.6931.684158.1
N2-H/H-N40.157−0.2253.502.5751.286164.7
Key point 8 (0.26 Bohr): ΔρN2···H = 0O6H···O20.0480.1333.832.6651.668170.46.05
N1H···N30.0570.1117.402.6941.694157.7
N2···HN40.1080.0053.212.5801.419165.0
Key point 9 (4.12 Bohr): G*N2·C(rWC)O6H···O20.0370.1173.872.7501.768170.48.85
N1H···N30.0330.0958.502.8761.921153.8
N2···HN40.0240.07111.363.0142.023160.2

For footnote definitions see .

Profiles of the physico-chemical parameters of the investigated III. G*·C*(rWC)↔G*′N2·C(rWC) tautomerisation via the DPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. For more detailed designations see Figure 3. Geometric structures of the 9 key points describing the evolution of the IV. G*′·C*(H)↔G*N7·C(H) tautomerisation via the DPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo (see Table 6). For more detailed designations see Figure 2.
Table 6

Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 9 key points and the polarity of the latters along the IRC of the IV. G*′·C*(H)↔G*N7·C(H) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 8).

ComplexAH···B H-bond/ A-H/H-B covalent bondρaΔρb100·εcdA···BddH···Be∠AH···Bfμg
Key point 1 (−6.04 Bohr): G*′·C*(H)O6H···N40.0470.1024.882.7101.742160.75.16
N7···HN30.0320.1015.682.8341.791180.0
C8H···O20.0030.0121.733.6753.029118.9
Key point 2 (−1.17 Bohr):ΔρO6···H = 0O6H···N40.1130.0073.732.4841.397164.96.89
N7···HN30.0770.0984.562.6341.558175.4
C8H···O20.0060.02215.153.3442.716116.8
Key point 3 (−0.96 Bohr): ρO6−H = ρH−N4O6-H/H-N40.165−0.2621.522.4281.214166.38.27
N7···HN30.0800.0894.452.6311.544175.6
C8H···O20.0060.02215.273.3432.716116.7
Key point 4 (−0.74 Bohr): ΔρH···N4 = 0O6···HN40.1260.0231.772.4941.313166.69.45
N7···HN30.0850.0764.302.6261.523175.7
C8H···O20.0060.02214.993.3382.710116.8
Key point 5 (−0.10 Bohr): ΔρN7···H = 0O6···HN40.0910.1362.152.5341.432167.79.96
N7···HN30.1110.0043.722.5891.420175.7
C8H···O20.0070.02412.563.3052.668117.3
Key point 6 (0.00 Bohr): TSG*·C*(H)G*N7·C(H)O6···HN40.0880.1432.192.5391.446167.79.63
N7-HN30.124−0.0533.482.5831.374176.0
N7H-N30.191−0.4933.582.5831.210176.0
C8H···O20.0070.02412.243.3032.665117.4
Key point 7 (0.17 Bohr): ρN7−H = ρH−N3O6···HN40.0840.1522.232.5431.463167.78.84
N7-H/H-N30.156−0.2313.092.5801.288176.2
C8H···O20.0070.02411.613.3022.662117.5
Key point 8 (0.41 Bohr): ΔρH···N3 = 0O6···HN40.0790.1612.282.5481.482167.67.73
N7H···N30.1120.0064.552.5831.414176.3
C8H···O20.0070.02410.783.3012.657117.8
Key point 9 (3.26 Bohr): G*N7·C(H)O6···HN40.0450.1372.532.7281.689172.27.32
N7H···N30.0540.0995.952.7581.700177.2
C8H···O20.0060.02216.543.3242.711115.8

For footnote definitions see .

Profiles of the physico-chemical parameters of the investigated IV. G*′·C*(H)↔G*N7·C(H) tautomerisation via the DPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. For more detailed designations see Figure 3. Geometric structures of the 9 key points describing the evolution of the V. G*′·C*(rH)↔G*′N7·C(rH) tautomerisation via the DPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo (see Table 7). For more detailed designations see Figure 2.
Table 7

Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 9 key points and the polarity of the latters along the IRC of the V. G*′·C*(rH)↔G*′N7·C(rH) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 10).

ComplexAH···B H-bond/ A-H/H-B covalent bondρaΔρb100·εcdA···BddH···Be∠AH···Bfμg
Key point 1 (−13.94 Bohr): G*′·C*(rH)O6H···O20.0340.1202.252.7331.785160.65.43
N7···HN30.0420.1012.732.8381.797176.3
C8H···N40.0040.0133.583.6813.015120.4
Key point 2 (−1.33 Bohr): ΔρN7···H = 0O6H···O20.0090.0272.753.3652.407167.65.11
N7···HN30.121−0.0073.602.5451.377167.2
C8H···N40.0440.1243.782.7401.814139.8
Key point 3 (−1.12 Bohr): ρN67−H = ρH−N3O6H···O20.0090.0272.823.3662.406167.94.70
N7-H/H-N30.161−0.2564.132.5491.283166.5
C8H···N40.0470.1244.092.7301.783141.6
Key point 4 (−0.71 Bohr): ΔρH···N3 = 0O6H···O20.0090.0272.773.3672.405168.14.47
N7H···N30.1100.0034.752.5641.430163.8
C8H···N40.0610.1164.872.7001.668148.7
Key point 5 (−0.20 Bohr): ΔρH···N4 = 0O6H···O20.0090.0272.673.3682.408167.94.93
N7H···N30.0900.0735.102.5741.506161.6
C8H···N40.111−0.0104.532.6791.425157.2
Key point 6 (−0.04 Bohr): ρC8−H = ρH−N4O6H···O20.0090.0272.663.3682.409167.95.38
N7H···N30.0870.0845.162.5761.520161.2
C8-H/H-N40.137−0.2022.322.6771.341158.5
Key point 7 (0.00 Bohr): TSG*·C*(rH)G*N7·C(rH)O6H···O20.0090.0272.663.3682.409167.95.51
N7H···N30.0860.0875.182.5761.524161.1
C8-HN40.131−0.1742.362.6771.320158.8
C8H-N40.145−0.1694.222.6771.404158.8
Key point 8 (0.36 Bohr): ΔρC8···H = 0O6H···O20.0090.0272.623.3692.411167.76.81
N7H···N30.0790.1075.312.5811.554160.0
C8···HN40.0840.0033.082.6811.581160.7
Key point 9 (9.18 Bohr): G*′N7·C(rH)O6H···O20.0220.0803.152.9181.968162.99.01
N7H···N30.0490.1146.602.7581.743167.8
C8···HN40.0160.0386.193.3022.316160.1

For footnote definitions see .

Profiles of the physico-chemical parameters of the investigated V. G*′·C*(rH)↔G*′N7·C(rH) tautomerisation via the DPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. For more detailed designations see Figure 3. Profiles of the energy of the intermolecular H-bonds EHB estimated by the EML formula at the (3,−1) BCPs along the IRC of the investigated tautomerisations via the SPT or DPT obtained at the B3LYP/6-311++G(d,p) level of theory (see Table 3). Energetic (in kcal·mol−1) and kinetic (in s) characteristics of the tautomerisation of the G*·C*(rWC), G*′·C*(H), and G*′·C*(rH) DNA base pairs via the SPT or DPT obtained at the MP2/aug-cc-pVDZ//B3LYP/6-311++G(d,p) level of QM theory in the continuum with ε = 1 under normal conditions (see Figure 1). The imaginary frequency at the TS of the tautomeric transition, cm. The Gibbs free energy of the initial relatively the terminal base pair of the tautomerisation reaction (T = 298.15 K). The electronic energy of the initial relatively the terminal base pair of the tautomerisation reaction. The Gibbs free energy barrier for the forward tautomerisation reaction. The electronic energy barrier for the forward tautomerisation reaction. The Gibbs free energy barrier for the reverse tautomerisation reaction. The electronic energy barrier for the reverse tautomerisation reaction. The time necessary to reach 99.9% of the equilibrium concentration between the reactant and the product of the tautomerisation reaction. The lifetime of the product of the tautomerisation reaction. Electron-topological, geometrical, and energetic characteristics of the intermolecular H-bonds in the investigated DNA base pairs in rare tautomeric forms and TSs of their tautomerization via the SPT or DPT obtained at the B3LYP/6-311++G(d,p) level of QM theory (ε = 1) (see Figure 1). The electron density at the (3,−1) BCP of the H-bond, a.u. The Laplacian of the electron density at the (3,−1) BCP of the H-bond, a.u. The ellipticity at the (3,−1) BCP of the H-bond. The distance between the A and B atoms of the AH···B H-bond, Å. The distance between the H and B atoms of the AH···B H-bond, Å. The H-bond angle, degree. The redshift of the stretching vibrational mode υ(AH) of the AH H-bonded group, cm. Energy of the H-bonds, calculated by Iogansen's (Iogansen, . The dipole moment of the complex, D. Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 11 key points and the polarity of the latters along the IRC of the I. G*·C*(rWC)↔G+·C−(rWC)↔G·C*O2(rWC) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 2). The electron density at the (3,−1) BCP, a.u. The Laplacian of the electron density at the (3,−1) BCP, a.u. The ellipticity at the (3–1) BCP. The distance between the A (H-bond donor) and B (H-bond acceptor) atoms of the H-bonds, Å. The distance between the H and B atoms of the H-bonds, Å. The H-bond angle, degree. The dipole moment of the complex, D. Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 11 key points and the polarity of the latters along the IRC of the II. G*·C*(rWC)↔G+·C−(rWC)↔G*N2·C(rWC) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 4).
Figure 4

Geometric structures of the 11 key points describing the evolution of the II. G*·C*(rWC)↔G+·C−(rWC)↔G*N2·C(rWC) tautomerisation via the sequential SPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo (see Table 4). For more detailed designations see Figure 2.

For footnote definitions see . Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 9 key points and the polarity of the latters along the IRC of the III. G*·C*(rWC)↔G*′N2·C(rWC) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 6).
Figure 6

Geometric structures of the 9 key points describing the evolution of the III. G*·C*(rWC)↔G*′N2·C(rWC) tautomerisation via the DPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo (see Table 5). For more detailed designations see Figure 2.

For footnote definitions see . Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 9 key points and the polarity of the latters along the IRC of the IV. G*′·C*(H)↔G*N7·C(H) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 8).
Figure 8

Geometric structures of the 9 key points describing the evolution of the IV. G*′·C*(H)↔G*N7·C(H) tautomerisation via the DPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo (see Table 6). For more detailed designations see Figure 2.

For footnote definitions see . Electron-topological and structural characteristics of the specific intermolecular bonds revealed in the 9 key points and the polarity of the latters along the IRC of the V. G*′·C*(rH)↔G*′N7·C(rH) tautomerisation obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum (see Figure 10).
Figure 10

Geometric structures of the 9 key points describing the evolution of the V. G*′·C*(rH)↔G*′N7·C(rH) tautomerisation via the DPT along the IRC obtained at the B3LYP/6-311++G(d,p) level of theory in vacuo (see Table 7). For more detailed designations see Figure 2.

For footnote definitions see . Patterns of the specific intermolecular interactions including AH···B H-bonds and loosened A-H-B covalent bridges that sequentially replace each other along the IRC of the investigated tautomerisations via the SPT or DPT obtained at the B3LYP/6-311++G(d,p) level of theory (see Figure 12).

Tautomerisation of the Reverse Löwdin G*·C*(rWC) Base Pair via the SPT: I. G*·C*(rWC)↔G+·C−(rWC)↔G·(rWC) and II. G*·C*(rWC)↔G+·C−(rWC)↔C(rWC)

For the first time we have discovered three local minima on the hypersurface of the electronic energy of the G*·C*(rWC) base pair corresponding to the high-energy tautomerised G·(rWC), G+·C−(rWC), and C(rWC) base pairs (Figure 1, Table 1). All of them are stabilized by the participation of three intermolecular H-bonds, among which the upper O6H…O2/O2HO6 H-bonds are the strongest (Table 2).
Table 2

Electron-topological, geometrical, and energetic characteristics of the intermolecular H-bonds in the investigated DNA base pairs in rare tautomeric forms and TSs of their tautomerization via the SPT or DPT obtained at the B3LYP/6-311++G(d,p) level of QM theory (ε = 1) (see Figure 1).

ComplexAH···B H–bondρaΔρb100·εcdA···BddH···Be∠AH···BfΔυgEAH···Bhμi
G*·C*(rWC)↔G+·C(rWC)
G*·C*(rWC) (Brovarets', 2013b)O6H···O20.0380.1243.312.7311.753171.1359.45.906.54
N3H···N10.0350.0866.322.9321.894173.6538.27.37
N2H···N40.0260.0778.043.0362.016178.0248.94.77
TSG*·C*(rWC)G+·C-(rWC)O6H···O20.0740.1582.772.5251.505174.914.60**3.92
N2H···N40.0430.1027.062.8291.793175.27.64**
G+·C(rWC)O6+H···O20.1030.1132.402.4581.384172.41764.113.703.20
N1+H···N30.0640.0915.912.7261.649176.01092.310.70
N2+H···N40.0410.0947.362.8601.816177.4665.68.25
I. G+·C(rWC)↔G·C*O2(rWC)
TSG+·C-(rWC)G·C*O2(rWC)N1H···N30.0590.0975.982.7401.676174.611.01**3.45
N2H···N40.0370.0917.542.8991.860176.66.30**
G·C*O2(rWC)O2H···O60.0740.1502.002.5321.500173.71237.611.425.43
N1H···N30.0410.0996.252.8561.825171.8465.86.81
N2H···N40.0290.0827.832.9871.961174.7368.55.98
II. G+·C(rWC)↔G*N2·C(rWC)
TSG+·C-(rWC)G*N2·C(rWC)O6H···O20.0560.1393.202.6211.611173.610.29**4.99
N1H···N30.0590.1036.102.7331.676179.911.25**
G*N2·C(rWC)O6H···O20.0470.1343.322.6661.670172.7684.48.387.15
N1H···N30.0420.1016.532.8511.815179.0455.46.73
N4H···N20.0480.0947.082.8161.756176.4846.19.37
III. G*·C*(rWC)↔G*′N2·C(rWC)
TSG*·C*(rWC)G*N2·C(rWC)O6H···O20.0490.1313.842.6631.662170.610.29**4.55
N1H···N30.0590.1057.292.6921.682158.211.25**
G*′N2·C(rWC)O6H···O20.0370.1173.872.7501.768170.4503.56.988.85
N1H···N30.0330.0958.502.8761.921153.8243.04.61
N4H···N20.0240.07111.363.0142.023160.2409.36.22
TSG*N2·C(rWC)G*N2·C(rWC)O6H···O20.0480.1323.592.6651.666172.2766.78.745.92
N1H···N30.0450.1057.022.8051.787165.5504.76.98
N4H···N20.0580.0825.232.7591.681173.41291.911.47
IV. G*′·C*(H)↔G*N7·C(H)
G*′·C*(H) (Brovarets', 2013b)O6H···N40.0470.1024.882.7101.742160.7691.68.425.16
N3H···N70.0320.1015.682.8341.791180.0572.67.62
C8H···O20.0030.0121.733.6753.029118.9−4.10.55*
TSG*·C*(H)G*N7·C(H)N4H···O60.0880.1432.192.5391.446167.717.77**9.63
C8H···O20.0070.02412.243.3032.665117.41.27*
G*N7·C(H)N4H···O60.0450.1372.532.7281.689172.213.28.037.32
N7H···N30.0540.0995.952.7581.700177.215.59.26
C8H···O20.0060.02216.543.3242.711115.81.21.15*
V. G*′·C*(rH)↔G*′N7·C(rH)
G*′·C*(rH) (Brovarets', 2013b)O6H···O20.0340.1202.252.7331.785160.6319.05.515.43
N3H···N70.0420.1012.732.8381.797176.3535.67.35
C8H···N40.0040.0133.583.6813.015120.4−2.10.63*
TSG*·C*(rH)G*N7·C(rH)O6H···O20.0090.0272.663.3682.409167.91.36**5.52
N7H···N30.0860.0875.182.5761.524161.117.32**
G*′N7·C(rH)O6H···O20.0220.0803.152.9181.968162.9112.62.819.01
N7H···N30.0490.1146.602.7581.743167.8619.67.94
N4H···C80.0160.0386.193.3022.316160.1339.75.71

The electron density at the (3,−1) BCP of the H-bond, a.u.

The Laplacian of the electron density at the (3,−1) BCP of the H-bond, a.u.

The ellipticity at the (3,−1) BCP of the H-bond.

The distance between the A and B atoms of the AH···B H-bond, Å.

The distance between the H and B atoms of the AH···B H-bond, Å.

The H-bond angle, degree.

The redshift of the stretching vibrational mode υ(AH) of the AH H-bonded group, cm.

Energy of the H-bonds, calculated by Iogansen's (Iogansen, .

The dipole moment of the complex, D.

In fact, the tautomerisation of the G*·C*(rWC) base pair with relative Gibbs free energy ΔG = 0.00 kcal·mol−1 starts from the single transfer of the proton localized at the N3 nitrogen atom of the C base to the N1 nitrogen atom of the G base along the intermolecular H-bond (C)N3H…N1(G). This G*·C*(rWC)↔G+·C−(rWC) tautomerization process occurs via the TSG*·C*(rWC)↔G+·C (Cs symmetry) (ΔG = 4.38 kcal·mol−1) containing N1-H-N3 covalent bridge and further proceeds through the intermediate—tight ion pair G+·C−(rWC) (ΔG = 4.44 kcal·mol−1) (Cs symmetry), which is the point of bifurcation. By the way, it should be noted that this is the first case of the reliable fixation of the ionic pair of bases, formed as a result of the SPT along the intermediate molecular H-bond, which is involved in its stabilization. Similar attempts to localize such structures for the A·T(WC) and G·C(WC) DNA base pairs didn't lead to result. Further the G*·C*(rWC)↔G+·C−(rWC) tautomerisation reaction proceeds according two scenarios via the proton transfer along: - the (G)O6H….O2(C) H-bond through the TSG (ΔG = 3.64 kcal·mol−1; Cs symmetry) with O6-H-O2 covalent bridge leading to the G·(rWC) product (Cs symmetry); - the (G)N2H….N4(C) H-bond through the TSG (ΔG = 9.27 kcal·mol−1; Cs symmetry) N2-H-N4 covalent bridge leading to the ·C(rWC) product (Cs symmetry). It attracts attention that electronic ΔEint and Gibbs free ΔGint energies of the interaction for the tautomerised G·(rWC) (ΔEint = −39.67/ΔGint = −26.53) and ·C(rWC) (ΔEint = −41.10/ΔGint = −26.10) base mispairs exceed the values for the initial G*·C*(rWC) base mispair (ΔEint = −20.21/ΔGint = −7.40) and also canonical G·C(WC) base pair (ΔEint = −29.28/ΔGint = −15.97 kcal·mol−1) (Brovarets' and Hovorun, 2014a). Moreover, it was revealed that all three formed high-energy complexes—the G+·C−(rWC) ion pair and G·(rWC), ·C(rWC) tautomers of the Löwdin G*·C*(rWC) base pair are dynamically unstable structures, since for them the zero energy of the vibrations, which frequency become imaginary at the TS of tautomerisation, significantly exceeds the reverse electronic barrier (363.8, 668.1, and 454.7 cm−1) (Table 1). Moreover, Gibbs free energies of the reverse barrier for the G*·C*(rWC)↔G+·C−(rWC) and G+·C−(rWC)↔·C(rWC) tautomeric transformations are negative (−0.06 and −1.12 kcal·mol−1, accordingly) and for the G+·C−(rWC)↔G·(rWC)—it is less (0.13 kcal·mol−1) than kT (0.62 kcal·mol−1 under normal conditions) (Table 1). So, in fact the Löwdin G*·C*(rWC) base pair does not tautomerise to the novel G·(rWC) and ·C(rWC) base mispairs via the SPT along the intermolecular H-bonds. However, despite this verdict, obtained data can be useful as an analogy or even as a heuristic push at the investigation of the tautomerisation mechanisms of the H-bonded complexes of any nature.

Tautomerisation of the Reverse Löwdin G*·C*(rWC) Base Pair via the DPT: III. G*·C*(rWC)↔G*′N2·C(rWC)

We have also detected the unusual tautomerisation of the reverse Lowdin's G*·C*(rWC) DNA base mispair via the asynchronous [with a level of asynchrony 3.49 Bohr (Brovarets' and Hovorun, 2019b)] concerted DPT to the G*′N2·C(rWC) DNA base mispair with trans-oriented N2H imino group of the G DNA base, in which participates the protons at the N3(C) and N2(G) nitrogen atoms moving in opposite directions. Unusual nature of this process consists in the fact that the transitions of the protons from N3(C) to N1(G) and from N2(G) to N4(C) along the intermolecular H-bonds provokes the rotation of the NH2 amino group of the G base into the trans-position relatively the neighboring double C2N3(G) bond. As a results, this G*·C*(rWC)↔G*′N2·C(rWC) tautomerisation reaction proceeds through the substantially non-planar intermediate – G+′N2·C−(rWC) ion pair with non-planar NH2 amino group (C1 symmetry), the substantially non-planar TSG*·C*(rWC)↔G*′ (C1 symmetry), and substantially non-planar product of this reaction—the G*′N2·C(rWC) DNA base mispair (C1 symmetry). This DNA base mispair is stabilized by the participation of the three intermolecular H-bonds—(G)O6H…O2(C), (G)N1H…N3(C), and (C)N4H…N2(G) and represents itself quite stable structure (ΔEint = −33.41/ΔGint = −18.99 kcal·mol−1). Its characteristic structural feature is the substantial non-flatness (∠N1C6C2N3 = 23.3°; ∠C2N2N4C4 = 53.8°) and the exit from the plane of the purine ring of the O6H (∠HO6C6N1 = 10.8°), N1H (∠HN1C2N3 = -157.4°), and N2H (∠HN2C2N3 = -167.6°) external groups with the trans-orientation relative to the neighboring bond C2N3. Moreover, we have revealed that the C(rWC) and G*′N2·C(rWC) DNA base pairs interconvert via the conformational rotation of the N2H imino group around the C2N2 bond through the Gibbs freee energy barrier 31.65 kcal·mol−1 (Figure 1, Tables 1–2).

Tautomerisation of the Hoogsteen G*′·C*(H) Base Pair via the Classical DPT: IV. G*′·C*(H)↔C(H)

The tautomerisation of the Hoogsteen G*′·C*(H) base pair (ΔG = 0.00 kcal·mol−1; Cs symmetry) is possible via one-single way—through the asynchronous DPT (the values of the asynchronity consists 1.58 Bohr) (for more details see further discussion and Figure 8) along two intermolecular antiparallel lower (G)O6H…N4(C) and (C)N3H…N7(G) H-bonds through the TSG*′·C*(H)↔G*N7·C(H) (ΔG = 4.01 kcal·mol−1; Cs symmetry) connected by the N7-H-N3 covalent bridge with further formation of the dynamically-unstable C(H) base mispair with small lifetime τ = 4.46·10−13 s (ΔG = 3.20 kcal·mol−1; Cs symmetry). The G*′·C*(H)↔C(H) tautomerisation starts from the initial transfer of the hydrogen atom localized at the O6 oxygen atom of the G*′ DNA base to the N4 nitrogen atom of the C* DNA base within the G*′·C*(H) DNA base pair and then through the G+·C−(H) Hoogsteen base pair via the proton transfer from the N3 nitrogen atom to the N7 nitrogen atom leading to the formation of the C(H) DNA base mispair by the participation of the rare tautomer and canonical C DNA base. Notably, that initial and final structures involved in this tautomerisation process are stabilized by the participation of three intermolecular H-bonds (Table 2), one of which (G)C8H…O2(C) is non-standard with energy 1.15 kcal·mol−1. Notably, electronic (ΔEint = −35.66) and Gibbs free (ΔGint = −22.05 kcal·mol−1) energies of the interaction for the terminal C(H) base mispair exceed the values for the initial base mispair (ΔEint = −21.24/ΔGint = −8.91 kcal·mol−1). At this, total energies of the H-bonds make a great contribution to the electronic interaction energy−78.1% for the G*′·C*(H) DNA base mispair and 51.7% for the C(H) DNA base mispair. All low-frequency intermolecular vibrations of the C(H) base mispair, which periods are in the range 8.06·10−13-1.16·10−12 s, can't develop during its lifetime. This situation is typical for the structures, which are deprived of dynamic stability (Brovarets' and Hovorun, 2019b). So, in this case in fact the G*′·C*(H) base pair does not tautomerise via the DPT similarly to the previous G*·C*(rWC) base pair.

Tautomerisation of the Reverse Hoogsteen G*′·C*(rH) Base Pair via the DPT by the Participation of the C8H(G*′) Group: V. G*′·C*(rH)↔G*′N7·C(rH)

The G*′·C*(rH) base pair differs from two previous ones, since it tautomerises via the asynchronous DPT (with the value of asynchronity 1.69 Bohr) along the intermolecular antiparallel (C)N3H…N7(G) and (G)C8H…N4(C) H-bonds with further formation of the yilidic form G*′N7 of the G DNA base (Govorun et al., 1995a,b; Kondratyuk et al., 2000), which is characterized by the transferred proton of the C8H group to the neighboring N7 nitrogen atom. The G*′·C*(rH)↔G*′N7·C(rH) tautomerisation proceeds via the initial transfer of the proton localized at the N3 nitrogen atom of C base to the N7 nitrogen atom of G base through the formation of the G+′N7·C− ion pair followed by further proton transfer localized at the C8 carbon atom of G*′ base to the N4 nitrogen atom of C* base. Notably, that TS of this process—TSG*′·C*(rH)↔G*′–has planar structure (Cs symmetry) and contains C8-H-N4 covalent bridge, which angle is 158.8°. This process is become possible due to the fact that G base, from one side, is CH-acid (Kondratyuk et al., 2000) and from the other—it is able to transfer into the zwitterionic tautomer—so-called yilidic form (Govorun et al., 1995a,b; Kondratyuk et al., 2000). Analysis of the obtained data (Table 1) evidences that G*′N7·C(rH) tautomer is dynamically stable structure with quite long lifetime (τ = 5.15·10−12 s). Characteristically, that all 6 low-frequency intermolecular vibrations of the G*′N7·C(rH) base mispair, which period are in the range 6.83·10−13-1.51·10−12 s, can develop during this lifetime. This is the first case (Brovarets' et al., 2018f), when the product of the tautomerization of the H-bonded base pair by the participation of the yilidic purine base is dynamically stable structure. However, formed G*′N7·C(rH) base pair has low population−2.3·10−17 under normal conditions, which complicates the understanding of its biological role. At the same time, such structures represent a considerable interest from a theoretical point of view, in particular, they contain unusual (C)N4H…C8−(G) H-bond (Table 2), which supplements existing data about the nature of the H-bonding in the pairs of DNA/RNA bases (Brovarets' et al., 2014e).

Profiles of the Physico-Chemical Parameters of the Investigated SPT and DPT Tautomerisations

We have investigated in details the mechanisms of the abovementioned processes of the tautomerisation of the reverse Lowdin's G*·C*(rWC), Hoogsteen G*′·C*(H), and reverse Hoogsteen G*′·C*(rH) base pairs via the PT along the intermolecular H-bonds. Tautomerisations proceed in a synchronous concerted manner via the stepwise SPT in the case of the I. G*·C*(rWC)↔G+·C−(rWC)↔G·(rWC) and II. G*·C*(rWC)↔G+·C−(rWC)↔C(rWC) reactions, while in an asynchronous concerted manner via the DPT in the case of the III. G*·C*(rWC)↔G*′N2·C(rWC), IV. G*′·C*(H)↔·C(H) and V. G*′·C*(rH)↔G*′N7·C(rH) reactions (Figures 2, 4, 6, 8, 10, Tables 3–7). We have established following regularities of the general character for the obtained sweeps of the most important physico-chemical parameters along the IRC. The widths of the reaction zone of the investigated reactions starting from the reagent and till the product are almost the same for the I. G*·C*(rWC)↔G·(rWC) (9.24); II. G*·C*(rWC)↔·C(rWC) (10.54) and IV. G*′·C*(H)↔·C(H) (9.30 Bohr), while these widths are significantly larger for the III. G*·C*(rWC)↔G*′N2·C(rWC) (14.49) and V. G*′·C*(rH)↔G*′N7·C(rH) (23.12 Bohr) reactions (Figures 2, 4, 6, 8, 10). All tautomerisation processes are dipole-active, since they are followed by the significant change of the dipole moment μ of the tautomerising base pair (Figures 3C, 5C, 7C, 9C, 11C). These dependencies are U-like with maximal and minimal values located in the transition state zone for the I. G*·C*(rWC)↔G·(rWC) SPT (values of μ change in the range: 3.15–6.54); II. G*·C*(rWC)↔·C(rWC) SPT (3.15–7.15); III. G*·C*(rWC)↔G*′N2·C(rWC) DPT (2.81–8.84) and V. G*′·C*(rH)↔G*′N7·C(rH) DPT (4.47–9.01 D) reactions, while it is reverse with maximal values at the transition state zone for the IV. G*′·C*(H)↔·C(H) reaction (5.16–10.12 D) DPT (Figures 3C, 5C, 7C, 9C, 11C).
Figure 5

Profiles of physico-chemical parameters of the investigated II. G*·C*(rWC)↔G+·C−(rWC)↔G*N2·C(rWC) tautomerisation via the sequential SPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. For more detailed designations see Figure 3.

Figure 7

Profiles of the physico-chemical parameters of the investigated III. G*·C*(rWC)↔G*′N2·C(rWC) tautomerisation via the DPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. For more detailed designations see Figure 3.

Figure 9

Profiles of the physico-chemical parameters of the investigated IV. G*′·C*(H)↔G*N7·C(H) tautomerisation via the DPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. For more detailed designations see Figure 3.

Figure 11

Profiles of the physico-chemical parameters of the investigated V. G*′·C*(rH)↔G*′N7·C(rH) tautomerisation via the DPT obtained at the B3LYP/6-311++G(d,p) level of theory in vacuum. For more detailed designations see Figure 3.

Tautomerisation process does not change the configuration of the base pair—complexes slightly compress on several dozens of Angstrom at the zone of the TSs of tautomerisation. TSs demonstrate significant deviations from a plane in the case, when steric conflicts arise between the interacting exocyclic groups (Figures 2, 4, 6, 8, 10). According to the authors' conception (Brovarets' and Hovorun, 2013a,b,c,d, 2014c,d, 2015h,i; Brovarets' et al., 2013a,b, 2014a,b,c, 2015), it was introduced key points, namely 11 key points were obtained in the cases of the I. G*·C*(rWC)↔G·(rWC) and II. G*·C*(rWC)↔·C(rWC) SPT along the IRC in contrast to the processes of the III. G*·C*(rWC)↔G*′N2·C(rWC), IV. G*′·C*(H)↔·C(H) and V. G*′·C*(rH)↔G*′N7·C(rH) DPT, for which 9 key points have been localized. At this, it was obtained typical crossings of the profiles for the electron density ρ, the Laplacian of the electron density Δρ and the distance dAH/HB between the hydrogen and electronegative A or B atoms for the H-bonds involved in the tautomerisation, notifying the equalization of these parameters. They occur at the 3rd and 9th key points in the case of the reaction I, 3rd and 8th—for the reaction II, 3rd and 7th—for the reactions III and IV and 3rd and 6th—for the reaction V (Figures 3G,H,K, 5G,H,K, 7G,H,K, 9G,H,K, 11G,H,K). One and the same regularity is observed for the dE/dIRC function in all cases of tautomerisations—with two local maxima and two local minima achieved at the TS zone (Figures 3B, 5B, 7B, 9B, 11B). Also it was observed five patterns for the energy EHB of the intermolecular H-bonds, estimated by the EML method (Espinosa et al., 1998; Matta et al., 2006b; Mata et al., 2011; Lecomte et al., 2015; Alkorta et al., 2016, 2017), along the IRC for the I, II, IV and V reactions and four patterns—for the III reaction (Figure 12, Table 8). These sweeps allow to estimate numerically the cooperativity of the neighboring H-bonds according to the methodology, proposed by us earlier (Brovarets' and Hovorun, 2019b). It was established the general pattern—the anti-parallel H-bonds amplify each other and parallel—weaken each other (Turaeva and Brown-Kennerly, 2015). Moreover, some of the dependencies of the energy EHB of the intermolecular H-bonds exist during entire IRC, such as (G)N2H…N4(C) for the I. G*·C*(rWC)↔G·(rWC) reaction, (G)O6H…O2(C) for the II. G*·C*(rWC)↔·C(rWC) reaction, (G)O6H…O2(C) for the III. G*·C*(rWC)↔G*′N2·C(rwWC) reaction, (G)C8H…O2(C) (its energy remains almost unchangeable during the IRC) for the G*′·C*(H)↔·C(H) reaction and (G)O6H…O2(C) for the G*′·C*(rH)↔G*′N7·C(rH) reaction. In the case of the III. G*·C*(rWC)↔G*′N2·C(rwWC) reaction, some of the H-bonds transform into the van der Waals contact and then to another H-bonds during the IRC: (G)N2HN4(C) H-bond → (G)N2…N4(C) vdW contact → (G)N2HN4(C) H-bond for the III. G*·C*(rWC)↔G*′N2·C(rwWC) reaction (Figure 12). For more details according the patterns refer to Table 8. Finally, we would like to note some general regularities, which are characteristic for all without exclusion processes of tautomerisation. Thus, in the vast majority of cases base pairs are plane symmetric structures during the entire PT and DPT tautomerization processes along the IRC, despite the ability of the DNA bases for the out-of-plane bending (Govorun et al., 1992; Hovorun et al., 1999; Nikolaienko et al., 2011), excluding two mentioned above cases, when there are deviation from the planarity—III. G*·C*(rWC)↔G*′N2·C(rWC) (Figure 1). Interestingly, the total energy of the intermolecular H-bonds only partially contributes to the electron energy of the monomers interactions among all without any exceptions H-bonded structures investigated in this work (see Figures 1, 12). This result is in a good accordance with generalized literature data (Brovarets' and Hovorun, 2014e, 2019b).

Conclusions

For the first time the tautomerisation of the biologically-important conformers of the G*·C* DNA base pair - reverse Löwdin G*·C*(rWC), Hoogsteen G*′·C*(H) and reverse Hoogsteen G*′·C*(rH) pairs - was investigated and thoroughly analyzed. It was found out that the G*·C*(rWC)↔G+·C−(rWC)↔G·(rWC) and G*·C*(rWC)↔G+·C−(rWC)↔C(rWC) tautomerization processes occur via the two-stage sequential SPT via dynamically-unstable zwitterion-like G+·C−(rWC) intermediate, while the G*·C*(rWC)↔G*′N2·C(rwWC), G*′·C*(H)↔·C(H), and G*′·C*(rH)↔G*′N7·C(rH) tautomerization processes occur through the one-stage DPT. At this, proton transfer along the non-canonical (G*)CH…N(C*) H-bond is accompanied by the significant deviation of the C-H-N bridge at the TS. Obtained data evidence that among the G*·C*(rWC), G′·C*(H) and G*′·C*(rH) base pairs only the tautomerisation of the latest of them lead to the formation of the dynamically stable G*′N7·C(rH) base pair with lifetime 5.15 ps with a miserable population 2.3·10−17. Moreover, it was revealed that the I. G*·C*(rWC)↔G+·C−(rWC)↔G·(rWC) and II. G*·C*(rWC)↔G+·C−(rWC)↔C(rWC) tautomerization reactions proceed in a synchronous concerted manner via the stepwise SPT, while the III. G*·C*(rWC)↔G*′N2·C(rWC), IV. G*′·C*(H)↔C(H), and V. G*′·C*(rH)↔G*′N7·C(rH) reactions occur in an asynchronous concerted manner via the DPT. We have also established dependencies of the most important physico-chemical parameters along the IRC enabling to understand more precisely the inherent nature of the investigated processes.

Data Availability

The datasets generated for this study are available on request to the corresponding author.

Author Contributions

OB analysis and preparation of the current literature survey, discussion of the strategy of the current investigation, study conception and design, acquisition of data, drafting of manuscript analysis and interpretation of data, performance of calculations, discussion of the obtained data, preparation of the numerical data for Tables 1–8, graphical materials for Figures 1–12, and text of the manuscript. TO preparation of the numerical data for Tables 1–8 and graphical materials for Figures 1–12. DH study conception, critical revision of manuscript, proposition of the task of the investigation, discussion of the obtained data, and preparation of the text of the manuscript. All authors were involved in the proofreading of the final version of the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  66 in total

1.  Why the tautomerization of the G·C Watson-Crick base pair via the DPT does not cause point mutations during DNA replication? QM and QTAIM comprehensive analysis.

Authors:  Ol'ha O Brovarets'; Dmytro M Hovorun
Journal:  J Biomol Struct Dyn       Date:  2013-08-02

2.  Energetic, Topological and Electric Field Analyses of Cation-Cation Nucleic Acid Interactions in Watson-Crick Disposition.

Authors:  Ibon Alkorta; Ignasi Mata; Elies Molins; Enrique Espinosa
Journal:  Chemphyschem       Date:  2018-11-29       Impact factor: 3.102

Review 3.  The origins of quantum biology.

Authors:  Johnjoe McFadden; Jim Al-Khalili
Journal:  Proc Math Phys Eng Sci       Date:  2018-12-12       Impact factor: 2.704

4.  Widespread transient Hoogsteen base pairs in canonical duplex DNA with variable energetics.

Authors:  Heidi S Alvey; Federico L Gottardo; Evgenia N Nikolova; Hashim M Al-Hashimi
Journal:  Nat Commun       Date:  2014-09-04       Impact factor: 14.919

5.  Is the DPT tautomerization of the long A·G Watson-Crick DNA base mispair a source of the adenine and guanine mutagenic tautomers? A QM and QTAIM response to the biologically important question.

Authors:  Ol'ha O Brovarets'; Roman O Zhurakivsky; Dmytro M Hovorun
Journal:  J Comput Chem       Date:  2013-12-30       Impact factor: 3.376

6.  Modelling proton tunnelling in the adenine-thymine base pair.

Authors:  A D Godbeer; J S Al-Khalili; P D Stevenson
Journal:  Phys Chem Chem Phys       Date:  2015-05-21       Impact factor: 3.676

7.  DPT tautomerisation of the wobble guanine·thymine DNA base mispair is not mutagenic: QM and QTAIM arguments.

Authors:  Ol'ha O Brovarets'; Roman O Zhurakivsky; Dmytro M Hovorun
Journal:  J Biomol Struct Dyn       Date:  2014-03-21

8.  Key microstructural mechanisms of the 2-aminopurine mutagenicity: Results of extensive quantum-chemical research.

Authors:  Ol'ha O Brovarets'; Dmytro M Hovorun
Journal:  J Biomol Struct Dyn       Date:  2019-01-08

9.  Surprising Conformers of the Biologically Important A·T DNA Base Pairs: QM/QTAIM Proofs.

Authors:  Ol'ha O Brovarets'; Kostiantyn S Tsiupa; Dmytro M Hovorun
Journal:  Front Chem       Date:  2018-02-27       Impact factor: 5.221

10.  An interacting quantum atom study of model SN 2 reactions (X- ···CH3 X, X = F, Cl, Br, and I).

Authors:  Ibon Alkorta; Joseph C R Thacker; Paul L A Popelier
Journal:  J Comput Chem       Date:  2017-11-10       Impact factor: 3.376

View more
  3 in total

1.  A Quantum-Mechanical Looking Behind the Scene of the Classic G·C Nucleobase Pairs Tautomerization.

Authors:  Ol'ha O Brovarets'; Alona Muradova; Dmytro M Hovorun
Journal:  Front Chem       Date:  2020-11-26       Impact factor: 5.221

2.  Novel mechanisms of the conformational transformations of the biologically important G·C nucleobase pairs in Watson-Crick, Hoogsteen and wobble configurations via the mutual rotations of the bases around the intermolecular H-bonds: a QM/QTAIM study.

Authors:  Ol'ha O Brovarets'; Alona Muradova; Dmytro M Hovorun
Journal:  RSC Adv       Date:  2021-07-27       Impact factor: 4.036

3.  The Research of G-Motif Construction and Chirality in Deoxyguanosine Monophosphate Nucleotide Complexes.

Authors:  Yanhong Zhu; Zhongkui Li; Pengfei Wang; Qi-Ming Qiu; Hongwei Ma; Hui Li
Journal:  Front Chem       Date:  2021-06-30       Impact factor: 5.221

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.