Sebastian Mai1, Brennan Ashwood2, Philipp Marquetand1, Carlos E Crespo-Hernández2, Leticia González1. 1. Institute of Theoretical Chemistry, Faculty of Chemistry, University of Vienna , Währinger Strasse 17, 1090 Vienna, Austria. 2. Center for Chemical Dynamics and Department of Chemistry, Case Western Reserve University , 10900 Euclid Avenue, Cleveland, Ohio 44106, United States.
Abstract
The solvatochromic effects of six different solvents on the UV absorption spectrum of 2-thiocytosine have been studied by a combination of experimental and theoretical techniques. The steady-state absorption spectra show significant shifts of the absorption bands, where in more polar solvents the first absorption maximum shifts to higher transition energies and the second maximum to lower energies. The observed solvatochromic shifts have been rationalized using three popular solvatochromic scales and with high-level multireference quantum chemistry calculations including implicit and explicit solvent effects. It has been found that the dipole moments of the excited states account for some general shifts in the excitation energies, whereas the explicit solvent interactions explain the differences in the spectra recorded in the different solvents.
The solvatochromic effects of six different solvents on the UV absorption spectrum of 2-thiocytosine have been studied by a combination of experimental and theoretical techniques. The steady-state absorption spectra show significant shifts of the absorption bands, where in more polar solvents the first absorption maximum shifts to higher transition energies and the second maximum to lower energies. The observed solvatochromic shifts have been rationalized using three popular solvatochromic scales and with high-level multireference quantum chemistry calculations including implicit and explicit solvent effects. It has been found that the dipole moments of the excited states account for some general shifts in the excitation energies, whereas the explicit solvent interactions explain the differences in the spectra recorded in the different solvents.
Thiobases are a class
of nucleobase analogues, molecules that are
structurally similar to the canonical nucleobases found in DNA. They
differ from the canonical nucleobases only by the substitution of
carbonyl oxygen atoms by sulfur atoms, and this chemical similarity
makes the thiobases important nucleobase analogues. For instance,
thiobases are widely used in biological and medical research to investigate
the genetic code[1] and in chemo-[2−4] and antiviral therapies.[4,5] 2-Thiocytosine (2tCyt),
as well as other thiobases, is naturally found in bacterial t-RNA.[6−8] In addition, the photochemical properties of thiobases
are exploited for applications such as photolabeling,[9,10] photoaffinity probing of nucleic acid structures and DNA/RNA-protein
interactions,[10,11] or photochemotherapeutic applications.[12−15]Two major characteristics explain the photochemical behavior
of
thiobases.[16,17] First, the absorption spectra
of the thiobases are significantly red-shifted compared to the ones
of the canonical nucleobases. Second, UVA excitation of each thiobase
leads to near unity population of long-lived triplet states through
ultrafast intersystem crossing,[16] contrasting
the efficient relaxation mechanisms exhibited by the canonical nucleobases.[18,19] The remarkably different photochemical properties of the thiobases
compared to the nucleobases have recently been rationalized as due
to the differential stabilization of key regions of the excited-state
potential energy surfaces upon sulfur substitution.[17]The particular microscopic environment that the thiobases
encounter
in solution, or when incorporated into DNA or RNA, is expected to
significantly affect the energies of the electronic states and their
absorption properties. This sensitivity of the electronic states on
the surroundings makes thiobases interesting candidates for photolabeling
agents. In this regard, it is desirable to obtain a detailed understanding
of the solvatochromic properties of thiobases, i.e., how the solvent
hydrogen bonding, polarity, or polarizability actually contribute
to the shifts of the main absorption bands. This knowledge is also
important for interpreting the absorption shifts in a biological environment,
which is less polar than many solvents,[20] but where hydrogen bonds often play a crucial role.Of particular
relevance to this work, the absorption spectrum of
2tCyt is only known in aqueous solution,[16,17,21−24] and only recently its lowest-energy
absorption band was assigned to particular electronic states.[17,24] On the contrary, a notable number of publications report ab initio
calculations on the tautomeric ratio of 2tCyt in gas phase, solid
phase, and aqueous solution.[25−32]In this contribution, we specifically investigate the effect
of
polar aprotic and protic solvents on the absorption spectrum of 2tCyt.
We present steady-state absorption spectra in six different solvents:
ethyl acetate (EtOAc), acetonitrile (ACN), dimethyl sulfoxide (DMSO),
ethanol (EtOH), methanol (MeOH), and aqueous phosphate-buffered saline
solution (H2O). This set of solvents excludes low-polarity
solvents, due to the low solubility of 2tCyt in such solvents. Moreover,
other solvents in which 2tCyt is soluble absorb in the same spectral
range as 2tCyt, and therefore were also excluded from the study. The
energetic positions of the absorption maxima are subjected to solvatochromic
analyses[33−35] to decompose the solvent shifts into contributions
from different solvent effects. The experimental results are complemented
with high-level multistate complete active space second order perturbation
theory (MS-CASPT2) calculations, which provide a rigorous assignment
of the absorption bands and rationalize the influence of the aprotic
and protic solvents on the vertical electronic transitions. The knowledge
gained is fundamental to understand the excited-state dynamics and
photochemistry of 2tCyt when incorporated into DNA or RNA, and to
evaluate its prospective use as a photolabeling agent.
Methods
Experimental
Methods
2-Thiocytosine (Sigma-Aldrich,
97%), sodium dihydrogen phosphate (Sigma-Aldrich, 99.0%), and sodium
hydrogenphosphate (Sigma-Aldrich, 99.0%) were used as received. Phosphate
buffered saline solution was prepared using 0.15 g of sodium dihydrogenphosphate and 0.27 g of sodium hydrogen phosphate dissolved in 200
mL of ultrapure water to give pH 7.4 aqueous phosphate buffer with
a total phosphate concentration of 16 mM. All other solvents were
used as received: acetonitrile (Fisher Scientific, 99.9%), methanol
(Fisher Scientific, 99.9%), ethanol (Fisher Scientific, 99.8%), dimethyl
sulfoxide (Sigma-Aldrich, 99.9%), and ethyl acetate (Fisher Scientific,
99.9%).The steady-state absorption spectra were recorded using
a Cary 100 Bio UV–vis Spectrophotometer (Varian) in 1 cm optical
path quartz cells (Starna Cells, Inc.) and were corrected for the
background signal of the solvent.
Computational Methods
The ab initio calculations performed
correspond to three main tasks: (i) the evaluation of the most stable
tautomer, (ii) high-level reference computations of the vertical excitation
energies in vacuum, and (iii) vertical excitation calculations considering
implicit and explicit solvent effects of the six solvents used in
the experiments.For the first task, namely the evaluation of
the most stable tautomer of 2tCyt in vacuum and in solution, we performed
RI-MP2/cc-pVTZ[36−38] optimizations/frequency calculations and RI-CCSD(T)/cc-pVTZ
single point calculations. At this stage, we considered only implicit
solvent effects, by means of the COSMO (conductor-like screening model)[39] solvation model, as implemented in Orca 3.0.[40]For the second task, i.e., the high-level
reference calculation
in vacuum, we performed MS-CASPT2(14,10)/ANO-RCC-VQZP computations
including 9 electronic states,[41−43] based on state-averaged (SA9)
complete active space self-consistent field (SA-CASSCF) wave functions.
The active space, shown in Figure , contains the full π system (8 orbitals) plus
the lone pairs of sulfur and the aromatic nitrogen atom (nS and nN in Figure ). The Douglas–Kroll–Hess
(DKH) Hamiltonian,[44] Cholesky decomposition,[45] and the default IPEA shift[46,47] of 0.25 a.u. were employed, but no level shift was applied. These
calculations were performed with MOLCAS 8.0.[48] The geometry for this reference gas-phase calculation was calculated
with RI-MP2/cc-pVQZ using Orca 3.0.[36−38]
Figure 1
Active space orbitals
of 2tCyt with naming convention.
Active space orbitals
of 2tCyt with naming convention.For the third task, explicit and implicit solvent effects
were
included in the vertical excitation computations. This was accomplished
by placing three to five solvent molecules close to 2tCyt (microsolvation)
in positions that facilitate hydrogen bonding, mimicking the hydrogen
bond network reported by Podolyan et al.[31] For EtOAc, ACN, and DMSO, solvent placement considered only the
possible acceptor–donorhydrogen bonds, leading to the inclusion
of three solvent molecules close to the N1–H and
NH2 groups. For EtOH, MeOH, and H2O, the positions
of the solvent molecules also considered donor–acceptor hydrogen
bonding, leading to two more solvent molecules near the N3 and S atoms. These structures (i.e., 2tCyt plus solvent molecules)
were then optimized at the BP86/aug-cc-pVDZ[37,38,49] level of theory, using COSMO to account
for the effect of the bulk solvent. These optimizations were performed
with Orca 3.0,[40] except for the structure
microsolvated with ACN, which was optimized with Turbomole 7.0,[50] whose optimizer better handled the linear ACN
molecules. Due to the choice of the positions of the solvent molecules,
these optimized geometries favor hydrogen bonding over dipolar or
dispersion interactions, which is justified in view that 2tCyt is
readily soluble in protic solvents.Using the optimized, microsolvated
geometries, we performed SA9-CASSCF(14,10)/cc-pVDZ[37,38] calculations followed by MS-CASPT2(14,10)[41] calculations for the vertical excitation energies. The effect of
the bulk solvent was added by means of PCM (polarizable continuum
model)[51−53] in the dielectric version implemented in MOLCAS 8.0
(see Table S1 in the Supporting Information (SI) for the employed solvent parameters).
The PCM charges were first equilibrated with the ground state density
in a single-state CASSCF computation, whereas in the SA calculations
only the dynamical part of the reaction field was adapted to the electron
density (nonequilibrium calculation). Instead of employing the SA
density,[52] we performed 9 independent SA-CASSCF
+ MS-CASPT2 calculations (i = 1, ..., 9), such that
the density of state i was used in calculation i (RASSI state overlaps were used to check that no root
flipping occurred). Finally, only the energies and oscillator strengths
(from the perturbatively modified CASCI wave functions[41]) of state i were taken from
calculation i, while the other states were neglected.
In this way, each state’s density is in equilibrium with the
dynamical part of the reaction field, and it is not necessary to apply
a state-specific correction as, e.g., in ref (52).Since the microsolvated
structures involve relatively many atoms
and PCM-MS-CASPT2 is quite expensive, it was necessary to decrease
the size of the basis set used in these calculations to double-ζ.
As discussed in the SI (Figure S1), the smaller basis set leads to a decrease of all
excitation energies by about 0.17 eV, but otherwise produces energies
in close agreement with the quadruple-ζ basis set, with a standard
deviation of only 0.08 eV. Although this standard deviation is larger
than some shifts between similar solvents (e.g., EtOH and MeOH, see
below), it can be assumed that the shifts due to the basis set are
similar in all solvents, leading to error cancellation in the solvent
effects. For these reasons, we are confident that the reduced basis
set size does not affect the conclusions from our solvation computations.
In the calculations, the DKH Hamiltonian, Cholesky decomposition,
and the active space shown in Figure were employed. The IPEA shift was set to zero, as
this is expected to provide better results in combination with a double-ζ
basis set.[47] To avoid intruder states,
an imaginary level shift[54] of 0.3 a.u.
was needed. MOLCAS 8.0[48] was employed for
the MS-CASPT2 calculations.The so-obtained excitation energies
are here directly compared
to the experimental absorption maxima, or the energetic positions
of fitted Gaussians (see below). This is a simplification, because
the vibrational broadening can induce systematic shifts of the absorption
maximum relative to the vertical excitation energy, which were reported
to be on the order of 0.1–0.3 eV.[55] However, under the assumption that in the different solvents these
systematic shifts are similar, our main conclusions concerning the
solvatochromic effects should still hold.
Experimental Results
Stationary
Absorption Spectrum
Figure shows the absorptivity spectra of 2tCyt
in the six solvents considered, with the wavelengths and absorptivities
given in Table . The
spectra show an intense absorption maximum at 269–286 nm (4.3–4.6
eV), with a residual absorption tail extending to approximately 330–350
nm (3.7–3.9 eV), depending on the solvent used. A second absorption
band with a maximum at 233–242 nm (5.1–5.3 eV) can be
observed in ACN, EtOH, MeOH, and H2O. In addition, the
protic solvents (EtOH, MeOH, H2O) give rise to a shoulder
in the 210–218 nm (5.6–5.9 eV) range. These observations
generally agree with the absorption spectra previously reported in
H2O and ACN.[21,23,24]
Figure 2
Absorptivity
spectra of the lowest-energy absorption bands of 2tCyt
in the considered solvents, with aprotic solvents in (a) and protic
solvents in (b). The spectra of EtOAc and DMSO was not recorded at
wavelengths shorter than ca. 250 nm due to strong solvent absorption.
Table 1
Experimental Wavelengths
λ (nm),
Energies E (eV), and Absorptivities (M–1cm–1) of Absorption Maxima and Shoulders in Figure
first
maximum
second maximum
shoulder
solvent
λ
E
λ
E
λ
E
EtOAc
286
4.33
16870
ACN
283
4.38
16700
233
5.32
11910
DMSO
285
4.35
18030
EtOH
280
4.43
17900
242
5.12
14220
∼210
5.90
6500
MeOH
278
4.46
17770
242
5.12
14840
∼212
5.84
7050
H2O
269
4.61
17950
242
5.12
18030
∼218
5.68
9000
Absorptivity
spectra of the lowest-energy absorption bands of 2tCyt
in the considered solvents, with aprotic solvents in (a) and protic
solvents in (b). The spectra of EtOAc and DMSO was not recorded at
wavelengths shorter than ca. 250 nm due to strong solvent absorption.A number of solvatochromic trends can be observed
in the spectra,
especially when going from polar aprotic solvents to polar protic
solvents. On the one hand, the lowest-energy maximum is blue-shifted
by 17 nm in going from EtOAc (286 nm) to H2O (269 nm),
while the molar extinction coefficient remains approximately constant.
On the other hand, the high-energy maximum is red-shifted by 9 nm,
from 233 nm in ACN to 242 nm in H2O, with significant narrowing
and an increase in the molar extinction coefficient. In addition,
a shoulder develops in the protic solvents, located at approximately
210 nm in the alcohols and at 218 nm in H2O. Finally, the
absorption tail at low energies blue-shifts in protic solvents, especially
in H2O, as has been observed in other thiobases.[16]Figure presents
least-squares fits of a sum of five Gaussians to the experimental
absorption spectra. This number of Gaussians was found to satisfactorily
describe the four observed spectral features (the absorption tail,
two maxima, one shoulder) and additionally the absorption at around
200 nm, which is the edge of the recorded spectral range. For EtOAc
and DMSO, only three Gaussians were necessary, because the spectral
features below 250 nm are concealed by the absorption of these solvents.
The relevant equation, the fit parameters obtained (intensities, central
energies, full width at half-maximum, and corresponding fitting errors
of the five Gaussians, Table S2), and the
residuals from the fits (Figure S2) are given in the SI.
Figure 3
Absorptivity spectra of 2tCyt in various solvents (black lines)
and decompositions of the spectra into a sum of Gaussians g1 to g5.
Absorptivity spectra of 2tCyt in various solvents (black lines)
and decompositions of the spectra into a sum of Gaussians g1 to g5.As shown in Figure , in all solvents the absorption spectrum of 2tCyt
is dominated by
the lowest-energy absorption band, which is well described by a single,
intense Gaussian (g2). The energetic position
of this Gaussian notably blue-shifts when going from the least polar
solvent, EtOAc, to the most polar solvent, H2O. In general,
it appears that the absorption tail (described by g1) follows the shifts of the lowest-energy absorption
maximum, i.e., the tail blue-shifts when going from the less polar
solvents to H2O; however, the behavior of g1 is slightly more complex than the one of g2. In particular, the spectrum in ACN shows the most pronounced
absorption tail, extending to 360 nm (chosen cutoff wavelength where
the absorptivity is still 200 M–1cm–1), whereas the tails in EtOAc, DMSO, MeOH, and EtOH extend to 350
nm and that in H2O to only 330 nm. According to the Gaussian
fits, the absorption tails in ACN and DMSO show the strongest absorption
(largest intensities for g1), whereas
the tail is weaker in the other four solvents. We note here that this
complex behavior might be partially because it is difficult to unambiguously
fit g1—in some cases, an almost
equally good fit could be obtained by simultaneously blue-shifting,
increase of width, and increase of intensity, so that the shape of
the absorption tail is approximately conserved.The decomposition
of the high-energy absorption band in Figure for ACN, EtOH, MeOH,
and H2O shows that the high-energy absorption band and
its shoulder can be described well with two Gaussians (g3 and g4), although the fitting
uncertainties were slightly larger than for g1 and g2. The position and intensity
ratio of g3 and g4 depends on the solvent, especially in EtOH and MeOH, where
the g4 component becomes very small.
Solvatochromic Analyses
We performed solvatochromic
analyses using three widely employed linear solvent effect scales,
proposed by Catalán,[33] Kamlet and
Taft,[34] and Reichardt,[35] respectively. Because the Catalán method is the
most recent one and—through its four parameters—allows
decomposing the solvent shifts into different contributions, we focus
our discussion mostly on this solvatochromic method. The employed
numerical data (absorption energies, solvent parameters, see Tables S3–S6) and the results of the three
analyses are given in the SI.The
solvatochromic analyses were carried out for the energies of the first
absorption maximum and absorption tail in the different solvents.
We focus on these two spectral features because the underlying transitions
play key roles in the photophysics of 2tCyt, but also because in DMSO
and EtOAc the high-energy band is overshadowed by the strong absorption
of the solvents below 250 nm and thus we cannot determine the energetic
position of this band.In the Catalán analysis, the solvent-dependent
energy shifts
of the absorption bands are linearly related to four solvatochromic
parameters, which Catalán calls the solvent acidity SA, solvent
basicity SB, solvent dipolarity SdP, and solvent polarizability SP.[33] Out of the 15 linear regression models possible
with four parameters, the statistically most significant (p < 0.001) models for the absorption maximum energies
are “only SA”: Emax = (4.
34 ± 0.01) + (0.24 ± 0.02) SA with R2 = 0.96, and “SA+SB”: Emax = (4. 39 ± 0.02) + (0.21 ± 0.02) SA + (−0.09
± 0.03) SB with R2 = 0.99. Including
the SdP and SP parameters in the models did not lead to statistically
significant improvements of the fits. These results show that the
solvent acidity and basicity (SA and SB) alone can successfully describe
the solvatochromic shifts of the lowest-energy absorption maximum
observed in Figure , whereas solvent dipolarity and polarizability do not have a notable
effect. However, it should be remarked that the maximum position could
also be explained solely with the solvent acidity.For the red
absorption tail, the linear regression models based
on the Catalán parameters do not lead to conclusive results.
The best-performing model, “only SA”, gives R2 = 0.81. Models with more parameters were not
found to be statistically significant. As the magnitudes of the absorption
coefficients for the red tail are small and also the solvent data
set is limited, we conclude that it is unwarranted to perform further
solvatochromic analyses for the red absorption tail.The Kamlet–Taft
analysis (see SI) showed that the most
relevant solvent parameter for the absorption
maximum is the hydrogen bond donor acidity α,[56] in agreement with the Catalán analysis. The two
other Kamlet–Taft parameters, solvent polarizability and hydrogen
bond acceptor basicity, seem to only play a minor role for the solvent
shifts of the absorption maximum. The Reichardt ETN parameter
(see SI), which is mostly related to solvent
polarity and acidity,[57] was found to correlate
well with the absorption maximum energies, however, being a single-parameter
model it does not allow identifying the most influential solvent property.
Computational Results
Next, we present computational results
to identify the lowest-energy
tautomer in each of the different solvents, to assign the deconvoluted
absorption spectra in Figure , and to rationalize the observed solvatochromic behavior.
Tautomer
Ratios
Table shows the relative energies of different tautomers
(displayed in Figure ) of 2tCyt in vacuum and in the six solvents used in this work. Interestingly,
the amino-thiol form with its two rotamers (B and C) is the most stable
tautomer in the gas phase, where other tautomers are at least 5 kcal/mol
higher in energy. Conversely, the 1H-amino-thion tautomer is the most
stable form in all the solvents considered here and is predicted to
be the only tautomer in ACN, DMSO, EtOH, MeOH, and water. However,
it is possible that the amino-thiol form may be present in a few percent
when using EtOAc as solvent. The vast abundance of the 1H-amino-thion
form is due to its large permanent dipole moment (7.3 D in vacuum,
according to CCSD(T)/cc-pVTZ), which leads to a large stabilization
in both the polar protic and polar aprotic solvents studied. These
results agree satisfactorily with the findings reported in the literature,[25,26,29,31] which consistently predict the 1H-amino-thion tautomer to be the
most stable one in water. Based on our findings, we focus our attention
on how the vertical excitation energies for the 1H-amino-thion tautomer
(simply referred to as 2tCyt hereafter) are affected by these solvents.
Table 2
Relative Energies (kcal/mol) and Permanent
Dipole Moments (Debye) of 2tCyt Tautomers (Shown in Figure ) in Different Solventsa
A
B
C
D
E
vacuum
5.0
0.0
0.2
5.6
12.1
EtOAc
0.0
2.1
2.1
5.1
6.2
ACN
0.0
4.6
4.6
6.8
4.9
DMSO
0.0
4.8
4.7
6.9
4.9
EtOH
0.0
4.3
4.3
6.6
4.9
MeOH
0.0
4.5
4.5
6.8
5.0
H2O
0.0
5.0
4.9
7.1
4.8
μvac
7.3
3.6
4.3
5.0
8.3
Level of theory: CCSD(T)/cc-pVTZ//MP2/cc-pVTZ
+ COSMO.
Figure 4
Structures of the investigated
tautomers of 2tCyt, with ring atom
numbering.
Level of theory: CCSD(T)/cc-pVTZ//MP2/cc-pVTZ
+ COSMO.Structures of the investigated
tautomers of 2tCyt, with ring atom
numbering.
Vertical Excitation Energies
in Vacuum
Table compiles the gas-phase vertical
excitation energies, oscillator strengths, and dipole moments for
the 1H-amino-thion tautomer of 2tCyt obtained at the MS-CASPT2/quadruple-ζ
level of theory. The four lowest excited singlet states are excitations
from the thiocarbonyl group—two states with nSπ* character (S1 and S3) and two states with πSπ*
character (S2 and S4); refer to Figure for the involved orbitals. Higher-lying states originate
from excitation from the nitrogen lone pair (nNπ*) and from excitation within the pyrimidine π
system (ππ*) according to the MS-CASPT2 calculations.
Based on the calculated oscillator strengths, the most intense transitions
are the πSπ* (S4) and ππ* (S6, S8) states, with energies of 4.44 eV (279 nm), 5.60 eV
(221 nm), and 6.10 eV (203 nm), respectively. The lowest πSπ* state (S2), at 3.74 eV
(331 nm), shows weaker absorption than the other ππ* states,
with an oscillator strength that is an order of magnitude smaller
than that of the S4 state. The nSπ* and nNπ* states are dark (i.e., oscillator strength smaller than
0.01) according to the calculations.
Table 3
Vertical
Excitation Energies, Oscillator
Strengths, and Wave Function Characters of the Lowest Eight Singlet
States of 2tCyta
state
E (eV)
fosc
char.
μ (D)b
S0
0.00
GS
7.5
S1
3.65
0.001
nSπ*
2.9
S2
3.74
0.055
πSπ*
3.2
S3
3.98
0.000
nSπ*
1.7
S4
4.44
0.657
πSπ*
6.3
S5
5.24
0.003
nNπ*
4.8
S6
5.60
0.292
ππ*
7.7
S7
5.78
0.005
nNπ*
9.1
S8
6.10
0.139
ππ*
9.7
MS(9)-CASPT2(14,10)/ANO-RCC-VQZP//RI-MP2/cc-pVQZ
(in vacuum).
Dipole moments
from SS-CASPT2.
MS(9)-CASPT2(14,10)/ANO-RCC-VQZP//RI-MP2/cc-pVQZ
(in vacuum).Dipole moments
from SS-CASPT2.Table also reports
the norms of the permanent dipole moment vectors of the excited states,
which are expected to influence the energy shifts in solution. Based
on these data, the states S1 to S5 are all less polar than the ground state of
2tCyt. On the contrary, the higher state S6 has approximately the same dipole moment as the ground state, and
the S7 and S8 states are more polar than the ground state.
Solvation Effects on the
Computed Vertical Excitations
We now present the results
of the MS-CASPT2 calculations incorporating
solvent effects. The optimized microsolvated geometries are shown
in Figure , giving
the positions of the explicit solvent molecules. Based on the optimized
structures, a significant solvent effect on the ground-state equilibrium
geometry of 2tCyt was observed. Most importantly, the S=C2–N3=C4–N moiety
undergoes bond length alteration in polar solvents: the C2=S (+0.05 Å) and N3=C4 (+0.03
Å) bonds become longer, whereas the N1–C2 and C4–Namino bonds become shorter
(−0.03 Å); see Table S7 in
the SI for more information. The effects
of these geometric changes on the excitation energies will be discussed
below.
Figure 5
Optimized geometries of 2tCyt microsolvated with different number
of solvent molecules, as depicted. Optimized with BP86/aug-cc-pVDZ+COSMO.
For coordinates, see the SI.
Optimized geometries of 2tCyt microsolvated with different number
of solvent molecules, as depicted. Optimized with BP86/aug-cc-pVDZ+COSMO.
For coordinates, see the SI.Table presents
the results of the computed vertical excitations incorporating both
explicit and implicit solvent effects. The table also includes (in
column “vac”) calculations in vacuum with otherwise
identical settings for comparison (note that these values are thus
different from those in Table ; see Figure S3 for a comparison
of the two vacuum computations). A number of trends can be observed
in these results. The three lowest excited states S1 (nSπ*), S2 (πSπ*), and S3 (nSπ*) are significantly
blue-shifted in solution, but their oscillator strengths remain small
and are weakly affected by solvation. The oscillator strength of the S2 (πSπ*) state is modulated
between 0.01 and 0.03, but without any particular trend. These modulations
are due to slightly different mixing of the S2 with the very bright S4, which
leads to different extent of intensity borrowing. The bright S4 (πSπ*) state is blue-shifted
from 4.3 eV (288 nm) in vacuum to 4.6 eV (269 nm) in water and its
oscillator strength is reduced from 0.7 to 0.4. The nNπ* states (S5 and S7) are also slightly blue-shifted but remain
dark in all solvents. Finally, the bright ππ* states (S6 and S8) red-shift
and become brighter when going from polar aprotic to polar protic
solvents.
Table 4
Vertical Excitation Energies and Oscillator
Strengths of 2tCyt in Various Solvents at the MS-CASPT2 Level of Theorya
state
char.
vac
EtOAc
ACN
DMSO
EtOH
MeOH
H2O
energies (eV)
S1
nSπ*
3.46
3.93
4.01
4.01
4.33
4.36
4.50
strong up
S2
πSπ*
3.47
3.93
3.99
4.04
4.21
4.24
4.23
strong up
S3
nSπ*
3.77
4.32
4.42
4.51
4.94
4.97
5.12
strong up
S4
πSπ*
4.31
4.42
4.49
4.46
4.58
4.59
4.60
up
S5
nNπ*
5.07
5.29
5.37
5.42
5.76
5.78
5.72
up
S6
ππ*
5.17
4.84
4.88
4.72
4.81
4.82
4.81
down
S7
nNπ*
5.66
5.51
5.59
5.52
5.80
5.81
5.79
up
S8
ππ*
5.70
5.25
5.30
5.15
5.07
5.08
5.14
down
oscillator strengths
S1
nSπ*
0.00
0.00
0.00
0.00
0.00
0.00
0.02
S2
πSπ*
0.02
0.01
0.02
0.02
0.03
0.03
0.01
S3
nSπ*
0.00
0.00
0.00
0.01
0.02
0.01
0.02
S4
πSπ*
0.69
0.59
0.58
0.56
0.48
0.47
0.40
darker
S5
nNπ*
0.00
0.00
0.00
0.00
0.00
0.00
0.02
S6
ππ*
0.24
0.27
0.31
0.23
0.24
0.23
0.31
brighter
S7
nNπ*
0.00
0.01
0.01
0.01
0.01
0.01
0.01
S8
ππ*
0.16
0.33
0.28
0.39
0.46
0.50
0.45
brighter
Note that for all columns the
same MS-CASPT2 settings (MS-CASPT2(14,10)/cc-pVDZ+PCM // BP86/aug-cc-pvdz+COSMO)
were used and therefore the values given here for “vac”
differ from the ones in Table .
Note that for all columns the
same MS-CASPT2 settings (MS-CASPT2(14,10)/cc-pVDZ+PCM // BP86/aug-cc-pvdz+COSMO)
were used and therefore the values given here for “vac”
differ from the ones in Table .
Discussion
In this section, we discuss the solvatochromic effects on the lowest-energy
absorption maximum and the assignment of the experimental transitions
in the absorption spectra of 2tCyt based on the results obtained from
the vertical excitation energies calculated at the MS-CASPT2 level
of theory.
Experimental Absorption Spectra
The results of the
three solvatochromic analyses, using the Catalán,[33] Kamlet–Taft,[34] and Reichardt[35] solvent scales, indicate
that mainly the solvent acidity determines the energetic position
of the first absorption maximum. This finding explains the large differences
between aprotic and protic solvents displayed in Figure . On the contrary, the solvent
polarizability or hydrogen bond acceptor strength do not seem to affect
the absorption energy significantly.
Theoretical Absorption
Spectra in Vacuum
We now discuss
the high-level reference excited-state calculations of 2tCyt in vacuum
and compare the results to the experimental absorption spectra. From
the onset, the calculated tautomer energies in Table clearly show that the 1H-amino-thion form
should be the most abundant tautomer present in all the solvents investigated.
This finding considerably simplifies the assignment of the absorption
bands in the spectrum of 2tCyt. In vacuum, the tautomeric equilibrium
shifts to the amino-thiol form, but since we did not record an absorption
spectrum in the gas phase, this tautomer is not relevant to our analysis.
Still, the vacuum calculations for the 1H-amino-thion tautomer alone
should provide a general representation of the set of relevant excited
states and their properties, while simultaneously helping to scrutinize
the accuracy of the calculations that take into consideration explicit
and implicit solvent effects.The vacuum MS-CASPT2 calculations
(Table ) show that
only some particular states have significant oscillator strengths
and contribute to the lowest-energy absorption bands in Figure . These states either show
dominant involvement of the π orbital
on the sulfur atom (S2, S4) or of the π orbitals localized on the pyrimidine
ring (S6, S8). Together, these states should give rise to two absorption maxima,
one near 4.4 eV (S4) with a tail (S2), and another near 5.6–6.1 eV (S6, S8) in the gas
phase.The calculations also yielded the norms of the permanent
dipole
moment vectors of the excited states (see Table ), which are expected to influence the energy
shifts when going from vacuum to solution. In particular, states with
a dipole moment larger than the one of the ground state (about 7.5
D) should red-shift in polar solvents, whereas states with smaller
dipole moments will blue-shift. For 2tCyt, the states S1 to S5 are all less polar
than the ground state, implying that these states should be shifted
to higher energies in polar solvents. Interestingly, this also includes
the bright ππ* (S4) state, even though states of ππ* character
are usually red-shifted in polar solvents. On the contrary, the higher
states S6 to S8 are more polar than the ground state and can be expected to red-shift
upon solvation.Despite the fact that the reference calculation
does not include
solvent effects, the transition energies (4.4 eV, 5.6 eV, 6.1 eV)
compare—perhaps coincidentally—well with the absorption
spectra recorded in EtOAc, ACN, or DMSO, which present a band centered
at 4.3–4.4 eV with a tail and a broad band at 5.1–5.4
eV. The protic solvents perturb the electronic transitions to a larger
extent than ACN, and hence the calculations performed in vacuum do
not compare well with the experimental absorption spectra in these
solvents. This observation is consistent with the solvatochromic analyses,
which showed that solvent acidity plays a large role in the solvent
shifts.
Theoretical Absorption Spectra in Solution
In order
to better understand the influence of solvation on the absorption
spectra, here we disentangle the different effects of the solvent
on the vertical excitation energies of 2tCyt presented in Table . We consider the
following effects: (i) change of molecular geometry, (ii) change of
dielectric constant ϵ, (iii) change of index of refraction n (which is related to the dielectric constant at high frequencies
ϵ∞ = n2), and
(iv) explicit solute–solvent interactions through hydrogen
bonds. Figure shows
the results of the decomposition analysis. The four panels of the
figure are organized such that the four effects are switched on one
by one, in the above order, to yield the full solvation calculations
in the last panel.
Figure 6
Decomposition of the solvatochromic shifts. In (a), the
excitation
energies and oscillator strengths (see color bar at the top) are shown
for a linear interpolation in internal coordinates (LIIC) scan from
the vacuum geometry to the geometry in H2O (taken from Figure f). In (b), at the
latter geometry the dielectric constant ϵ is varied from 1 (vacuum)
to 80 (H2O), with the refractive index kept at n = 1. In (c), for each individual solvent the calculations
considered the actual optimized geometries, ϵ, and n, but no explicit solvent molecules (data in Table S7). In (d), additionally explicit solvation (as described
above) is considered (data in Table ).
Decomposition of the solvatochromic shifts. In (a), the
excitation
energies and oscillator strengths (see color bar at the top) are shown
for a linear interpolation in internal coordinates (LIIC) scan from
the vacuum geometry to the geometry in H2O (taken from Figure f). In (b), at the
latter geometry the dielectric constant ϵ is varied from 1 (vacuum)
to 80 (H2O), with the refractive index kept at n = 1. In (c), for each individual solvent the calculations
considered the actual optimized geometries, ϵ, and n, but no explicit solvent molecules (data in Table S7). In (d), additionally explicit solvation (as described
above) is considered (data in Table ).In Figure a, we
investigate the effect of the structural changes from the equilibrium
geometry in vacuum to the equilibrium geometry in aqueous solution
(the optimized geometry from Figure f). The geometry in water was employed because the
optimized geometries in the other solvents are located somewhere between
the vacuum geometry and the water one. The changes observed are mainly
related to bond length alteration of the S=C2–N3=C4–N moiety (see Figure for atom numbering). The panel
shows that the geometric change of 2tCyt alone leads to a strong red-shift
of the S1, S2, and S3 states, a weak red-shift of
the S4 and S5 states, virtually no shift of the S6 and S7 states, and a blue-shift of the S8 state. These shifts are fully opposite to
what is observed in the experiment and in the full solvation calculations
(as shown in Table ). Thus, the geometric relaxation of 2tCyt due to solvation alone
cannot explain the trends in the experimental spectra. Instead, it
is counteracted by the interactions of the excited-state dipole moments
with the solvent, as well as by specific solute–solvent interactions.In Figure b, we
modify the dielectric constant ϵ used in the PCM model from
1 (vacuum) to 80 (water). Because for this panel we use the geometry
optimized in water, the leftmost points of panel (b) are identical
to the rightmost points of panel (a), with both panels forming a continuous
scan. The increase of ϵ leads to the expected result that less
polar states (especially S1, S2, and S3) are shifted to
higher energies, whereas the polar 1ππ* states
are shifted to lower energies. This shows that part of the solvatochromic
shifts observed experimentally originate from the interaction of the
solute dipole moment with the solvent. Following the dependence of the electrostatic potential
in PCM,[51,58] most of this shifting effect already occurs
at small ϵ between 1 and 10, whereas a further increase of ϵ
virtually does not affect the energies. As most of the solvents considered
here have an ϵ above 20, this figure shows that the solvatochromic
effects on the excited states do not stem from the interaction between
solvent dielectricity and state polarity.In Figure c, we
consider three solvent effects: geometry change, the dielectric constant,
and additionally the refractive index. For each of the six solvents,
in this panel we employ the actual optimized geometry in each solvent
(not the one in water, as in panels (a) and (b)), the actual value
of ϵ and the actual refractive index n. However,
explicit solvent molecules are not included in panel (c). Interestingly,
it can be seen that the excitation energies and oscillator strengths
are fairly similar for all six solvents, with the exception of the
less polar EtOAc solvent. As the experimental excitation energies
and intensities vary considerably more than the results in panel (c),
it can be concluded that the geometric relaxation and excited-state
polarities are not the main causes for the differences observed between
the solvents.Finally, in panel (d) of Figure we consider for each solvent all implicit
and explicit
solvent effects (geometry, ϵ, n, and hydrogen
bonds). It can be seen that the inclusion of the explicit solvent
effects through microsolvation leads to significant differences in
the excitation energies in the different solvents. In particular,
the unpolar states are shifted to higher energies by large amounts
in the polar protic solvents, whereas the bright states are less affected
and move closer together. These findings agree with the results of
the solvatochromic analysis, which also showed that the explicit solvent
effects (hydrogen bonding) dominate the solvent absorption shifts
in 2tCyt.
Solvatochromic Effects in 2tCyt
The experimental and
computational results can now be put together to directly assign the
features in the absorption spectra to the different electronic states.
The most probable candidate for the electronic state responsible for
the low-energy absorption maximum is the bright S4 (πSπ*) state, based on the fact
that its energy, oscillator strength, and solvent shifts all match
with this absorption maximum. The only disagreement was found in the
solvent-dependence of the band’s intensity, with the experiment
showing similar absorptivities in all solvents, whereas the calculations
predict a lowering of the oscillator strength of S4 in polar solvents. Interestingly, the S4 is an example of a bright ππ* state that blue-shifts in polar solvents, which is an unusual behavior
for a ππ* state of a nucleobase analogue.
A reason for this behavior is the fact that the S4 dipole moment is smaller than the ground state dipole
moment of 2tCyt. Additionally, based on the results in Figure explicit solvent interactions
are likely also involved in this blue-shift.Likewise, we assign
the second absorption maximum around 5.1–5.4 eV to the two ππ* states localized on the pyrimidine ring
(S6 and S8), again based on a good match of the excitation energies, intensities,
and solvent shifts. These two states show the typical behavior of
a red-shift upon solvation in polar solvents, due to their very large
dipole moments. Here, the calculations agree with experiment in that
the band’s intensity increases with increasing solvent polarity.
Although the assignment of the weak high-energy shoulder cannot be
done unambiguously, it may be associated with the upper ππ* state (S8), since both move to lower
energies in polar solvents.The low-energy residual absorption
tail can be at least partially
attributed to the lowest ππ*
(S2) and nπ* (S1) states,
which in protic solvents is strongly blue-shifted and therefore becomes
buried under the strong absorption of S4, in line with the experimental observations. However, given the
uncertainties in fitting the low-energy residual absorption next to
the much brighter S4 absorption and the
phenomenological nature of the employed Gaussian deconvolution, we
refrain from a more thorough analysis of the absorption tail. An alternative
explanation for the absorption tail would be that it is due to the
vibrational progression of the S4 absorption,
and that the different solvents lead to a different extent of this
progression.Generally, the excitation energy shifts from the
vacuum calculation
to the calculations including solvents can be explained partially
by the dipole moment of the excited states, with less polar states
being shifted to higher energies and more polar states to lower energies
(compare the dipole moments in Table ). The change of the ground state geometry in solution
partially counteracts these shifts, but does not strongly affect the
observations. More importantly, the specific solvent–solute
interactions through hydrogen bonds appear to be responsible for the
different shifts among the investigated solvents.
Comparison
with Cytosine
2tCyt exhibits solvatochromic
trends roughly similar to cytosine (Cyt), which was theoretically
investigated recently.[52] For both nucleobases,
the two lowest nπ* states are destabilized
by about 1 eV in going from vacuum to water. The third nπ* state is also predicted to significantly blue-shift in Cyt, but
the destabilization is less drastic for 2tCyt. For the fourth nπ* state, the blue-shift is small (0.1–0.3
eV) in both nucleobases. These similarities in the solvatochromic
trends also extend to the ππ* states.
For Cyt, both the first and second ππ* states are slightly (0.1 eV) destabilized in going from vacuum
to water. For 2tCyt, the lowest ππ* is
more blue-shifted (0.8 eV) while the second ππ* state is only slightly destabilized (0.3 eV). The third and fourth ππ* states of Cyt and 2tCyt are red-shifted
in going from vacuum to water, where the shifts are larger for 2tCyt
than for Cyt.Differences in the solvatochromic properties arise
in the oscillator strengths of the ππ* states. In 2tCyt, the lowest ππ* state
is relatively dark (0.01–0.03) with no observable solvent trend,
while that of Cyt is bright and brightens in going from vacuum (0.14)
to water (0.28). The second ππ* state
is bright for both molecules and darkens in going from vacuum to water.
The third ππ* state of Cyt brightens
while that of 2tCyt darkens. The fourth ππ* state brightens for both 2tCyt and Cyt. These different behaviors
are probably due to different wave function characters of the Cyt
vs 2tCyt states, and especially the participation of the two lowest
π* orbitals.Based on these results, it appears that 2tCyt
and Cyt exhibit similar
solvatochromic trends, where the lower-lying excited states tend to
blue-shift (e.g., the lowest ππ* and
the two lowest nπ* states) and only the higher ππ* states red-shift in polar protic solvents.
Another molecule related to 2tCyt and recently investigated theoretically
is 5-methylcytidine, for which the first ππ* and nπ* states were shown[59] to blue-shift when going from THF/ACN to water, which is
in line with the behavior of 2tCyt and Cyt.
Conclusions
In this contribution, we investigated the solvatochromic effects
on the absorption spectrum of 2-thiocytosine (2tCyt) in six different
solvents: ethyl acetate, dimethyl sulfoxide, acetonitrile, ethanol,
methanol, and water. The absorption spectra generally show an intense
absorption band at 269–286 nm, with an absorption tail extending
to 330–360 nm, depending on solvent. In acetonitrile, ethanol,
methanol, and water, a second absorption maximum of comparable intensity
was observed at 233–242 nm, with a weak shoulder discernible
at 210–218 nm.In order to identify the excited states
responsible for the absorption
bands and to explain the experimentally observed solvatochromic shifts,
we used three popular solvatochromic scales and performed multistate
complete-active-space perturbation theory computations for the only
relevant tautomer of 2tCyt, the 1-H-amino-thion form. These calculations
included both explicit and implicit solvent effects on the electronic
structure of 2tCyt, by considering up to five solvent molecules of
the first solvation shell and by means of a polarizable continuum
model, respectively. The computations showed that, generally, the
polarity of the solvent induces shifts in the excitation energies,
with the unpolar states S1 to S5 being shifted to higher energies, and the
more polar states S6 to S8 shifted to slightly lower energies. More importantly,
the specific solute–solvent interactions through hydrogen bonds
are mostly responsible for the differences in the absorption spectra
in the different solvents. This finding is consistent with Catalán
and Kamlet–Taft solvatochromic analyses of the lowest-energy
absorption band, which showed that the solvatochromic parameter most
relevant for this band is the hydrogen bond donor acidity of the solvent.
Collectively, these findings imply that the absorption spectrum of
2tCyt responds sensitively to its microscopic environment, a finding
that should aid in prospective applications of 2tCyt as a DNA/RNA
photolabeling agent.
Authors: R L SINSHEIMER; J F SCOTT; J R LOOFBOUROW; B BEST; R HASTINGS; E VOLLMER; M WESTERGAARD Journal: J Biol Chem Date: 1950-11 Impact factor: 5.157
Authors: Sebastian Sinnecker; Arivazhagan Rajendran; Andreas Klamt; Michael Diedenhofen; Frank Neese Journal: J Phys Chem A Date: 2006-02-16 Impact factor: 2.781
Authors: Bob-Dan Lechner; Paul Smith; Beth McGill; Skye Marshall; Jemma L Trick; Andrei P Chumakov; Charles Peter Winlove; Oleg V Konovalov; Christian D Lorenz; Peter G Petrov Journal: Membranes (Basel) Date: 2022-08-24