Literature DB >> 33324814

Development of a Cyclic Periodic Wave Function Approach for the Study of Infinitely Periodic Solid-State Systems.

Abstract

The ab initio cyclic periodic wave function (CPWF) approach is developed for the treatment of infinitely periodic systems. Using the full infinite Hamiltonian operator, as well as symmetrically identical basis set wave functions that preserve the translational symmetry of the electron density of the system, this approach can be applied at the Hartree-Fock level, or correlation can be directly included by the usual modes. In this approach, all many-body interactions are included, and no edge effects occur. Initial test calculations of the CPWF method at the ab initio Hartree-Fock level are performed on the chains of hydrogen fluoride molecules.

Entities: Chemical Disease Species

Year: 2020 PMID： 33324814 PMCID： PMC7726750 DOI： 10.1021/acsomega.0c04094

Source DB: PubMed Journal: ACS Omega ISSN： 2470-1343

Introduction

In the past, many models and approaches have been developed for the determination of the bulk chemical properties in solid-state systems, including some approaches which are used to study infinitely periodic molecular solids. The simplest and most popular models for studying infinitely periodic molecular solids are cluster models[1−5] and pair- potential, or N-body potential, methods.[6] The most common approaches to their studies are through the use of Bloch orbitals[7] and density function theory (DFT).[8,9] Since the 1970s, cluster models have been applied in many kinds of solids.[1−5,10] The assumption in cluster models is that the electronic structure of the infinite crystal can be reasonably simulated by performing high-level calculations on a system consisting of a few representative molecules that are geometrically selected to imitate the overall crystal structure, as shown in Figure . By systematically increasing the number of molecules in the clusters, it is hoped that the properties of the clusters will smoothly converge to those of the crystal, if a large enough cluster is chosen. In a review of cluster models, Fink et al.[11] pointed out that cluster termination (or edge) effects are the major deficiency of the approach, when a finite cluster is used to simulate an extended or infinite system. Adachi[12] concluded that systematically increasing the cluster size in the solid will eventually yield a large enough cluster to provide an adequate model for the bulk crystal. However, this approach requires great computational effort to obtain a reasonable representation of the system studied and reduce the edge effects. Zunger proposed the cyclic cluster model (CCM) in 1970s,[13−15] which places the atoms themselves at cyclic periodic positions in space. Jug et al. successfully implemented the CCM at different levels of calculations to various crystalline systems.[16−20] Janetzko et al.[21] applied the CCM to different covalent periodic systems, treated by the Kohn–Sham auxiliary density function theory, which successfully retained the symmetry of the corresponding periodic systems.

Figure 1

Three different cluster models labeled with solid line―, dashed line---, and dotted line.... respectively, for an imaginary periodic system.

Three different cluster models labeled with solid line―, dashed line---, and dotted line.... respectively, for an imaginary periodic system. In pair-potential or N-body potential methods, an important assumption[6] is that the intermolecular energies of N molecules can be sufficiently well approximated by the sum of interactions between N(N – 1)/2 individual pairs. This approach works very well when the interaction energies of three-body or higher-body interactions are negligible compared to the two-body interactions.[22−24] Raynor[4] showed that large oscillations in error occurred in studies of molecular hydrogen solid at high pressure as higher many-body terms were added. In addition, studies on solid molecular hydrogen showed that anisotropies become more important either at short intermolecular distance,[25] or at high pressure.[22] Although those anisotropies make the pair-potential methods more difficult to use, pair potentials are still being applied to model some solid-state systems.[26−30] More recently, Bučko et al.[31,32] successfully used a pair interactions model with a DFT approach to treat molecular crystals, which can be applied at any level of approximations. For the study of weak interactions in large molecular systems, it is necessary to simultaneously use quite large basis sets and reasonably high-order correlation corrections,[33−35] which is still very computationally demanding. To solve some of the problems involved with these systems, we have developed a very promising new ab initio method, called the cyclic periodic wave function (CPWF) method, which uses the complete infinite Hamiltonian for the infinite crystal and preserves the translational symmetry of the system in the wave function. In this paper, we will show the results of some initial test calculations of our method at the ab initio Hartree–Fock level for one-dimensional (1-D) systems. We will test and apply our CPWF approach at the AM1 and PM3 semiempirical levels of approximation to a series of two-dimensional (2-D) and three-dimensional (3-D) systems to demonstrate the viability and power of our approach, and the results will be presented in two subsequent papers.

Computational Methods

The novel ab initio cyclic periodic wave function (CPWF) approach uses the full infinite Hamiltonian operator for the crystal. First, the infinitely periodic system is divided into chemically equivalent repeat units, as demonstrated by Figure b. Figure a,b shows two different methods for choosing repeat units in an infinitely periodic line of molecules. Figure a places the cell boundaries in positions that truncate molecules, while Figure b places them between molecules. The cell boundaries in Figure b are such that the coefficients for the molecules at odd positions remain identical to those at even positions, no matter how many cell interactions are included in the calculation. The full infinite Hamiltonian operator for the system is then be written aswhere I and J denote individual repeat units, i and j denote electron labels, and A and B denote the atomic centers within repeat units I and J.

Figure 2

Two alternative unit cell definitions for a Bloch orbital model of an infinite zig-zag line of molecules. (a) Cell boundaries truncate molecules; (b) cell boundaries occur between molecules.

Two alternative unit cell definitions for a Bloch orbital model of an infinite zig-zag line of molecules. (a) Cell boundaries truncate molecules; (b) cell boundaries occur between molecules. To solve the Schrödinger equation for the infinitely repeating system, we will assume that each repeat unit in the perfect crystal is electronically identical to all others, differing only perhaps by one or more symmetry transformations. Since each repeat unit therefore has a chemical environment that is identical to that of every other repeat unit in the system, the electron density distribution near repeat unit I must be kept identical to that near repeat unit J. Thus, we will begin by defining the ground-state Slater determinant of repeat unit molecular orbitals to have the following formwhere the molecular orbitals (MOs) associated with repeat unit I must have the identical mathematical form as those associated with repeat unit J—i.e., the MOs on repeat unit I are simple translations, and perhaps also rotations or reflections, of those on repeat unit J. Thus, if we can determine the “best” form for the MOs on one repeat unit, we will have determined the best form for all repeat units in the infinite system. The assumption is that a single Slater determinant ground-state wave function can be readily relaxed by incorporating Møller–Plesset perturbation theory corrections to recover the correlation energy.[34,35] To create infinitely periodic molecular orbitals which preserve the translational symmetry in the system, we follow a similar prescription to that used by us in studies of infinitely periodic molecular and ionic solids,[4,36−40] but where the Hartree–Fock–Roothaan coefficients between neighboring repeat units are directly evaluated. The basis set used in CPWF is constructed from a set of cyclic periodic wave functions, instead of traditional Bloch orbitals. The principal difference stems from using the individual repeat units themselves in the complex exponential expansion, rather than the unit cell, as is shown in the equationNote that eq is dependent upon the assignment of position indices (J, J, J). Each repeat unit is first assigned a unique set of three independent position indices (J, J, J), from which the geographic position for the center of each unit can be determined. In eq , ϕ( is an atomic orbital located on the monomer at position (j, j, j). Note that this atomic orbital may be a linear combination of degenerate atomic orbitals, with the particular linear combination being determined by the orientation of the monomer at position (j, j, j), relative to its orientation at the (0,0,0) position. For example, suppose that the third and fourth atomic orbitals in the basis set on the monomer at position (0,0,0) are a p and a p orbital, respectively, and further suppose that the monomer in the (1,0,0) position is related to that at the (0,0,0) position by a clockwise rotation of 90° about the z-axis. Then, in order for the third and fourth atomic orbitals at (1,0,0) be symmetrically identical to the third and fourth ones at (0,0,0), they should be defined as followsNext, the repeat length N in each direction must be selected. Choosing N = 4 ensures that all of the interactions for repeat units within ±1 in that index direction are treated exactly, (these repeat units are referred to as nearest neighbors); choosing N = 6 ensures that all of the interactions for repeat units within ±2 in that index direction are treated exactly (these repeat units are referred to as next-nearest neighbors); and so on. Equation then ensures that the cyclic periodic basis set function, ψ, is N-fold periodic in the x-index, N-fold periodic in the y-index, and N-fold periodic in the z-index. Next, the periodic basis set is symmetrically orthogonalized to give a new orthonormal basis setwhere j references different atomic wave functions for the atoms inside cell (J, J, J), and k references atomic wave functions in cell (K, K, K). The elements of the inverse half-root overlap matrix, S–1/2, in eq are generated in the usual way,[36,41] i.e., from the eigenvalues and eigenvectors of the overlap matrix formed from the cyclic periodic wave functions defined in eq . The limits for J, J, and J are defined relative to K, K, and K, as followswith similar relations for the y- and z-summations. Finally, it is these functions that form the orthonormalized basis set that is used to determine the Hartree–Fock–Roothaan coefficients for the molecular orbitals. Thus, the ith MO for the repeat unit at position (I, I, I) are given bywhere K, K, and K are defined relative to I, I, and I, as followswith similar relations for the y- and z-summations. Because the full infinite Hamiltonian is used and the infinite translational symmetry is preserved in both the basis functions and the Hamiltonian, the infinite Fock matrix becomes block diagonal. Thus, we need only diagonalize the block representing the range of (K, K, K) = (0,0,0) to (N – 1, N – 1, N – 1), since the coefficients derived from it will be identical to those needed for all other blocks. Furthermore, even this block is further diagonalized to produce N × N × N individual blocks containing complex elements, as a consequence of the fact that all of the matrix elements between periodic basis functions at different index positions are zero:whereNow, redefining the second sets of summation indices as followswhere Δ varies from −((1/2)N – 1) to + ((1/2)N – 1), with similar ranges for Δ and Δ. Note that any matrix element involving an interaction that is further away than N/2 – 1 in any position index will be 0, we get the following form for eq Since f( ≡ f(0,0,0)→(Δ, and because each term in the first set of summations reduces to the following,with similar relations for the y- and z-summations. We find that the only nonzero terms in eq will be those involving center J = center I, leading to the following formwhere the f(0,0,0)→(Δ elements are the Fock matrix elements for the interaction of atomic orbital i on the monomer located at (0,0,0) with atomic orbital j on the monomer located at (Δ,Δ,Δ). Thus, the Hartree–Fock problem becomes a matter of finding the eigenfunctions of N × N × N individual matrices, each of which is an M × M matrix, where M is the number of atomic orbitals per monomer. However, the more efficient eigenvalue–eigenfunction routines are designed for matrices with real components only. By taking appropriate linear combinations of the cyclic periodic wave functions in eq , these matrices can be transformed into fully real ones. The linear combinations needed are illustrated below in the equationwith similar transformations for K and K. The largest block that can occur after these transformations becomes one that involves matrices with dimensions of 8 M × 8 M. However, for most systems, further symmetry relations aid in reducing the blocks still further, and the largest blocks that have occurred in any 3-D system that we have studied thus far had dimensions of 4 M. Thus, a Hartree–Fock calculation on the full infinite solid-state system is no more computationally intense than doing a few calculations on the isolated monomer, with a basis set 4 times that used for the lone monomer. Because the translational symmetry of the infinitely periodic crystal has been preserved in the chosen form for the wave functions, the final total electron density at each repeat unit will be identical to that at every other repeat unit. Furthermore, the total energy of the system becomes partitioned into identical components from each repeat unit. Thus, each repeat unit contributes an identical energy to the total infinite system. In the familiar terms of standard closed-shell Hartree–Fock calculations, the energy of the Ith repeat unit is given bywhere H and F are evaluated using the wave functions in eq , and density matrix elements P are as followsand where the D coefficients are those for the orthogonalized components to the Hartree–Fock coefficients,The summation of J = “nearest”, or “next nearest” in eq , means summations relative to I. For example, if I = 000, then J nearest would involve all terms with J = −1 or +1, J = −1 or +1, and J = −1 or +1, etc. The EMad term given in eq is the Madelung correction term that arises as a consequence of applying our method for correcting the imbalanced attractive and repulsive contributions that arise when truncated summations occur.[40] Since we include all intercharge terms in the infinite system into this term, it also acts to recover some of the electrostatic interactions that are missing as a result of using truncated sums. Our method converges more quickly than does the more commonly used Delhalle method.[42,43] Specifically, our method subtracts the average of the first terms in the multipole expansions of the Coulomb and nuclear attraction integrals from the integrals themselves, as they are added into the H and F matrices. The infinite sum of all such multipole expansions is then added to both the Fock matrix elements and the overall energy. These terms have the following formswhere qA is the Mulliken charge on atom A and whereOur process has two beneficial effects—it prevents the occurrence of unbalanced contributions from attractive or repulsive terms, and it accounts at least partially for any long-range charge–charge interactions that were not directly included in the calculation. Thus, even though finite values are used for the repeat lengths, N, N, and N, which limits the number of intermolecular interactions that are included exactly into the matrix elements, the effects from neighbors further out are at least partially recovered. Thus, our method is far less sensitive to the sizes chosen for the repeat lengths, N, N, and N, than would otherwise be expected. Finally, using cyclic periodic wave functions, it is a straightforward process to determine basis set superposition errors (BSSE) using the Boys–Bernardi counterpoise method.[44] Since for each choice of the repeat length parameters (N, N, N), only certain specific monomers are explicitly included in the matrix elements, the calculation of the counterpoise correction to the energy of the isolated monomer in each case should contain exactly the same interactions in their matrices. Thus, for example, if N is 4, then all atomic orbitals on positions with I = −1, 0, and 1 would be used in the basis set expansion relative to an isolated monomer at the position (0,0,0) (i.e., with I = I = I = 0), but integrals involving |ΔI| > 1 would be set equal to 0. These integrals would then be partially recovered, in exactly the same way as for the infinite system, by adding in the Madelung correction terms for these interactions. Thus, the BSSE-corrected interaction energy per monomer would be determined as followswhere EI is the energy per repeat unit for the infinite system, as given in eq , ECPI is the counterpoise correction to the isolated repeat unit’s energy, calculated as described above, and ΔEdefI is the deformation energy, i.e., the energy necessary to deform the isolated repeat unit from its equilibrium geometry to its new geometry in the infinite system. Finally, it should be noted that although the basis set wave functions are cyclically periodic, the positions of the repeat units are not. Thus, the physical positions of each repeat unit will be translationally periodic, i.e., the molecules (and their associated atomic orbitals) are placed at their expected physical positions within the crystal.

Results and Discussion

To gain some understanding of how sensitive the interaction energies in hydrogen-bonded systems will be to the choice of the repeat length parameter, we decided to investigate two models involving the strongest hydrogen bond—that between hydrogen and fluorine in hydrogen fluoride (HF), and perform ab initio calculations on chains of hydrogen fluoride molecules. The first model is an infinite linear chain of hydrogen fluoride molecules shown in Figure a, and the second one is the single infinite HF catemer chain shown in Figure b. Our goal is to examine how different choices of the repeat length parameter, N, affect the predicted hydrogen-bond energy in a one-dimensional system.

Figure 3

Models of HF molecules. (a) Simple linear chain model of HF molecules; (b) single infinite HF catemer chain model.

Models of HF molecules. (a) Simple linear chain model of HF molecules; (b) single infinite HF catemer chain model. It has been clearly demonstrated that the accuracy of hydrogen-bond interactions is strongly influenced by the quality of the basis set used and by the degree of correlation included.[4,39,45,46] Slater-type orbitals are known to have better long-range properties than Gaussian-type orbitals, and the trends in energy as a function of repeat length N were more important to us than the attainment of high accuracy. There are some very successful software packages that have been developed using DFT so far, such as the GAUSSIAN program package,[47] the Vienna ab initio package (VASP)[48−52] for the study of molecular solid systems,[53,54] and the most recent CRYSTAL[55,56] and CRYSCOR[57,58] programs. In our case, a software package is available that applies our cyclic periodic wave function method using Slater orbitals, so we decided to employ a simple minimum basis set of Slater orbitals for our test calculations.

Effect of Repeat Index Variation on the Linear Chains of HF Molecules

The first set of calculations was performed on a very simple model: an infinite linear chain of hydrogen fluoride molecules, as illustrated in Figure a. Our purpose in choosing this model was twofold: (i) to investigate the effect of different choices for the repeat length parameter, N, on the calculated hydrogen-bond energies, and (ii) to compare the efficiency of our approach when compared to cluster-type calculations. For these initial calculations, we kept the molecular H–F bond distance fixed at 1.80 bohr (0.950 Å), which was the optimal distance found for the isolated HF molecule. First, we investigated the effect on the H-bond energy of varying the F···F intermolecular distance, using three different values for N:N = 4, which includes nearest-neighbor interactions exactly; N = 6, which also includes next-nearest neighbors; and N = 8, which includes next-next-nearest neighbors, as well. In addition to varying the values of N, we also investigated the importance of the Madelung correction term EMad in eq . Note that all of the curves which included this term provide a significant lowering of the overall energy per monomer, as shown in Figure . Since this term requires very little computation time, all further discussion will be for those curves which explicitly include the calculation of EMad. As shown in Figure , the results for N = 6 and 8 are nearly indistinguishable from one another, predicting nearly identical values for the optimal F···F intermolecular distance and for the H-bond energy. Although the N = 4 calculations predict noticeably weaker H-bond energies than the other two, they recover most of the intermolecular H-bond energy (about 80%). The N = 4 results also predict a slightly longer F···F intermolecular distance, but it differs by only about 0.1 Å from the other two. Thus, even for a system with interaction energies nearly as strong as covalent ones, the energy converges very rapidly as the length parameter, N, is increased.

Figure 4

Effect of N, and the Madelung correction terms on the calculated H-bond energies, ΔHHBond vs the intermolecular distance RF···F. All dashed lines are without Madelung correction, and all solid lines are with Madelung correction. Lines with diamond symbols are for N = 4, lines with circle symbols are for N = 6, and lines with square symbols are for N = 8, respectively. Next, we used the same infinite linear chain model to compare the trends in the repeat parameter, N, to the trends found using a truncated chain approach. For the cluster calculations, we calculated the H-bond energy in two different ways. The first corresponds to the difference between the average energy per monomer in the truncated chain and the energy of a lone monomerwhere E is the total energy of the N-mer and E1 is the energy of the isolated molecule. The second method views an N-mer as being composed of (N – 1)-mer plus a monomer and one hydrogen bond. The average contribution per hydrogen bond then becomeswith E and E1 are defined as above. Figure shows the effect of cluster sizes on the predicted hydrogen-bond energy convergence for the cluster models. Also shown are the results of our cyclic periodic wave function method, with N = 4, 6, and 8. Since N = 4 corresponds to exact inclusion of nearest-neighbor interactions, its value is plotted as corresponding to the amount of computational efforts on a two-membered cluster. Similarly, our results for N = 6 and 8 are plotted as corresponding to three- and four-membered clusters, respectively. What is clear from Figure is that our cyclic periodic wave function method converges far more rapidly to the infinite limit than do the cluster models. Also, it is clear that ΔE2 converges more rapidly than ΔE1 for the cluster models. However, regardless of the method used to determine it, the convergence of the cluster models is extremely slow when compared to the convergence of the cyclic periodic wave function method.

Figure 5

Comparison of the computational efforts for the H-bond energy convergence vs the cluster length. For the truncated chain approaches, the x-axis represents the number of molecules in the truncated chain. For the cyclic periodic approach, the x-axis represents the number of nearest neighbors explicitly included in the matrix elements. The line with square symbols shows the values of ΔE1 for the truncated chain models, eq , the line with diamond symbols shows the values of ΔE2 for the truncated chain models, eq , and the line with circle symbols shows the calculated H-bond energies from the cyclic periodic model. The dashed line indicates the infinite limit for all methods. The importance of rapid convergence becomes even more apparent when we consider the effect on computation time. Two factors strongly affect the amount of time needed to perform a Hartree–Fock calculation on a system: the number of unique Coulomb-type integrals that must be evaluated and the size of the Fock matrix that must be diagonalized. As a measure of the first factor, we have plotted both the ΔE2 data and the cyclic periodic wave function data as a function of the number of unique Coulomb integrals that had to be calculated. These plots are shown in Figure . Here, we can clearly see the benefits of using a model that preserves the symmetry of the infinite system. For the cluster models, a relatively large increase in computation time produces very little improvement in convergence. This is because in the cluster models, each molecule in the cluster occupies a different chemical environment. This asymmetry greatly increases the complexity of the calculation.

Figure 6

Comparison of the calculated energy error as a function of the number of unique Coulomb integrals that must be evaluated. The line with diamond symbols is for values of ΔE2 for the truncated chain models; the line with circle symbols is for the calculated H-bond energies from the cyclic periodic model. The dashed line shows the infinite limit for all methods. It is easy to compare these approaches with regard to the second effect, Fock matrix size. For a cluster model, the Fock matrix will have a dimension of NM × NM, where M is the number of basis functions used on each monomer and N is the number of monomers in the cluster. Since the time needed to diagonalize matrices is on the order of n3 for an n × n matrix, these times will rise very rapidly with cluster size. In our approach, we will need to perform several diagonalizations of smaller Fock matrices. For a one-dimensional chain of identical molecules (linear or zig-zag), we will need to diagonalize the following: two sets of M × M matrices and [(1/2)N – 1] sets of 2 M × 2 M matrices. Thus, increasing the repeat length does not affect the sizes of the matrices that are diagonalized—it only affects how many such matrices need to be diagonalized. As a result, the time dependence for the cluster models will be proportional to (NM)3, but it will be proportional to only 16[2N – 3]M3 for the cyclic periodic wave function approach. This means that performing a Hartree–Fock calculation on an infinite system is only moderately more time consuming than performing the calculation on an isolated molecule.

Results for Fully Optimized Catemers at SCF and Møllet–Plesset Levels

Since our method is wave function-based, we are able to directly determine correlation corrections using Møllet–Plesset perturbation theory. These corrections are demonstrated using our second model system, which replicates more closely the geometry of HF in the solid state. In the solid state, HF crystallizes in staggered chains,[59] as illustrated in Figure b. Using our cyclic periodic wave function approach, we studied the effect of varying the repeat length parameter, N, on the H-bond energy in a single infinite HF catemer chain, calculated with and without correlation. We once again employed a minimum basis set of Slater orbitals for these calculations. Since the experimental FHF angle is nearly linear at 176°,[59] we assumed that this angle remained exactly linear and optimized the H–F and F···F bond lengths, as well as the FFF angles, at the Hartree–Fock level. We then used our best geometries while calculating the Møllet–Plesset perturbation theory corrections through MP4 for N = 4 and 6, and through MP3 for N = 8. Further corrections for N = 8 did not seem justified, since agreement was so close between the other N = 8 results with the N = 6 ones. Our results for these studies are summarized in Table , where we report both the total energies for each calculation as well as the predicted hydrogen-bond energy at each level of calculation. It should be noted that although our bond angles are in excellent agreement with the experimental values of 116 and 120°,[59] our final bond lengths are in rather poor agreement. In the crystal, the F–H bond length is found to be approximately 0.95 Å and the H···F distances are found to be around 1.53 Å.[59] Our results appear to be converging on nearly equal distances of 1.10 Å for both interactions. This is most likely an artifact of using a very incomplete basis set. The shortening of the distances is probably due to the system attempting to bring all of the basis functions closer together so that the electronic structure around the intramolecular bonds can be better described. However, it is the trends as a function of N that are our real concern with this model, and for these, we see that all three sets predict very similar geometries. For the energies, we have tabulated in parentheses the % errors relative to the MP4 results for N = 6. Here, we see a very interesting trend as we add correlation. First, we see that the N = 6 and 8 results are nearly identical to one another with the N = 4 results predicting slightly weaker hydrogen bonds. Most interesting is the trend as higher-order perturbations are included. The MP2 level seems to overcorrect toward predicting more stable H-bonds relative to the Hartree–Fock results, with the latter actually showing smaller errors than the MP2 values. The MP3 level values are all less than 1 kcal different from the final MP4 values. Thus, the greatest errors are seen at the MP2 level. Clearly, care must be taken when applying the Møllet–Plesset perturbation theory corrections. For systems where relatively strong interactions occur between monomers, if any correlation corrections are to be made using the Møllet–Plesset perturbation theory, caution must be exhibited before accepting corrections calculated only at the MP2 level.

Table 1

Calculated Geometries and Energies of HF Catemer

	N_x = 1	N_x = 4	N_x = 6	N_x = 8
geometries
R_FH (Å)	0.942	1.016	1.080	1.080
R_H···F (Å)		1.185	1.080	1.085
∠F···F···F (degree)		110.4	115.3	114.0
energies (hartrees)
E_SCF	–99.531767	–99.554606	–99.557580	–99.558156
ε_MP2	–0.018435	–0.023157	–0.021758	–0.021057
ε_MP3	–0.005930	–0.003219	–0.002376	–0.002205
ε_MP4 - singles	–0.001996	–0.000426	–0.000170	–0.000065
ε_MP4 - doubles	–0.000116	0.000065	0.000329	0.000338
ε_MP4 - triples	0	–0.000514	–0.000263	–0.000142
ε_MP4 - quadruples	0	–0.000325	–0.000281
ε_MP4 - total	–0.002112	–0.001200	–0.000385
ΔE_INT (kcal)
SCFa		14.33 (−4%)	16.20 (+8%)	16.56 (+11%)
MP2a		17.29 (+15%)	18.28 (+22%)	18.20 (+22%)
MP3a		15.59 (+4%)	16.05 (+7%)	15.87 (+6%)
MP4a^,b		15.02 (0%)	14.97 (0%)	(14.46) (−3%)

% errors are given in parentheses after the ΔEINT values, relative to the MP4(N = 6) value of 14.97 kcal.

MP4 values were not calculated for N = 8, and thus that total is placed within parentheses.

% errors are given in parentheses after the ΔEINT values, relative to the MP4(N = 6) value of 14.97 kcal. MP4 values were not calculated for N = 8, and thus that total is placed within parentheses. Since our method is based upon a closed form for the wave functions, it is a straightforward process to add in correlation through the use of the Møller–Plesset perturbation theory. For the FH···F interactions in our model studies, we found that, although the MP2 level corrections led to a correction to the hydrogen-bond energies of around 1 kcal/mol, truncating the process at the MP2 level leads to considerably larger errors than those seen in the Hartree–Fock results. The optimal level appears to be at the MP3 level for these systems, since at all levels of calculation, these results were all well within 1 kcal of the MP4 results.

Conclusions

There are several advantages to using our cyclic periodic wave function (CPWF) approach for the treatment of infinitely periodic systems. First, this is a wave function-based method. Thus, it can be applied at any level of approximation—from semiempirical to full ab initio, with or without the inclusion of correlation and with any desired basis set. Second, it is a chemically intuitive approach. The repeat units are chosen to be either the individual molecules within molecular solids or the individual monomer units within polymers,[60,61] and are placed at their exact translationally periodic positions, as found in the infinitely periodic crystal, without any built-in periodic boundary conditions (PBCs). For chemists, it is a more intuitive way to study large organic crystals. In this way, the CPWF approach differs from the cyclic cluster model (CCM), which places the atoms or molecules at cyclic periodic positions in space, and our approach is also different from all of the methods that apply PBCs. The zeroth-order wave function corresponds to a single isolated molecule or monomer unit. Increasing the periods for the repeat lengths (N, N, N), then corresponds to the inclusion of exact interactions with all nearest neighbors, when N = 4, and with all next-nearest neighbors as well, when N = 6, etc., and the ground-state wave function would ultimately converge to the exact Hartree–Fock ground state in the limit of N = ∞. Longer-range interactions are at least partially recovered by the use of our Madelung correction scheme,[40] which corrects both the individual Fock matrix elements as well as the total energy by approximately recovering all interactions between farther neighbors.[40] We have found that using N = 4 suffices for most solid-state systems,[60,61] as long as the intermonomer interactions are weaker than covalent interactions. Even when covalent interactions occur between monomers, we have found that N = 4 often suffices, though using N = 6 in the direction of the covalent interaction will provide greater accuracy for some systems. Without the inclusion of the Madelung corrections, higher values for the repeat length would be necessary to obtain equivalent accuracy. Next, the electron density is treated as being infinitely periodic. Thus, bulk properties of solid-state systems or infinite polymers are directly calculable from the contribution of any single repeat unit in the system. Since every repeat unit will contribute an identical electron density as every other repeat unit, the chemical equivalence of the repeat units is maintained. Note that this is not the case for calculations that use a cluster approach. In cluster models, there will be many different chemical environments within the cluster, making it difficult, if not impossible, to determine contributions from any particular unit within the cluster. The other advantage is that the number of integrals which must be calculated and the dimensions of matrices which must be diagonalized are tremendously reduced from those necessary using either cluster approaches or N-body approaches. Because our wave functions use individual molecules or monomers as the repeat unit, instead of the unit cell of the crystal,[60,61] and because our wave functions are cyclically periodic, instead of translationally periodic, our method is origin independent and does not depend in any way upon how the repeat unit is chosen. The CPWF approach builds the symmetry relations between identical molecules or monomers directly into the wave functions, ensuring that the electron density at each individual molecule within the infinite system is identical. An extension of this approach allows accurate inclusion of delocalization, delocalized systems can also be treated with the same method. Although the final equations were not derived here for DFT calculations, our wave function in eq can be used for DFT studies, as well as for ab initio and semiempirical calculations. Finally, the CPWF approach is applied at the AM1 and PM3 semiempirical levels of approximation to study different three-dimensional infinitely periodic solid-state systems stabilized by different kind of weak interactions between repeat units, all intra- and intermonomer geometry parameters are optimizable in this CPWF method.[60,61] Thus, our approach leads to complete flexibility in the determination of the three-dimensional geometry of the solid-state system.

11 in total

1. H-bonding cooperativity and energetics of alpha-helix formation of five 17-amino acid peptides.

Authors: Robert Wieczorek; J J Dannenberg
Journal: J Am Chem Soc Date: 2003-07-09 Impact factor: 15.419

2. Ab initio molecular-dynamics simulation of the liquid-metal-amorphous-semiconductor transition in germanium.

Authors:
Journal: Phys Rev B Condens Matter Date: 1994-05-15

3. CRYSCOR: a program for the post-Hartree-Fock treatment of periodic systems.

Authors: Cesare Pisani; Martin Schütz; Silvia Casassa; Denis Usvyat; Lorenzo Maschio; Marco Lorenz; Alessandro Erba
Journal: Phys Chem Chem Phys Date: 2012-02-15 Impact factor: 3.676

4. Improved description of the structure of molecular and layered crystals: ab initio DFT calculations with van der Waals corrections.

Authors: Tomáš Bučko; Jürgen Hafner; Sébastien Lebègue; János G Ángyán
Journal: J Phys Chem A Date: 2010-11-04 Impact factor: 2.781