Literature DB >> 34898215

Stochastic Generalized Active Space Self-Consistent Field: Theory and Application.

Oskar Weser¹, Kai Guther^1,2, Khaldoon Ghanem¹, Giovanni Li Manni¹.

Abstract

An algorithm to perform stochastic generalized active space calculations, Stochastic-GAS, is presented, that uses the Slater determinant based FCIQMC algorithm as configuration interaction eigensolver. Stochastic-GAS allows the construction and stochastic optimization of preselected truncated configuration interaction wave functions, either to reduce the computational costs of large active space wave function optimizations, or to probe the role of specific electron correlation pathways. As for the conventional GAS procedure, the preselection of the truncated wave function is based on the selection of multiple active subspaces while imposing restrictions on the interspace excitations. Both local and cumulative minimum and maximum occupation number constraints are supported by Stochastic-GAS. The occupation number constraints are efficiently encoded in precomputed probability distributions, using the precomputed heat bath algorithm, which removes nearly all runtime overhead of GAS. This strategy effectively allows the FCIQMC dynamics to a priori exclude electronic configurations that are not allowed by GAS restrictions. Stochastic-GAS reduced density matrices are stochastically sampled, allowing orbital relaxations via Stochastic-GASSCF, and direct evaluation of properties that can be extracted from density matrices, such as the spin expectation value. Three test case applications have been chosen to demonstrate the flexibility of Stochastic-GAS: (a) the Stochastic-GASSCF [5·(6, 6)] optimization of a stack of five benzene molecules, that shows the applicability of Stochastic-GAS toward fragment-based chemical systems; (b) an uncontracted stochastic MRCISD calculation that correlates 96 electrons and 159 molecular orbitals, and uses a large (32, 34) active space reference wave function for an Fe(II)-porphyrin model system, showing how GAS can be applied to systematically recover dynamic electron correlation, and how in the specific case of the Fe(II)-porphyrin dynamic correlation further differentially stabilizes the 3Eg over the 5A1g spin state; (c) the study of an Fe4S4 cluster's spin-ladder energetics via highly truncated stochastic-GAS [4·(5, 5)] wave functions, where we show how GAS can be applied to understand the competing spin-exchange and charge-transfer correlating mechanisms in stabilizing different spin-states.

Entities: Chemical

Year: 2021 PMID： 34898215 PMCID： PMC8757470 DOI： 10.1021/acs.jctc.1c00936

Source DB: PubMed Journal: J Chem Theory Comput ISSN： 1549-9618 Impact factor: 6.006

Introduction

Multiconfigurational Self Consistent Field (MCSCF) methods are well-established approaches in quantum chemistry to investigate the electronic structures of systems featuring strong electron correlation effects, and are characterized by highly multireference wave functions. MCSCF wave functions are written as linear combinations of electronic configurations, which can for example be Slater determinants (SDs) or spin-adapted configuration state functions (CSFs). The many-body wave function is then optimized to minimize the CI energy, while the molecular orbitals are self-consistently optimized under the mean field of the CI wave function. MCSCF approaches represent a highly flexible strategy that can easily be adapted to a large variety of challenging chemical systems. The Complete Active Space Self Consistent Field (CASSCF) method, is a popular MCSCF approach.[1−4] In CASSCF a number of important orbitals, n, usually around the frontier orbitals, and their N electrons are selected to form the active space. Doubly occupied and empty orbitals not included in the active space form the inactive and the virtual spaces, respectively. All possible electronic configurations are generated, compatibly with spin and space symmetry, by distributing the N active electrons among the n active orbitals, a CAS(N, n). While the CI coefficients are obtained via exact or approximated schemes for the CI Hamiltonian diagonalization, the orbitals are variationally optimized via inactive–active, active–virtual, and inactive–virtual orbital rotations. CAS is conceptually simple because only one active space has to be selected. However, the size of the CAS wave function exponentially grows with the size of the active space, and the computational costs of conventional diagonalization techniques[5−7] quickly reach their practical limits for CAS(18, 18) wave functions. For most chemical systems, the full CI expansion in the active space is unnecessarily large, since CAS wave functions are generally sparse, mostly containing “deadwood”, that is electronic configurations with vanishingly small CI amplitudes.[8−10] Various methods exist that attempt to exclude deadwood from the CI optimization step, either via a sparse wave function representation, via a user preselection of truncated CI expansions, via an on-the-fly selection of the important electronic configurations, or by exploiting redundancies within the many-body wave function. One example is the Full Configuration Interaction Quantum Monte Carlo (FCIQMC) algorithm[108−14] that takes advantage of the sparsity of the wave functions, and deadwood is not (or rarely) processed and stored along the FCIQMC optimization procedure. FCIQMC is a projective method that stochastically propagates the imaginary-time Schrödinger’s equation to solve the CI-problem. Apart from being a sparse CI-eigensolver it can be near-linearly parallelized to benefit from modern hardware. The use of FCIQMC as the CASSCF CI-eigensolver within the Super-CI framework, termed Stochastic-CASSCF,[15] was developed in our group and has been applied with great success to circumvent the active space size limits of conventional CASSCF.[15−18,97] Examples of methodologies where truncated CI spaces are preselected include the generalized valence bond approach,[19] constrained-CASSCF (CCASSCF),[20] quasi-CASSCF (QCASSCF),[21] restricted-CI (RCI),[22,23] restricted active space self-consistent field (RASSCF),[24,25] the occupation restricted multiple active spaces self-consistent field (ORMAS-SCF) method,[26] and the generalized active space self-consistent field (GASSCF) approach.[24,27−30] In CCASSCF the active space is partitioned into several subspaces with a fixed number of particles per subspace; the method is formulated in a basis of CSFs. QCASSCF works like CCASSCF but is formulated in a basis of Slater determinants. RAS wave functions are defined using three active subspaces, commonly labeled RAS1, RAS2, and RAS3, with RAS1 containing doubly occupied orbitals, RAS3 containing empty orbitals, and RAS2 containing orbitals with occupation numbers ranging from 0 to 2. The maximum number of holes in RAS1 and the maximum number of particles in RAS3 are used as restrictions to define the configuration interaction space. In the ORMAS-SCF method, implemented in the GAMESS-US chemistry software package,[31] several active spaces are chosen; all intraspace excitations are allowed while the number of interspace excitations are restricted by local minimum and maximum occupation numbers per active subspace. The corresponding CI problem is solved in the Slater determinant basis, relying on the Slater–Condon rules. The similar concept of generalized active space (GAS) was introduced by Jeppe Olsen already in 1988. In 2011, the GAS approach was coupled to the Super-CI algorithm within the (Open)Molcas chemistry software package[32,33] for the variational orbital relaxation, leading to GASSCF.[27] As in ORMAS, the truncated GAS wave functions are built by selecting a number of active subspaces, and imposing constraints at the level of the interspace excitations. However, GASSCF differs from ORMAS-SCF in a number of aspects; most notably, in GAS interspace excitation constraints are enforced via cumulative minimum and maximum occupation numbers, instead of the local constraints of the ORMAS scheme, and a spin-adapted basis of CSFs is used in the GAS method, relying on the Graphical Unitary Group Approach (GUGA).[34] GAS-like truncated CI wave functions have also been implemented in the Molpro package.[35] GAS restrictions can be used to exclude deadwood configurations and to reduce the computational costs while retaining highly accurate multireference predictions. This strategy was adopted in the 2011 work and applied to the dissociation curve of the Gd2 dimer and to the study of the relative stability of two energetically low-lying spin states of the Oxo-Mn(salen) complex.[27] The GAS strategy can also be applied to investigate the role of specific electron correlation mechanisms, by removing electronic configurations that are relevant to describe those correlation pathways. This strategy was undertaken in our group to quantify the effect of the correlation enhanced π-backdonation in Fe(II)-porphyrins,[18] to understand correlation effects in corner-sharing cuprates, and to investigate the effect of a novel combined approach based on localization, site ordering permutations and GUGA.[36,97] Selected CI methods are another class of MC techniques that attempt to circumvent the exponential scaling limitation by selecting the important electronic configurations on the fly using automated heuristics. These methodologies heavily rely on the Slater–Condon rules and are generally bound to a Slater determinant basis.[37−45] Another notable strategy that reduces the exponential scaling limitation is the Density Matrix Renormalization Group (DMRG) theory.[46−56] In this work we introduce a flexible Stochastic-GAS method, that stochastically optimizes truncated GAS wave functions expanded in the Slater determinant many-body basis, based on the FCIQMC algorithm. In one of our earlier works, we introduced a prototype Stochastic-GAS implementation that supported only disconnected GAS subspaces, in that similar to the QCAS strategy, and successfully applied it to an Fe(II)-porphyrin model system,[18] to probe the effect of the correlation enhanced σ-donation/π-backdonations on the basis of a large CAS(32, 34) active space.[16] The GAS algorithm here described also supports interspace excitations that can be restricted by both cumulative and local minimum and maximum occupation numbers constraints, as in the conventional GASSCF method,[27] and in ORMAS-SCF,[26] respectively. In Stochastic-GAS, occupation number constraints (local or cumulative) are embedded within the precomputed heat bath (PCHB) excitation generation.[57] Our algorithm does not incur runtime overhead to adhere to the GAS constraints, instead they are automatically accounted for by precalculated heat bath probability distributions. Moreover, the Stochastic-GAS dynamics automatically benefits from another recent development in FCIQMC, the adaptive shift with an offset,[58,59] that greatly improves the convergence with respect to walker numbers. Stochastic CAS, QCAS, RAS, and equivalently uncontracted multireference configuration interaction (MRCI) wave functions are special cases of the GAS strategy; thus, they are promptly available by an appropriate choice of the GAS subspaces and corresponding constraints. Our efficient implementation of the Stochastic-GAS method, using hybrid parallelization, the GAS-PCHB excitation generator, and the adaptive shift has allowed, for example, uncontracted stochastic-MRCISD calculations with up to 96 electrons and 159 orbitals and a large (32, 34) active space reference wave function. Within the Stochastic-GAS method, one- and two-body reduced density matrices (RDMs) can be stochastically sampled as for stochastic FCI or CAS wave functions.[15,60−62] Those can be subsequently utilized to calculate orbital gradients, Hessians, or within the Super-CI theory[2,27] to variationally relax the molecular orbitals. This gives rise to Stochastic-GASSCF. As shown in the following, RDMs can also be utilized to calculate properties, such as the spin expectation value. The Stochastic-GAS method has been implemented and has been made available in the open source NECI program.[14] The Stochastic-GASSCF variant is available via the interface of the NECI code with the OpenMolcas chemistry software package.[33] The remainder of the article is organized as it follows: In section we summarize the key concepts of GAS and the original PCHB algorithms. In section we introduce the novel GAS-PCHB method, and discuss in some details its performance. In section we discuss three test case applications, that show how Stochastic-GAS can be applied to various chemical situations, and to understand the role of different forms of electron correlation mechanisms. The first example is a stack of five benzene molecules, at varying intermolecular distances, which illustrates the applicability of Stochastic-GASSCF to fragment-based chemical systems. The second example uses Stochastic-GASCI to perform a very large uncontracted stochastic MRCISD calculation that correlates 96 electrons and 159 orbitals, and uses a large CAS(32, 34) active space reference wave function, for an Fe(II)–porphyrin model system, and demonstrates how our algorithm can be used to account for dynamic correlation effects. With this example, we also demonstrate that dynamic correlation effects outside the CAS(32, 34) further stabilize the 3Eg over the 5A1g spin state. In a third test case application, the Stochastic-GASCI strategy has been utilized to investigate the low-energy spin ladder of an Fe4S4 cubane cluster. We show how the GAS strategy can be applied to understand the two competing spin-exchange and charge-transfer correlating mechanisms in stabilizing different spin-states. In section 5 we summarize the findings of this paper and section 6 contains an appendix with mathematical details.

Theoretical Background

Generalized Active Space (GAS) Wave Functions

The generalized active space approach arises from the necessity to build truncated CI wave functions that span a preselected portion of the corresponding complete active space (CAS). As for CAS, GAS-CI wave functions are preselected by the user, through chemical (and/or physical) considerations, and a careful choice of active orbitals and electrons. The active orbitals are subsequently partitioned in a number of active subspaces. The nature, size, and number of these subspaces largely depend on the investigated systems, and generally are chosen according to the type of electron correlation that one wants to target or exclude from the CI space. The examples discussed in the Application section or in ref (27) can be used as guidelines to the strategic choice of GAS subspaces. Within each subspace a full-CI expansion is generated (complete set of intraspace excitations), while the number of interspace excitations is restricted.[18,24,27,28] GAS spaces are defined disconnected if no interspace excitations are permitted, while they are defined connected if interspace excitations are allowed. In the same GAS wave function both connected and disconnected spaces can exist. Figure depicts a possible specification of GAS constraints.

Figure 1

Pictorial representation of a GAS wave function with five GAS subspaces. GAS1 and GAS2 are connected to each other but disconnected from the rest. GAS3, GAS4, and GAS5 are also connected to each other but disconnected from GAS1 and GAS2. The number of interspace excitations are limited by constraining the particle number per GAS space. In the original work on the GAS approach[27,63−65] the term supergroup was utilized to refer to a given distribution of particles (α-, β-, or in general electrons) among GAS subspaces, while fulfilling the GAS constraints. We will interpret supergroups as a special case of compositions, a term borrowed from number theory.[66] A composition is a solution to the following integer equationWe consider two compositions to be different, if their order differs, that is, 2 + 1 = 3 and 1 + 2 = 3 are two different compositions. If we identify the number of summands k with the number of GAS spaces, N with the total number of particles, and x with the number of particles in the ith GAS space, we can easily interpret a given composition as distribution of particles over GAS spaces. We can constrain the allowed compositions, hence the allowed interspace excitations, by defining local or cumulative minimum and maximum occupation numbers per GAS space. We write Nmin, Nmax for local constraints and Ñmin, Ñmax for cumulative constraints. GAS allowed compositions are then those for whichis fulfilled and will be called supergroups, as mentioned before. Two electronic configurations with the same distribution of electrons per GAS subspace belong to the same supergroup. The occupation number constraints cannot be chosen freely. For example, if the total minimum exceeds the total number of electrons, ∑Nmin > N or Ñmin > N, no valid composition (and supergroup) exists. In chemical applications, the Pauli-principle enforces that the number of spin orbitals in a GAS space is larger than or equal to the minimum particle number. These “constraints of the constraints” have been discussed in the literature.[26] We trivially note that if minima and maxima equal each other in every GAS space, it is not possible to excite a particle from one GAS space to the other, and the spaces are disconnected. We also note that in the case of cumulative constraints it is possible to enforce tight inequalities for the last space Ñmin = Ñmax = N which tie the total number of particles and the GAS constraints together and allow shortcuts in an algorithm using cumulative constraints. It is not always possible to convert between the two types of constraints; there are constraints which can be expressed only using local constraints, and vice versa. This aspect has already been discussed in the manuscript introducing the GASSCF method.[27] An example is given in the Appendix (example 6.5). If the constraints can be converted into each other it is done by the following relationships (Lemma 6.6, see Appendix):In the Appendix (subsection ) we give the necessary proofs. It can be proven easily that CAS and RAS wave functions are special cases of GAS, with one and three active spaces, respectively. We trivially note that in CAS, as only one active space is necessary, a single supergroup is generated. We also note that GAS wave functions with purely disconnected spaces have also one supergroup, with a constant number of electrons per GAS space. On the contrary, the electronic configurations generated in RAS wave functions can already be distributed among a number of supergroups, depending of the level of excitation from RAS1 and into RAS3 spaces. The conventional GASSCF, that we implemented in 2011 and made available within the Molcas(32) and the OpenMolcas(33) chemistry software packages, is based on cumulative GAS constraints.[27] The ORMAS-SCF method uses local occupation number constraints.[26] The novel Stochastic-GAS algorithm introduced in this work allows both local and cumulative constraints. The MRCI method accounts for dynamic correlation effects on top of a multiconfigurational wave function chosen as reference, generally of CAS type.[67−69] Since the uncontracted MRCI approach can be expressed via RAS specifications, an efficient GAS algorithm could promptly allow uncontracted MRCI calculations. This is generally prohibitively expensive, considering the unfavorable exponential scaling of RAS wave functions with respect to the size of RAS1 and RAS3 spaces.[32] However, it is feasible using our Stochastic-GAS algorithm, and an example is offered in section . In the GASSCF method orbitals are variationally optimized via a self-consistent field (SCF) procedure, under the mean field generated by the GAS wave function.[27] As in CASSCF, all intraspace orbital rotations, such as GAS1 ↔ GAS1 or GAS2 ↔ GAS2, are redundant and already described by the intraspace excitations in the CI expansion; thus, these excitations are excluded from the orbital optimization. Interspace orbital rotations such as GAS1 ↔ GAS2, however, are only partially redundant and have to be considered in the GASSCF orbital optimization step, in addition to inactive–active, inactive–virtual, and active–virtual rotations. As some of these rotations are already represented by the GAS wave functions, linear dependencies are introduced, that often have a negative impact on the rate of convergence of the GASSCF procedure.[65] Also of interest is the structure of the 1-RDM for GAS wave functions. For disconnected GAS spaces the 1-RDM is block diagonal, because off-diagonal elements, which couple orbitals belonging to different GAS subspaces, vanish. Thus, the diagonalization of the 1-RDM for disconnected GAS, which leads to the natural orbitals, represents an invariant orbital transformation. For connected GAS subspaces, the off-diagonal elements between orbitals belonging to different GAS spaces in general do not vanish, and diagonalization of the one-body density matrix becomes a noninvariant rotation, that mixes orbitals from different GAS subspaces. Thus, natural orbital occupation numbers are only well-defined for disconnected GAS spaces. For connected spaces we can define “pseudonatural orbitals” which are obtained from the block diagonalization of the 1-RDM, each block referring to orbitals of one GAS subspace. Pseudonatural orbitals and natural orbitals are identical for disconnected spaces. Although GAS wave functions with purely disconnected spaces are highly constrained, they are of great theoretical and practical interest. From a practical standpoint, they do not suffer from the redundancy problems mentioned above, and they have well-defined natural orbital occupation numbers. An algorithm that assumes purely disconnected spaces is also much easier to derive and implement. A first prototype of the stochastic-GASSCF method with disconnected spaces was reported in our earlier work.[18]

Precomputed Heat Bath (PCHB)

In this section, we discuss the Precomputed Heat Bath (PCHB) excitation generation using the Heat Bath sampling algorithm developed by Holmes et al.[57] in the context of FCIQMC,[57] and adopted in the present work for the stochastic-GAS algorithm. For the reader who is unfamiliar with FCIQMC we give a brief summary in the Appendix (subsection ). We first introduce the (on-the-fly) Heat Bath excitation generator which calculates matrix elements to all connected determinants on-the-fly and suggests a new determinant with proportional probability. From the generation probability standpoint this is the ideal excitation generator, but the wall clock time per excitation becomes quickly large because an on-the-fly calculation of matrix elements incurs large overhead and the setup of the nonuniform probability distributions scales with the number of orbitals n and number of particles N as . The PCHB excitation generator adopted for the Stochastic-GAS algorithm is based on the Slater–Condon rules for double excitations. If we evaluate the matrix element between two determinants that differ only by a double excitation, we obtainThus, the matrix element only depends on the two-electron integrals (g) involving the differing orbitals, hence it only depends on the excitation, but not on the starting determinant, D. This allows the following approximate heat bath excitation generation: starting from a determinant D, two particles I, J are selected; next, two indices A, B are drawn for the holes from a precalculated probability distribution with probability given bywhere H are the matrix elements for a double excitation from I, J to A, B. If the picked hole indices A, B are already occupied in D the excitation is discarded. The numerator of eq involves no approximation. However, compared to the on-the-fly heat bath method, the denominator contains some nonzero elements which would vanish in the nonapproximated heat-bath algorithm if X or Y were already occupied in D. Note that p(AB; IJ) is not yet the complete pgen value needed for eq . It has to be multiplied with the probability to perform a double excitation and to draw the particles I and J. If the probability distribution from eq is implemented using the alias-method, the time for the excitation generation scales as with the number of orbitals and particles.[70] As we will discuss later in depth, this is a typical trade of “space for time”. If we write |M| for the number of elements in a set M, we need |{(I, J)|I < J; I, J < n}| probability distributions with |{(A, B)|A < B; A, B < n}| entries; hence, the memory demand scales with , with n being the number of orbitals. The Hamiltonian matrix element between two determinants D and D that differ by one single excitation is given byThe value of this element depends on the specific occupied orbitals in D. Hence it is not possible to define configuration-independent probability distributions as for the double excitation case. Thus, in general it is not efficient to use precomputed probability distributions for single excitations as it cannot be done in time. In the context of single-reference methods it is possible to introduce additional approximations and define precomputed probability distributions even for single excitations.[71] However, for the more general case of multireference wave functions, which represent our main target, such approximations cannot be applied, and single excitations are picked uniformly.

The GAS-PCHB Algorithm

In this section, we describe how the PCHB excitation generation and the concept of supergroups in GAS can be combined to derive an efficient algorithm, that we call GAS-PCHB, for performing Stochastic-GASCI and Stochastic-GASSCF calculations within the FCIQMC framework. The simplest stochastic implementation of GAS constraints consists in performing excitations using the conventional FCIQMC excitation generators and to discard GAS forbidden excitations a posteriori. The discarding GAS implementation can be easily combined with any already available FCIQMC excitation generator, including PCHB, and represents the natural choice for benchmarking more sophisticated GAS excitation generators, such as the GAS-PCHB algorithm, that a priori suggests only GAS allowed determinants. We have also implemented a discarding-GAS algorithm and found that when GAS constraints simply aim at removing deadwood configurations, the discarding-GAS performs surprisingly well, and it is rather challenging to develop GAS excitation generators that aim at excluding configurations a priori, without incurring overhead that makes the discarding implementation faster in practice. We succeeded in this task via the GAS-PCHB algorithm.

The Algorithm

In FCIQMC the spawning step is responsible for the stochastic propagation of walkers into the CI space, starting from occupied determinants. Thus, if we assume that our starting determinant is allowed by GAS constraints, only the spawning step has to be modified to ensure that all spawned determinants are GAS allowed. The algorithmic details to realize a GAS-PCHB excitation generator are described in this section. Within the GAS approach, a given (A, B ← I, J) excitation can lead to a GAS allowed or forbidden determinant D depending on the starting determinant D. Hence for GAS, it is not possible to generate probability distributions that only depend on the orbital indices, p(AB; IJ) as for eq . The concept of supergroups and compositions (section ) are introduced in our GAS-PCHB excitation generator to circumvent the dependency of the probability distributions on the individual Slater determinants. The supergroup of a given determinant can be determined by counting the particles per GAS space (a operation, where N is the number of correlated particles). Counting how many particles an excitation transfers between GAS spaces is also a trivial operation. Hence for a given supergroup (and all determinants belonging to it), an excitation is GAS allowed if the composition after excitation is still inside the chosen GAS constraints. Whether an excitation is GAS allowed or forbidden only depends on the supergroup of the starting determinant D. This condition applies for local and cumulative constraints alike. If we define isg to be a labeling index for the supergroups, we can introduce a modified Hamiltonian, H̃(isg), for each supergroup whose entries are set to zero for GAS forbidden excitations and to the original Hamiltonian otherwise. Thus, in the case of double excitations, we can writeEquation is similar to eq , in that the right-hand side of the equation does not depend on the determinant D, but only on its supergroup isg(D). Similar to the FCI-PCHB probability distributions (eq ), we can define GAS-PCHB probability distributions asAlthough the new dependency on isg(D) increases the number of probability distributions that have to be generated and stored, the direct dependency on the individual Slater determinants is circumvented, making GAS-PCHB a practical tool of general applicability. In the next section the scaling of the algorithm will be discussed together with some examples that show the practical limitations bound to the dependency of p(AB; IJ; isg) on the number of supergroups. However, we can anticipate that since there are much fewer supergroups than determinants, the different probability distributions can in most of the practical cases be precomputed and stored. As for the FCI-PCHB case, it is not possible for single excitations to use precomputed probability distributions to perform importance sampling according to the matrix element. Nevertheless, it is possible to perform uniform selection of holes for single excitations which at least automatically adhere to GAS constraints by usingwhere Ñ is an appropriate normalization factor, to ensure ∑p(X; I; isg) = 1 for a given particle I and given supergroup isg. Such a distribution can be very efficiently implemented by using bitmasks. For an efficient GAS-PCHB excitation generator, a fast function to determine the supergroup index of any given Slater determinant is key. A fast on-the-fly algorithm to calculate isg(D) is given in the appendix (subsection ). The time to calculate isg(D) can be additionally reduced, by evaluating isg(D) only once for a given determinant and then reusing this value for all walkers on this determinant. The reused supergroup index does not require additional communication, because of an implementation detail in the annihilation step. All walkers on the same determinant are collected to the same process, to facilitate the annihilation of newly spawned walkers from different parent determinants. This implies that in the subsequent spawning step all walkers belonging to a given determinant will reside on one process. Every walker that attempts to spawn from this determinant can look up the index without any communication across processes. In the special case of disconnected spaces or GAS constraints that are equivalent to CAS there is exactly one supergroup (section ). Hence the index isg equals one for every determinant and does not have to be calculated at all in this case. Algorithm 1 summarizes the main steps of the GAS-PCHB excitation generator. The adaptation of the semistochastic method (introduced in subsection ) to the stochastic GAS-PCHB procedure requires only minor changes, conceptually and code-wise. In the case of GAS, the deterministic core-space Hamiltonian for performing semistochastic FCIQMC dynamics is simply defined asand since Hcore has to be constructed only once and the full information about GAS constraints is contained in the zeroed off-diagonal elements it is very easy to implement the semistochastic method for GAS constraints. The sampling of reduced density matrices (RDM) does not require any code adaptation, since GAS forbidden CI coefficients are simply zero and they are not accumulated during the RDM sampling steps.[60−62] Therefore, the stochastic GAS-PCHB excitation generator allows us to formulate a Stochastic-GASSCF procedure, and gives us access to properties encoded into the RDMs, such as the spin expectation value.

Performance and Scaling

To evaluate the performance of the GAS-PCHB excitation generator, invariant and noninvariant GAS constraints are to be distinguished. Invariant GAS constraints are those that exclude deadwood configurations. Noninvariant GAS constraints, instead exclude configurations that would have nonzero coefficients in the corresponding CAS-CI expansion, and once removed the resulting total energy increases. The conventional GAS-CI algorithm,[27] greatly benefits both from invariant and noninvariant GAS constraints, because the largest bottleneck of the method is the memory required to store the dense CI vector. A truncated Hamiltonian matrix and corresponding CI eigenvector greatly reduce this demand, independently of the nature of the truncated configurations. Conversely, FCIQMC is a method that benefits from sparsity in the wave function, and unpopulated determinants do not occupy memory and are rarely selected at the spawning step. Hence, invariant GAS constraints do not improve the course of the dynamics, nor do they reduce the corresponding computational costs (spawning process and storage). On the contrary, noninvariant GAS constraints reduce the CI space to which walkers are allowed to propagate. Consequently, these GAS constraints can effectively reduce the computational costs for FCIQMC. If we compare GAS-PCHB with discarding-GAS and assume that the supergroup index isg is known, and that the list of probability distributions for all supergroups are already available, the drawing of orbital pairs AB from p(AB; IJ; isg) is practically as fast as drawing AB from the corresponding FCI distribution p(AB; IJ). Computational overhead for the GAS-PCHB algorithm arises from the generation of the probability distributions (only at the beginning of the simulation) and from the evaluation of the supergroup index for a given determinant at runtime. In the worst case, if every determinant is occupied by exactly one walker, the supergroup has to be calculated for every walker and the time per excitation increases slightly. Such a dense CI wave function is rarely encountered in practical applications. In actual chemical problems determinants are occupied by multiple walkers and the supergroup index is calculated only once for each newly occupied determinant. Thus, in practical GAS calculations, the evaluation of the supergroup index represents a negligible additional step and the time per excitation can be considered identical for GAS-PCHB and discarded-GAS. For GAS schemes, where only disconnected spaces are considered, this negligible overhead vanishes completely, since for disconnected GAS schemes only one supergroup exists; the supergroup index isg equals one for every determinant and does not have to be calculated (see section ). This implies that FCI-PCHB can be implemented as a special case of GAS-PCHB. Since discarded excitations increase autocorrelation of the projected energy, the standard error σ of a discarding-GAS excitation generator will usually be larger than for the a priori selection provided by the GAS-PCHB scheme.[72] Also the pgen(i, j) for discarding GAS algorithms is generally lower than for corresponding a priori GAS algorithms; this has the effect of leading to a smaller imaginary time-step for the discarding-GAS algorithm. Both effects deteriorate the efficiency of a discarding implementation with respect to the GAS-PCHB algorithm.[57] Because the amount of GAS discarded excitations strongly depends on the system, it is difficult to give general efficiency ratios between GAS-PCHB and discarding-GAS. But since the time per excitation is in general the same for both methods, GAS-PCHB is usually more efficient than discarding-GAS. PCHB (in Stochastic-GAS and Stochastic-CAS) is a typical trade of “space for time”. The memory demand for GAS-PCHB probability distributions increases with , where n is the number of spatial molecular orbitals and nsg the number of supergroups that are generated for a given GAS specification. It is rather difficult to write a closed expression for the scaling of nsg with respect to the number of particles, N, GAS spaces, nGAS, and GAS constraints. It has to be stressed that the number of supergroups is independent from the number of orbitals and in the best case of purely disconnected spaces there is only one supergroup, regardless of N and nGAS. In the worst case of no minimum or maximum restrictions, the scaling of nsg is combinatorial and given by the number of compositions (Lemma 6.4) asIn practical applications, as the ones discussed in the next section, interspace excitations lie between these extremes, and in general closer to the lower extreme. As an example, we consider a system of five stacked benzene molecules, with an active space that includes the six π-orbitals of each benzene, and distributed into separate GAS subspaces. This system is discussed in greater detail in subsection . If we use cumulative constraints we can definefor the ith GAS space, to control the number of allowed supergroups depending on the interspace excitations nexc. Table shows a steep scaling of the memory with nexc. Our GAS-PCHB implementation uses hybrid parallelization, and precomputes the probability distributions in shared memory on every node. Thus, in the case of the stack of five benzene molecules, ≈35 GB per node is required for three interspace excitations (Table ). The memory demand would have been larger with pure message-passing parallelization, where each process requires its own copy of the probability distributions. In that scenario, a 40 processes node would require 1.5 TB of memory for the same system. Since drawing a number is a read-only operation, no complicated locking mechanisms or atomic operations are required, after the distributions have been initialized. The memory demand is further reduced by a factor of if distribution entries are indexed over spatial orbitals instead of spin orbitals.[14]

Table 1

Memory Demand of Probability Distributions for a Hypothetical [5·(6, 6)] GAS Calculation with Different Number of Interspace Excitations nexc Using Cumulative Constraints (eq )

algorithm	n_exc	n_sg	memory/GB
FCI-PCHB	0	1	0.01
GAS-PCHB	0	1	0.01
GAS-PCHB	1	81	1.17
GAS-PCHB	2	625	9.06
GAS-PCHB	3	2401	34.81

Moreover, as shown in the next section, in practical calculations double interspace excitations are usually enough to recover the Full CI energy for chemically motivated GAS constraints. If the number of interspace excitations is low, the memory demand remains contained and a larger number of particles and GAS subspaces are accessible. For example, in Table , we show the memory requirements for a hypothetical [n·(6, 6)] GAS calculation, with varying number of GAS subspaces, n, and using a fixed number of interspace excitations, nexc = 2. On today’s scientific computing hardware, up to eight of such (6, 6) GAS subspaces can be correlated (≈1 TB). In this context, we note again that the number of supergroups is independent from the number of orbitals, and only depends on the number of GAS spaces and the level of interspace excitations.

Table 2

Memory Demand of Probability Distributions for a Hypothetical [n·(6, 6)] GAS Calculation with Varying Number of GAS Subspaces, And a Constant Number of Interspace Excitations Set to nexc = 2. The GAS Constraints Are Cumulative as Given by eq

n_benzene	n_sg	memory/GB
1	1	0.01
2	5	0.07
3	25	0.36
4	125	1.81
5	625	9.06
6	3125	45.31
7	15625	226.55
8	78125	1132.74

Application

In this section three applications of Stochastic-GASCI and Stochastic-GASSCF are presented that show how GAS in its stochastic form can be utilized for modeling the electronic structure of a variety of chemical systems. The first example is a stack of five benzene molecules, which illustrates how a limited number of interspace excitations in GASSCF already recovers the full CI energy if the main correlation effects happen inside each GAS space. The second example uses Stochastic-GASCI to perform a very large uncontracted-MRCI calculation for a Fe(II)–porphyrin model system, with a (32, 34) active space as reference wave function, and correlating a total of 96 electrons and 159 orbitals. This example demonstrates how the new method can be efficiently used to account for dynamic correlation in a systematic way. As a last example, we use Stochastic-GASCI to investigate the spin ladder of an all-ferric Fe4(III)S4 cluster, and discuss the role of the leading forms of electron correlation by selectively switching them off via GAS constraints.

Benzene Stack

In this section we discuss the application of Stochastic-GAS-CI and Stochastic-GASSCF to a stack of five benzene molecules separated by a varying distance, d, ranging from 3.0 to 20.0 Å (Figure ).

Figure 2

Geometry of the benzene stack. The interfragment distance, d, has been changed from a value of 20 Å (very weak, mean-field-only interactions between fragments) to a value of 3 Å, where many-body correlation effects take place. The geometry of the benzene unit was taken from the Computational Chemistry Comparison and Benchmark DataBase.[73] A conventional CASSCF(6, 6) calculation was performed on this structure using OpenMolcas.[33] The resulting MO coefficient matrix was repeated five times along the diagonal to form a block-diagonal coefficient matrix, used as MO basis for the GAS-CI calculations and as starting MOs for the Stochastic-GASSCF optimizations. Since the molecular orbitals of this block-diagonal matrix are not orthonormal, a Gram-Schmidt orthonormalization was performed prior to the stochastic-GASSCF optimization. For this system, π–π* correlation within each individual benzene (intrafragment) is expected to be dominating, while electron correlation across the fragments is expected to be weaker, and its role becoming increasingly important as the benzene fragments get closer to each other. A GAS [5·(6, 6)] active space has been chosen, which consists of the 30 π-orbitals, six on each benzene, and their 30 electrons. The six π-orbitals of each benzene have been grouped into separate GAS subspaces. We used cumulativeand local GAS constraintsfor the ith GAS space. The number of interspace excitations, nexc, starting from a value of zero (disconnected spaces), was gradually enlarged until convergence in total energy was reached. Local and cumulative constraints are exactly equivalent for disconnected spaces (nexc = 0), and yield very similar results for single excitations (nexc = 1). Looking at the sizes of the Hilbert spaces, as we will do later in depth, the highest discrepancy between local and cumulative constraints is expected for nexc = 1. Since the deviation was negligible already in this case we tested only cumulative constraints for nexc > 1. The case of disconnected spaces, nexc = 0, is equivalent to a system of neutral fragments, whose π-electrons are internally correlated and interact among each other only via the mean field generated by the local (6, 6) active space expansion. For nexc ≥ 1, charge-transfer configurations are added to the wave function and many-body correlation effects are explicitly accounted for. Hence, convergence with respect to nexc was reached earlier for larger distances between neighboring benzene molecules. The number of supergroups and memory requirements for the different CAS and GAS calculations with cumulative constraints are summarized in Table . The CAS(30, 30) space with S = 0 consists of 2.41 × 1016 SDs, while the GAS space with disconnected spaces consists of 1.32 × 1014 SDs which is 0.5% of the CAS size. The connected GAS spaces contain different supergroups for local and cumulative constraints, and the allowed configurations and Hilbert space sizes differ slightly at the same level of interspace excitations. For example, the supergroup [6, 5, 8, 5, 6] would be allowed by the cumulative constraints given in eq for single interspace excitations nexc = 1, but is forbidden by the local constraints in eq . On the other hand, the supergroup [5, 5, 7, 7, 6] and other multiple single excitations from neighboring fragments would be allowed by local constraints (eq ) but forbidden by cumulative ones (eq ). The [5·(6, 6)] GAS space with nexc = 1 and local constraints consists of 51 supergroups and 4.25 × 1015 SDs, or 18% of the CAS size, and requires 0.74 GB to store the PCHB probability distribution in memory. Conversely, the cumulative constraints lead to 81 supergroups and 5.22 × 1015 SDs, or 22% of the CAS size, and require 1.17 GB of memory to store the corresponding PCHB probability distributions. For a higher number of interspace excitations the difference of Hilbert space sizes between local and cumulative constraints decreases further. Figure shows the energy difference between GASSCF and CASSCF, (EGASSCF – ECASSCF), for a different number of interspace excitations, nexc, and different distances between neighboring benzene fragments.

Figure 3

(EGASSCF – ECASSCF) energy difference [kJ/mol] for a number of interspace excitations, nexc, ranging from 0 to 3, and different distances, d, between the benzene fragments. The chemical accuracy of 1 kcal mol–1 is marked with the black-dashed horizontal line. A table of all energies is given in the Supporting Information. The Stochastic-GASSCF energy converges very fast to the Stochastic-CASSCF value, as the nexc value is increased. Already with nexc = 1 the error is below the chemical accuracy of 1 kcal mol–1 for all distances and both local and cumulative GAS constraints. As expected, the CI-truncation error is dependent on the distance. The closer the benzenes are to each other, the more charge transfer configurations are required for an accurate description of the correlation effects. At an interfragment distance of 4.5 Å and above, disconnected spaces suffice to have deviations smaller than 1 kcal mol–1 from the corresponding Stochastic-CASSCF calculation. Notice that this is a typical distance for π-stacking, leading to the conclusion that highly truncated MC wave functions, such as GAS wave functions with disconnected spaces, can be of high value for realistic model systems of weakly interacting fragments. It has to be emphasized that the mean field orbital relaxation effect of fragments onto each other is still accounted for by the SCF procedure, even for disconnected spaces, as opposite to methods where only the CI problem is solved in a fixed MO basis. The stack of benzene molecules represents a good ground for comparison between Stochastic GASSCF, Active Space Decomposition Density Matrix Renormalization Group (ASD-DMRG),[74,75] and Non-Orthogonal CI with a Reduced Common Molecular Orbital Basis (NOCI-RCMO),[76,77] as the latter approaches have also been tested on the same or similar model systems in earlier works. Both NOCI-RCMO and ASD-DMRG are tailored toward clusters of molecules with weak interspace interactions, and share the assumption that the main correlation effects happen within the fragments. In ASD-DMRG, the CI problem is solved conventionally on each fragment. The compound wave function is then constructed as a linear combination of direct products of fragment states. As in DMRG, a matrix-product ansatz is used for the coefficients.[74] The dimension of these matrices, commonly called the bond dimension, M, is the main factor controlling accuracy and cost of such calculations. Although the M value cannot be as intuitively interpreted as the number of interspace excitations in GASSCF, it is also a measure for correlation between fragments. If M = 1 the matrix-product reduces to a plain product ansatz of noninteracting systems, while if M is the dimension of the full Hilbert-Space, the CI-expansion can be exactly recovered. The M value in realistic systems lies somewhere in between, as in those cases M cannot be made large enough to reconstruct exactly the entire Hilbert space. The most notable difference between ASD-DMRG and conventional DMRG, is the low value of M at the order of 102 that is required by ASD-DMRG to reach convergence for fragment-like systems. Conventional ab initio DMRG, where the sites are not optimized fragments but spatial molecular orbitals requires M values that are approximately 2 orders of magnitude higher. In ref (74), the authors of ASD-DMRG state that “If a poor initial guess for the chain includes only neutral fragments and the total charge is constrained to be neutral, the algorithm will keep only neutral fragment states although charge transfer configurations may be important in the exact ground state” and overcome this limitation via a perturtative correction. Within the GAS approach, charge transfer configurations are added by tuning the number of interspace excitations, nexc. It is thus, possible to precisely identify these configurations and quantify their importance, as shown in Figure . The NOCI-RCMO method uses orthonormal molecular orbitals for each state on each fragment but allows nonorthogonality between orbitals in different states or different fragments.[76,77] From the nonorthogonal and partly redundant orbitals, a common orbital basis is constructed on each fragment by removing linear dependencies among the orbitals in different states, depending on a cutoff value τMO for the diagonalized overlap matrix. The common orbital bases on each fragment are then collected together to form a large nonorthogonal MO basis for the cluster. The similarity with the GAS truncation arises at the evaluation of matrix elements which requires several determinant pairs due to the nonorthogonality. Determinants are neglected if their CI-coefficient are smaller than another threshold, τdet. The application of NOCI-RCMO to similar aromatic systems as our benzene stack shows that τdet can become as large as 1 × 10–6 for fragment distances of 5 Å without affecting the total energy value.[76] Unlike GASSCF, τdet does not a priori exclude higher-order charge-transfer configurations. However, if a system is made of weakly interacting fragments and fragment MOs are utilized, charge-transfer configurations will have (vanishingly) small CI-coefficients, and will be excluded at run time by the chosen τdet threshold. A high value of τdet has then a similar meaning as a low number of allowed interspace excitations in GASSCF. It is important to highlight that both ASD-DMRG and NOCI-RCMO are tailored toward systems of weakly interacting fragment molecules, while GASSCF is a method of general applicability, that can be used on compounds of weakly interacting fragments, as well as on strongly correlated and covalently bonded systems as shown in the later sections. Regardless of the chosen method for optimizing the CI problem, the orbital representation is also very important. In the particular case of the benzene stack, choosing fragment-localized orbitals enhances the locality of electron correlation within each fragment, and the sparsity of the many-body eigenvectors. Truncations (via GAS, NOCI, or ASD-DMRG) that take advantage of the sparse structure of the wave functions have negligible impact on the accurate description of correlation effects and on the predicted total and relative energies. In the following, the error introduced by not optimizing the molecular orbitals is discussed. Since the CI energy of the first SCF iteration is the GASCI energy on the initial, unoptimized orbitals, it is possible to compare the GASCI total energies with the GASSCF energies. The energy difference between GASCI and GASSCF (EGASCI – EGASSCF) for different numbers of interspace excitations, nexc, and different distances between the benzene fragments is shown in Figure .

Figure 4

Energy difference between GASSCF and GASCI, (EGASCI – EGASSCF), for different numbers of interspace excitations, nexc, and different distances between the benzene fragments. The chemical accuracy of 1 kcal mol–1 is marked by the horizontal dashed-black line. The discrepancy between GASCI and GASSCF, due to missing variational relaxation of the orbitals, is higher than the difference between GASSCF with disconnected spaces and CASSCF, indicating that mean-field effects can be substantially larger than correlation effects bound to the charge-transfer correlation mechanism. Interestingly, the error is nearly independent from the nexc chosen. Of particular interest is also the speed of convergence of the FCIQMC dynamics (in the CAS form) depending on the orbital basis. Only noninvariant orbital rotations are performed in the CASSCF procedure (inactive ↔ active, inactive ↔ virtual and active ↔ virtual rotations). Thus, the active orbitals are not rotated among each other by the SCF procedure, and the main correlation features are to a large extent retained along the CASSCF optimization. At convergence, the optimized CASSCF active orbitals are in general transformed into natural orbitals, by diagonalization of the one-body RDM in the active space. For GASSCF, the diagonalization of the 1-RDM in the full active space is not an invariant rotation. Instead, invariant is the rotation to pseudonatural orbitals, defined as those which diagonalize each GAS subspace separately. While pseudonatural orbitals do not disrupt the fragment-localized structure of the MO basis, the natural orbitals are in general delocalized across the entire system. The locality of correlation (within the fragment) is lost when the more delocalized natual orbitals are utilized, and the many-body wave function generally becomes more dense. FCIQMC dynamics are sensitive to the MO basis adopted. This argument has been discussed for exchange-coupled transition metal clusters.[78] The benzene stack example shows how the MO representation can affect FCIQMC dynamics for weakly interacting closed-shell systems. The FCIQMC (in CAS form) projected energy, Eproj, as a function of the walker population, and using both natural orbitals and fragment-localized orbitals is depicted in Figure .

Figure 5

FCIQMC projected energy, Eproj, against the number of iterations using the more delocalized natural orbitals and the fragment-localized orbitals, and increasingly larger number of walkers. The system is the benzene stack (Figure ) with an interfragment distance of 6.0 Å. The fragment-localized orbitals utilized here are the CASSCF orbitals. The natural orbitals are obtained from the diagonalization of the corresponding one-body RDM. The FCIQMC dynamics (in CAS form) converge faster when fragment-localized molecular orbitals are utilized, rather than for example the corresponding more delocalized natural orbitals. This result represents the numerical evidence that fragment-localized orbitals produce sparser wave functions which are simpler to describe by a finite distribution of stochastic walkers. Within GASSCF, where the pseudonatural orbitals are used, the convergence speed does not deteriorate, owing to the fact that pseudonatural orbitals preserve the fragment-localized nature of the molecular orbitals. We also note that the time-step can be chosen larger for fragment-localized orbitals () than for delocalized ones () which allows faster propagation along the imaginary time while retaining a stable dynamics. FCIQMC benefits from a sparse representation of the wave function. While the exact CI-energy is invariant under unitary MO transformations, methods that approximate the full-CI energy are not invariant under the same MO transformations. This is due to the different degree of sparsity of the Hamiltonian matrix and the corresponding CI wave function with respect to the orbital transformations. If orbitals that are delocalized over the entire compound system are utilized as one-electron basis, a simple π–π* excitation on one fragment can only be represented by a large linear combination of excitations across most (if not all) delocalized MOs, artificially coupling them to each other and, thus, producing unnecessarily complicated CI expansions, featuring long-range entanglements. On the contrary, fragment-localized orbitals keep the leading forms of electron correlation confined within each fragment, increasing the sparsity of the wave function. We also note that for systems made of weakly interacting fragments, such as the benzene stack, and using a GAS strategy that reflects the fragment nature of the compound, the Stochastic-GAS calculations are practically size-extensive if localized orbitals are used, because we can safely assume that only interspace excitations into neighboring fragments are relevant.

Fe(II)–Porphyrin Model System

Iron-porphyrins are the central building block for a variety of enzymes in biochemistry. Owing to the low barrier between Fe2+ and Fe3+ and nearly degenerate low-energy electronic states, they catalyze important redox reactions and can serve as charge or molecular carriers.[79−82] The relative stability of the low-energy spin states depends on ligand field and many-body correlation effects experienced by the metal center that, in turn, depend on chemical functionalization and geometry of the conjugated macrocycle. For this reason, a reliable theoretical prediction of the energetically low-lying spin states is challenging and necessary to facilitate the understanding of nature’s efficient enzymatic reactions. The theoretical prediction of the relative stability of the energetically close 5A1g and 3Eg states in the square planar Fe(II)–porphyrin system is a notoriously difficult task, and there have been a number of theoretical investigations on this topic.[16−18,43,44,83−90] It has been shown that the triplet is characterized by more complex electron correlation mechanisms than the quintet spin state and only if these correlation effects are precisely taken into account, is the triplet predicted to be the ground state.[16−18] A (14, 16) active space erroneously predicts a quintet ground state, even when coupled to the post-CASSCF perturbative CASPT2 correction.[16] If the active space is substantially enlarged, CAS(32, 34),[16] consisting of the entire ligand-based π system, the σ donating orbitals, and the valence and double-shell d orbitals, complex mechanisms such as 3d–3d′−π* excitations are observed in the wave function[18] that can be ascribed to correlation-induced delocalization of a metal charge to the macrocycle, a correlated π-backdonation. Only then is a triplet ground state predicted.[16] In a joint FCIQMC and DMRG study, we have also analyzed the CI-expansion of the wave function and the orbital entanglement to visualize the complex correlation mechanisms taking place in this system.[18] In the same work, a prototype Stochastic GAS implementation for disconnected spaces was used to quantitatively probe the importance of π-backdonations and was found to have an effect of 2.5 kcal mol–1 on the spin gap.[18] Even if the (32, 34) active space describes qualitatively well the necessary correlation mechanisms, dynamic correlation effects exist that go beyond the (32, 34) active space. Semicore correlation has been accounted for by further enlarging the active space, CAS(40, 38), including the 3s and 3p orbitals, and ultimately increasing the spin gap to 4.4 kcal mol–1, at the Stochastic-CASSCF level of theory.[17] Coupled cluster calculations with up to quadruple excitations (CCSDTQ) inside the Stochastic-CASSCF(40, 38) orbital space yielded a spin gap of 4.8 kcal mol–1.[17] Several methods exist to treat efficiently dynamic correlation effects on top of CASSCF wave functions. However, the list dramatically reduces when a large CASSCF wave function is used as reference. The multiconfiguration pair–density functional theory, MCPDFT, is one of the few methods that can be effectively coupled to very large CAS reference wave functions. MCPDFT calculations on top of DMRG-CASSCF(32, 34) active space were performed by Zhou et al.[89] and further stabilized the triplet over the quintet with an estimated spin gap of 16.1 kcal mol–1.[89] Although this method can be coupled to large CAS wave functions, it is not possible to systematically improve it. Moreover, although the delocalization error(91) does not affect the SCF procedure in MC-PDFT, as this is carried exclusively using the preceding CASSCF procedure, it is possible that the delocalization error bias, dependent on the chosen exchange and correlation translated functional, still exists that overstabilizes the triplet spin-state. Another example is the tailored coupled cluster approach (TCC) that performs Full CI within the active space and uses the resulting CI coefficients as fixed amplitudes in the subsequent coupled cluster equations, which are then solved to account for the remaining dynamic correlation.[92] The tailored distinguishable cluster method[109] with singles and doubles (TDCSD) and F12 correction[93,94] gave a spin gap of 5.8 kcal mol–1.[95] For the current application, the Stochastic-GAS approach has been applied to build and stochastically solve a large RAS-CI wave function. The converged CASSCF(32, 34) MOs have been used as a one-electron basis.[16,18] The 34 active orbitals have been included in the RAS2 space. The RAS1 space was selected by identifying plateaus in the orbital energy of the inactive orbitals for each irreducible representation and including orbitals above these plateaus. In total 32 doubly occupied orbitals were chosen for the RAS1 space, including the four 3s and 3p semicore orbitals from the metal center and the 28 additional σ-orbitals from the macrocycle. Since there were no well-defined plateaus in the orbital energies of the virtual orbitals, the RAS3 space was simply defined by an energy threshold of 0.85 Eh. The threshold was chosen such that the resulting memory demand could be still fullfilled by the smallest node used for these calculations. All virtual orbitals below this threshold were included into the RAS3 space. A total of 93 empty orbitals were selected for the RAS3 space. Up to double excitations out of RAS1 and into RAS3 were allowed, leading to a total of nine supergroups and a memory requirement of 97.64 GB for the GAS-PCHB probability distributions. In total, a RAS(96, 2, 2; 32, 34, 93) active space was selected, where the notation RAS(n, l, m; i, j, k) is used, where n represents the number of active electrons, l is the maximum number of holes allowed in RAS1, and m is the maximum number of electrons allowed in RAS3. Active orbitals are labeled by i, j, k and refer to those placed in RAS1, RAS2, and RAS3, respectively. This scheme correlates 96 electrons into 159 orbitals. The Stochastic-GAS scheme is conceptually equivalent to a stochastic uncontracted Multi-Reference Configuration Interaction approach with single and double excitations from the occupied space (Stochastic-MRCISD). Clearly, no conventional uncontracted or contracted MRCI procedure can be carried that uses the large CAS(32, 34) reference wave function. In that respect the present calculation is unprecedented and it is only possible using our Stochastic-GAS strategy. The spin gap, ΔE = E(5A1g) – E(3Eg), predicted by our large Stochastic-GAS approach, is 7.0 kcal mol–1, a value that is considerably larger than any systematically improvable result previously reported. The Stochastic-GAS spin gap is reported in Table together with the results obtained with other methods on the same model system. Computational details related to the Fe(II)–porphyrin applications can be found in the Supporting Information.

Table 3

Spin Gap ΔE = E(3A1g) – E(3Eg) between the Quintet and Triplet State of Fe(II)–Porphyrin for Different Methods from the Literaturea

algorithm	ΔE/(kcal mol^–1)
CASSCF(14, 16)/CASPT2[16]	–0.5
Stochastic-CASSCF(32, 34)[16,18]	3.5
DMRG(M = 1 × 10⁴) CASSCF (32, 34)[18]	3.5
Stochastic-CASSCF(40, 38)[17]	4.4
Stochastic-CASSCF(40, 38)/CCSDTQ[17]	4.8
Stochastic-CASSCF(40, 38)/CCSDTQ + F12[17]	5.7
Stochastic-CASSCF(32, 34) + TDCSD[95]	2.6
Stochastic-CASSCF(32, 34) + TDCSD_F12[95]	5.8
DMRG(M = 300) CASSCF(34, 35) + MCPDFT(tPBE)[89]	16.1

Stochastic-CASSCF(32, 34) + RASCI(96, 2, 2; 32, 34, 93)	7.0(1)

The results are sorted by increasing spin gap, except for the MRCISD result in the last row which is from this work.

The results are sorted by increasing spin gap, except for the MRCISD result in the last row which is from this work. The spin gap increases from 3.5 kcal mol–1 to 7.0 kcal mol–1, in going from CASSCF(32, 34) to the large Stochastic-RASCI calculations. The doubled spin gap prediction clearly shows the importance of dynamic correction effects on top of an already large active space, that describes most of the valence correlation mechanisms. This unprecedented result should also be compared to the CASSCF(14, 16)/CASPT2 approach used earlier.[16] While the CASPT2 also aims at recovering dynamic correlation outside the active space, the chosen active space was too small, and important high-order excitations effects (such as the correlation induced π-backdonation discussed in ref (16 and 18)) were missed by the second order perturbative correction. It is also important to emphasize that our approach can be further systematically improved by increasing the excitation level from RAS1 and to RAS3, a study that goes beyond the scope of the present work.

Fe4S4 Cubane Spin Structure

In this section, the Stochastic-GAS paradigm is used to investigate correlation effects in spin ladders of exchange-coupled polynuclear transition metal clusters, here exemplified by an all-ferric Fe4(III)S4 cubane complex. The GAS strategy is first applied to an N4 tetrahedron model system as a proof of concept. The smaller N4 model is chosen to mimic the weak magnetic interactions across the four magnetic centers of the transition metal cubane. In the all-ferric Fe4(III)S4 cubane, each magnetic center is in a local spin sloc = 5/2 with five unpaired electrons, for a total of 20 unpaired electrons. The (20, 20) active space is the smallest that can be chosen to describe spin interactions in this system, which is already too large for conventional multiconfigurational techniques. The N4 model is characterized by three unpaired electrons per site, a local spin sloc = 3/2, and a total of 12 valence electrons. Conventional CAS(12, 12) calculations are routinely feasible and fast and will be used as a reference for comparisons with GAS(12, 12) calculations, in Slater determinant and spin-adapted bases. Considering that the current implementation of the Stochastic-GAS operates on the basis of Slater determinants (SDs), the N4 will also be used to address the question of whether spin-pure solutions can be obtained from our SD-based Stochastic-GAS method. SDs are not necessarily eigenfunctions of the spin operator Ŝ2, but they are always eigenfunctions of the spin-projection operator, Ŝ. Since Ŝ2 and Ŝ commute, a basis of joint eigenfunctions exists. If the respective quantum numbers of Ŝ2 and Ŝ are s and m, for a common eigenfunction of Ŝ2 and Ŝ we know thatimplying that eigenfunctions of Ŝ with eigenvalue m cannot form a basis for an eigensolution of Ŝ2 with |m| > s, but they can form a basis for any eigensolution of Ŝ2 with |m| ≤ s. Starting from an SD as reference, the FCIQMC dynamics preserves the spin-projection, m, and converges to the lowest spin state with s ≥ |m|. It follows that for antiferromagnetically coupled systems, it is possible to target spin pure states by adjusting the spin-projection of our starting guess. However, for ferromagnets, where higher spin means lower energy, any spin-unconstrained optimization will inevitably lead to the high-spin ground state, independently of the initial choice of m. The analysis of the following spin-systems is carried with this limitation in mind. The distorted N4 tetrahedron model system is discussed first. The N atoms are at the equivalent positions of the four metal centers of the Fe4S4 system. Two N–N bond distances are 2.85 Å, and four are 2.75 Å. The selected active space consists of the 12 2p orbitals. A conventional CASSCF(12, 12) was performed. The optimized natural orbitals were subsequently localized with the Pipek-Mezey method and used as starting orbitals for subsequent GAS-CI calculations. No SCF orbital optimizations were carried for this system, as our main focus is the rationalization of electron correlation mechanisms that are missed with respect to the corresponding CAS, when interspace excitations are sevelery constrained by GAS. This comparison is only possible if the same active orbitals are utilized for CAS and GAS approaches, except for the invariant rotations among the active orbitals. The GAS active space was defined as [4 × (3, 3)] with the three 2p orbitals of each nitrogen atom grouped in a separate GAS subspace. The charge-transfer excitations between the magnetic centers were controlled using cumulative constraintsand excitation levels nexc between 0 and 2 were considered. Interestingly, convergence with respect to nexc is reached for two interspace excitations, and GAS schemes with larger nexc values were not necessary. For nexc = 1, the number of supergroups is 19 and a memory of 56.2 MB has been allocated to store the PCHB probability distributions. For nexc = 2 the number of supergroups is 85 and 250 MB are required. GAS calculations were performed in both spin pure and SD bases and compared to each other. The Supporting Information contains further computational details. The CAS and GAS spin ladders of N4, for the different choices of nexc, and using both spin-pure and SD-based GAS are depicted in Figure . In addition, the spin quantum number s is computed from the ⟨ψ|Ŝ2|ψ⟩ expectation value for each chosen m value in the SD based calculations and is also reported in Figure . In the Supporting Information we derive the working equations for how Ŝ2 is evaluated using the RDMs.

Figure 6

CAS and GAS energies relative to the CAS(12, 12) s = 6 state for all lowest spin states of N4 and calculated spin quantum number, s, for the SD-based CAS and GAS wave functions. Both spin-pure and SD representations were used for GAS. The x-axis refers to the total spin, s, for spin-adapted calculations and to the total spin-projection, m, for SD-based calculations. The GAS results in an SD basis for disconnected GAS subspaces are omitted (see main document for details). The highest spin state (s = 6) of the N4 system can be represented by a single SD (or CSF) with all orbitals occupied by exclusively α (or exclusively β) electrons (|m| = s = 6). The energy of this state is unaffected by GAS constraints, as particles cannot be excited among GAS subspaces anyway, due to the Pauli exclusion principle. For |m| < 6 numerous forms of electron correlation can potentially take place. Three main excitation types are recognized: (a) on-site excitations (with electron pairing) may lead to non-Hund contributions into the multiconfigurational wave function, (b) exchange interactions, that introduce long-range correlation effects, and (c) charge-transfer excitations across the sites that reduce on-site electron repulsion. As shown in Figure , for spin-adapted and SD-based CAS, the lower spin states of the N4 model are energetically more stable than the higher spin states (an antiferromagnet). For GAS wave functions with connected spaces (to the limit of single interspace excitations) the same result is obtained. The agreement between CAS and GAS wave functions with as little as single interspace excitations, nexc = 1, is impressive, even though the GAS space is considerably smaller than the CAS space. The CAS(12, 12) space consists of 853776 SDs, while the GAS[4·(3, 3)] space with single interspace excitations nexc = 1 consists of 468942 SDs which is 55% of the CAS space size. The largest energy difference between CAS and GAS with single interspace excitations is obtained for the s = 0 spin state and is only 0.36 kJ mol–1, a negligible quantity. The N4 cluster is antiferromagnetically ordered for CAS and connected GAS spaces, hence we conclude from the previous discussion that we can target spin pure states with selected m values and s = |m|. The results in Figure confirm precisely this aspect. CAS and connected GAS energies, obtained using the SD representation, are undistinguishable from the corresponding energies obtained in a spin-adapted basis. Also, the calculated spin quantum number from the expectation value of the Ŝ2 operator (Figure ) confirms that for this system all states are pure spin eigensolutions, despite the fact that an SD basis has been utilized. For disconnected GAS spaces (nexc = 0), the spin ladder is inverted to ferromagnetic order. This can be explained by considering the two main competing correlation mechanisms, spin exchange and charge-transfer. The exchange energy favors parallel alignment of spins across the sites, while charge-transfer correlation across magnetic centers allows for correlation induced differential stabilization of the lower spin states. In the absence of charge-transfer excitations (nexc = 0), only exchange interactions remain that stabilize the high spin states, leading to a ferromagnetically ordered system. Thus, for disconnected spaces, independently of the chosen m value for the SD based Stochastic-GAS dynamics, the final state is the one with the highest spin, s = 6. For this reason, neither a spin ladder nor spin expectation values have been reported in Figure for disconnected GAS calculations in an SD basis. A spin-adapted Stochastic-GAS implementation is currently under development, that relies on the GUGA technique to build and couple CSFs via the Hamiltonian operator. The development of the spin-adapted Stochastic-GAS is precisely motivated by the above-discussed limitation. It is also important to mention that spin purification techniques exist for ensuring that the eigenvectors of a SD-based CI expansion have the desired ⟨Ŝ2⟩.[96] For the Fe4S4 system, the active space consisted of the 20 3d orbitals of the Fe3+ ions and their 20 electrons. The structure of the cluster can be found in the computational details of the Supporting Information. This (20, 20) active space exceeds the limits of conventional CAS; thus, only the stochastic-CAS and GAS strategies will be presented in this section. The spin-pure CASSCF(20, 20) localized orbitals from ref (97) were used, and only CASCI and GASCI calculations were performed here. The GAS active space was defined as [4·(5, 5)] with each of the localized Fe(III) orbitals being in a separate GAS space. While for the N4 system, cumulative GAS constraints have been used (for direct comparison with the conventional GAS method where only cumulative constraints are available), the excitation level between the irons was controlled by local constraintsAs shown for the benzene stack, differences between local and cumulative GAS constraints exist, but their practical effect on the energetics is in general negligible. GASCI calculations with an excitation level nexc of 1 and 2 were performed. Higher excitation levels were not necessary as convergence is reached already for nexc = 2. As in the case of N4 it was not possible to use disconnected spaces (nexc = 0) due to the ferromagnetic ordering. The spin ladder of Fe4S4 calculated with the SD-based Stochastic-CASCI and Stochastic-GASCI methods is depicted in Figure , together with the deviation from the spin-adapted Stochastic-CASSCF results from ref (97).

Figure 7

Spin ladder of Fe4S4 calculated with Stochastic-CAS in SD and spin-adapted basis (GUGA), and Stochastic-GASCI using the SD representation. The Stochastic-CAS and Stochastic-GAS calculations used a defined m value to target the desired state, while the spin-adapted Stochastic-CAS used a well-defined s value for each state. The upper figure displays the spin ladder energetics relative to the high-spin state (s = 10). The lower figure displays the energy difference, ΔE, between the SD based calculations and the spin-adapted Stochastic-CASSCF results from ref (97). The local interspace excitations for GAS are controlled with nexc according to eq . Error bars were obtained using blocking analysis.[72] The spin-adapted Stochastic-CASSCF calculations used N = 1 × 106 walkers,[97] while the SD-based calculations used N = 5 × 107 walkers. The analogy between the Fe4S4 and the N4 spin-ladders is to be noted. The highest-spin s = |m| = 10 configuration for Fe4S4 is an eigenfunction of both Ŝ2 and Ŝ, and the CI-expansion consists of precisely one configuration. For this case, neither difference exists between SD and CSF bases, nor between the GAS and the CAS expansions. Moreover, as for the N4 system, also the Fe4S4 system is an antiferromagnet and it is thus possible to target spin eigenstates using the SD representation, without requiring spin-adaptation or spin-purification strategies. We will first discuss the difference between the SD-based Stochastic-CASCI and the spin-adapted Stochastic-CASCI spin-ladders. Figure shows that the CASCI spin ladder in the SD basis is in good agreement with the spin-adapted CASCI results. Some marginal deviations appear for s ≤ 3. The largest deviation occurs for s = 2, and it is less than 1 kcal mol–1. For higher spin-states the deviation becomes vanishingly small. This difference has been attributed in an earlier work[78] to the slow convergence of the FCIQMC dynamics in SD basis with respect to the number of walkers (initiator bias). The number of possible SDs for a given number of electrons N, orbitals n, and spin-projection m is given bywhich reaches its maximum for m = 0, if n = N as in our case. If the initiator bias is directly bound to the size of the Hilbert space, it should be highest for the lowest spin-state. Why, then, is the slowest convergence observed for the s = |m| = 2 state? The answer is offered by the CI expansion in CSF basis. The Fe4S4 cluster behaves to the leading terms as a spin-system, with exchange-interactions representing the main form of spin interactions across the sites.[98] CSFs with singly occupied orbitals represent the leading configurations of such a spin-system, and their number is promptly given by the van Vleck-Sherman formula:[99]where n and s refer to the number of singly occupied orbitals and the target spin, respectively. For the (20, 20) active space, ΩCSF attains its maximum at s = 2. The multireference character in a CSF basis is amplified when transformed to the corresponding SD basis, and more walkers are required within the FCIQMC algorithm for a correct stochastic representation of the more complex wave function. Therefore, while it is possible to calculate pure spin states for such systems using an SD basis, the number of walkers required for converging the FCIQMC dynamics is larger than in a CSF basis. While the reference calculation in a spin-pure basis was converged with N = 1 × 106 walkers,[97] the SD-based calculations used N = 5 × 107 walkers and was not fully converged in all spin states, as shown by the difference in Figure . We would like to stress that the SD-based dynamics were not at the computational limits and increasing the walker number to achieve convergence would have been possible. This was not done as it was not in the scope of the present investigation. The main goal was to compare the truncation effects of GAS compared to CAS. We now turn our attention toward the error that is introduced from a limited number of GAS interspace excitations. As already observed for the N4 system, disconnected GAS model calculations lead to the unphysical inverted and ferro-magnetically ordered spin-ladder. Interestingly, already a GAS with single interspace excitations is enough to obtain a spin ladder in excellent agreement with the corresponding CAS wave function and energetics. The largest deviation from the SD-based Stochastic-CASCI calculation is only 1.2 kJ mol–1, and the deviation from the spin-adapted Stochastic-CAS is around 4 kJ mol–1 (see lower part of Figure ). If double excitations among the iron atoms are allowed, the Stochastic-GASCI recovers the CASCI energy with a maximum deviation of 0.3 kJ mol–1 from the SD-based Stochastic-CAS energy. With these examples we demonstrated how the GAS strategy can be used to probe the effect of charge transfer excitations and to reduce the size of the Hilbert space by limiting the number of interspace excitations.

Conclusion

In this work, the Stochastic-GASSCF method was introduced based on a new PCHB excitation generator in FCIQMC for the stochastic sampling of the GASCI space, the stochastic sampling of RDMs, and the Super-CI method for the variational orbital relaxation. In Stochastic-GAS both local and cumulative particle number constraints can be imposed, a feature that is unique to Stochastic-GAS, and allows exploration of both the cumulative constraints of the conventional GAS,[27] as implemented in OpenMolcas,[33] and the local constraints, as implemented in the ORMAS method, and made available in GAMESS-US.[31] GAS allowed electronic configurations are classified into supergroups based on the number of electrons per GAS subspace. The concept of supergroups is at the core of the new GAS-PCHB excitation generation, since the GAS constraints only require knowledge about the supergroup of a configuration. Since the GAS constraints are ingrained in precalculated probability distributions per supergroup, our algorithm adds practically no runtime overhead to an unconstrained Full CI PCHB calculation. On the other hand, the higher number of probability distributions increases the memory demand compared to Full CI PCHB. Three different potential showcase applications have been discussed that demonstrate how Stochastic-GAS can be used to reduce computational costs of FCIQMC by operating on preselected truncated CI spaces or to enhance our understanding of the role of different electron correlation pathways. The first example was a stack of five benzene molecules. We separated the system into five GAS spaces to enable full correlation inside each molecule, but allowed only a limited number of excitations between the fragments. Depending on the distance between the fragments, a different number of interspace excitations was necessary to recover the Full CI energy, but already with one interspace excitation the error was well below 1 kJ mol–1 for distances of 3 Å or larger. By this application, we show that Stochastic-GAS can easily be tailored toward fragment-based compounds, and thus operate in a mode that is conceptually similar to other techniques, such as NOCI-RCMO and ASD-DMRG. However, while NOCI-RCMO and ASD-DMRG are exclusively tailored toward fragments, GAS can be applied to a wider range of chemical situations. With suitably chosen orbitals and GAS subspaces the new method is practically size-extensive. In the second example we used Stochastic-GAS to perform a very large uncontracted stochastic-MRCISD calculation (using the RAS strategy) on an Fe(II)–porphyrin model system, for which a total of 96 electrons and 159 orbitals have been correlated over a (32, 34) RAS2 space. While the RAS2 space accounts for relatively strong forms of electron correlation, as we have discussed already in earlier works,[16−18] single and double excitations out of the occupied space (RAS1) and into the virtual orbitals (RAS3) account for dynamic correlation. By considering both static and dynamic electron correlation, we could greatly improve the theoretical estimate for the spin gap between the 5A1g and 3Eg state to be around 7.0(1) kcal mol–1, substantially larger than our previous estimate exclusively based on the Stochastic-CASSCF(32, 34) energetics. This application shows how Stochastic-GAS can be utilized to account for dynamic correlation in a systematically improvable way, as opposed to other methodologies such as DFT[100,101] or MC-PDFT[102] that rely on the accuracy of (translated) functionals to describe exchange and correlation effects. In the last example, we calculated the spin ladder of an Fe4S4 cubane cluster and used Stochastic-GAS to understand the role of charge-transfer excitations in differentially stabilizing the low-energy spin-states. We showed that although the current Stochastic-GAS is based on an SD many-body basis, it is possible to efficiently use the method to selectively target pure-spin states of antiferromagnets. A limitation still exists in using the SD-based approach; for some spin states, a relatively slower convergence is observed with respect to the walker number, compared to the spin-adapted implementation. This limitation is independent of the GAS constraints, but depends on the system. This limitation mostly characterizes systems featuring a large number of unpaired electrons, where denser wave functions exist and spin interactions are harder to describe using a SD basis. This limitation is to a large extent removed using spin-adaptation. A spin-adapted Stochastic-GAS strategy is under development and will be presented in a separate work. Via Stochastic-GAS it is demonstrated that the exchange interactions stabilize the high spin states (inverted spin-ladder for disconnected GAS subspaces with ferromagnetically ordered spin states), while low-spin states are stabilized via charge-transfer excitations, that are included as soon as connected-GAS spaces are considered. For the Fe4S4 cubane system, GAS calculations with single interspace excitations already recover the CAS energy with an error ≤1.2 kJ mol–1 and two interspace excitations decrease this error to ≤0.3 kJ mol–1. As a final remark for possible future applications, we note that the Stochastic-GAS strategy can be utilized also for core-excitations, necessary in simulating X-ray diffraction spectroscopy.[103,104] As for the conventional GAS strategy, core orbitals can be included in one of the GAS subspaces and constrain them to have a minimum number of holes. The advantage of Stochastic-GAS over conventional GAS is that in the former the GAS subspaces can be made substantially larger. Thus, dynamic correlation effects for the core-excited states can be accounted for already at the level of Stochastic-GAS. Future development of methods in our group will concentrate on GAS in a spin-pure basis using GUGA.

Appendix

FCIQMC

In this section a brief overview of the FCIQMC algorithm[11,14,105] is provided, the elements of which are crucial to the understanding of the stochastic-GAS algorithm. Starting with the imaginary-time (τ = it) Schrödinger equation, , and assuming that an initial state, D0 (referred to as the reference determinant), has non-zero overlap with the ground state, Ψ0, our system will evolve to the ground state in the long-term limitIf we assume a finite many-body basis, for example Slater determinants (SDs), D, and linearize the propagator via a first-order Taylor expansion, we obtainwhereand S is a shift parameter specifically introduced within FCIQMC for walker population control. The value of S has to equal the correlation energy at stationary conditions.[11] In principle, eq can be solved deterministically. However, the evaluation of the large number of Hamiltonian matrix elements and the operation count bound to the CI update and the storage of the updated CI vector, makes this equation prohibitive to solve deterministically, in practical cases where large active spaces are utilized. In FCIQMC the imaginary-time evolution of the CI wave function is represented via the propagation of signed stochastic walkers across the configurational space. At each time-step, Δτ, the propagation process is divided into four steps: excitation generation, spawning, death, and annihilation. New walkers spawn stochastically using eq . For a given time-step Δτ we accept new spawns from the parent determinant D to the child determinant D with an acceptance probabilityNote that pacc(i, j) may become larger than one which means that a given walker can spawn more than one new walker. For a stable FCIQMC dynamics, it is desirable to have spawn events with a constant probability, hence to keep Δτ |K| nearly constant. This is achieved by suggesting new determinants D with a non-uniform generation probability pgen(i, j) ∝ |K|, that is, to suggest strongly connected determinants more often. Thus, a modified equation for the acceptance probability is consideredwhere a suitable choice of pgen(i, j) ∝ |K| leads to a more stable acceptance probability. The suggestion of new determinants is called the excitation generation step, and it is at the heart of an efficient implementation of FCIQMC. The excitation generation and spawning steps are “embarassingly parallel” processes. After excitation generation and spawning, the diagonal death step from eq is performed, and all walkers on determinant D are stochastically killed with a probability proportional to (K. Depending on the signs of the parent determinants, D, and of the corresponding K it is possible that spawns to D with different signs arise from different determinants D, which is a manifestation of the sign-problem within the FCIQMC algorithm.[11,106] To partially control the sign problem, spawns of opposite sign to the same determinant are summed at each time-step. This process represents the annihilation step. At the beginning of the simulation the shift parameter, S, is kept constant (generally initialized to a small real number or equal to zero). This allows the walker population to grow until the target population is reached. Once the target population is reached, the spawn, annihilation, and death steps are repeated with a shift parameter that is varied such that the target population stays constant. The calculation is carried in stationary conditions to collect sufficient data points for a satisfactory statistical analyis.[72] The stochastic error of FCIQMC can be greatly reduced by using the semi-stochastic method. After stochastic propagation of eq , the ncore most occupied determinants, representing the core space of the evolving CI wave function, are identified, and the full Hamiltonian matrix, Hcore, for these configurations is constructed. The dynamics is propagated deterministically inside the core space and stochastically outside. The projected energy, Eproj is a common FCIQMC energy estimator which is obtained by projecting the sampled wave function, Ψ(τ), at any imaginary-time, τ, on the reference determinant, D0:When the wave function approaches the ground state, the above estimator converges to the ground state energy. To minimize the relative statistical noise of the denominator, one chooses a reference determinant that has a high CI coefficient. This is typically the Hartree-Fock (HF) determinant which usually has the highest CI coefficient. However, if another determinant is found during the simulation to have a higher coefficient, a change of reference may occur, that helps in stabilizing the projected energy estimate. Note that FCIQMC samples of the numerator and the denominator of the projected energy should be averaged separately before taking their ratio. The original FCIQMC algorithm suffers from a sign-problem in its application to most systems including ab inito ones.[11,106] When the number of walkers is below a certain threshold, called the annihilation plateau, the sampled wave function does not have a stable sign-structure and is dominated by sign-incoherent noise. The annihilation plateau depends on the system under study and is typically a non-negligible fraction of the overall size of the Hilbert space. This means that one needs a minimum number of walkers that scales exponentially with the number of electrons and the number of orbitals. The problem is largely overcome by applying the initiator approximation, i-FCIQMC, which obviates the annihilation plateau and allows a stable simulation using small numbers of walkers.[12,107] In i-FCIQMC, a walker is classified as an initiator if the determinant on which it resides has a population above a chosen threshold nadd (usually set to three). Only initiators are allowed to spawn onto empty determinants, while non-initiators can only spawn onto other occupied determinants. These constrained dynamics stop low-populated determinants from propagating unstable sign-structure further into the Hilbert space but introduce a bias, called the initiator bias, that can be systematically improved by increasing the number of walkers. In the limit of a very large number of walkers, all non-zero walkers become initiators and the exactness of the original method is restored. The convergence of the initiator method to the exact FCI limit with the number of walkers can be further accelerated with the help of the adaptive shift method.[58,59] The initiator bias is mainly attributed to the missing back-spawns onto the non-initiators resulting from their underpopulated local Hilbert space. This bias is ameliorated in the adaptive shift method by reducing the shift of non-initiators and thus boosting their lifetime to compensate for the missing back-spawns. In the adaptive shift method, each determinant D gets its own local shift S as a fraction of the total global shift Swhere S ≤ Δ ≤ 0 is an adjustable offset parameter to be discussed below and f are factors measuring how much a determinant is affected by the undersampling. These factors are computed during the simulation as a weighted ratio of the spawns accepted under the initiator constraintand the weights w are the first-order perturbation theory contribution of a walker on determinant D to determinant Dwith E being an estimate of the ground state energy such as the instantaneous projected energy. The offset parameter Δ provides a mean of controlling the amount of correction applied by the adaptive shift method. When Δ = 0, the correction is applied in its full strength (S = f · S), while for Δ = S the adaptive shift reduces to the conventional i-FCIQMC algorithm. Lowering the offset gives higher total energy estimates, using the same number of walkers. When the energy is plotted as a function of the number of walkers, there is a strict ordering between the energy curves for different offsets, with some converging from below (high offsets), while others converge from above (low offsets). By varying the offset, one can use this property to bracket the exact energy between the curves of different offsets (see Figure ). A good starting point for varying the offset is setting it to half the correlation energy Δ = Ẽcorrelation/2. This estimate of the correlation energy, Ẽcorrelation, can be approximated by the shift, S, of an earlier FCIQMC calculation or by other methods such as MP2 or coupled cluster calculations.

Figure 8

Normal initiator and adaptive shift results using different offsets for N2 in the cc-pVDZ basis set and stretched geometry: 4.2a0.[59]

Normal initiator and adaptive shift results using different offsets for N2 in the cc-pVDZ basis set and stretched geometry: 4.2a0.[59] For the test case applications investigated in this work the choice of Δ = S/2 was already satisfactory for largely curing the initiator bias.

Compositions, Supergroups, and Indexing

In this subsection we derive a fast on-the-fly algorithm for calculating the supergroup index of a given determinant. We will first discuss how to calculate the index for a given composition and apply this knowledge to supergroups. We repeat the definitions formally. Definition 6.1 (Compositions and Supergroups) For we call a solution of the following equationa composition of integers. We define p(k, n) to be the number of different compositions. Solutions with a different order of summands are considered to be different. If we identify n with the number of particles and k with the number of possible GAS spaces then a supergroup is a composition that is allowed by local or cumulative GAS constraints. We define p(k, n, Nmin, Nmax) and p(k, n, Ñmin, Ñmax) to be the number of supergroups for local and cumulative constraints, respectively. We trivially note that p(1, n) = p(k, 0) = 1 and in general we have the following lemma. Lemma 6.2 (Number of compositions) For the number of compositions is given byProof. If there are k summands, there are (k – 1) “+” symbols in the composition. If we have n symbols “∗” and denote numbers with a corresponding number of those symbols we can write compositions with arrangements of “+” and “∗” symbols. For example 8 = 3 + 0 + 3 + 2 can be denoted as 8 = ***++***+**. The number of compositions is then the number of different arrangements.□ Definition 6.3 (Composition and supergroup index). We assume lexicographical decreasing order of the compositions and assign the composition index based on this order. In the same way, we assume lexicographical decreasing order of the supergroups and assign the supergroup index based on this order. In Table , we show an example for the composition of three with three summands and the supergroups from example GAS constraints and the respective indices.

Table 4

Example for the Supergroups of a GAS Constraint with a Cumulative Minimum and Maximum of [0, 1, 3] and [2, 2, 3]a

i_sg	i_C	x₁	x₂	x₃
	1	3	0	0
	2	2	1	0
1	3	2	0	1
	4	1	2	0
2	5	1	1	1
3	6	1	0	2
	7	0	3	0
4	8	0	2	1
5	9	0	1	2
	10	0	0	3

The composition index is iC and the supergroup index isg. GAS allowed compositions, i.e., supergroups, are in bold font.

The composition index is iC and the supergroup index isg. GAS allowed compositions, i.e., supergroups, are in bold font. Lemma 6.4 (Determine composition index) For a composition the composition index iC is given bywhereProof. We know that all elements of the composition lie between 0 and n. If the first element C1 is equal to n, all other elements of the composition have to be zero and the composition index has to be one. (First row of Table ) If C1 is smaller than n, all compositions with a leading term L, C1 + 1 ≤ L ≤ n have a lower index. The number of all compositions with a first element of L is given by p(k – 1, n – L), because we write the sum of n – L with k – 1 summands. By summing the number of all compositions with a leading term L, C1 + 1 ≤ L ≤ n, we can now calculate the index of the first composition which has a leading term of C1 asIf we keep in mind that n – C1 = ∑C = |C2:|1 we can rewrite eq as If we look at the second element C2, we are either finished, because C1 + C2 equals n, or all compositions with a second element L, C2 + 1 ≤ L ≤ n – C1 are larger than C in lexicographical order. We can repeat the previous steps to calculate the number of compositions and continue this procedure for all elements of C to arrive at the final eq .□ We give an illustrative example and determine the index of the composition [1, 0, 2]. Since the leading term is a 1, we can “jump” over all compositions with a leading 3 or 2 and arrive at [1, 2, 0], which is the first composition with a leading 1. The number of terms with a leading 3 is given by p(2, 3–3) = p(2, 0). The number of terms with a leading 2 is given by p(2, 3–2) = p(2, 1). Then we can repeat the same logic to jump over [1, 1, 1] to arrive at [1, 0, 2]. In total this giveswhich agrees with Table . Since p(k, n) only requires the binomial coefficient which can be implemented as lookup table, the whole calculation may become a summation of values from a lookup table. If we had a general closed solution to calculate the number of supergroups p and p, we could apply the same logic of “jumping over” leading terms to calculate the supergroup index. Unfortunately only recursive solutions are known for p and p which do not lend themselves to an efficient implementation.[66] For this reason, we generate all possible supergroups in the beginning. This step does not have to be performant, one can, for example, generate all compositions and just test if they adhere to GAS constraints. After lexicographical sorting of the supergroups we calculate the composition index for each of them and store it. If we now want to calculate the supergroup index for a determinant, we count the number of particles per GAS space to get a supergroup, calculate the composition index of the supergroup using Lemma 6.4, and look up the position of this composition index in our stored list. We can calculate for example the supergroup index of [1, 0, 2] as

Conversion of Constraints

In this section, we want to show that cumulative and local GAS constraints are not always equivalent and that there are systems which can be expressed in only one of them. We also want to prove eq for the conversion between these constraints when it is possible. We write again N and nGAS for the number of particles and GAS spaces. If Nmin, Nmax denote local GAS constraints and Ñmin, Ñmax denote cumulative ones, then we write for the set of all supergroups under local constraints and for the set of all supergroups under cumulative constraints . We start with an example that shows that there is at least one cumulative constraint that cannot be expressed with local ones. Example 6.5. We define a system comprising four GAS spaces using cumulative constraints and eight particles and tight inequalities Ñ4min = Ñ4max = 8 in the last space in Table . We assume that local constraints Nmin, Nmax exist such that and lead this to a contradiction.

Table 5

An Example System with Cumulative Constraints Ñmin, Ñmax which Cannot Be Expressed Using Local Constraints

Ñ_i^min	Ñ_i^max
1	3
4	4
5	7
8	8

Proof. The compositions [1, 3, 1, 3] and [3, 1, 3, 1] are contained in . This implies that for all spaces Nmin ≤ 1 and 3 ≤ Nmax. We immediately concludeSince we see from eq that But the calculation shows that [1, 1, 3, 3] cannot be contained in the cumulative GAS constraints from Table .□ In a next step, we prove eq with the following Lemma. Lemma 6.6. If for a given number of particles N and GAS spaces nGAS local GAS constraints Nmin, Nmax and cumulative GAS constraints Ñmin, Ñmax exist such thatand if for each i > 1 there exists a composition such thatand if for each i > 1 there exists a composition such thatand if for each i > 1 there exists a composition such thatand if for each i > 1 there exists a composition such thatthen the following relationships must hold:Proof. The relationship is true for i = 1. We now prove for i > 1 that Nmin = Ñmin – Ñmax. Note that this is not a proof by induction, we just separated the two cases. By definition of Nmin, we have for the composition from condition 40 thatOn the other hand, we have for every composition Since we assume that y is also contained in the set of supergroups under local constraints (condition 39), we conclude for every Since we also assume, that at least one z admits the extremal value Nmin (condition 41) we concludeWith inequality 45 we arrive at Nmin = Ñmin – Ñmax. The proof of Nmax = Ñmax – Ñmin can be performed in exactly the same way.□ It might seem very difficult to use Lemma 6.6 in practice, because the equality of local and cumulative constraints is an assumed condition that has to be verified (eq ). But it is very useful not so much to convert between constraints that are known to be equivalent, but to prove that a given type of constraint has no equivalent. We could have proven example 6.5 by directly applying the conversion formulas (eq ) to the cumulative constraints Ñmin, Ñmax. The obtained local constraints Nmin, Nmax and the original cumulative constraints fulfill the conditions 39–43, but as in the original proof of example 6.5 the obtained local constraints have different supergroups With Lemma 6.6 we conclude that there are no local constraints that are equivalent to the cumulative constraints from Table .

64 in total

1. Density-matrix algorithms for quantum renormalization groups.

Authors:
Journal: Phys Rev B Condens Matter Date: 1993-10-01

2. The sign problem and population dynamics in the full configuration interaction quantum Monte Carlo method.

Authors: J S Spencer; N S Blunt; W M C Foulkes
Journal: J Chem Phys Date: 2012-02-07 Impact factor: 3.488

3. Molcas 8: New capabilities for multiconfigurational quantum chemical calculations across the periodic table.

Authors: Francesco Aquilante; Jochen Autschbach; Rebecca K Carlson; Liviu F Chibotaru; Mickaël G Delcey; Luca De Vico; Ignacio Fdez Galván; Nicolas Ferré; Luis Manuel Frutos; Laura Gagliardi; Marco Garavelli; Angelo Giussani; Chad E Hoyer; Giovanni Li Manni; Hans Lischka; Dongxia Ma; Per Åke Malmqvist; Thomas Müller; Artur Nenov; Massimo Olivucci; Thomas Bondo Pedersen; Daoling Peng; Felix Plasser; Ben Pritchard; Markus Reiher; Ivan Rivalta; Igor Schapiro; Javier Segarra-Martí; Michael Stenrup; Donald G Truhlar; Liviu Ungur; Alessio Valentini; Steven Vancoillie; Valera Veryazov; Victor P Vysotskiy; Oliver Weingart; Felipe Zapata; Roland Lindh
Journal: J Comput Chem Date: 2015-11-12 Impact factor: 3.376

4. NECI: N-Electron Configuration Interaction with an emphasis on state-of-the-art stochastic methods.

Authors: Kai Guther; Robert J Anderson; Nick S Blunt; Nikolay A Bogdanov; Deidre Cleland; Nike Dattani; Werner Dobrautz; Khaldoon Ghanem; Peter Jeszenszki; Niklas Liebermann; Giovanni Li Manni; Alexander Y Lozovoi; Hongjun Luo; Dongxia Ma; Florian Merz; Catherine Overy; Markus Rampp; Pradipta Kumar Samanta; Lauretta R Schwarz; James J Shepherd; Simon D Smart; Eugenio Vitale; Oskar Weser; George H Booth; Ali Alavi
Journal: J Chem Phys Date: 2020-07-21 Impact factor: 3.488

5. Modern Approaches to Exact Diagonalization and Selected Configuration Interaction with the Adaptive Sampling CI Method.

Authors: Norm M Tubman; C Daniel Freeman; Daniel S Levine; Diptarka Hait; Martin Head-Gordon; K Birgitta Whaley
Journal: J Chem Theory Comput Date: 2020-04-02 Impact factor: 6.006

6. X-ray linear and non-linear spectroscopy of the ESCA molecule.

Authors: Artur Nenov; Francesco Segatta; Adam Bruner; Shaul Mukamel; Marco Garavelli
Journal: J Chem Phys Date: 2019-09-21 Impact factor: 3.488

7. Combining the Complete Active Space Self-Consistent Field Method and the Full Configuration Interaction Quantum Monte Carlo within a Super-CI Framework, with Application to Challenging Metal-Porphyrins.

Authors: Giovanni Li Manni; Simon D Smart; Ali Alavi
Journal: J Chem Theory Comput Date: 2016-02-16 Impact factor: 6.006