1
|
Yin C, Becker SB, Thorpe JH, Matthews DA. Spatial Signatures of Electron Correlation in Least-Squares Tensor Hypercontraction. J Phys Chem A 2025; 129:788-802. [PMID: 39784297 DOI: 10.1021/acs.jpca.4c06666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2025]
Abstract
Least-squares tensor hypercontraction (LS-THC) has received some attention in recent years as an approach to reduce the significant computational costs of wave function-based methods in quantum chemistry. However, previous work has demonstrated that LS-THC factorization performs disproportionately worse in the description of wave function components (e.g., cluster amplitudes T̂2) than Hamiltonian components (e.g., electron repulsion integrals (pq|rs)). This work develops novel theoretical methods to study the source of these errors in the context of the real-space T̂2 kernel, and reports, for the first time, the existence of a "correlation feature" in the errors of the LS-THC representation of the "exchange-like" correlation energy EX and T̂2 that is remarkably consistent across ten molecular species, three correlated wave functions, and four basis sets. This correlation feature portends the existence of a "pair point kernel" missing in the usual LS-THC representation of the wave function, which critically depends upon pairs of grid points situated close to atoms and with interpair distances between one and two Bohr radii. These findings point the way for future LS-THC developments to address these shortcomings.
Collapse
Affiliation(s)
- Chao Yin
- Department of Chemistry, Southern Methodist University, Dallas, Texas 75275, United States
| | - Sara Beth Becker
- Department of Chemistry, Southern Methodist University, Dallas, Texas 75275, United States
| | - James H Thorpe
- Department of Chemistry, Southern Methodist University, Dallas, Texas 75275, United States
| | - Devin A Matthews
- Department of Chemistry, Southern Methodist University, Dallas, Texas 75275, United States
| |
Collapse
|
2
|
Teke N, Melekamburath A, Gaudel B, Valeev EF. "Best" Iterative Coupled-Cluster Triples Model? More Evidence for 3CC. J Phys Chem A 2024; 128:9819-9828. [PMID: 39478319 PMCID: PMC11571224 DOI: 10.1021/acs.jpca.4c04667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 10/14/2024] [Accepted: 10/15/2024] [Indexed: 11/15/2024]
Abstract
To follow up on the unexpectedly good performance of several coupled-cluster models with approximate inclusion of 3-body clusters [Rishi, V.; Valeev, E. F. J. Chem. Phys. 2019, 151, 064102.] we performed a more complete assessment of the 3CC method [Feller, D. . J. Chem. Phys. 2008, 129, 204105.] for accurate computational thermochemistry in the standard HEAT framework. New spin-integrated implementation of the 3CC method applicable to closed- and open-shell systems utilizes a new automated toolchain for derivation, optimization, and evaluation of operator algebra in many-body electronic structure. We found that with a double-ζ basis set the 3CC correlation energies and their atomization energy contributions are almost always more accurate (with respect to the CCSDTQ reference) than the CCSDT model as well as the standard CCSD(T) model. The mean absolute errors in cc-pVDZ {3CC, CCSDT, and CCSD(T)} electronic (per valence electron) and atomization energies relative to the CCSDTQ reference for the HEAT data set [Tajti, A. . J. Chem. Phys. 2004, 121, 11599-11613.], were {24, 70, 122} μEh/e and {0.46, 2.00, 2.58} kJ/mol, respectively. The mean absolute errors in the complete-basis-set limit {3CC, CCSDT, and CCSD(T)} atomization energies relative to the HEAT model reference, were {0.52, 2.00, and 1.07} kJ/mol, The significant and systematic reduction of the error by the 3CC method and its lower cost than CCSDT suggests it as a viable candidate for post-CCSD(T) thermochemistry applications, as well as the preferred alternative to CCSDT in general.
Collapse
Affiliation(s)
- Nakul
K. Teke
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Ajay Melekamburath
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Bimal Gaudel
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Edward F. Valeev
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
3
|
Pokhilko P, Yeh CN, Morales MA, Zgid D. Tensor hypercontraction for fully self-consistent imaginary-time GF2 and GWSOX methods: Theory, implementation, and role of the Green's function second-order exchange for intermolecular interactions. J Chem Phys 2024; 161:084108. [PMID: 39185845 DOI: 10.1063/5.0215954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 08/05/2024] [Indexed: 08/27/2024] Open
Abstract
We present an efficient MPI-parallel algorithm and its implementation for evaluating the self-consistent correlated second-order exchange term (SOX), which is employed as a correction to the fully self-consistent GW scheme called scGWSOX (GW plus the SOX term iterated to achieve full Green's function self-consistency). Due to the application of the tensor hypercontraction (THC) in our computational procedure, the scaling of the evaluation of scGWSOX is reduced from O(nτnAO5) to O(nτN2nAO2). This fully MPI-parallel and THC-adapted approach enabled us to conduct the largest fully self-consistent scGWSOX calculations with over 1100 atomic orbitals with only negligible errors attributed to THC fitting. Utilizing our THC implementation for scGW, scGF2, and scGWSOX, we evaluated energies of intermolecular interactions. This approach allowed us to circumvent issues related to reference dependence and ambiguity in energy evaluation, which are common challenges in non-self-consistent calculations. We demonstrate that scGW exhibits a slight overbinding tendency for large systems, contrary to the underbinding observed with non-self-consistent RPA. Conversely, scGWSOX exhibits a slight underbinding tendency for such systems. This behavior is both physical and systematic and is caused by exclusion-principle violating diagrams or corresponding corrections. Our analysis elucidates the role played by these different diagrams, which is crucial for the construction of rigorous, accurate, and systematic methods. Finally, we explicitly show that all perturbative fully self-consistent Green's function methods are size-extensive and size-consistent.
Collapse
Affiliation(s)
- Pavel Pokhilko
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Chia-Nan Yeh
- Center for Computational Quantum Physics, Flatiron Institute, New York, New York 10010, USA
| | - Miguel A Morales
- Center for Computational Quantum Physics, Flatiron Institute, New York, New York 10010, USA
| | - Dominika Zgid
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, USA
- Department of Physics, University of Michigan, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
4
|
Datar A, Matthews DA. Robust Tensor Hypercontraction of the Particle-Particle Ladder Term in Equation-of-Motion Coupled Cluster Theory. J Chem Theory Comput 2024; 20:708-720. [PMID: 38198505 DOI: 10.1021/acs.jctc.3c00892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2024]
Abstract
One method of representing a high-rank tensor as a (hyper-)product of lower-rank tensors is the tensor hypercontraction (THC) method of Hohenstein et al. This strategy has been found to be useful for reducing the polynomial scaling of coupled-cluster methods by representation of a four-dimensional tensor of electron-repulsion integrals in terms of five two-dimensional matrices. Pierce et al. have already shown that the application of a robust form of THC to the particle-particle ladder (PPL) term reduces the cost of this term in couple-cluster singles and doubles (CCSD) from O ( N 6 ) to O ( N 5 ) with negligible errors in energy with respect to the density-fitted variant. In this work, we have implemented the least-squares variant of THC (LS-THC) which does not require a nonlinear tensor factorization, including the robust form (R-LS-THC), for the calculation of the excitation and electron attachment energies using equation-of-motion coupled cluster methods EOMEE-CCSD and EOMEA-CCSD, respectively. We have benchmarked the effect of the R-LS-THC-PPL approximation on excitation energies using the comprehensive QUEST database and the accuracy of electron attachment energies using the NAB22 database. We find that errors on the order of 1 meV are achievable with a reduction in total calculation time of approximately 5 ×.
Collapse
Affiliation(s)
- Avdhoot Datar
- Department of Chemistry, Southern Methodist University, Dallas, Texas 75275, United States
| | - Devin A Matthews
- Department of Chemistry, Southern Methodist University, Dallas, Texas 75275, United States
| |
Collapse
|
5
|
Asadchev A, Valeev EF. High-Performance Evaluation of High Angular Momentum 4-Center Gaussian Integrals on Modern Accelerated Processors. J Phys Chem A 2023; 127:10889-10895. [PMID: 38090753 DOI: 10.1021/acs.jpca.3c04574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2023]
Abstract
We present a high-performance evaluation method for 4-center 2-particle integrals over Gaussian atomic orbitals with high angular momenta (l ≥ 4) and arbitrary contraction degrees on graphical processing units (GPUs) and other accelerators. The implementation uses the matrix form of McMurchie-Davidson recurrences. Evaluation of the four-center integrals over four l = 6 (i) Gaussian AOs in double precision (FP64) on an NVIDIA V100 GPU outperforms the reference implementation of the Obara-Saika recurrences (Libint) running on a single Intel Xeon core by more than a factor of 1000, easily exceeding the 73:1 ratio of the respective hardware peak FLOP rates while reaching almost 50% of the V100 peak. The approach can be extended to support AOs with even higher angular momenta; for lower angular momenta (l ≤ 3), additional improvements will be reported elsewhere. The implementation is part of an open-source LibintX library freely available at github.com:ValeevGroup/LibintX.
Collapse
Affiliation(s)
- Andrey Asadchev
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Edward F Valeev
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
6
|
Rettig A, Lee J, Head-Gordon M. Even Faster Exact Exchange for Solids via Tensor Hypercontraction. J Chem Theory Comput 2023; 19:5773-5784. [PMID: 37586065 DOI: 10.1021/acs.jctc.3c00407] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Hybrid density functional theory (DFT) remains intractable for large periodic systems due to the demanding computational cost of exact exchange. We apply the tensor hypercontraction (THC) (or interpolative separable density fitting) approximation to periodic hybrid DFT calculations with Gaussian-type orbitals using the Gaussian plane wave approach. This is done to lower the computational scaling with respect to the number of basis functions (N) and k-points (Nk) at a fixed system size. Additionally, we propose an algorithm to fit only occupied orbital products via THC (i.e., a set of points, NISDF) to further reduce computation time and memory usage. This algorithm has linear scaling cost with k-points, no explicit dependence of NISDF on basis set size, and overall cubic scaling with unit cell size. Significant speedups and reduced memory usage may be obtained for moderately sized k-point meshes, with additional gains for large k-point meshes. Adequate accuracy can be obtained using THC-oo-K for self-consistent calculations. We perform illustrative hybrid density function theory calculations on the benzene crystal in the basis set and thermodynamic limits to highlight the utility of this algorithm.
Collapse
Affiliation(s)
- Adam Rettig
- Department of Chemistry, University of California, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Joonho Lee
- Department of Chemistry, Columbia University, New York, New York 10027, United States
| | - Martin Head-Gordon
- Department of Chemistry, University of California, Berkeley, California 94720, United States
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| |
Collapse
|
7
|
Pierce K, Valeev EF. Efficient Construction of Canonical Polyadic Approximations of Tensor Networks. J Chem Theory Comput 2023; 19:71-81. [PMID: 36484711 DOI: 10.1021/acs.jctc.2c00861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We consider the problem of constructing a canonical polyadic (CP) decomposition for a tensor network, rather than a single tensor. We illustrate how it is possible to reduce the complexity of constructing an approximate CP representation of the network by leveraging its structure in the course of the CP factor optimization. The utility of this technique is demonstrated for the order-4 Coulomb interaction tensor approximated by two order-3 tensors via an approximate generalized square-root (SQ) factorization, such as density fitting or (pivoted) Cholesky. The complexity of constructing a four-way CP decomposition is reduced from O(n4RCP) (for the nonapproximated Coulomb tensor) to O(n3RCP) (for the SQ-factorized Coulomb tensor), where n and RCP are the basis and CP ranks, respectively. This reduces the cost of constructing the CP approximation of two-body interaction tensors of relevance to accurate many-body electronic structure by up to 2 orders of magnitude for systems with up to 36 atoms studied here. The full four-way CP approximation of the Coulomb interaction tensor is shown to be more accurate than the known approaches which utilize CP-factorizations of the SQ factors (which are also constructed with an O(n3RCP) cost), such as the algebraic pseudospectral and tensor hypercontraction approaches. The CP-decomposed SQ factors can also serve as a robust initial guess for the four-way CP factors.
Collapse
Affiliation(s)
- Karl Pierce
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia24061, United States
| | - Edward F Valeev
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia24061, United States
| |
Collapse
|
8
|
Sharma S, White AF, Beylkin G. Fast Exchange with Gaussian Basis Set Using Robust Pseudospectral Method. J Chem Theory Comput 2022; 18:7306-7320. [PMID: 36417710 DOI: 10.1021/acs.jctc.2c00720] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
In this article, we present an algorithm to efficiently evaluate the exchange matrix in periodic systems when a Gaussian basis set with pseudopotentials is used. The usual algorithm for evaluating exchange matrix scales cubically with the system size because one has to perform O(N2) fast Fourier transform (FFT). Here, we introduce an algorithm that retains the cubic scaling but reduces the prefactor significantly by eliminating the need to do FFTs during each exchange build. This is accomplished by representing the products of Gaussian basis function using a linear combination of an auxiliary basis the number of which scales linearly with the size of the system. We store the potential due to these auxiliary functions in memory, which allows us to obtain the exchange matrix without the need to do FFT, albeit at the cost of additional memory requirement. Although the basic idea of using auxiliary functions is not new, our algorithm is cheaper due to a combination of three ingredients: (a) we use a robust pseudospectral method that allows us to use a relatively small number of auxiliary basis to obtain high accuracy; (b) we use occ-RI exchange, which eliminates the need to construct the full exchange matrix; and (c) we use the (interpolative separable density fitting) ISDF algorithm to construct these auxiliary basis sets that are used in the robust pseudospectral method. The resulting algorithm is accurate, and we note that the error in the final energy decreases exponentially rapidly with the number of auxiliary functions.
Collapse
Affiliation(s)
- Sandeep Sharma
- Department of Chemistry, University of Colorado Boulder, Boulder, Colorado80309, United States
| | - Alec F White
- Quantum Simulation Technologies, Inc., Boston, Massachusetts02135, United States
| | - Gregory Beylkin
- Department of Applied Mathematics, University of Colorado, Boulder, Colorado80309, United States
| |
Collapse
|
9
|
Hohenstein EG, Fales BS, Parrish RM, Martínez TJ. Rank-reduced coupled-cluster. III. Tensor hypercontraction of the doubles amplitudes. J Chem Phys 2022; 156:054102. [DOI: 10.1063/5.0077770] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Edward G. Hohenstein
- Department of Chemistry and The PULSE Institute, Stanford University, Stanford, California 94305, USA
- SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, California 94025, USA
| | - B. Scott Fales
- Department of Chemistry and The PULSE Institute, Stanford University, Stanford, California 94305, USA
- SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, California 94025, USA
| | | | - Todd J. Martínez
- Department of Chemistry and The PULSE Institute, Stanford University, Stanford, California 94305, USA
- SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, California 94025, USA
| |
Collapse
|
10
|
Lesiuk M. Quintic-scaling rank-reduced coupled cluster theory with single and double excitations. J Chem Phys 2022; 156:064103. [DOI: 10.1063/5.0071916] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
11
|
Hohenstein EG, Martínez TJ. GPU acceleration of rank-reduced coupled-cluster singles and doubles. J Chem Phys 2021; 155:184110. [PMID: 34773962 DOI: 10.1063/5.0063467] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We have developed a graphical processing unit (GPU) accelerated implementation of our recently introduced rank-reduced coupled-cluster singles and doubles (RR-CCSD) method. RR-CCSD introduces a low-rank approximation of the doubles amplitudes. This is combined with a low-rank approximation of the electron repulsion integrals via Cholesky decomposition. The result of these two low-rank approximations is the replacement of the usual fourth-order CCSD tensors with products of second- and third-order tensors. In our implementation, only a single fourth-order tensor must be constructed as an intermediate during the solution of the amplitude equations. Owing in large part to the compression of the doubles amplitudes, the GPU-accelerated implementation shows excellent parallel efficiency (95% on eight GPUs). Our implementation can solve the RR-CCSD equations for up to 400 electrons and 1550 basis functions-roughly 50% larger than the largest canonical CCSD computations that have been performed on any hardware. In addition to increased scalability, the RR-CCSD computations are faster than the corresponding CCSD computations for all but the smallest molecules. We test the accuracy of RR-CCSD for a variety of chemical systems including up to 1000 basis functions and determine that accuracy to better than 0.1% error in the correlation energy can be achieved with roughly 95% compression of the ov space for the largest systems considered. We also demonstrate that conformational energies can be predicted to be within 0.1 kcal mol-1 with efficient compression applied to the wavefunction. Finally, we find that low-rank approximations of the CCSD doubles amplitudes used in the similarity transformation of the Hamiltonian prior to a conventional equation-of-motion CCSD computation will not introduce significant errors (on the order of a few hundredths of an electronvolt) into the resulting excitation energies.
Collapse
Affiliation(s)
- Edward G Hohenstein
- Department of Chemistry and The PULSE Institute, Stanford University, Stanford, California 94305, USA
| | - Todd J Martínez
- Department of Chemistry and The PULSE Institute, Stanford University, Stanford, California 94305, USA
| |
Collapse
|