1
|
Laqua H, Dittmer LB, Head-Gordon M. The conundrum of diffuse basis sets: A blessing for accuracy yet a curse for sparsity. J Chem Phys 2025; 162:184107. [PMID: 40341928 DOI: 10.1063/5.0264572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2025] [Accepted: 04/17/2025] [Indexed: 05/11/2025] Open
Abstract
Diffuse atomic orbital basis sets have proven to be essential to obtain accurate interaction energies, especially in regard to non-covalent interactions. However, they also have a detrimental impact on the sparsity of the one-particle density matrix (1-PDM), to a degree stronger than the spatial extent of the basis functions alone could explain. This is despite the fact that the matrix elements of the 1-PDM of insulators (systems with significant highest occupied molecular orbital-lowest unoccupied molecular orbital gaps) are expected to decay exponentially with increasing real-space distance from the diagonal. The observed low sparsity of the 1-PDM appears to be independent of representation and even persists after projecting the 1-PDM onto a real-space grid, leading to the conclusion that this "curse of sparsity" is solely a basis set artifact, which, counterintuitively, becomes worse for larger basis sets, seemingly contradicting the notion of a well-defined basis set limit. We show that this is a consequence of the low locality of the contra-variant basis functions as quantified by the inverse overlap matrix S-1 being significantly less sparse than its co-variant dual. Introducing the model system of an infinite non-interacting chain of helium atoms, we are able to quantify the exponential decay rate to be proportional to the diffuseness as well as local incompleteness of the basis set, meaning small and diffuse basis sets are affected the most. Finally, we propose one solution to the conundrum in the form of the complementary auxiliary basis set singles correction in combination with compact, low l-quantum-number basis sets, showing promising results for non-covalent interactions.
Collapse
Affiliation(s)
- Henryk Laqua
- Kenneth S. Pitzer Center for Theoretical Chemistry, Department of Chemistry, University of California, Berkeley, California 94720, USA
| | - Linus Bjarne Dittmer
- Kenneth S. Pitzer Center for Theoretical Chemistry, Department of Chemistry, University of California, Berkeley, California 94720, USA
- Interdisciplinary Center for Scientific Computing, Ruprecht-Karls University, Im Neuenheimer Feld 205, 69120 Heidelberg, Germany
| | - Martin Head-Gordon
- Kenneth S. Pitzer Center for Theoretical Chemistry, Department of Chemistry, University of California, Berkeley, California 94720, USA
| |
Collapse
|
2
|
Khadatkar S, Motamarri P. Subspace recursive Fermi-operator expansion strategies for large-scale DFT eigenvalue problems on HPC architectures. J Chem Phys 2023; 159:031102. [PMID: 37470423 DOI: 10.1063/5.0150287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 06/19/2023] [Indexed: 07/21/2023] Open
Abstract
Quantum mechanical calculations for material modeling using Kohn-Sham density functional theory (DFT) involve the solution of a nonlinear eigenvalue problem for N smallest eigenvector-eigenvalue pairs, with N proportional to the number of electrons in the material system. These calculations are computationally demanding and have asymptotic cubic scaling complexity with the number of electrons. Large-scale matrix eigenvalue problems arising from the discretization of the Kohn-Sham DFT equations employing a systematically convergent basis traditionally rely on iterative orthogonal projection methods, which are shown to be computationally efficient and scalable on massively parallel computing architectures. However, as the size of the material system increases, these methods are known to incur dominant computational costs through the Rayleigh-Ritz projection step of the discretized Kohn-Sham Hamiltonian matrix and the subsequent subspace diagonalization of the projected matrix. This work explores the potential of polynomial expansion approaches based on recursive Fermi-operator expansion as an alternative to the subspace diagonalization of the projected Hamiltonian matrix to reduce the computational cost. Subsequently, we perform a detailed comparison of various recursive polynomial expansion approaches to the traditional approach of explicit diagonalization on both multi-node central processing unit and graphics processing unit architectures and assess their relative performance in terms of accuracy, computational efficiency, scaling behavior, and energy efficiency.
Collapse
Affiliation(s)
- Sameer Khadatkar
- Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru 560012, India
| | - Phani Motamarri
- Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru 560012, India
| |
Collapse
|
3
|
Negre CFA, Wall ME, Niklasson AMN. Graph-based quantum response theory and shadow Born-Oppenheimer molecular dynamics. J Chem Phys 2023; 158:074108. [PMID: 36813723 DOI: 10.1063/5.0137119] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Graph-based linear scaling electronic structure theory for quantum-mechanical molecular dynamics simulations [A. M. N. Niklasson et al., J. Chem. Phys. 144, 234101 (2016)] is adapted to the most recent shadow potential formulations of extended Lagrangian Born-Oppenheimer molecular dynamics, including fractional molecular-orbital occupation numbers [A. M. N. Niklasson, J. Chem. Phys. 152, 104103 (2020) and A. M. N. Niklasson, Eur. Phys. J. B 94, 164 (2021)], which enables stable simulations of sensitive complex chemical systems with unsteady charge solutions. The proposed formulation includes a preconditioned Krylov subspace approximation for the integration of the extended electronic degrees of freedom, which requires quantum response calculations for electronic states with fractional occupation numbers. For the response calculations, we introduce a graph-based canonical quantum perturbation theory that can be performed with the same natural parallelism and linear scaling complexity as the graph-based electronic structure calculations for the unperturbed ground state. The proposed techniques are particularly well-suited for semi-empirical electronic structure theory, and the methods are demonstrated using self-consistent charge density-functional tight-binding theory both for the acceleration of self-consistent field calculations and for quantum-mechanical molecular dynamics simulations. Graph-based techniques combined with the semi-empirical theory enable stable simulations of large, complex chemical systems, including tens-of-thousands of atoms.
Collapse
Affiliation(s)
- Christian F A Negre
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Michael E Wall
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Anders M N Niklasson
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| |
Collapse
|
4
|
Clark AE, Adams H, Hernandez R, Krylov AI, Niklasson AMN, Sarupria S, Wang Y, Wild SM, Yang Q. The Middle Science: Traversing Scale In Complex Many-Body Systems. ACS CENTRAL SCIENCE 2021; 7:1271-1287. [PMID: 34471670 PMCID: PMC8393217 DOI: 10.1021/acscentsci.1c00685] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
A roadmap is developed that integrates simulation methodology and data science methods to target new theories that traverse the multiple length- and time-scale features of many-body phenomena.
Collapse
Affiliation(s)
- Aurora E. Clark
- Department of Chemistry, Washington State University, Pullman, Washington 99163, United States
| | - Henry Adams
- Department of Mathematics, Colorado State
University, Fort Collins, Colorado 80523, United States
| | - Rigoberto Hernandez
- Departments
of Chemistry, Chemical and Biomolecular Engineering, and Materials
Science and Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Anna I. Krylov
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | - Anders M. N. Niklasson
- Theoretical
Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Sapna Sarupria
- Department of Chemical and Biomolecular Engineering, Center for Optical
Materials Science and Engineering Technologies (COMSET), Clemson University, Clemson, South Carolina 29670, United States
- Department
of Chemistry, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Yusu Wang
- Halıcıŏglu Data Science Institute, University of California, San Diego, La Jolla, California 92093, United States
| | - Stefan M. Wild
- Mathematics
and Computer Science Division, Argonne National
Laboratory, Lemont, Illinois 60439, United
States
| | - Qian Yang
- Computer Science and Engineering Department, University of Connecticut, Storrs, Connecticut 06269-4155, United States
| |
Collapse
|
5
|
Finkelstein J, Smith JS, Mniszewski SM, Barros K, Negre CFA, Rubensson EH, Niklasson AMN. Mixed Precision Fermi-Operator Expansion on Tensor Cores from a Machine Learning Perspective. J Chem Theory Comput 2021; 17:2256-2265. [PMID: 33797253 DOI: 10.1021/acs.jctc.1c00057] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
We present a second-order recursive Fermi-operator expansion scheme using mixed precision floating point operations to perform electronic structure calculations using tensor core units. A performance of over 100 teraFLOPs is achieved for half-precision floating point operations on Nvidia's A100 tensor core units. The second-order recursive Fermi-operator scheme is formulated in terms of a generalized, differentiable deep neural network structure, which solves the quantum mechanical electronic structure problem. We demonstrate how this network can be accelerated by optimizing the weight and bias values to substantially reduce the number of layers required for convergence. We also show how this machine learning approach can be used to optimize the coefficients of the recursive Fermi-operator expansion to accurately represent the fractional occupation numbers of the electronic states at finite temperatures.
Collapse
Affiliation(s)
- Joshua Finkelstein
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Justin S Smith
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Susan M Mniszewski
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Kipton Barros
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Christian F A Negre
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Emanuel H Rubensson
- Division of Scientific Computing, Department of Information Technology, Uppsala University, P.O. Box 337, Uppsala SE-751 05, Sweden
| | - Anders M N Niklasson
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| |
Collapse
|
6
|
Mniszewski SM, Perriot R, Rubensson EH, Negre CFA, Cawkwell MJ, Niklasson AMN. Linear Scaling Pseudo Fermi-Operator Expansion for Fractional Occupation. J Chem Theory Comput 2018; 15:190-200. [DOI: 10.1021/acs.jctc.8b00887] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Affiliation(s)
- Susan M. Mniszewski
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Romain Perriot
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Emanuel H. Rubensson
- Division of Scientific Computing, Department of Information Technology, Uppsala University, Box 337, SE-751 05 Uppsala, Sweden
| | - Christian F. A. Negre
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Marc J. Cawkwell
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Anders M. N. Niklasson
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| |
Collapse
|
7
|
Kruchinina A, Rudberg E, Rubensson EH. Parameterless Stopping Criteria for Recursive Density Matrix Expansions. J Chem Theory Comput 2016; 12:5788-5802. [PMID: 27783507 DOI: 10.1021/acs.jctc.6b00626] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Parameterless stopping criteria for recursive polynomial expansions to construct the density matrix in electronic structure calculations are proposed. Based on convergence-order estimation the new stopping criteria automatically and accurately detect when the calculation is dominated by numerical errors and continued iteration does not improve the result. Difficulties in selecting a stopping tolerance and appropriately balancing it in relation to parameters controlling the numerical accuracy are avoided. Thus, our parameterless stopping criteria stand in contrast to the standard approach to stop as soon as some error measure goes below a user-defined parameter or tolerance. We demonstrate that the stopping criteria work well both in dense and sparse matrix calculations and in large-scale self-consistent field calculations with the quantum chemistry program Ergo ( www.ergoscf.org ) .
Collapse
Affiliation(s)
- Anastasia Kruchinina
- Division of Scientific Computing, Department of Information Technology, Uppsala University , 751 05 Uppsala, Sweden
| | - Elias Rudberg
- Division of Scientific Computing, Department of Information Technology, Uppsala University , 751 05 Uppsala, Sweden
| | - Emanuel H Rubensson
- Division of Scientific Computing, Department of Information Technology, Uppsala University , 751 05 Uppsala, Sweden
| |
Collapse
|
8
|
Niklasson AMN, Mniszewski SM, Negre CFA, Cawkwell MJ, Swart PJ, Mohd-Yusof J, Germann TC, Wall ME, Bock N, Rubensson EH, Djidjev H. Graph-based linear scaling electronic structure theory. J Chem Phys 2016; 144:234101. [DOI: 10.1063/1.4952650] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
| | - Susan M. Mniszewski
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Christian F. A. Negre
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Marc J. Cawkwell
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Pieter J. Swart
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Jamal Mohd-Yusof
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Timothy C. Germann
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Michael E. Wall
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Nicolas Bock
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| | - Emanuel H. Rubensson
- Division of Scientific Computing, Department of Information Technology, Uppsala University, Box 337, SE-751 05 Uppsala, Sweden
| | - Hristo Djidjev
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA
| |
Collapse
|
9
|
Riplinger C, Pinski P, Becker U, Valeev EF, Neese F. Sparse maps--A systematic infrastructure for reduced-scaling electronic structure methods. II. Linear scaling domain based pair natural orbital coupled cluster theory. J Chem Phys 2016; 144:024109. [PMID: 26772556 DOI: 10.1063/1.4939030] [Citation(s) in RCA: 724] [Impact Index Per Article: 80.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Domain based local pair natural orbital coupled cluster theory with single-, double-, and perturbative triple excitations (DLPNO-CCSD(T)) is a highly efficient local correlation method. It is known to be accurate and robust and can be used in a black box fashion in order to obtain coupled cluster quality total energies for large molecules with several hundred atoms. While previous implementations showed near linear scaling up to a few hundred atoms, several nonlinear scaling steps limited the applicability of the method for very large systems. In this work, these limitations are overcome and a linear scaling DLPNO-CCSD(T) method for closed shell systems is reported. The new implementation is based on the concept of sparse maps that was introduced in Part I of this series [P. Pinski, C. Riplinger, E. F. Valeev, and F. Neese, J. Chem. Phys. 143, 034108 (2015)]. Using the sparse map infrastructure, all essential computational steps (integral transformation and storage, initial guess, pair natural orbital construction, amplitude iterations, triples correction) are achieved in a linear scaling fashion. In addition, a number of additional algorithmic improvements are reported that lead to significant speedups of the method. The new, linear-scaling DLPNO-CCSD(T) implementation typically is 7 times faster than the previous implementation and consumes 4 times less disk space for large three-dimensional systems. For linear systems, the performance gains and memory savings are substantially larger. Calculations with more than 20 000 basis functions and 1000 atoms are reported in this work. In all cases, the time required for the coupled cluster step is comparable to or lower than for the preceding Hartree-Fock calculation, even if this is carried out with the efficient resolution-of-the-identity and chain-of-spheres approximations. The new implementation even reduces the error in absolute correlation energies by about a factor of two, compared to the already accurate previous implementation.
Collapse
Affiliation(s)
- Christoph Riplinger
- Max Planck Institute for Chemical Energy Conversion, Stiftstr. 34-36, D-45470 Mülheim an der Ruhr, Germany
| | - Peter Pinski
- Max Planck Institute for Chemical Energy Conversion, Stiftstr. 34-36, D-45470 Mülheim an der Ruhr, Germany
| | - Ute Becker
- Max Planck Institute for Chemical Energy Conversion, Stiftstr. 34-36, D-45470 Mülheim an der Ruhr, Germany
| | - Edward F Valeev
- Department of Chemistry, Virginia Tech, Blacksburg, Virginia 24061, USA
| | - Frank Neese
- Max Planck Institute for Chemical Energy Conversion, Stiftstr. 34-36, D-45470 Mülheim an der Ruhr, Germany
| |
Collapse
|
10
|
Mniszewski SM, Cawkwell MJ, Wall ME, Mohd-Yusof J, Bock N, Germann TC, Niklasson AMN. Efficient Parallel Linear Scaling Construction of the Density Matrix for Born–Oppenheimer Molecular Dynamics. J Chem Theory Comput 2015; 11:4644-54. [DOI: 10.1021/acs.jctc.5b00552] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- S. M. Mniszewski
- Computer, Computational, and Statistical Sciences Division and ‡Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - M. J. Cawkwell
- Computer, Computational, and Statistical Sciences Division and ‡Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - M. E. Wall
- Computer, Computational, and Statistical Sciences Division and ‡Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - J. Mohd-Yusof
- Computer, Computational, and Statistical Sciences Division and ‡Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - N. Bock
- Computer, Computational, and Statistical Sciences Division and ‡Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - T. C. Germann
- Computer, Computational, and Statistical Sciences Division and ‡Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - A. M. N. Niklasson
- Computer, Computational, and Statistical Sciences Division and ‡Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| |
Collapse
|
11
|
|