1
|
In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
|
2
|
Artificial-Intelligence-Enhanced On-the-Fly Simulation of Nonlinear Time-Resolved Spectra. J Phys Chem Lett 2024; 15:2325-2331. [PMID: 38386692 DOI: 10.1021/acs.jpclett.4c00107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2024]
Abstract
Time-resolved spectroscopy is an important tool for unraveling the minute details of structural changes in molecules of biological and technological significance. The nonlinear femtosecond signals detected for such systems must be interpreted, but it is a challenging task for which theoretical simulations are often indispensable. Accurate simulations of transient absorption or two-dimensional electronic spectra are, however, computationally very expensive, prohibiting the wider adoption of existing first-principles methods. Here, we report an artificial-intelligence-enhanced protocol to drastically reduce the computational cost of simulating nonlinear time-resolved electronic spectra, which makes such simulations affordable for polyatomic molecules of increasing size. The protocol is based on the doorway-window approach for the on-the-fly surface-hopping simulations. We show its applicability for the prototypical molecule of pyrazine for which it produces spectra with high precision with respect to ab initio reference while cutting the computational cost by at least 95% compared to pure first-principles simulations.
Collapse
|
3
|
Estimating Nonradiative Excited-State Lifetimes in Photoactive Semiconducting Nanostructures. THE JOURNAL OF PHYSICAL CHEMISTRY. C, NANOMATERIALS AND INTERFACES 2024; 128:2713-2721. [PMID: 38379918 PMCID: PMC10875665 DOI: 10.1021/acs.jpcc.3c08053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 01/09/2024] [Accepted: 01/16/2024] [Indexed: 02/22/2024]
Abstract
The time evolution of the exciton generated by light adsorption in a photocatalyst is an important feature that can be approached from full nonadiabatic molecular dynamics simulations. Here, a crucial parameter is the nonradiative recombination rate between the hole and the electron that form the exciton. In the present work, we explore the performance of a Fermi's golden rule-based approach on predicting the recombination rate in a set of photoactive titania nanostructures, relying solely on the coupling of the ground and first excited state. In this scheme the analysis of the first excited state is carried out by invoking Kasha's rule thus avoiding computationally expensive nonadiabatic molecular dynamics simulations and resulting in an affordable estimate of the recombination rate. Our results show that, compared to previous ones from nonadiabatic molecular dynamics simulations, semiquantitative recombination rates can be predicted for the smaller titania nanostructures, and qualitative values are obtained from the larger ones. The present scheme is expected to be useful in the field of computational heterogeneous photocatalysis whenever a complex and computationally expensive full nonadiabatic molecular dynamics cannot be carried out.
Collapse
|
4
|
Theoretical trends in the dynamics simulations of molecular machines across multiple scales. Phys Chem Chem Phys 2024; 26:4828-4839. [PMID: 38235540 DOI: 10.1039/d3cp05201j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Over the past few decades, molecular machines have been extensively studied, since they are composed of single molecules for functional materials capable of responding to external stimuli, enabling motion at scales ranging from the microscopic to the macroscopic level within molecular aggregates. This advancement holds the potential to efficiently transform external resources into mechanical movement, achieved through precise control of conformational changes in stimuli-responsive materials. However, the underlying mechanism that links microscopic and macroscopic motions remains unclear, demanding computational development associated with simulating the construction of molecular machines from single molecules. This bottleneck has impeded the design of more efficient functional materials. Advancements in theoretical simulations have successfully been developed in various computational models to unveil the operational mechanisms of stimulus-responsive molecular machines, which could help us reduce the costs in experimental trial-and-error procedures. It opens doors to the computer-aided design of innovative functional materials. In this perspective, we have reviewed theoretical approaches employed in simulating dynamic processes involving conformational changes in molecular machines, spanning different scales and environmental conditions. In addition, we have highlighted current challenges and anticipated future trends in the collective control of aggregates within molecular machines. Our goal is to provide a comprehensive overview of recent theoretical advancements in the field of molecular machines, offering valuable insights for the design of novel smart materials.
Collapse
|
5
|
Fast and accurate excited states predictions: machine learning and diabatization. Phys Chem Chem Phys 2024; 26:4306-4319. [PMID: 38234256 PMCID: PMC10829538 DOI: 10.1039/d3cp05685f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 01/02/2024] [Indexed: 01/19/2024]
Abstract
The efficiency of machine learning algorithms for electronically excited states is far behind ground-state applications. One of the underlying problems is the insufficient smoothness of the fitted potential energy surfaces and other properties in the vicinity of state crossings and conical intersections, which is a prerequisite for an efficient regression. Smooth surfaces can be obtained by switching to the diabatic basis. However, diabatization itself is still an outstanding problem. We overcome these limitations by solving both problems at once. We use a machine learning approach combining clustering and regression techniques to correct for the deficiencies of property-based diabatization which, in return, provides us with smooth surfaces that can be easily fitted. Our approach extends the applicability of property-based diabatization to multidimensional systems. We utilize the proposed diabatization scheme to achieve higher prediction accuracy for adiabatic states and we show its performance by reconstructing global potential energy surfaces of excited states of nitrosyl fluoride and formaldehyde. While the proposed methodology is independent of the specific property-based diabatization and regression algorithm, we show its performance for kernel ridge regression and a very simple diabatization based on transition multipoles. Compared to most other algorithms based on machine learning, our approach needs only a small amount of training data.
Collapse
|
6
|
Mapping the Space of Photoswitchable Ligands and Photodruggable Proteins with Computational Modeling. J Chem Inf Model 2023; 63:5794-5802. [PMID: 37671878 DOI: 10.1021/acs.jcim.3c00484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2023]
Abstract
Light-activated drugs are a promising way to localize biological activity and minimize side effects. However, their development is complicated by the numerous photophysical and biological properties that must be simultaneously optimized. To accelerate the design of photoactive drugs, we describe a procedure that combines ligand-protein docking with chemical property prediction based on machine learning (ML). We apply this procedure to 58 proteins and 9000 photo-drug candidates based on azobenzene cis-trans isomerism. We find that most proteins display a preference for trans isomers over cis and that the binding affinities of nominally active/inactive pairs are in fact highly correlated. These findings have significant value for photopharmacology research, and reinforce the need for virtual screening to identify compounds with rare desirable properties. Further, we combine our procedure with quantum chemical validation to identify promising candidates for the photoactive inhibition of PARP1, an enzyme that is over-expressed in cancer cells. The top compounds are predicted to have long-lived active forms, differential bioactivity, and absorption in the near-infrared therapeutic window.
Collapse
|
7
|
Ultrafast Photocontrolled Rotation in a Molecular Motor Investigated by Machine Learning-Based Nonadiabatic Dynamics Simulations. J Phys Chem A 2023; 127:7682-7693. [PMID: 37672626 DOI: 10.1021/acs.jpca.3c01036] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
The thermal helix inversion (THI) of the overcrowded alkene-based molecular motors determines the speed of the unidirectional rotation due to the high reaction barrier in the ground state, in comparison with the ultrafast photoreaction process. Recently, a phosphine-based motor has achieved all-photochemical rotation experimentally, promising to be controlled without a thermal step. However, the mechanism of this photochemical reaction has not yet been fully revealed. The comprehensive computational studies on photoisomerization still resort to nonadiabatic molecular dynamics (NAMD) simulations based on electronic structure calculations, which remains a high computational cost for large systems such as molecular motors. Machine learning (ML) has become an accelerating tool in NAMD simulations recently, where excited-state potential energy surfaces (PESs) are constructed analytically with high accuracy, providing an efficient approach for simulations in photochemistry. Herein the reaction pathway is explored by a spin-flip time-dependent density functional theory (SF-TDDFT) approach in combination with ML-based NAMD simulations. According to our computational simulations, we notice that one of the key factors of fulfilling all-photochemical rotation in the phosphine-based motor is that the excitation energies of four isomers are similar. Additionally, a shortcut photoinduced transformation between unstable isomers replaces the THI step, which shares the conical intersection (CI) with photoisomerization. In this study, we provide a practical approach to speed up the NAMD simulations in photochemical reactions for a large system that could be extended to other complex systems.
Collapse
|
8
|
Machine Learning Seams of Conical Intersection: A Characteristic Polynomial Approach. J Phys Chem Lett 2023; 14:7780-7786. [PMID: 37615964 PMCID: PMC10494228 DOI: 10.1021/acs.jpclett.3c01649] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 08/15/2023] [Indexed: 08/25/2023]
Abstract
The machine learning of potential energy surfaces (PESs) has undergone rapid progress in recent years. The vast majority of this work, however, has been focused on the learning of ground state PESs. To reliably extend machine learning protocols to excited state PESs, the occurrence of seams of conical intersections between adiabatic electronic states must be correctly accounted for. This introduces a serious problem, for at such points, the adiabatic potentials are not differentiable to any order, complicating the application of standard machine learning methods. We show that this issue may be overcome by instead learning the coordinate-dependent coefficients of the characteristic polynomial of a simple decomposition of the potential matrix. We demonstrate that, through this approach, quantitatively accurate machine learning models of seams of conical intersection may be constructed.
Collapse
|
9
|
Unexplored Isomerization Pathways of Azobis(benzo-15-crown-5): Computational Studies on a Butterfly Crown Ether. J Phys Chem A 2023; 127:7080-7093. [PMID: 37526572 DOI: 10.1021/acs.jpca.3c02363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/02/2023]
Abstract
Computational studies on trans → cis and cis → trans isomerizations of photoresponsive azobis(benzo-15-crown-5) have been reported in this work. The photoexcited ππ* state (S2) of the trans isomer relaxes through the planar S2 minimum and the planar S2/S1 conical intersection (both situated around 9 kcal/mol below the vertically excited S2 state) arising along the N═N stretching coordinate. The nπ* state (S1) of this isomer has both planar and rotated (clockwise and anticlockwise) minima, which may lead to a torsional conical intersection (S0/S1) geometry having a
Collapse
|
10
|
Rydberg state dynamics and fragmentation mechanism of N,N,N',N'-tetramethylmethylenediamine. J Chem Phys 2023; 159:044301. [PMID: 37486051 DOI: 10.1063/5.0159559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 07/05/2023] [Indexed: 07/25/2023] Open
Abstract
The non-adiabatic relaxation processes and the fragmentation dynamics of Rydberg-excited N,N,N',N'-tetramethylmethylenediamine (TMMDA) are investigated using femtosecond time-resolved photoelectron imaging and time-resolved mass spectroscopy. Excitation at 208 nm populates TMMDA in a charge-localized 3p state. Rapid internal conversion (IC) to 3s produces two charge-delocalized conformers with independent time constants and distinct population ratios. As the system explores the 3s potential surface, the structural evolution continues on a 1.55 ps timescale, followed by a slower (12.1 ps) relaxation to the ground state. A thorough comparison of the time-dependent mass and photoelectron spectra suggests that ionization out of the 3p state ends up with the parent ion, the vibrational energy of which is insufficient for the bond cleavage. On the contrary, by virtue of the additional energy acquired by IC from 3p, the internal energy deposited in 3s is available to break the C-N bond, leading to the fragment ion. The fragmentation is found to occur on the ion surface instead of the Rydberg surface.
Collapse
|
11
|
Constructing Diabatic Potential Energy Matrices with Neural Networks Based on Adiabatic Energies and Physical Considerations: Toward Quantum Dynamic Accuracy. J Chem Theory Comput 2023. [PMID: 37216273 DOI: 10.1021/acs.jctc.2c01074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
A permutation invariant polynomial-neural network (PIP-NN) approach for constructing the global diabatic potential energy matrices (PEMs) of the coupled states of molecules is proposed. Specifically, the diabatization scheme is based merely on the adiabatic energy data of the system, which is ideally a most convenient way due to not requiring additional ab initio calculations for the data of the derivative coupling or any other physical properties of the molecule. Considering the permutation and coupling characteristics of the system, particularly in the presence of conical intersections, some vital treatments for the off-diagonal terms in diabatic PEM are essentially needed. Taking the photodissociation of H2O(X~/B~)/NH3(X~/A~) and nonadiabatic reaction Na(3p) + H2 → NaH(Σ+) + H for example, this PIP-NN method is shown to build up the global diabatic PEMs effectively and accurately. The root-mean-square errors of the adiabatic potential energies in the fitting for three different systems are all small (<10 meV). Further quantum dynamic calculations show that the absorption spectra and product branching ratios in both H2O(X~/B~) and NH3(X~/A~) nonadiabatic photodissociation are well reproduced on the new diabatic PEMs, and the nonadiabatic reaction probability of Na(3p) + H2 → NaH(Σ+) + H obtained on the new diabatic PEMs of the 12A1 and 12B2 states is in reasonably good agreement with previous theoretical result as well, validating this new PIP-NN method.
Collapse
|
12
|
Calculation of the ELF in the excited state with single-determinant methods. J Chem Phys 2023; 158:2887544. [PMID: 37125705 DOI: 10.1063/5.0142918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 04/17/2023] [Indexed: 05/02/2023] Open
Abstract
Since its first definition, back in 1990, the electron localization function (ELF) has settled as one of the most commonly employed techniques to characterize the nature of the chemical bond in real space. Although most of the work using the ELF has focused on the study of ground-state chemical reactivity, a growing interest has blossomed to apply these techniques to the nearly unexplored realm of excited states and photochemistry. Since accurate excited electronic states usually require to account appropriately for electron correlation, the standard single-determinant ELF formulation cannot be blindly applied to them, and it is necessary to turn to correlated ELF descriptions based on the two-particle density matrix (2-PDM). The latter requires costly wavefunction approaches, unaffordable for most of the systems of current photochemical interest. Here, we compare the exact, 2-PDM-based ELF results with those of approximate 2-PDM reconstructions taken from reduced density matrix functional theory. Our approach is put to the test in a wide variety of representative scenarios, such as those provided by the lowest-lying excited electronic states of simple diatomic and polyatomic molecules. Altogether, our results suggest that even approximate 2-PDMs are able to accurately reproduce, on a general basis, the topological and statistical features of the ELF scalar field, paving the way toward the application of cost-effective methodologies, such as time-dependent-Hartree-Fock or time-dependent density functional theory, in the accurate description of the chemical bonding in excited states of photochemical relevance.
Collapse
|
13
|
Thermal Half-Lives of Azobenzene Derivatives: Virtual Screening Based on Intersystem Crossing Using a Machine Learning Potential. ACS CENTRAL SCIENCE 2023; 9:166-176. [PMID: 36844486 PMCID: PMC9951306 DOI: 10.1021/acscentsci.2c00897] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Indexed: 05/27/2023]
Abstract
Molecular photoswitches are the foundation of light-activated drugs. A key photoswitch is azobenzene, which exhibits trans-cis isomerism in response to light. The thermal half-life of the cis isomer is of crucial importance, since it controls the duration of the light-induced biological effect. Here we introduce a computational tool for predicting the thermal half-lives of azobenzene derivatives. Our automated approach uses a fast and accurate machine learning potential trained on quantum chemistry data. Building on well-established earlier evidence, we argue that thermal isomerization proceeds through rotation mediated by intersystem crossing, and incorporate this mechanism into our automated workflow. We use our approach to predict the thermal half-lives of 19,000 azobenzene derivatives. We explore trends and trade-offs between barriers and absorption wavelengths, and open-source our data and software to accelerate research in photopharmacology.
Collapse
|
14
|
Graph neural networks for materials science and chemistry. COMMUNICATIONS MATERIALS 2022; 3:93. [PMID: 36468086 PMCID: PMC9702700 DOI: 10.1038/s43246-022-00315-6] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 11/07/2022] [Indexed: 05/14/2023]
Abstract
Machine learning plays an increasingly important role in many areas of chemistry and materials science, being used to predict materials properties, accelerate simulations, design new structures, and predict synthesis routes of new materials. Graph neural networks (GNNs) are one of the fastest growing classes of machine learning models. They are of particular relevance for chemistry and materials science, as they directly work on a graph or structural representation of molecules and materials and therefore have full access to all relevant information required to characterize materials. In this Review, we provide an overview of the basic principles of GNNs, widely used datasets, and state-of-the-art architectures, followed by a discussion of a wide range of recent applications of GNNs in chemistry and materials science, and concluding with a road-map for the further development and application of GNNs.
Collapse
|
15
|
Data-driven discovery of molecular photoswitches with multioutput Gaussian processes. Chem Sci 2022; 13:13541-13551. [PMID: 36507171 PMCID: PMC9682911 DOI: 10.1039/d2sc04306h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 09/16/2022] [Indexed: 11/11/2022] Open
Abstract
Photoswitchable molecules display two or more isomeric forms that may be accessed using light. Separating the electronic absorption bands of these isomers is key to selectively addressing a specific isomer and achieving high photostationary states whilst overall red-shifting the absorption bands serves to limit material damage due to UV-exposure and increases penetration depth in photopharmacological applications. Engineering these properties into a system through synthetic design however, remains a challenge. Here, we present a data-driven discovery pipeline for molecular photoswitches underpinned by dataset curation and multitask learning with Gaussian processes. In the prediction of electronic transition wavelengths, we demonstrate that a multioutput Gaussian process (MOGP) trained using labels from four photoswitch transition wavelengths yields the strongest predictive performance relative to single-task models as well as operationally outperforming time-dependent density functional theory (TD-DFT) in terms of the wall-clock time for prediction. We validate our proposed approach experimentally by screening a library of commercially available photoswitchable molecules. Through this screen, we identified several motifs that displayed separated electronic absorption bands of their isomers, exhibited red-shifted absorptions, and are suited for information transfer and photopharmacological applications. Our curated dataset, code, as well as all models are made available at https://github.com/Ryan-Rhys/The-Photoswitch-Dataset.
Collapse
|
16
|
Abstract
![]()
Newton-X is an open-source computational platform to
perform nonadiabatic
molecular dynamics based on surface hopping and spectrum simulations
using the nuclear ensemble approach. Both are among the most common
methodologies in computational chemistry for photophysical and photochemical
investigations. This paper describes the main features of these methods
and how they are implemented in Newton-X. It emphasizes the newest
developments, including zero-point-energy leakage correction, dynamics
on complex-valued potential energy surfaces, dynamics induced by incoherent
light, dynamics based on machine-learning potentials, exciton dynamics
of multiple chromophores, and supervised and unsupervised machine
learning techniques. Newton-X is interfaced with several third-party
quantum-chemistry programs, spanning a broad spectrum of electronic
structure methods.
Collapse
|
17
|
Abstract
ConspectusPhotochemical reactions are of great importance in chemistry, biology, and materials science because they take advantage of a renewable energy source, mild reaction conditions, and high atom economy. Light absorption can excite molecules to a higher energy electronic state of the same spin multiplicity. The following nonadiabatic processes induce molecular transformations that afford exotic molecular architectures and high-energy-isomers that are inaccessible by thermal means. Computational simulations now complement time-resolved instrumentation to reveal ultrafast excited-state mechanistic information for photochemical reactions that is essential in disentangling elusive spectroscopic features, excited-state lifetimes, and excited-state mechanistic critical points. Nonadiabatic molecular dynamics (NAMD), powered by surface hopping techniques, is among the most widely applied techniques to model the photochemical reactions of medium-sized molecules. However, the computational efficiency is limited because of the requisite thousands of multiconfigurational quantum-chemical calculations multiplied by hundreds of trajectories. Machine learning (ML) has emerged as a revolutionary force in computational chemistry to predict the outcome of the resource-intensive multiconfigurational calculations on the fly. An ML potential trained with a substantial set of quantum-chemical calculations can predict the energies and forces with errors under chemical accuracy at a negligible cost. The integration of ML potentials in NAMD dramatically extends the maximum simulation time scale by ∼10 000-fold to the nanosecond regime.In this Account, we present a comprehensive demonstration of ML photodynamics simulations and summarize our most recent applications in resolving complex photochemical reactions. First, we address three fundamental components of ML techniques for photodynamics simulations: the quantum-chemical data set, the ML potential, and NAMD. Second, we describe best practices in building training data and our procedure toward training the ML photodynamics model with our recent literature contributions. We introduce a convenient training data generation scheme combining Wigner sampling and geometrical interpolation. It trains reliable and effective ML potentials suitable for subsequent active learning to detect undersampled data. We demonstrate how active learning automatically discovers new mechanistic pathways and reproduces experimental results. We point out that atomic permutation is an essential data augmentation approach to improve the learnability of distance-based molecular descriptors for highly symmetric molecules. Third, we demonstrate the utility of ML-photodynamics by showing the results of ML photodynamics simulations of (1) photo-torquoselective 4π disrotatory electrocyclic ring closing of norbornyl cyclohexadiene, which reveals a thermal conversion from experimentally unobserved intermediates to the reactant in 1 ns; (2) [2 + 2] photocycloaddition of substituted [3]-syn-ladderdienes in competition with 4π and 6π electrocyclic ring-opening reactions, uncovering substituent effects to explain the reported increased quantum yield of substituted cubane precursors; and (3) photochemical 4π disrotatory electrocyclic reactions of fluorobenzenes in nanoseconds with XMS-CASPT2-level training data. We expect this Account to broaden understanding of ML photodynamics and inspire future developments and applications to increasingly large molecules within complex environments on long time scales.
Collapse
|