1
|
Davidson Y, Philipp A, Chakraborty S, Bronstein AM, Gershoni-Poranne R. How local is "local"? Deep learning reveals locality of the induced magnetic field of polycyclic aromatic hydrocarbons. J Chem Phys 2025; 162:144101. [PMID: 40197568 DOI: 10.1063/5.0257558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2025] [Accepted: 03/19/2025] [Indexed: 04/10/2025] Open
Abstract
We investigate the locality of magnetic response in polycyclic aromatic molecules using a novel deep-learning approach. Our method employs graph neural networks (GNNs) with a graph-of-rings representation to predict nucleus independent chemical shifts (NICS) in the space around the molecule. We train a series of models, each time reducing the size of the largest molecules used in training. The accuracy of prediction remains high (MAE < 0.5 ppm), even when training the model only on molecules with up to four rings, thus providing strong evidence for the locality of magnetic response. To overcome the known problem of generalization of GNNs, we implement a k-hop expansion strategy and succeed in achieving accurate predictions for molecules with up to 15 rings (almost 4 times the size of the largest training example). Our findings have implications for understanding the magnetic response in complex molecules and demonstrate a promising approach to overcoming GNN scalability limitations. Furthermore, the trained models enable rapid characterization, without the need for more expensive DFT calculations.
Collapse
Affiliation(s)
- Yair Davidson
- Department of Computer Science, Technion-Israel Institute of Technology, Haifa 32000, Israel
| | - Aviad Philipp
- Department of Computer Science, Technion-Israel Institute of Technology, Haifa 32000, Israel
| | - Sabyasachi Chakraborty
- Schulich Faculty of Chemistry and the Resnick Sustainability Center for Catalysis, Technion-Israel Institute of Technology, Haifa 32000, Israel
| | - Alex M Bronstein
- Department of Computer Science, Technion-Israel Institute of Technology, Haifa 32000, Israel
- Institute of Science and Technology Austria, Klosterneuburg 3400, Austria
| | - Renana Gershoni-Poranne
- Schulich Faculty of Chemistry and the Resnick Sustainability Center for Catalysis, Technion-Israel Institute of Technology, Haifa 32000, Israel
| |
Collapse
|
2
|
Golub P, Yang C, Vlček V, Veis L. Quantum Chemical Density Matrix Renormalization Group Method Boosted by Machine Learning. J Phys Chem Lett 2025; 16:3295-3301. [PMID: 40126916 PMCID: PMC11973911 DOI: 10.1021/acs.jpclett.5c00207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2025] [Revised: 03/07/2025] [Accepted: 03/19/2025] [Indexed: 03/26/2025]
Abstract
The use of machine learning (ML) to refine low-level theoretical calculations to achieve higher accuracy is a promising and actively evolving approach known as Δ-ML. The density matrix renormalization group (DMRG) is a powerful variational approach widely used for studying strongly correlated quantum systems. High computational efficiency can be achieved without compromising accuracy. Here, we demonstrate the potential of a simple ML model to significantly enhance the performance of the quantum chemical DMRG method.
Collapse
Affiliation(s)
- Pavlo Golub
- J.
Heyrovsky Institute of Physical Chemistry, v.v.i., Czech Academy of Sciences, Prague, 18223, Czech Republic
| | - Chao Yang
- Applied
Mathematics and Computational Research Division, Lawerence Berkeley National Laboratory, Berkeley, 94720, United States
| | - Vojtěch Vlček
- Department
of Chemistry and Biochemistry, University
of California, Santa Barbara, Santa Barbara, 93117, United States
- Department
of Materials, University of California,
Santa Barbara, Santa Barbara, 93117, United
States
| | - Libor Veis
- J.
Heyrovský Institute of Physical Chemistry, v.v.i., Czech Academy of Sciences, Prague, 18223, Czech Republic
| |
Collapse
|
3
|
Wahab A, Gershoni-Poranne R. Accelerated diradical character assessment in large datasets of polybenzenoid hydrocarbons using xTB fractional occupation. Phys Chem Chem Phys 2025; 27:5973-5983. [PMID: 39651645 DOI: 10.1039/d4cp04059g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Polybenzenoid hydrocarbons (PBHs) have garnered significant attention in the field of organic electronics due to their unique electronic properties. To facilitate the design and discovery of new functional organic materials based on these compounds, it is necessary to assess their diradical character. However, this usually requires expensive multireference calculations. In this study, we demonstrate rapid identification and quantification of open-shell character in PBHs using the fractional occupation number weighted electron density metric (NFOD) calculated with the semiempirical GFN2-xTB (xTB) method. We apply this approach to the entire chemical space of PBHs containing up to 10 rings, a total of over 19k molecules, and find that approximately 7% of the molecules are identified as having diradical character. Our findings reveal a strong correlation between xTB-calculated NFOD and the more computationally expensive Yamaguchi y and DFT-calculated NFOD, validating the use of this efficient method for large-scale screening. Additionally, we identify a linear relationship between size and NFOD value and implement a size-dependent threshold for open-shell character, which significantly improves the accuracy of diradical identification across the chemical space of PBHs. This size-aware approach reduces false positive identifications from 6.97% to 0.38% compared to using a single threshold value. Overall, this work demonstrates that xTB-calculated NFOD provides a rapid and cost-effective alternative for large-scale screening of open-shell character in PBHs.
Collapse
Affiliation(s)
- Alexandra Wahab
- Laboratory for Organic Chemistry, Department of Chemistry and Applied Biosciences, ETH Zurich, 8093 Zurich, Switzerland
| | - Renana Gershoni-Poranne
- Laboratory for Organic Chemistry, Department of Chemistry and Applied Biosciences, ETH Zurich, 8093 Zurich, Switzerland
- Schulich Faculty of Chemistry, Technion - Israel Institute of Technology, Haifa 32000, Israel.
| |
Collapse
|
4
|
Wahab A, Gershoni-Poranne R. COMPAS-3: a dataset of peri-condensed polybenzenoid hydrocarbons. Phys Chem Chem Phys 2024; 26:15344-15357. [PMID: 38758092 DOI: 10.1039/d4cp01027b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/18/2024]
Abstract
We introduce the third installment of the COMPAS Project - a COMputational database of Polycyclic Aromatic Systems, focused on peri-condensed polybenzenoid hydrocarbons. In this installment, we develop two datasets containing the optimized ground-state structures and a selection of molecular properties of ∼39k and ∼9k peri-condensed polybenzenoid hydrocarbons (at the GFN2-xTB and CAM-B3LYP-D3BJ/cc-pvdz//CAM-B3LYP-D3BJ/def2-SVP levels, respectively). The manuscript details the enumeration and data generation processes and describes the information available within the datasets. An in-depth comparison between the two types of computation is performed, and it is found that the geometrical disagreement is maximal for slightly-distorted molecules. In addition, a data-driven analysis of the structure-property trends of peri-condensed PBHs is performed, highlighting the effect of the size of peri-condensed islands and linearly annulated rings on the HOMO-LUMO gap. The insights described herein are important for rational design of novel functional aromatic molecules for use in, e.g., organic electronics. The generated datasets provide a basis for additional data-driven machine- and deep-learning studies in chemistry.
Collapse
Affiliation(s)
- Alexandra Wahab
- The Laboratory for Organic Chemistry, Department of Chemistry and Applied Biosciences, ETH Zurich, 8093 Zurich, Switzerland
| | - Renana Gershoni-Poranne
- The Schulich Faculty of Chemistry and the Resnick Sustainability Center for Catalysis, Technion - Israel Institute of Technology, Haifa 32000, Israel.
| |
Collapse
|
5
|
Hattori I, Hagai M, Ito M, Sakai M, Narita H, Fujimoto KJ, Yanai T, Yamaguchi S. In Silico Screening and Experimental Verification of Near-Infrared-Emissive Two-Boron-Doped Polycyclic Aromatic Hydrocarbons. Angew Chem Int Ed Engl 2024; 63:e202403829. [PMID: 38556467 DOI: 10.1002/anie.202403829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 03/26/2024] [Accepted: 03/27/2024] [Indexed: 04/02/2024]
Abstract
Embedding two boron atoms into a polycyclic aromatic hydrocarbon (PAH) leads to the formation of a neutral analogue that is isoelectronic to the corresponding dicationic PAH skeleton, which can significantly alter its electronic structure. Based on this concept, we explore herein the identification of near-infrared (NIR)-emissive PAHs with the aid of an in silico screening method. Using perylene as the PAH scaffold, we embedded two boron atoms and fused two thiophene rings to it. Based on this design concept, all possible structures (ca. 2500 entities) were generated using a comprehensive structure generator. Time-dependent DFT calculations were conducted on all these structures, and promising candidates were extracted based on the vertical excitation energy, transition dipole moment, and atomization energy per bond. One of the extracted dithieno-diboraperylene candidates was synthesized and indeed exhibited emission at 724 nm with a quantum yield of 0.40 in toluene, demonstrating the validity of this screening method. This modification was further applied to other PAHs, and a series of thienobora-modified PAHs was synthesized.
Collapse
Affiliation(s)
- Izumi Hattori
- Department of Chemistry, Graduate School of Science, and Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University Furo, Chikusa, Nagoya, 464-8602, Japan
| | - Masaya Hagai
- Department of Chemistry, Graduate School of Science, and Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University Furo, Chikusa, Nagoya, 464-8602, Japan
| | - Masato Ito
- Department of Chemistry, Graduate School of Science, and Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University Furo, Chikusa, Nagoya, 464-8602, Japan
| | - Mika Sakai
- Department of Chemistry, Graduate School of Science, and Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University Furo, Chikusa, Nagoya, 464-8602, Japan
| | - Hiroki Narita
- Department of Chemistry, Graduate School of Science, and Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University Furo, Chikusa, Nagoya, 464-8602, Japan
| | - Kazuhiro J Fujimoto
- Department of Chemistry, Graduate School of Science, and Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University Furo, Chikusa, Nagoya, 464-8602, Japan
- Institute of Transformative Bio-Molecules (WPI-ITbM), Nagoya University Furo, Chikusa, Nagoya, 464-8601, Japan
| | - Takeshi Yanai
- Department of Chemistry, Graduate School of Science, and Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University Furo, Chikusa, Nagoya, 464-8602, Japan
- Institute of Transformative Bio-Molecules (WPI-ITbM), Nagoya University Furo, Chikusa, Nagoya, 464-8601, Japan
| | - Shigehiro Yamaguchi
- Department of Chemistry, Graduate School of Science, and Integrated Research Consortium on Chemical Sciences (IRCCS), Nagoya University Furo, Chikusa, Nagoya, 464-8602, Japan
- Institute of Transformative Bio-Molecules (WPI-ITbM), Nagoya University Furo, Chikusa, Nagoya, 464-8601, Japan
| |
Collapse
|
6
|
Scott JM, Dale SG, McBroom J, Gould T, Li Q. Size Isn't Everything: Geometric Tuning in Polycyclic Aromatic Hydrocarbons and Its Implications for Carbon Nanodots. J Phys Chem A 2024; 128:2003-2014. [PMID: 38470339 DOI: 10.1021/acs.jpca.3c07416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Recent developments in light-emitting carbon nanodots and molecular organic semiconductors have seen renewed interest in the properties of polycyclic aromatic hydrocarbons (PAHs) as a family. The networks of delocalized π electrons in sp2-hybridized carbon grant PAHs light-emissive properties right across the visible spectrum. However, the mechanistic understanding of their emission energy has been limited due to the ground state-focused methods of determination. This computational chemistry work, therefore, seeks to validate existing rules and elucidate new features and characteristics of PAHs that influence their emissions. Predictions based on (time-dependent) density functional theory account for the full 3-dimensional electronic structure of ground and excited states and reveal that twisting and near-degeneracies strongly influence emission spectra and may therefore be used to tune the color of PAHs and, hence, carbon nanodots. We particularly note that the influence of twisting goes beyond torsional destabilization of the ground-state and geometric relaxation of the excited state, with a third contribution associated with the electric transition dipole. Symmetries and peri-condensation may also have an effect, but this could not be statistically confirmed. In pursuing this goal, we demonstrate that with minimal changes to molecular size, the entire visible spectrum may be spanned by geometric modification alone; we have also provided a first estimate of emission energy for 35 molecules currently lacking published emission spectra as well as clear guidelines for when more sophisticated computational techniques are required to predict the properties of PAHs accurately.
Collapse
Affiliation(s)
- James M Scott
- Queensland Micro- and Nanotechnology Centre, Griffith University, Nathan, Queensland 4111, Australia
- School of Engineering and Built Environment, Griffith University, Nathan, Queensland 4111, Australia
| | - Stephen G Dale
- Queensland Micro- and Nanotechnology Centre, Griffith University, Nathan, Queensland 4111, Australia
- The Institute for Functional Intelligent Materials (I-FIM), National University of Singapore, 4 Science Drive 2, Singapore 117544, Singapore
| | - James McBroom
- School of Environment and Science, Griffith University, Nathan, Queensland 4111, Australia
| | - Tim Gould
- Queensland Micro- and Nanotechnology Centre, Griffith University, Nathan, Queensland 4111, Australia
- School of Environment and Science, Griffith University, Nathan, Queensland 4111, Australia
| | - Qin Li
- Queensland Micro- and Nanotechnology Centre, Griffith University, Nathan, Queensland 4111, Australia
- School of Engineering and Built Environment, Griffith University, Nathan, Queensland 4111, Australia
| |
Collapse
|
7
|
Mayo Yanes E, Chakraborty S, Gershoni-Poranne R. COMPAS-2: a dataset of cata-condensed hetero-polycyclic aromatic systems. Sci Data 2024; 11:97. [PMID: 38242917 PMCID: PMC10799083 DOI: 10.1038/s41597-024-02927-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 01/05/2024] [Indexed: 01/21/2024] Open
Abstract
Polycyclic aromatic systems are highly important to numerous applications, in particular to organic electronics and optoelectronics. High-throughput screening and generative models that can help to identify new molecules to advance these technologies require large amounts of high-quality data, which is expensive to generate. In this report, we present the largest freely available dataset of geometries and properties of cata-condensed poly(hetero)cyclic aromatic molecules calculated to date. Our dataset contains ~500k molecules comprising 11 types of aromatic and antiaromatic building blocks calculated at the GFN1-xTB level and is representative of a highly diverse chemical space. We detail the structure enumeration process and the methods used to provide various electronic properties (including HOMO-LUMO gap, adiabatic ionization potential, and adiabatic electron affinity). Additionally, we benchmark against a ~50k dataset calculated at the CAM-B3LYP-D3BJ/def2-SVP level and develop a fitting scheme to correct the xTB values to higher accuracy. These new datasets represent the second installment in the COMputational database of Polycyclic Aromatic Systems (COMPAS) Project.
Collapse
Affiliation(s)
- Eduardo Mayo Yanes
- Schulich Faculty of Chemistry, Technion - Israel Institute of Technology, Haifa, 32000, Israel
| | - Sabyasachi Chakraborty
- Schulich Faculty of Chemistry, Technion - Israel Institute of Technology, Haifa, 32000, Israel
| | - Renana Gershoni-Poranne
- Schulich Faculty of Chemistry, Technion - Israel Institute of Technology, Haifa, 32000, Israel.
| |
Collapse
|
8
|
Weiss T, Mayo Yanes E, Chakraborty S, Cosmo L, Bronstein AM, Gershoni-Poranne R. Guided diffusion for inverse molecular design. NATURE COMPUTATIONAL SCIENCE 2023; 3:873-882. [PMID: 38177755 DOI: 10.1038/s43588-023-00532-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 09/06/2023] [Indexed: 01/06/2024]
Abstract
The holy grail of materials science is de novo molecular design, meaning engineering molecules with desired characteristics. The introduction of generative deep learning has greatly advanced efforts in this direction, yet molecular discovery remains challenging and often inefficient. Herein we introduce GaUDI, a guided diffusion model for inverse molecular design that combines an equivariant graph neural net for property prediction and a generative diffusion model. We demonstrate GaUDI's effectiveness in designing molecules for organic electronic applications by using single- and multiple-objective tasks applied to a generated dataset of 475,000 polycyclic aromatic systems. GaUDI shows improved conditional design, generating molecules with optimal properties and even going beyond the original distribution to suggest better molecules than those in the dataset. In addition to point-wise targets, GaUDI can also be guided toward open-ended targets (for example, a minimum or maximum) and in all cases achieves close to 100% validity of generated molecules.
Collapse
Affiliation(s)
- Tomer Weiss
- Department of Computer Science, Technion-Israel Institute of Technology, Haifa, Israel
| | - Eduardo Mayo Yanes
- Schulich Faculty of Chemistry, Technion-Israel Institute of Technology, Haifa, Israel
| | | | - Luca Cosmo
- University Ca' Foscari of Venice, Venice, Italy
| | - Alex M Bronstein
- Department of Computer Science, Technion-Israel Institute of Technology, Haifa, Israel.
| | | |
Collapse
|
9
|
Weiss T, Wahab A, Bronstein AM, Gershoni-Poranne R. Interpretable Deep-Learning Unveils Structure-Property Relationships in Polybenzenoid Hydrocarbons. J Org Chem 2023; 88:9645-9656. [PMID: 36696660 DOI: 10.1021/acs.joc.2c02381] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
In this work, interpretable deep learning was used to identify structure-property relationships governing the HOMO-LUMO gap and the relative stability of polybenzenoid hydrocarbons (PBHs) using a ring-based graph representation. This representation was combined with a subunit-based perception of PBHs, allowing chemical insights to be presented in terms of intuitive and simple structural motifs. The resulting insights agree with conventional organic chemistry knowledge and electronic structure-based analyses and also reveal new behaviors and identify influential structural motifs. In particular, we evaluated and compared the effects of linear, angular, and branching motifs on these two molecular properties and explored the role of dispersion in mitigating the torsional strain inherent in nonplanar PBHs. Hence, the observed regularities and the proposed analysis contribute to a deeper understanding of the behavior of PBHs and form the foundation for design strategies for new functional PBHs.
Collapse
Affiliation(s)
- Tomer Weiss
- Department of Computer Science, Technion - Israel Institute of Technology, Haifa32000, Israel
| | - Alexandra Wahab
- Laboratory for Organic Chemistry, Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich8093, Switzerland
| | - Alex M Bronstein
- Department of Computer Science, Technion - Israel Institute of Technology, Haifa32000, Israel
| | - Renana Gershoni-Poranne
- Schulich Faculty of Chemistry, Technion - Israel Institute of Technology, Haifa32000, Israel
| |
Collapse
|