1
|
Finta S, Kalikadien AV, Pidko EA. Data-Driven Virtual Screening of Conformational Ensembles of Transition-Metal Complexes. J Chem Theory Comput 2025; 21:5334-5345. [PMID: 40340435 PMCID: PMC12120983 DOI: 10.1021/acs.jctc.5c00303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2025] [Revised: 04/29/2025] [Accepted: 04/29/2025] [Indexed: 05/10/2025]
Abstract
Transition-metal complexes serve as highly enantioselective homogeneous catalysts for various transformations, making them valuable in the pharmaceutical industry. Data-driven prediction models can accelerate high-throughput catalyst design but require computer-readable representations that account for conformational flexibility. This is typically achieved through high-level conformer searches, followed by DFT optimization of the transition-metal complexes. However, conformer selection remains reliant on human assumptions, with no cost-efficient and generalizable workflow available. To address this, we introduce an automated approach to correlate CREST(GFN2-xTB//GFN-FF)-generated conformer ensembles with their DFT-optimized counterparts for systematic conformer selection. We analyzed 24 precatalyst structures, performing CREST conformer searches, followed by full DFT optimization. Three filtering methods were evaluated: (i) geometric ligand descriptors, (ii) PCA-based selection, and (iii) DBSCAN clustering using RMSD and energy. The proposed methods were validated on Rh-based catalysts featuring bisphosphine ligands, which are widely employed in hydrogenation reactions. To assess general applicability, both the precatalyst and its corresponding acrylate-bound complex were analyzed. Our results confirm that CREST overestimates ligand flexibility, and energy-based filtering is ineffective. PCA-based selection failed to distinguish conformers by DFT energy, while RMSD-based filtering improved selection but lacked tunability. DBSCAN clustering provided the most effective approach, eliminating redundancies while preserving key configurations. This method remained robust across data sets and is computationally efficient without requiring molecular descriptor calculations. These findings highlight the limitations of energy-based filtering and the advantages of structure-based approaches for conformer selection. While DBSCAN clustering is a practical solution, its parameters remain system-dependent. For high-accuracy applications, refined energy calculations may be necessary; however, DBSCAN-based clustering offers a computationally accessible strategy for rapid catalyst representations involving conformational flexibility.
Collapse
Affiliation(s)
- Sára Finta
- Inorganic Systems Engineering,
Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZDelft, The Netherlands
| | - Adarsh V. Kalikadien
- Inorganic Systems Engineering,
Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZDelft, The Netherlands
| | - Evgeny A. Pidko
- Inorganic Systems Engineering,
Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZDelft, The Netherlands
| |
Collapse
|
2
|
Abdullayev O, Garay-Ruiz D, Bori-Bru B, Bo C. Microkinetic Assessment of Ligand-Exchanging Catalytic Cycles. ACS Catal 2025; 15:4739-4745. [PMID: 40144675 PMCID: PMC11934266 DOI: 10.1021/acscatal.5c00348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2025] [Revised: 02/19/2025] [Accepted: 02/19/2025] [Indexed: 03/28/2025]
Abstract
Computational chemistry has become a fundamental part of the understanding and optimization of catalytic processes. Among these, the characterization of homogeneous organometallic catalysts, combining an active transition metal atom and set of ligands, is one of the main fields of application of these kinds of studies. More recently, microkinetic studies have been employed to bridge the gap between experimental measurements such as conversion or selectivity and the Gibbs free energies gathered by computations. In this work, we have developed an automated framework (MicroKatc) for microkinetic analysis, to tackle the yet understudied effect of ligand exchange processes that modify the nature of the catalytic scaffold in situ. We report the application of such a framework to the rhodium-catalyzed hydroformylation of ethylene, confirming the acceleration of the reaction as trimethylphosphine (PMe3) displaces the carbonyl ligands in the catalyst by means of simulations at variable phosphine concentrations, as well as the determination of the degree of rate control (DRC) and apparent activation energies throughout the catalytic process.
Collapse
Affiliation(s)
- Orkhan Abdullayev
- Institute
of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology (BIST), Av. Paisos Catalans, 16, Tarragona 43007, Spain
| | - Diego Garay-Ruiz
- Institute
of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology (BIST), Av. Paisos Catalans, 16, Tarragona 43007, Spain
| | - Berta Bori-Bru
- Institute
of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology (BIST), Av. Paisos Catalans, 16, Tarragona 43007, Spain
| | - Carles Bo
- Institute
of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology (BIST), Av. Paisos Catalans, 16, Tarragona 43007, Spain
- Department
of Physical and Inorganic Chemistry, University
Rovira i Virgili (URV), Marcel·lí
Domingo s/n, Tarragona 43007, Spain
| |
Collapse
|
3
|
Kalikadien AV, Valsecchi C, van Putten R, Maes T, Muuronen M, Dyubankova N, Lefort L, Pidko EA. Probing machine learning models based on high throughput experimentation data for the discovery of asymmetric hydrogenation catalysts. Chem Sci 2024; 15:13618-13630. [PMID: 39211503 PMCID: PMC11352728 DOI: 10.1039/d4sc03647f] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Accepted: 07/15/2024] [Indexed: 09/04/2024] Open
Abstract
Enantioselective hydrogenation of olefins by Rh-based chiral catalysts has been extensively studied for more than 50 years. Naively, one would expect that everything about this transformation is known and that selecting a catalyst that induces the desired reactivity or selectivity is a trivial task. Nonetheless, ligand engineering or selection for any new prochiral olefin remains an empirical trial-error exercise. In this study, we investigated whether machine learning techniques could be used to accelerate the identification of the most efficient chiral ligand. For this purpose, we used high throughput experimentation to build a large dataset consisting of results for Rh-catalyzed asymmetric olefin hydrogenation, specially designed for applications in machine learning. We showcased its alignment with existing literature while addressing observed discrepancies. Additionally, a computational framework for the automated and reproducible quantum-chemistry based featurization of catalyst structures was created. Together with less computationally demanding representations, these descriptors were fed into our machine learning pipeline for both out-of-domain and in-domain prediction tasks of selectivity and reactivity. For out-of-domain purposes, our models provided limited efficacy. It was found that even the most expensive descriptors do not impart significant meaning to the model predictions. The in-domain application, while partly successful for predictions of conversion, emphasizes the need for evaluating the cost-benefit ratio of computationally intensive descriptors and for tailored descriptor design. Challenges persist in predicting enantioselectivity, calling for caution in interpreting results from small datasets. Our insights underscore the importance of dataset diversity with broad substrate inclusion and suggest that mechanistic considerations could improve the accuracy of statistical models.
Collapse
Affiliation(s)
- Adarsh V Kalikadien
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology Van der Maasweg 9, 2629 HZ Delft The Netherlands
| | - Cecile Valsecchi
- Discovery, Product Development and Supply, Janssen Cilag S.p.A. Viale Fulvio Testi, 280/6 20126 Milano Italy
| | - Robbert van Putten
- Discovery, Product Development and Supply, Janssen Pharmaceutica N.V. Turnhoutseweg 30 2340 Beerse Belgium
| | - Tor Maes
- Discovery, Product Development and Supply, Janssen Pharmaceutica N.V. Turnhoutseweg 30 2340 Beerse Belgium
| | - Mikko Muuronen
- Discovery, Product Development and Supply, Janssen Pharmaceutica N.V. Turnhoutseweg 30 2340 Beerse Belgium
| | - Natalia Dyubankova
- Discovery, Product Development and Supply, Janssen Pharmaceutica N.V. Turnhoutseweg 30 2340 Beerse Belgium
| | - Laurent Lefort
- Discovery, Product Development and Supply, Janssen Pharmaceutica N.V. Turnhoutseweg 30 2340 Beerse Belgium
| | - Evgeny A Pidko
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology Van der Maasweg 9, 2629 HZ Delft The Netherlands
| |
Collapse
|
4
|
Laplaza R, Wodrich MD, Corminboeuf C. Overcoming the Pitfalls of Computing Reaction Selectivity from Ensembles of Transition States. J Phys Chem Lett 2024; 15:7363-7370. [PMID: 38990895 DOI: 10.1021/acs.jpclett.4c01657] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
The prediction of reaction selectivity is a challenging task for computational chemistry, not only because many molecules adopt multiple conformations but also due to the exponential relationship between effective activation energies and rate constants. To account for molecular flexibility, an increasing number of methods exist that generate conformational ensembles of transition state (TS) structures. Typically, these TS ensembles are Boltzmann weighted and used to compute selectivity assuming Curtin-Hammett conditions. This strategy, however, can lead to erroneous predictions if the appropriate filtering of the conformer ensembles is not conducted. Here, we demonstrate how any possible selectivity can be obtained by processing the same sets of TS ensembles for a model reaction. To address the burdensome filtering task in a consistent and automated way, we introduce marc, a tool for the modular analysis of representative conformers that aids in avoiding human errors while minimizing the number of reoptimization computations needed to obtain correct reaction selectivity.
Collapse
Affiliation(s)
- Ruben Laplaza
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Matthew D Wodrich
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
5
|
Baidun M, Kalikadien AV, Lefort L, Pidko EA. Impact of Model Selection and Conformational Effects on the Descriptors for In Silico Screening Campaigns: A Case Study of Rh-Catalyzed Acrylate Hydrogenation. THE JOURNAL OF PHYSICAL CHEMISTRY. C, NANOMATERIALS AND INTERFACES 2024; 128:7987-7998. [PMID: 40291068 PMCID: PMC12025388 DOI: 10.1021/acs.jpcc.4c01631] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 04/16/2024] [Accepted: 04/16/2024] [Indexed: 04/30/2025]
Abstract
Data-driven catalyst design is a promising approach for addressing the challenges in identifying suitable catalysts for synthetic transformations. Models with descriptor calculations relying solely on the precatalyst structure are potentially generalizable but may overlook catalyst-substrate interactions. This study explores substrate-specific interactions in the context of Rh-catalyzed asymmetric hydrogenation to elucidate the impact of substrate inclusion on the catalyst structure and on the descriptors derived from it. We compare a catalyst-substrate complex with methyl 2-acetamidoacrylate as a model substrate with the generic precatalyst structure involving a placeholder substrate, norbornadiene, across 11 Rh-based catalysts with bidentate bisphosphine ligands. For these systems, a full conformer ensemble analysis reveals an intriguing finding: the rigid substrate induces conformational freedom in the ligand. This flexibility gives rise to a more diverse conformer landscape, showing a previously overlooked aspect of catalyst-substrate dynamics. Electronic descriptor variations particularly highlight differences between substrate-specific and precatalyst structures. This study suggests that generic precatalyst-like models may lack crucial insights into the conformational freedom of the catalyst. We speculate that such conformational freedom may be a more general phenomenon that can influence the development of generalizable predictive models of computational TM-based catalysis.
Collapse
Affiliation(s)
- Margareth
S. Baidun
- Inorganic
Systems Engineering, Department of Chemical Engineering, Faculty of
Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, The Netherlands
| | - Adarsh V. Kalikadien
- Inorganic
Systems Engineering, Department of Chemical Engineering, Faculty of
Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, The Netherlands
| | - Laurent Lefort
- Discovery,
Product Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340 Beerse, Belgium
| | - Evgeny A. Pidko
- Inorganic
Systems Engineering, Department of Chemical Engineering, Faculty of
Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ Delft, The Netherlands
| |
Collapse
|