Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Varnek A, Fourches D, Hoonakker F, Solov'ev VP. Substructural fragments: an universal language to encode reactions, molecular and supramolecular structures. J Comput Aided Mol Des 2005;19:693-703. [PMID: 16292611 DOI: 10.1007/s10822-005-9008-0] [Citation(s) in RCA: 139] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2005] [Accepted: 07/28/2005] [Indexed: 10/25/2022]

For:	Varnek A, Fourches D, Hoonakker F, Solov'ev VP. Substructural fragments: an universal language to encode reactions, molecular and supramolecular structures. J Comput Aided Mol Des 2005;19:693-703. [PMID: 16292611 DOI: 10.1007/s10822-005-9008-0] [Citation(s) in RCA: 139] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2005] [Accepted: 07/28/2005] [Indexed: 10/25/2022]

Number

Cited by Other Article(s)

Li J, Reid JP. Connecting the complexity of stereoselective synthesis to the evolution of predictive tools. Chem Sci 2025;16:3832-3851. [PMID: 39911341 PMCID: PMC11791519 DOI: 10.1039/d4sc07461k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Accepted: 01/22/2025] [Indexed: 02/07/2025] Open

Chen LY, Li YP. Machine learning-guided strategies for reaction conditions design and optimization. Beilstein J Org Chem 2024;20:2476-2492. [PMID: 39376489 PMCID: PMC11457048 DOI: 10.3762/bjoc.20.212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Accepted: 09/19/2024] [Indexed: 10/09/2024] Open

Spiekermann KA, Dong X, Menon A, Green WH, Pfeifle M, Sandfort F, Welz O, Bergeler M. Accurately Predicting Barrier Heights for Radical Reactions in Solution Using Deep Graph Networks. J Phys Chem A 2024;128:8384-8403. [PMID: 39298746 DOI: 10.1021/acs.jpca.4c04121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]

Abstract

Quantitative estimates of reaction barriers and solvent effects are essential for developing kinetic mechanisms and predicting reaction outcomes. Here, we create a new data set of 5,600 unique elementary radical reactions calculated using the M06-2X/def2-QZVP//B3LYP-D3(BJ)/def2-TZVP level of theory. A conformer search is done for each species using TPSS/def2-TZVP. Gibbs free energies of activation and of reaction for these radical reactions in 40 common solvents are obtained using COSMO-RS for solvation effects. These balanced reactions involve the elements H, C, N, O, and S, contain up to 19 heavy atoms, and have atom-mapped SMILES. All transition states are verified by an intrinsic reaction coordinate calculation. We next train a deep graph network to directly estimate the Gibbs free energy of activation and of reaction in both gas and solution phases using only the atom-mapped SMILES of the reactant and product and the SMILES of the solvent. This simple input representation avoids computationally expensive optimizations for the reactant, transition state, and product structures during inference, making our model well-suited for high-throughput predictive chemistry and quickly providing information for (retro-)synthesis planning tools. To properly measure model performance, we report results on both interpolative and extrapolative data splits and also compare to several baseline models. During training and testing, the data set is augmented by including the reverse direction of each reaction and variants with different resonance structures. After data augmentation, we have around 2 million entries to train the model, which achieves a testing set mean absolute error of 1.16 kcal mol-1 for the Gibbs free energy of activation in solution. We anticipate this model will accelerate predictions for high-throughput screening to quickly identify relevant reactions in solution, and our data set will serve as a benchmark for future studies.

Collapse

Plyer L, Marcou G, Perves C, Bonachera F, Varnek A. Implementation of a soft grading system for chemistry in a Moodle plugin: reaction handling. J Cheminform 2024;16:90. [PMID: 39090756 PMCID: PMC11295431 DOI: 10.1186/s13321-024-00889-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 07/21/2024] [Indexed: 08/04/2024] Open

Abedin MM, Tabata K, Matsumura Y, Komatsuzaki T. Multi-armed bandit algorithm for sequential experiments of molecular properties with dynamic feature selection. J Chem Phys 2024;161:014115. [PMID: 38958158 DOI: 10.1063/5.0206042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 06/16/2024] [Indexed: 07/04/2024] Open

Ryzhkov FV, Ryzhkova YE, Elinson MN. Python tools for structural tasks in chemistry. Mol Divers 2024:10.1007/s11030-024-10889-7. [PMID: 38744790 DOI: 10.1007/s11030-024-10889-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 04/27/2024] [Indexed: 05/16/2024]

Chen S, An S, Babazade R, Jung Y. Precise atom-to-atom mapping for organic reactions via human-in-the-loop machine learning. Nat Commun 2024;15:2250. [PMID: 38480709 PMCID: PMC10937625 DOI: 10.1038/s41467-024-46364-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 02/20/2024] [Indexed: 03/17/2024] Open

Sidorov P, Tsuji N. A Primer on 2D Descriptors in Selectivity Modeling for Asymmetric Catalysis. Chemistry 2024;30:e202302837. [PMID: 38010242 DOI: 10.1002/chem.202302837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 11/29/2023]

Chung Y, Green WH. Machine learning from quantum chemistry to predict experimental solvent effects on reaction rates. Chem Sci 2024;15:2410-2424. [PMID: 38362410 PMCID: PMC10866337 DOI: 10.1039/d3sc05353a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 01/04/2024] [Indexed: 02/17/2024] Open

Voinarovska V, Kabeshov M, Dudenko D, Genheden S, Tetko IV. When Yield Prediction Does Not Yield Prediction: An Overview of the Current Challenges. J Chem Inf Model 2024;64:42-56. [PMID: 38116926 PMCID: PMC10778086 DOI: 10.1021/acs.jcim.3c01524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 11/29/2023] [Accepted: 11/30/2023] [Indexed: 12/21/2023]

Zankov D, Madzhidov T, Polishchuk P, Sidorov P, Varnek A. Multi-Instance Learning Approach to the Modeling of Enantioselectivity of Conformationally Flexible Organic Catalysts. J Chem Inf Model 2023;63:6629-6641. [PMID: 37902548 DOI: 10.1021/acs.jcim.3c00393] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2023]

Zankov D, Madzhidov T, Baskin I, Varnek A. Conjugated quantitative structure-property relationship models: Prediction of kinetic characteristics linked by the Arrhenius equation. Mol Inform 2023;42:e2200275. [PMID: 37488968 DOI: 10.1002/minf.202200275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 07/08/2023] [Accepted: 07/24/2023] [Indexed: 07/26/2023]

Sar S, Mitra S, Panda P, Mandal SC, Ghosh N, Halder AK, Cordeiro MNDS. In Silico Modeling and Structural Analysis of Soluble Epoxide Hydrolase Inhibitors for Enhanced Therapeutic Design. Molecules 2023;28:6379. [PMID: 37687207 PMCID: PMC10490281 DOI: 10.3390/molecules28176379] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 08/17/2023] [Accepted: 08/28/2023] [Indexed: 09/10/2023] Open

Abstract

Human soluble epoxide hydrolase (sEH), a dual-functioning homodimeric enzyme with hydrolase and phosphatase activities, is known for its pivotal role in the hydrolysis of epoxyeicosatrienoic acids. Inhibitors targeting sEH have shown promising potential in the treatment of various life-threatening diseases. In this study, we employed a range of in silico modeling approaches to investigate a diverse dataset of structurally distinct sEH inhibitors. Our primary aim was to develop predictive and validated models while gaining insights into the structural requirements necessary for achieving higher inhibitory potential. To accomplish this, we initially calculated molecular descriptors using nine different descriptor-calculating tools, coupled with stochastic and non-stochastic feature selection strategies, to identify the most statistically significant linear 2D-QSAR model. The resulting model highlighted the critical roles played by topological characteristics, 2D pharmacophore features, and specific physicochemical properties in enhancing inhibitory potential. In addition to conventional 2D-QSAR modeling, we implemented the Transformer-CNN methodology to develop QSAR models, enabling us to obtain structural interpretations based on the Layer-wise Relevance Propagation (LRP) algorithm. Moreover, a comprehensive 3D-QSAR analysis provided additional insights into the structural requirements of these compounds as potent sEH inhibitors. To validate the findings from the QSAR modeling studies, we performed molecular dynamics (MD) simulations using selected compounds from the dataset. The simulation results offered crucial insights into receptor-ligand interactions, supporting the predictions obtained from the QSAR models. Collectively, our work serves as an essential guideline for the rational design of novel sEH inhibitors with enhanced therapeutic potential. Importantly, all the in silico studies were performed using open-access tools to ensure reproducibility and accessibility.

Collapse

Jiang J, Zhang R, Yuan Y, Li T, Li G, Zhao Z, Yu Z. NoiseMol: A noise-robusted data augmentation via perturbing noise for molecular property prediction. J Mol Graph Model 2023;121:108454. [PMID: 36963306 DOI: 10.1016/j.jmgm.2023.108454] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 03/05/2023] [Accepted: 03/13/2023] [Indexed: 03/17/2023]

Abstract

Simplified Molecular-Input Line-Entry System (SMILES) is one of a widely used molecular representation methods for molecular property prediction. We conjecture that all the characters in the SMILES string of a molecule are essential for making up the molecules, but most of them make little contribution to determining a particular property of the molecule. Therefore, we verified the conjecture in the pre-experiment. Motivated by the result, we propose to inject proper noisy information into the SMILES to augment the training data by increasing the diversity of the labeled molecules. To this end, we explore injecting perturbing noise into the original labeled SMILES strings to construct augmented data for alleviating the limitation of the labeled compound data and enhancing the model to extract more useful molecular representation for molecular property prediction. Specifically, we directly adopt mask, swap, deletion, and fusion operations on SMILES strings to randomly mask, swap, and delete atoms in SMILES strings. Then, the augmented data is used by two strategies: each epoch alternately feeds the original and perturbing noisy molecules, or each batch alternately feeds the original and perturbing noisy molecules. We conduct experiments on both Transformer and BiGRU models to validate the effectiveness by adopting widely used datasets from MoleculeNet and ZINC. Experimental results demonstrate that the proposed method outperforms strong baselines on all the datasets. NoiseMol obtains the best performance on BBBP and FDA when compared with state-of-the-art methods. Besides, NoiseMol achieves the best accuracy on LogP. Therefore, injecting perturbing noise into the labeled SMILES strings is an effective and efficient method, which improves the prediction performance, generalization, and robustness of the deep learning models.

Collapse

Ksenofontov AA, Isaev YI, Lukanov MM, Makarov DM, Eventova VA, Khodov IA, Berezin MB. Accurate prediction of ¹¹B NMR chemical shift of BODIPYs via machine learning. Phys Chem Chem Phys 2023;25:9472-9481. [PMID: 36935644 DOI: 10.1039/d3cp00253e] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]

Tsuji N, Sidorov P, Zhu C, Nagata Y, Gimadiev T, Varnek A, List B. Predicting Highly Enantioselective Catalysts Using Tunable Fragment Descriptors. Angew Chem Int Ed Engl 2023;62:e202218659. [PMID: 36688354 DOI: 10.1002/anie.202218659] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 01/17/2023] [Accepted: 01/19/2023] [Indexed: 01/24/2023]

Kwon Y, Kim S, Choi YS, Kang S. Generative Modeling to Predict Multiple Suitable Conditions for Chemical Reactions. J Chem Inf Model 2022;62:5952-5960. [PMID: 36413480 DOI: 10.1021/acs.jcim.2c01085] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Mai H, Le TC, Chen D, Winkler DA, Caruso RA. Machine Learning for Electrocatalyst and Photocatalyst Design and Discovery. Chem Rev 2022;122:13478-13515. [PMID: 35862246 DOI: 10.1021/acs.chemrev.2c00061] [Citation(s) in RCA: 97] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Lewis‐Atwell T, Townsend PA, Grayson MN. Machine learning activation energies of chemical reactions. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1593] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Spiekermann KA, Pattanaik L, Green WH. Fast Predictions of Reaction Barrier Heights: Toward Coupled-Cluster Accuracy. J Phys Chem A 2022;126:3976-3986. [PMID: 35727075 DOI: 10.1021/acs.jpca.2c02614] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Heid E, Green WH. Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction. J Chem Inf Model 2022;62:2101-2110. [PMID: 34734699 PMCID: PMC9092344 DOI: 10.1021/acs.jcim.1c00975] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Indexed: 11/28/2022]

Schadow G, Borodina YV, Delannée V, Ihlenfeldt WD, Godfrey AG, Nicklaus MC. Reaction SPL – extension of a public document markup standard to chemical reactions. PURE APPL CHEM 2022. [PMCID: PMC9189732 DOI: 10.1515/pac-2021-2011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Baskin I, Epshtein A, Ein-Eli Y. Benchmarking machine learning methods for modeling physical properties of ionic liquids. J Mol Liq 2022. [DOI: 10.1016/j.molliq.2022.118616] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Prediction of Carbonate Selectivity of PVC-Plasticized Sensor Membranes with Newly Synthesized Ionophores through QSPR Modeling. CHEMOSENSORS 2022. [DOI: 10.3390/chemosensors10020043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Afonina VA, Mazitov DA, Nurmukhametova A, Shevelev MD, Khasanova DA, Nugmanov RI, Burilov VA, Madzhidov TI, Varnek A. Prediction of Optimal Conditions of Hydrogenation Reaction Using the Likelihood Ranking Approach. Int J Mol Sci 2021;23:ijms23010248. [PMID: 35008674 PMCID: PMC8745269 DOI: 10.3390/ijms23010248] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 12/18/2021] [Accepted: 12/23/2021] [Indexed: 11/20/2022] Open

Affiliation(s)

Valentina A. Afonina Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia; (V.A.A.); (D.A.M.); (A.N.); (M.D.S.); (D.A.K.); (R.I.N.); (V.A.B.)
Daniyar A. Mazitov Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia; (V.A.A.); (D.A.M.); (A.N.); (M.D.S.); (D.A.K.); (R.I.N.); (V.A.B.)
Albina Nurmukhametova Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia; (V.A.A.); (D.A.M.); (A.N.); (M.D.S.); (D.A.K.); (R.I.N.); (V.A.B.)
Maxim D. Shevelev Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia; (V.A.A.); (D.A.M.); (A.N.); (M.D.S.); (D.A.K.); (R.I.N.); (V.A.B.) Laboratory of Chemoinformatics (UMR 7140 CNRS/UniStra), Université de Strasbourg, 4, Rue Blaise Pascal, 67000 Strasbourg, France
Dina A. Khasanova Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia; (V.A.A.); (D.A.M.); (A.N.); (M.D.S.); (D.A.K.); (R.I.N.); (V.A.B.)
Ramil I. Nugmanov Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia; (V.A.A.); (D.A.M.); (A.N.); (M.D.S.); (D.A.K.); (R.I.N.); (V.A.B.)
Vladimir A. Burilov Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia; (V.A.A.); (D.A.M.); (A.N.); (M.D.S.); (D.A.K.); (R.I.N.); (V.A.B.)
Timur I. Madzhidov Chemoinformatics and Molecular Modelling Lab, A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya Str. 18, 420008 Kazan, Russia; (V.A.A.); (D.A.M.); (A.N.); (M.D.S.); (D.A.K.); (R.I.N.); (V.A.B.) Correspondence: (T.I.M.); (A.V.)
Alexandre Varnek Laboratory of Chemoinformatics (UMR 7140 CNRS/UniStra), Université de Strasbourg, 4, Rue Blaise Pascal, 67000 Strasbourg, France Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, Sapporo 001-0021, Japan Correspondence: (T.I.M.); (A.V.)

Collapse

Gimadiev T, Nugmanov R, Khakimova A, Fatykhova A, Madzhidov T, Sidorov P, Varnek A. CGRdb2.0: A Python Database Management System for Molecules, Reactions, and Chemical Data. J Chem Inf Model 2021;62:2015-2020. [PMID: 34843251 DOI: 10.1021/acs.jcim.1c01105] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Orlov AA, Demenko DY, Bignaud C, Valtz A, Marcou G, Horvath D, Coquelet C, Varnek A, de Meyer F. Chemoinformatics-Driven Design of New Physical Solvents for Selective CO₂ Absorption. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2021;55:15542-15553. [PMID: 34736317 DOI: 10.1021/acs.est.1c04092] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Machine learning modelling of chemical reaction characteristics: yesterday, today, tomorrow. MENDELEEV COMMUNICATIONS 2021. [DOI: 10.1016/j.mencom.2021.11.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Dong J, Zhao M, Liu Y, Su Y, Zeng X. Deep learning in retrosynthesis planning: datasets, models and tools. Brief Bioinform 2021;23:6375056. [PMID: 34571535 DOI: 10.1093/bib/bbab391] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 08/16/2021] [Accepted: 08/30/2021] [Indexed: 12/29/2022] Open

Varnek A, Zankov D, Polishchuk P, Madzhidov T. Multi-Instance Learning Approach to Predictive Modeling of Catalysts Enantioselectivity. Synlett 2021. [DOI: 10.1055/a-1553-0427] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Baybekov S, Marcou G, Ramos P, Saurel O, Galzi JL, Varnek A. DMSO Solubility Assessment for Fragment-Based Screening. Molecules 2021;26:3950. [PMID: 34203441 PMCID: PMC8271413 DOI: 10.3390/molecules26133950] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 06/23/2021] [Accepted: 06/23/2021] [Indexed: 11/16/2022] Open

Mansouri K, Karmaus AL, Fitzpatrick J, Patlewicz G, Pradeep P, Alberga D, Alepee N, Allen TE, Allen D, Alves VM, Andrade CH, Auernhammer TR, Ballabio D, Bell S, Benfenati E, Bhattacharya S, Bastos JV, Boyd S, Brown J, Capuzzi SJ, Chushak Y, Ciallella H, Clark AM, Consonni V, Daga PR, Ekins S, Farag S, Fedorov M, Fourches D, Gadaleta D, Gao F, Gearhart JM, Goh G, Goodman JM, Grisoni F, Grulke CM, Hartung T, Hirn M, Karpov P, Korotcov A, Lavado GJ, Lawless M, Li X, Luechtefeld T, Lunghini F, Mangiatordi GF, Marcou G, Marsh D, Martin T, Mauri A, Muratov EN, Myatt GJ, Nguyen DT, Nicolotti O, Note R, Pande P, Parks AK, Peryea T, Polash AH, Rallo R, Roncaglioni A, Rowlands C, Ruiz P, Russo DP, Sayed A, Sayre R, Sheils T, Siegel C, Silva AC, Simeonov A, Sosnin S, Southall N, Strickland J, Tang Y, Teppen B, Tetko IV, Thomas D, Tkachenko V, Todeschini R, Toma C, Tripodi I, Trisciuzzi D, Tropsha A, Varnek A, Vukovic K, Wang Z, Wang L, Waters KM, Wedlake AJ, Wijeyesakere SJ, Wilson D, Xiao Z, Yang H, Zahoranszky-Kohalmi G, Zakharov AV, Zhang FF, Zhang Z, Zhao T, Zhu H, Zorn KM, et alMansouri K, Karmaus AL, Fitzpatrick J, Patlewicz G, Pradeep P, Alberga D, Alepee N, Allen TE, Allen D, Alves VM, Andrade CH, Auernhammer TR, Ballabio D, Bell S, Benfenati E, Bhattacharya S, Bastos JV, Boyd S, Brown J, Capuzzi SJ, Chushak Y, Ciallella H, Clark AM, Consonni V, Daga PR, Ekins S, Farag S, Fedorov M, Fourches D, Gadaleta D, Gao F, Gearhart JM, Goh G, Goodman JM, Grisoni F, Grulke CM, Hartung T, Hirn M, Karpov P, Korotcov A, Lavado GJ, Lawless M, Li X, Luechtefeld T, Lunghini F, Mangiatordi GF, Marcou G, Marsh D, Martin T, Mauri A, Muratov EN, Myatt GJ, Nguyen DT, Nicolotti O, Note R, Pande P, Parks AK, Peryea T, Polash AH, Rallo R, Roncaglioni A, Rowlands C, Ruiz P, Russo DP, Sayed A, Sayre R, Sheils T, Siegel C, Silva AC, Simeonov A, Sosnin S, Southall N, Strickland J, Tang Y, Teppen B, Tetko IV, Thomas D, Tkachenko V, Todeschini R, Toma C, Tripodi I, Trisciuzzi D, Tropsha A, Varnek A, Vukovic K, Wang Z, Wang L, Waters KM, Wedlake AJ, Wijeyesakere SJ, Wilson D, Xiao Z, Yang H, Zahoranszky-Kohalmi G, Zakharov AV, Zhang FF, Zhang Z, Zhao T, Zhu H, Zorn KM, Casey W, Kleinstreuer NC. CATMoS: Collaborative Acute Toxicity Modeling Suite. ENVIRONMENTAL HEALTH PERSPECTIVES 2021;129:47013. [PMID: 33929906 PMCID: PMC8086800 DOI: 10.1289/ehp8495] [Show More Authors] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 03/10/2021] [Accepted: 03/19/2021] [Indexed: 05/02/2023]

Abstract

BACKGROUND

Humans are exposed to tens of thousands of chemical substances that need to be assessed for their potential toxicity. Acute systemic toxicity testing serves as the basis for regulatory hazard classification, labeling, and risk management. However, it is cost- and time-prohibitive to evaluate all new and existing chemicals using traditional rodent acute toxicity tests. In silico models built using existing data facilitate rapid acute toxicity predictions without using animals.

OBJECTIVES

The U.S. Interagency Coordinating Committee on the Validation of Alternative Methods (ICCVAM) Acute Toxicity Workgroup organized an international collaboration to develop in silico models for predicting acute oral toxicity based on five different end points: Lethal Dose 50 (LD 50 value, U.S. Environmental Protection Agency hazard (four) categories, Globally Harmonized System for Classification and Labeling hazard (five) categories, very toxic chemicals [LD 50 (LD 50 ≤ 50 mg / kg )], and nontoxic chemicals (L D 50 > 2,000 mg / kg ).

METHODS

An acute oral toxicity data inventory for 11,992 chemicals was compiled, split into training and evaluation sets, and made available to 35 participating international research groups that submitted a total of 139 predictive models. Predictions that fell within the applicability domains of the submitted models were evaluated using external validation sets. These were then combined into consensus models to leverage strengths of individual approaches.

RESULTS

The resulting consensus predictions, which leverage the collective strengths of each individual model, form the Collaborative Acute Toxicity Modeling Suite (CATMoS). CATMoS demonstrated high performance in terms of accuracy and robustness when compared with in vivo results.

DISCUSSION

CATMoS is being evaluated by regulatory agencies for its utility and applicability as a potential replacement for in vivo rat acute oral toxicity studies. CATMoS predictions for more than 800,000 chemicals have been made available via the National Toxicology Program's Integrated Chemical Environment tools and data sets (ice.ntp.niehs.nih.gov). The models are also implemented in a free, standalone, open-source tool, OPERA, which allows predictions of new and untested chemicals to be made. https://doi.org/10.1289/EHP8495.

Collapse

Affiliation(s)

Kamel Mansouri Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA
Agnes L. Karmaus Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
Jeremy Fitzpatrick ScitoVation, Research Triangle Park, North Carolina, USA
Grace Patlewicz Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Prachi Pradeep Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA Oak Ridge Institute for Science and Education (ORISE) Research Participation Program, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Domenico Alberga Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
Nathalie Alepee L’Oréal Research & Innovation, Aulnay-sous-Bois, France
Timothy E.H. Allen Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
Dave Allen Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
Vinicius M. Alves Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Carolina H. Andrade Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Tyler R. Auernhammer The Dow Chemical Company, Midland, Michigan, USA
Davide Ballabio Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Shannon Bell Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
Emilio Benfenati Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Sudin Bhattacharya Institute for Quantitative Health Science and Engineering, Department of Biomedical Engineering, Michigan State University, East Lansing, Michigan, USA
Joyce V. Bastos Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Stephen Boyd Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
J.B. Brown Kyoto University Graduate School of Medicine, Kyoto, Japan
Stephen J. Capuzzi Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
Yaroslav Chushak Aeromedical Research Department, Force Health Protection, USAFSAM, Dayton, Ohio, USA Henry M Jackson Foundation for the Advancement of Military Medicine, Dayton, Ohio, USA
Heather Ciallella Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
Alex M. Clark Collaborations Pharmaceuticals, Inc., Raleigh, North Carolina, USA
Viviana Consonni Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Pankaj R. Daga Simulations Plus, Inc., Lancaster, California, USA
Sean Ekins Collaborations Pharmaceuticals, Inc., Raleigh, North Carolina, USA
Sherif Farag Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
Maxim Fedorov Skoltech, Skolkovo Institute of Science and Technology, Moscow, Russia
Denis Fourches Department of Chemistry, North Carolina State University, Raleigh, North Carolina, USA Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
Domenico Gadaleta Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Feng Gao Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
Jeffery M. Gearhart Aeromedical Research Department, Force Health Protection, USAFSAM, Dayton, Ohio, USA Henry M Jackson Foundation for the Advancement of Military Medicine, Dayton, Ohio, USA
Garett Goh Pacific Northwest National Laboratory, Richland, Washington, USA
Jonathan M. Goodman Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
Francesca Grisoni Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Christopher M. Grulke Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Thomas Hartung Underwriters Laboratories, Northbrook, Illinois, USA
Matthew Hirn Department of Computational Mathematics, Science & Engineering, Department of Mathematics, Michigan State University, East Lansing, Michigan, USA
Pavel Karpov Institute of Structural Biology, Helmholtz Zentrum München (GmbH), Neuherberg, Germany
Alexandru Korotcov Science Data Software, LLC, Rockville, Maryland, USA
Giovanna J. Lavado Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Michael Lawless Simulations Plus, Inc., Lancaster, California, USA
Xinhao Li Department of Chemistry, North Carolina State University, Raleigh, North Carolina, USA
Thomas Luechtefeld Underwriters Laboratories, Northbrook, Illinois, USA
Filippo Lunghini Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
Giuseppe F. Mangiatordi Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
Gilles Marcou Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
Dan Marsh Underwriters Laboratories, Northbrook, Illinois, USA
Todd Martin Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Cincinnati, Ohio, USA
Andrea Mauri Alvascience Srl, Lecco, Italy
Eugene N. Muratov Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Glenn J. Myatt Leadscope Inc., Columbus, Ohio, USA
Dac-Trung Nguyen National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Orazio Nicolotti Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
Reine Note L’Oréal Research & Innovation, Aulnay-sous-Bois, France
Paritosh Pande Pacific Northwest National Laboratory, Richland, Washington, USA
Amanda K. Parks The Dow Chemical Company, Midland, Michigan, USA
Tyler Peryea National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Ahsan H. Polash Kyoto University Graduate School of Medicine, Kyoto, Japan
Robert Rallo Pacific Northwest National Laboratory, Richland, Washington, USA
Alessandra Roncaglioni Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Craig Rowlands Underwriters Laboratories, Northbrook, Illinois, USA
Patricia Ruiz Office of Innovation and Analytics, Agency for Toxic Substances and Disease Registry, Centers for Disease Control and Prevention, Atlanta, Georgia, USA
Daniel P. Russo Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
Ahmed Sayed Rosettastein Consulting UG, Freising, Germany
Risa Sayre Center for Computational Toxicology and Exposure, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA Oak Ridge Institute for Science and Education (ORISE) Research Participation Program, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Timothy Sheils National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Charles Siegel Pacific Northwest National Laboratory, Richland, Washington, USA
Arthur C. Silva Laboratory for Molecular Modeling and Design, Faculty of Pharmacy, Federal University of Goiás, Goiania, Brazil
Anton Simeonov National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Sergey Sosnin Skoltech, Skolkovo Institute of Science and Technology, Moscow, Russia
Noel Southall National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Judy Strickland Integrated Laboratory Systems, LLC, Morrisville, North Carolina, USA
Yun Tang Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Brian Teppen Department of Plant, Soil, and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA
Igor V. Tetko Institute of Structural Biology, Helmholtz Zentrum München (GmbH), Neuherberg, Germany BIGCHEM GmbH, Unterschleissheim, Germany
Dennis Thomas Pacific Northwest National Laboratory, Richland, Washington, USA
Valery Tkachenko Science Data Software, LLC, Rockville, Maryland, USA
Roberto Todeschini Milano Chemometrics & QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, Milan, Italy
Cosimo Toma Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Ignacio Tripodi Computer Science/Interdisciplinary Quantitative Biology, University of Colorado, Boulder, Colorado, USA
Daniela Trisciuzzi Dipartimento di Farmacia-Scienze del Farmaco, Università degli Studi di Bari “Aldo Moro”, Bari, Italy
Alexander Tropsha Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina, USA
Alexandre Varnek Laboratoire de Chemoinformatique, URM7140, Université de Strasbourg, Strasbourg, France
Kristijan Vukovic Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
Zhongyu Wang School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
Liguo Wang School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
Katrina M. Waters Pacific Northwest National Laboratory, Richland, Washington, USA
Andrew J. Wedlake Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK
Sanjeeva J. Wijeyesakere The Dow Chemical Company, Midland, Michigan, USA
Dan Wilson The Dow Chemical Company, Midland, Michigan, USA
Zijun Xiao School of Environmental Sciences and Technology, Dalian University of Technology; Dalian, Liaoning, China
Hongbin Yang Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
Gergely Zahoranszky-Kohalmi National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Alexey V. Zakharov National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Fagen F. Zhang The Dow Chemical Company, Midland, Michigan, USA
Zhen Zhang Dow Agrosciences, Indianapolis, Indiana, USA
Tongan Zhao National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland, USA
Hao Zhu Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey, USA
Kimberley M. Zorn Collaborations Pharmaceuticals, Inc., Raleigh, North Carolina, USA
Warren Casey National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA
Nicole C. Kleinstreuer National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods, Research Triangle Park, North Carolina, USA

Collapse

Kumar S, Kim MH. SMPLIP-Score: predicting ligand binding affinity from simple and interpretable on-the-fly interaction fingerprint pattern descriptors. J Cheminform 2021;13:28. [PMID: 33766140 PMCID: PMC7993508 DOI: 10.1186/s13321-021-00507-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Accepted: 03/16/2021] [Indexed: 12/13/2022] Open

Organic reactivity from mechanism to machine learning. Nat Rev Chem 2021;5:240-255. [PMID: 37117288 DOI: 10.1038/s41570-021-00260-x] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/10/2021] [Indexed: 12/25/2022]

Rakhimbekova A, Akhmetshin TN, Minibaeva GI, Nugmanov RI, Gimadiev TR, Madzhidov TI, Baskin II, Varnek A. Cross-validation strategies in QSPR modelling of chemical reactions. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2021;32:207-219. [PMID: 33601989 DOI: 10.1080/1062936x.2021.1883107] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 01/26/2021] [Indexed: 06/12/2023]

Bort W, Baskin II, Gimadiev T, Mukanov A, Nugmanov R, Sidorov P, Marcou G, Horvath D, Klimchuk O, Madzhidov T, Varnek A. Discovery of novel chemical reactions by deep generative recurrent neural network. Sci Rep 2021;11:3178. [PMID: 33542271 PMCID: PMC7862614 DOI: 10.1038/s41598-021-81889-y] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 01/06/2021] [Indexed: 12/18/2022] Open

Affiliation(s)

William Bort Laboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg, 1, rue Blaise Pascal, 67000, Strasbourg, France
Igor I Baskin Laboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg, 1, rue Blaise Pascal, 67000, Strasbourg, France Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya str. 18, 420008, Kazan, Russia Department of Materials Science and Engineering, Technion - Israel Institute of Technology, 3200003, Haifa, Israel
Timur Gimadiev Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, Sapporo, 001-0021, Japan
Artem Mukanov Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya str. 18, 420008, Kazan, Russia
Ramil Nugmanov Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya str. 18, 420008, Kazan, Russia
Pavel Sidorov Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, Sapporo, 001-0021, Japan
Gilles Marcou Laboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg, 1, rue Blaise Pascal, 67000, Strasbourg, France
Dragos Horvath Laboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg, 1, rue Blaise Pascal, 67000, Strasbourg, France
Olga Klimchuk Laboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg, 1, rue Blaise Pascal, 67000, Strasbourg, France
Timur Madzhidov Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, Kremlyovskaya str. 18, 420008, Kazan, Russia
Alexandre Varnek Laboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg, 1, rue Blaise Pascal, 67000, Strasbourg, France. Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, Sapporo, 001-0021, Japan.

Collapse

Gimadiev T, Nugmanov R, Batyrshin D, Madzhidov T, Maeda S, Sidorov P, Varnek A. Combined Graph/Relational Database Management System for Calculated Chemical Reaction Pathway Data. J Chem Inf Model 2021;61:554-559. [PMID: 33502186 DOI: 10.1021/acs.jcim.0c01280] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]

Jorner K, Brinck T, Norrby PO, Buttar D. Machine learning meets mechanistic modelling for accurate prediction of experimental activation energies. Chem Sci 2021;12:1163-1175. [PMID: 36299676 PMCID: PMC9528810 DOI: 10.1039/d0sc04896h] [Citation(s) in RCA: 88] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 11/02/2020] [Indexed: 12/19/2022] Open

Varnek A, Baskin II. Modern Trends in Chemical Reactions Modeling. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11543-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open

Chemical Graph Theory for Property Modeling in QSAR and QSPR—Charming QSAR & QSPR. MATHEMATICS 2020. [DOI: 10.3390/math9010060] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Guan Y, Coley CW, Wu H, Ranasinghe D, Heid E, Struble TJ, Pattanaik L, Green WH, Jensen KF. Regio-selectivity prediction with a machine-learned reaction representation and on-the-fly quantum mechanical descriptors. Chem Sci 2020;12:2198-2208. [PMID: 34163985 PMCID: PMC8179287 DOI: 10.1039/d0sc04823b] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 12/19/2020] [Indexed: 12/20/2022] Open

Cabrera-Andrade A, López-Cortés A, Jaramillo-Koupermann G, González-Díaz H, Pazos A, Munteanu CR, Pérez-Castillo Y, Tejera E. A Multi-Objective Approach for Anti-Osteosarcoma Cancer Agents Discovery through Drug Repurposing. Pharmaceuticals (Basel) 2020;13:ph13110409. [PMID: 33266378 PMCID: PMC7700154 DOI: 10.3390/ph13110409] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 11/11/2020] [Accepted: 11/12/2020] [Indexed: 02/08/2023] Open

Affiliation(s)

Alejandro Cabrera-Andrade Grupo de Bio-Quimioinformática, Universidad de Las Américas, Quito 170125, Ecuador; Carrera de Enfermería, Facultad de Ciencias de la Salud, Universidad de Las Américas, Quito 170125, Ecuador Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, CITIC, Campus Elviña s/n, 15071 A Coruña, Spain; (A.L.-C.); (A.P.); (C.R.M.) Correspondence: (A.C.-A.); (E.T.)
Andrés López-Cortés Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, CITIC, Campus Elviña s/n, 15071 A Coruña, Spain; (A.L.-C.); (A.P.); (C.R.M.) Centro de Investigación Genética y Genómica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170129, Ecuador Latin American Network for Implementation and Validation of Clinical Pharmacogenomics Guidelines (RELIVAF-CYTED), 28029 Madrid, Spain
Gabriela Jaramillo-Koupermann Laboratorio de Biología Molecular, Subproceso de Anatomía Patológica, Hospital de Especialidades Eugenio Espejo, Quito 170403, Ecuador;
Humberto González-Díaz Department of Organic and Inorganic Chemistry, and Basque Center for Biophysics CSIC-UPV/EHU, University of the Basque Country UPV/EHU, 48940 Leioa, Spain; IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Spain
Alejandro Pazos Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, CITIC, Campus Elviña s/n, 15071 A Coruña, Spain; (A.L.-C.); (A.P.); (C.R.M.) Biomedical Research Institute of A Coruña (INIBIC), University Hospital Complex of A Coruña (CHUAC), 15006 A Coruña, Spain
Cristian R. Munteanu Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, CITIC, Campus Elviña s/n, 15071 A Coruña, Spain; (A.L.-C.); (A.P.); (C.R.M.) Biomedical Research Institute of A Coruña (INIBIC), University Hospital Complex of A Coruña (CHUAC), 15006 A Coruña, Spain
Yunierkis Pérez-Castillo Grupo de Bio-Quimioinformática, Universidad de Las Américas, Quito 170125, Ecuador; Escuela de Ciencias Físicas y Matemáticas, Universidad de Las Américas, Quito 170125, Ecuador
Eduardo Tejera Grupo de Bio-Quimioinformática, Universidad de Las Américas, Quito 170125, Ecuador; Facultad de Ingeniería y Ciencias Agropecuarias, Universidad de Las Américas, Quito 170125, Ecuador Correspondence: (A.C.-A.); (E.T.)

Collapse

David L, Thakkar A, Mercado R, Engkvist O. Molecular representations in AI-driven drug discovery: a review and practical guide. J Cheminform 2020;12:56. [PMID: 33431035 PMCID: PMC7495975 DOI: 10.1186/s13321-020-00460-5] [Citation(s) in RCA: 215] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2020] [Accepted: 09/05/2020] [Indexed: 02/08/2023] Open

Chaube S, Goverapet Srinivasan S, Rai B. Applied machine learning for predicting the lanthanide-ligand binding affinities. Sci Rep 2020;10:14322. [PMID: 32868845 PMCID: PMC7459320 DOI: 10.1038/s41598-020-71255-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 08/12/2020] [Indexed: 11/25/2022] Open

Abstract

Binding affinities of metal-ligand complexes are central to a multitude of applications like drug design, chelation therapy, designing reagents for solvent extraction etc. While state-of-the-art molecular modelling approaches are usually employed to gather structural and chemical insights about the metal complexation with ligands, their computational cost and the limited ability to predict metal-ligand stability constants with reasonable accuracy, renders them impractical to screen large chemical spaces. In this context, leveraging vast amounts of experimental data to learn the metal-binding affinities of ligands becomes a promising alternative. Here, we develop a machine learning framework for predicting binding affinities (logK1) of lanthanide cations with several structurally diverse molecular ligands. Six supervised machine learning algorithms-Random Forest (RF), k-Nearest Neighbours (KNN), Support Vector Machines (SVM), Kernel Ridge Regression (KRR), Multi Layered Perceptrons (MLP) and Adaptive Boosting (AdaBoost)-were trained on a dataset comprising thousands of experimental values of logK1 and validated in an external 10-folds cross-validation procedure. This was followed by a thorough feature engineering and feature importance analysis to identify the molecular, metallic and solvent features most relevant to binding affinity prediction, along with an evaluation of performance metrics against the dimensionality of feature space. Having demonstrated the excellent predictive ability of our framework, we utilized the best performing AdaBoost model to predict the logK1 values of lanthanide cations with nearly 71 million compounds present in the PubChem database. Our methodology opens up an opportunity for significantly accelerating screening and design of ligands for various targeted applications, from vast chemical spaces.

Collapse

Rakhimbekova A, Madzhidov TI, Nugmanov RI, Gimadiev TR, Baskin II, Varnek A. Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions. Int J Mol Sci 2020;21:E5542. [PMID: 32756326 PMCID: PMC7432167 DOI: 10.3390/ijms21155542] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 07/27/2020] [Accepted: 07/30/2020] [Indexed: 01/28/2023] Open

Baskin II, Lozano S, Durot M, Marcou G, Horvath D, Varnek A. Autoignition temperature: comprehensive data analysis and predictive models. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2020;31:597-613. [PMID: 32646236 DOI: 10.1080/1062936x.2020.1785933] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 06/18/2020] [Indexed: 06/11/2023]

Thermodynamic radii of lanthanide ions derived from metal–ligand complexes stability constants. J INCL PHENOM MACRO 2020. [DOI: 10.1007/s10847-020-01010-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Bosc N, Muller C, Hoffer L, Lagorce D, Bourg S, Derviaux C, Gourdel ME, Rain JC, Miller TW, Villoutreix BO, Miteva MA, Bonnet P, Morelli X, Sperandio O, Roche P. Fr-PPIChem: An Academic Compound Library Dedicated to Protein-Protein Interactions. ACS Chem Biol 2020;15:1566-1574. [PMID: 32320205 PMCID: PMC7399473 DOI: 10.1021/acschembio.0c00179] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Affiliation(s)

Nicolas Bosc Inserm U973 MTi, 25 rue Hélène Brion 75013 Paris Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR3528, 28 rue du Dr Roux 75015 Paris
Christophe Muller IPC Drug Discovery Platform, Institut Paoli-Calmettes, 232 Boulevard de Sainte-Marguerite, 13009, Marseille, France
Laurent Hoffer CRCM, CNRS, INSERM, Institut Paoli-Calmettes, Aix-Marseille Univ, 13009 Marseille, France
David Lagorce Université de Paris, INSERM US14, Plateforme Maladies Rares - Orphanet, 75014 Paris, France
Stéphane Bourg Institut de Chimie Organique et Analytique (ICOA), Université d’Orléans, UMR CNRS 7311, BP 6759, 45067 Orléans. France
Carine Derviaux IPC Drug Discovery Platform, Institut Paoli-Calmettes, 232 Boulevard de Sainte-Marguerite, 13009, Marseille, France
Marie-Edith Gourdel Hybrigenics Services SAS, 1 rue Pierre Fontaine, 91000 Evry Courcouronnes, France
Jean-Christophe Rain Hybrigenics Services SAS, 1 rue Pierre Fontaine, 91000 Evry Courcouronnes, France
Thomas W. Miller IPC Drug Discovery Platform, Institut Paoli-Calmettes, 232 Boulevard de Sainte-Marguerite, 13009, Marseille, France
Bruno O. Villoutreix Université de Lille, INSERM, Institut Pasteur de Lille, U1177 - Drugs and Molecules for living Systems, 59000 Lille, France
Maria A. Miteva Inserm U1268 MCTR, CNRS UMR 8038 CiTCoM – Univ. De Paris, Faculté de Pharmacie de Paris, 75006 Paris, France
Pascal Bonnet Institut de Chimie Organique et Analytique (ICOA), Université d’Orléans, UMR CNRS 7311, BP 6759, 45067 Orléans. France
Xavier Morelli IPC Drug Discovery Platform, Institut Paoli-Calmettes, 232 Boulevard de Sainte-Marguerite, 13009, Marseille, France CRCM, CNRS, INSERM, Institut Paoli-Calmettes, Aix-Marseille Univ, 13009 Marseille, France
Olivier Sperandio Inserm U973 MTi, 25 rue Hélène Brion 75013 Paris Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR3528, 28 rue du Dr Roux 75015 Paris
Philippe Roche CRCM, CNRS, INSERM, Institut Paoli-Calmettes, Aix-Marseille Univ, 13009 Marseille, France

Collapse

Muratov EN, Bajorath J, Sheridan RP, Tetko IV, Filimonov D, Poroikov V, Oprea TI, Baskin II, Varnek A, Roitberg A, Isayev O, Curtarolo S, Fourches D, Cohen Y, Aspuru-Guzik A, Winkler DA, Agrafiotis D, Cherkasov A, Tropsha A. QSAR without borders. Chem Soc Rev 2020;49:3525-3564. [PMID: 32356548 PMCID: PMC8008490 DOI: 10.1039/d0cs00098a] [Citation(s) in RCA: 384] [Impact Index Per Article: 76.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Li X, Fourches D. Inductive transfer learning for molecular activity prediction: Next-Gen QSAR Models with MolPMoFiT. J Cheminform 2020;12:27. [PMID: 33430978 PMCID: PMC7178569 DOI: 10.1186/s13321-020-00430-x] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 04/15/2020] [Indexed: 12/25/2022] Open