1
|
Gadaleta D, Serrano-Candelas E, Ortega-Vallbona R, Colombo E, Garcia de Lomana M, Biava G, Aparicio-Sánchez P, Roncaglioni A, Gozalbes R, Benfenati E. Comprehensive benchmarking of computational tools for predicting toxicokinetic and physicochemical properties of chemicals. J Cheminform 2024; 16:145. [PMID: 39726044 DOI: 10.1186/s13321-024-00931-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 11/11/2024] [Indexed: 12/28/2024] Open
Abstract
Ensuring the safety of chemicals for environmental and human health involves assessing physicochemical (PC) and toxicokinetic (TK) properties, which are crucial for absorption, distribution, metabolism, excretion, and toxicity (ADMET). Computational methods play a vital role in predicting these properties, given the current trends in reducing experimental approaches, especially those that involve animal experimentation. In the present manuscript, twelve software tools implementing Quantitative Structure-Activity Relationship (QSAR) models were selected for the prediction of 17 relevant PC and TK properties. A total of 41 validation datasets were collected from the literature, curated and used for assessing the models' external predictivity, emphasizing the performance of the models inside the applicability domain. Overall, the results confirmed the adequate predictive performance of the majority of the selected tools, with models for PC properties (R2 average = 0.717) generally outperforming those for TK properties (R2 average = 0.639 for regression, average balanced accuracy = 0.780 for classification). Notably, several of the tools evaluated exhibited good predictivity across different properties and were identified as recurring optimal choices. Moreover, a systematic analysis of the chemical space covered by the external validation datasets confirmed the validity of the collected results for relevant chemical categories (e.g., drugs and industrial chemicals), further increasing the confidence in the overall evaluation. The best performing models were ultimately suggested for each investigated property and proposed as robust computational tools for high-throughput assessment of highly relevant chemical properties. SCIENTIFIC CONTRIBUTION: The present manuscript provides an overview of the state-of-the-art available computational tools for predicting the PC and TK properties of chemicals. The results here offer valuable guidance to researchers, regulatory authorities, and the industry in identifying robust computational tools suitable for predicting relevant chemical properties in the context of chemical design, toxicity and environmental fate assessment.
Collapse
Affiliation(s)
- Domenico Gadaleta
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy.
| | - Eva Serrano-Candelas
- ProtoQSAR SL, CEEI (Centro Europeo de Empresas Innovadoras), 46980, Paterna, Valencia, Spain
| | - Rita Ortega-Vallbona
- ProtoQSAR SL, CEEI (Centro Europeo de Empresas Innovadoras), 46980, Paterna, Valencia, Spain
| | - Erika Colombo
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Marina Garcia de Lomana
- Bayer AG, Machine Learning Research, Research & Development, Pharmaceuticals, Leverkusen, Germany
| | - Giada Biava
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Pablo Aparicio-Sánchez
- ProtoQSAR SL, CEEI (Centro Europeo de Empresas Innovadoras), 46980, Paterna, Valencia, Spain
- Spanish National Cancer Research Center (CNIO), Experimental Therapeutics Programme, Madrid, Spain
| | - Alessandra Roncaglioni
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Rafael Gozalbes
- ProtoQSAR SL, CEEI (Centro Europeo de Empresas Innovadoras), 46980, Paterna, Valencia, Spain
- Moldrug AI Systems SL, c/Olimpia Arozena Torres 45, 46018, Valencia, Spain
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| |
Collapse
|
2
|
Roy D, Patel C. Revisiting the Use of Quantum Chemical Calculations in LogP octanol-water Prediction. Molecules 2023; 28:801. [PMID: 36677858 PMCID: PMC9866719 DOI: 10.3390/molecules28020801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 01/06/2023] [Accepted: 01/10/2023] [Indexed: 01/15/2023] Open
Abstract
The partition coefficients of drug and drug-like molecules between an aqueous and organic phase are an important property for developing new therapeutics. The predictive power of computational methods is used extensively to predict partition coefficients of molecules. The application of quantum chemical calculations is used to develop methods to develop structure-activity relationship models for such prediction, either based on molecular fragment methods, or via direct calculation of solvation free energy in solvent continuum. The applicability, merits, and shortcomings of these developments are revisited here.
Collapse
Affiliation(s)
- Dipankar Roy
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
| | - Chandan Patel
- Department of Applied Sciences, COEP Technological University, Wellesely Road, Shivajinagar, Pune 411005, Maharashtra, India
| |
Collapse
|
3
|
Clark RD. Predicting mammalian metabolism and toxicity of pesticides in silico. PEST MANAGEMENT SCIENCE 2018; 74:1992-2003. [PMID: 29762898 PMCID: PMC6099302 DOI: 10.1002/ps.4935] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Revised: 04/02/2018] [Accepted: 04/03/2018] [Indexed: 05/05/2023]
Abstract
Pesticides must be effective to be commercially viable but they must also be reasonably safe for those who manufacture them, apply them, or consume the food they are used to produce. Animal testing is key to ensuring safety, but it comes late in the agrochemical development process, is expensive, and requires relatively large amounts of material. Surrogate assays used as in vitro models require less material and shift identification of potential mammalian toxicity back to earlier stages in development. Modern in silico methods are cost-effective complements to such in vitro models that make it possible to predict mammalian metabolism, toxicity and exposure for a pesticide, crop residue or other metabolite before it has been synthesized. Their broader use could substantially reduce the amount of time and effort wasted in pesticide development. This contribution reviews the kind of in silico models that are currently available for vetting ideas about what to synthesize and how to focus development efforts; the limitations of those models; and the practical considerations that have slowed development in the area. Detailed discussions are provided of how bacterial mutagenicity, human cytochrome P450 (CYP) metabolism, and bioavailability in humans and rats can be predicted. © 2018 The Authors. Pest Management Science published by John Wiley & Sons Ltd on behalf of Society of Chemical Industry.
Collapse
|
4
|
Zang Q, Mansouri K, Williams AJ, Judson RS, Allen DG, Casey WM, Kleinstreuer NC. In Silico Prediction of Physicochemical Properties of Environmental Chemicals Using Molecular Fingerprints and Machine Learning. J Chem Inf Model 2017; 57:36-49. [PMID: 28006899 PMCID: PMC6131700 DOI: 10.1021/acs.jcim.6b00625] [Citation(s) in RCA: 93] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
There are little available toxicity data on the vast majority of chemicals in commerce. High-throughput screening (HTS) studies, such as those being carried out by the U.S. Environmental Protection Agency (EPA) ToxCast program in partnership with the federal Tox21 research program, can generate biological data to inform models for predicting potential toxicity. However, physicochemical properties are also needed to model environmental fate and transport, as well as exposure potential. The purpose of the present study was to generate an open-source quantitative structure-property relationship (QSPR) workflow to predict a variety of physicochemical properties that would have cross-platform compatibility to integrate into existing cheminformatics workflows. In this effort, decades-old experimental property data sets available within the EPA EPI Suite were reanalyzed using modern cheminformatics workflows to develop updated QSPR models capable of supplying computationally efficient, open, and transparent HTS property predictions in support of environmental modeling efforts. Models were built using updated EPI Suite data sets for the prediction of six physicochemical properties: octanol-water partition coefficient (logP), water solubility (logS), boiling point (BP), melting point (MP), vapor pressure (logVP), and bioconcentration factor (logBCF). The coefficient of determination (R2) between the estimated values and experimental data for the six predicted properties ranged from 0.826 (MP) to 0.965 (BP), with model performance for five of the six properties exceeding those from the original EPI Suite models. The newly derived models can be employed for rapid estimation of physicochemical properties within an open-source HTS workflow to inform fate and toxicity prediction models of environmental chemicals.
Collapse
Affiliation(s)
- Qingda Zang
- Integrated Laboratory Systems, Inc., Research Triangle Park, NC 27709, USA
| | - Kamel Mansouri
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Antony J. Williams
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Richard S. Judson
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - David G. Allen
- Integrated Laboratory Systems, Inc., Research Triangle Park, NC 27709, USA
| | - Warren M. Casey
- National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| | - Nicole C. Kleinstreuer
- National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| |
Collapse
|
5
|
Abstract
It is widely accepted that modern QSAR began in the early 1960s. However, as long ago as 1816 scientists were making predictions about physical and chemical properties. The first investigations into the correlation of biological activities with physicochemical properties such as molecular weight and aqueous solubility began in 1841, almost 60 years before the important work of Overton and Meyer linking aquatic toxicity to lipid-water partitioning. Throughout the 20th century QSAR progressed, though there were many lean years. In 1962 came the seminal work of Corwin Hansch and co-workers, which stimulated a huge interest in the prediction of biological activities. Initially that interest lay largely within medicinal chemistry and drug design, but in the 1970s and 1980s, with increasing ecotoxicological concerns, QSAR modelling of environmental toxicities began to grow, especially once regulatory authorities became involved. Since then QSAR has continued to expand, with over 1400 publications annually from 2011 onwards.
Collapse
|
6
|
Austin TJ, Eadsforth CV. Development of a chronic fish toxicity model for predicting sub-lethal NOEC values for non-polar narcotics. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2014; 25:147-160. [PMID: 24635482 DOI: 10.1080/1062936x.2013.871577] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2013] [Accepted: 10/04/2013] [Indexed: 06/03/2023]
Abstract
To comply with the REACH (Registration, Evaluation, Authorisation and restriction of Chemicals) regulations, the generation of chronic fish toxicity data is required for chemicals produced or imported within or into the EU in quantities greater than 100 tonnes per year. This comes at a great cost to industry and consumers alike and requires the sacrifice of many vertebrates. In acknowledgment of these issues the REACH regulations encourage the use of non-testing methods (NTM). These include read-across, weight-of-evidence and QSAR (quantitative structure-activity relationship) techniques. There are many QSAR tools available to generate predictive values for a number of physico-chemical properties, as well as human and environmental health end points; however, close analysis of the currently available chronic fish models identified room for improvement in both the selection of data used and in its application in model creation. In light of this a model was developed using only sub-lethal no-observed-effect concentration (NOEC) end-point data according to best practice QSAR development. Only the lowest value was taken for each compound, in line with the conservative approach taken by the European Chemicals Agency (ECHA). The model developed meets the Organisation for Economic Co-operation and Development (OECD) principles, has strong internal and external validation statistics, and can reliably predict sub-lethal chronic NOEC values for fish within its defined applicability domain.
Collapse
Affiliation(s)
- T J Austin
- a Shell Risk Science Team, Shell Health , Manchester , UK
| | | |
Collapse
|