1
|
Tian L, Woo W, Canchola A, Chen K, Lin YH. Correlation gas chromatography and two-dimensional volatility basis methods to predict gas-particle partitioning for e-cigarette aerosols. AEROSOL SCIENCE AND TECHNOLOGY : THE JOURNAL OF THE AMERICAN ASSOCIATION FOR AEROSOL RESEARCH 2024; 58:630-643. [PMID: 38774581 PMCID: PMC11105163 DOI: 10.1080/02786826.2024.2326547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 02/27/2024] [Indexed: 05/24/2024]
Abstract
E-cigarette aerosols contain a complex mixture of harmful and potentially harmful chemicals. Once released into the environment, they evolve and become new sources of indoor air pollutants that could pose a significant threat to both users and non-users. However, current understanding of the physicochemical properties of e-cigarette aerosol constituents that govern gas-particle partitioning in the atmosphere is limited, making it difficult to estimate the health risks associated with exposure. Here, we used correlation gas chromatography (C-GC) and two-dimensional volatility basis set (2D-VBS) methods to determine the vapor pressures and volatility for commonly reported toxic and irritating e-cigarette aerosol constituents. The vapor pressures of target compounds at 298 K were estimated from the Antoine-type linear relationship between the vapor pressure of reference standards and their retention times. Our C-GC results showed an overall positive correlation (R = 0.84) with estimates using the EPI (Estimation Programs Interface) Suite. The volatility calculated by 2D-VBS correlates well with the calculated vapor pressure from both C-GC (R = 0.82) and EPI Suite (R = 0.85). The volatility distribution also indicated fresh e-cigarette aerosol constituents are mainly more volatile organic compounds. Our case study revealed that low-vapor-pressure compounds (e.g., σ-dodecalactone, γ-decalactone, and maltol) become enriched in the e-cigarette aerosols within 2 hours following vaping emissions. Overall, these findings demonstrate the applicability of the C-GC and 2D-VBS methods for determining the physiochemical properties of e-cigarette aerosol constituents, which can aid in assessing the dynamic chemical composition of e-cigarette aerosols and exposures to vaping emissions in indoor environments.
Collapse
Affiliation(s)
- Linhui Tian
- Department of Environmental Sciences, University of California, Riverside, California, USA
| | - Wonsik Woo
- Environmental Toxicology Graduate Program, University of California, Riverside, California, USA
| | - Alexa Canchola
- Environmental Toxicology Graduate Program, University of California, Riverside, California, USA
| | - Kunpeng Chen
- Department of Environmental Sciences, University of California, Riverside, California, USA
| | - Ying-Hsuan Lin
- Department of Environmental Sciences, University of California, Riverside, California, USA
- Environmental Toxicology Graduate Program, University of California, Riverside, California, USA
| |
Collapse
|
2
|
Al-Fakih AM, Algamal ZY, Qasim MK. An improved opposition-based crow search algorithm for biodegradable material classification. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2022; 33:403-415. [PMID: 35469528 DOI: 10.1080/1062936x.2022.2064546] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 04/05/2022] [Indexed: 06/14/2023]
Abstract
The development of a reliable quantitative structure-activity relationship (QSAR) classification model with a small number of molecular descriptors is a crucial step in chemometrics. In this study, an improvement of crow search algorithm (CSA) is proposed by adapting the opposite-based learning (OBL) approach, which is named as OBL-CSA, to improve the exploration and exploitation capability of the CSA in quantitative structure-biodegradation relationship (QSBR) modelling of classifying the biodegradable materials. The results reveal that the performance of OBL-CSA not only manifest in improving the classification performance, but also in reduced computational time required to complete the process when compared to the standard CSA and other four optimization algorithms tested, which are the particle swarm algorithm (PSO), black hole algorithm (BHA), grey wolf algorithm (GWA), and whale optimization algorithm (WOA). In conclusion, the OBL-CSA could be a valuable resource in the classification of biodegradable materials.
Collapse
Affiliation(s)
- A M Al-Fakih
- Department of Chemistry, Faculty of Science, Universiti Teknologi Malaysia, Johor, Malaysia and Department of Chemistry, Faculty of Science, Sana'a University, Sana'a, Yemen
| | - Z Y Algamal
- Department of Statistics and Informatics, University of Mosul, Mosul, Iraq
| | - M K Qasim
- Department of General Science, University of Mosul, Mosul, Iraq
| |
Collapse
|
3
|
Chirico N, Sangion A, Gramatica P, Bertato L, Casartelli I, Papa E. QSARINS-Chem standalone version: A new platform-independent software to profile chemicals for physico-chemical properties, fate, and toxicity. J Comput Chem 2021; 42:1452-1460. [PMID: 33973667 PMCID: PMC8251994 DOI: 10.1002/jcc.26551] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 04/13/2021] [Indexed: 01/19/2023]
Abstract
The new software QSARINS-Chem standalone version is a multiplatform tool, freely downloadable, for the in silico profiling of multiple properties and activities of organic chemicals. This software, which is based on the concept of the QSARINS-chem module embedded in the QSARINS software, has been fully redesigned and redeveloped in the Java™ language. In addition to a selection of models included in the old module, the new software predicts biotransformation rates and aquatic toxicities of pharmaceuticals and personal care products in multiple organisms, and offers a suite of tools for the analysis of predictions. Furthermore, a comprehensive and transparent database of molecular structures is provided. The new QSARINS-Chem standalone version is an informative and solid tool, which is useful to support the assessment of the potential hazard and risks related to organic chemicals and is dedicated to users which are interested in the application of QSARs to generate reliable predictions.
Collapse
Affiliation(s)
- Nicola Chirico
- Department of Theoretical and Applied SciencesUniversity of InsubriaVareseItaly
| | - Alessandro Sangion
- Department of Theoretical and Applied SciencesUniversity of InsubriaVareseItaly
- Department of Physical and Environmental SciencesUniversity of Toronto ScarboroughTorontoOntarioCanada
| | - Paola Gramatica
- Department of Theoretical and Applied SciencesUniversity of InsubriaVareseItaly
| | - Linda Bertato
- Department of Theoretical and Applied SciencesUniversity of InsubriaVareseItaly
| | - Ilaria Casartelli
- Department of Theoretical and Applied SciencesUniversity of InsubriaVareseItaly
| | - Ester Papa
- Department of Theoretical and Applied SciencesUniversity of InsubriaVareseItaly
| |
Collapse
|
4
|
Abstract
At the end of her academic career, the author summarizes the main aspects of QSAR modeling, giving comments and suggestions according to her 23 years' experience in QSAR research on environmental topics. The focus is mainly on Multiple Linear Regression, particularly Ordinary Least Squares, using a Genetic Algorithm for variable selection from various theoretical molecular descriptors, but the comments can be useful also for other QSAR methods. The need for rigorous validation, also external, and for applicability domain check to guarantee predictivity and reliability of QSAR models is particularly highlighted. The commented approach is the “predictive” one, based on chemometrics, and is usefully applied to the prioritization of environmental pollutants. All the discussed points and the author's ideas are implemented in the software QSARINS, as a legacy to the QSAR community.
Collapse
|
5
|
Sivaraman G, Jackson NE, Sanchez-Lengeling B, Vázquez-Mayagoitia Á, Aspuru-Guzik A, Vishwanath V, de Pablo JJ. A machine learning workflow for molecular analysis: application to melting points. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/ab8aa3] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Abstract
Computational tools encompassing integrated molecular prediction, analysis, and generation are key for molecular design in a variety of critical applications. In this work, we develop a workflow for molecular analysis (MOLAN) that integrates an ensemble of supervised and unsupervised machine learning techniques to analyze molecular data sets. The MOLAN workflow combines molecular featurization, clustering algorithms, uncertainty analysis, low-bias dataset construction, high-performance regression models, graph-based molecular embeddings and attribution, and a semi-supervised variational autoencoder based on the novel SELFIES representation to enable molecular design. We demonstrate the utility of the MOLAN workflow in the context of a challenging multi-molecule property prediction problem: the determination of melting points solely from single molecule structure. This application serves as a case study for how to employ the MOLAN workflow in the context of molecular property prediction.
Collapse
|
6
|
Kianpour M, Mohammadinasab E, Isfahani TM. Comparison between genetic algorithm‐multiple linear regression and back‐propagation‐artificial neural network methods for predicting the
LD
50
of organo (phosphate and thiophosphate) compounds. J CHIN CHEM SOC-TAIP 2020. [DOI: 10.1002/jccs.201900514] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Affiliation(s)
- Mina Kianpour
- Department of Chemistry, Arak BranchIslamic Azad University Arak Iran
| | | | | |
Collapse
|
7
|
Hologram QSAR study on the critical micelle concentration of Gemini surfactants. Colloids Surf A Physicochem Eng Asp 2020. [DOI: 10.1016/j.colsurfa.2019.124226] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
8
|
Tutone M, Pecoraro B, Almerico AM. Investigation on Quantitative Structure-Activity Relationships of 1,3,4-Oxadiazole Derivatives as Potential Telomerase Inhibitors. Curr Drug Discov Technol 2020; 17:79-86. [PMID: 30039762 DOI: 10.2174/1570163815666180724113208] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Revised: 07/17/2018] [Accepted: 07/18/2018] [Indexed: 06/08/2023]
Abstract
BACKGROUND Telomerase, a reverse transcriptase, maintains telomere and chromosomes integrity of dividing cells, while it is inactivated in most somatic cells. In tumor cells, telomerase is highly activated, and works in order to maintain the length of telomeres causing immortality, hence it could be considered as a potential marker to tumorigenesis.A series of 1,3,4-oxadiazole derivatives showed significant broad-spectrum anticancer activity against different cell lines, and demonstrated telomerase inhibition. METHODS This series of 24 N-benzylidene-2-((5-(pyridine-4-yl)-1,3,4-oxadiazol-2yl)thio)acetohydrazide derivatives as telomerase inhibitors has been considered to carry out QSAR studies. The endpoint to build QSAR models is determined by the IC50 values for telomerase inhibition, i.e., the concentration (μM) of inhibitor that produces 50% inhibition. These values were converted to pIC50 (- log IC50) values. We used the most common and transparent method, where models are described by clearly expressed mathematical equations: Multiple Linear Regression (MLR) by Ordinary Least Squares (OLS). RESULTS Validated models with high correlation coefficients were developed. The Multiple Linear Regression (MLR) models, by Ordinary Least Squares (OLS), showed good robustness and predictive capability, according to the Multi-Criteria Decision Making (MCDM = 0.8352), a technique that simultaneously enhances the performances of a certain number of criteria. The descriptors selected for the models, such as electrotopological state (E-state) descriptors, and extended topochemical atom (ETA) descriptors, showed the relevant chemical information contributing to the activity of these compounds. CONCLUSION The results obtained in this study make sure about the identification of potential hits as prospective telomerase inhibitors.
Collapse
Affiliation(s)
- Marco Tutone
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche (STEBICEF) Universita degli Studi di Palermo, via Archirafi 28, 90123-Palermo, Italy
| | - Beatrice Pecoraro
- Department of Clinical and Pharmaceutical Sciences, School of Life and Medical Sciences, University of Hertfordshire, College Lane, Hatfield, Hertfordshire AL10 9AB, United Kingdom
| | - Anna M Almerico
- Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche (STEBICEF) Universita degli Studi di Palermo, via Archirafi 28, 90123-Palermo, Italy
| |
Collapse
|
9
|
Modeling Physico-Chemical ADMET Endpoints with Multitask Graph Convolutional Networks. Molecules 2019; 25:molecules25010044. [PMID: 31877719 PMCID: PMC6982787 DOI: 10.3390/molecules25010044] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 12/19/2019] [Accepted: 12/20/2019] [Indexed: 11/19/2022] Open
Abstract
Simple physico-chemical properties, like logD, solubility, or melting point, can reveal a great deal about how a compound under development might later behave. These data are typically measured for most compounds in drug discovery projects in a medium throughput fashion. Collecting and assembling all the Bayer in-house data related to these properties allowed us to apply powerful machine learning techniques to predict the outcome of those assays for new compounds. In this paper, we report our finding that, especially for predicting physicochemical ADMET endpoints, a multitask graph convolutional approach appears a highly competitive choice. For seven endpoints of interest, we compared the performance of that approach to fully connected neural networks and different single task models. The new model shows increased predictive performance compared to previous modeling methods and will allow early prioritization of compounds even before they are synthesized. In addition, our model follows the generalized solubility equation without being explicitly trained under this constraint.
Collapse
|
10
|
Papa E, Sangion A, Chirico N. Celebrating 40 Years of Career. Mol Inform 2019; 38:e1980831. [PMID: 31432627 DOI: 10.1002/minf.201980831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Ester Papa
- Department of Theoretical and Applied Sciences, University of Insubria, via J.H. Dunant, 3 -, 21100, Varese, Italy
| | - Alessandro Sangion
- Department of Physical and Environmental Sciences, University of Toronto Scarborough, 1265 Military Trail -, M1C 1A4, Toronto ON, Canada
| | - Nicola Chirico
- Department of Theoretical and Applied Sciences, University of Insubria, via J.H. Dunant, 3 -, 21100, Varese, Italy
| |
Collapse
|
11
|
Yang X, Wang Y, Byrne R, Schneider G, Yang S. Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery. Chem Rev 2019; 119:10520-10594. [PMID: 31294972 DOI: 10.1021/acs.chemrev.8b00728] [Citation(s) in RCA: 350] [Impact Index Per Article: 70.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Artificial intelligence (AI), and, in particular, deep learning as a subcategory of AI, provides opportunities for the discovery and development of innovative drugs. Various machine learning approaches have recently (re)emerged, some of which may be considered instances of domain-specific AI which have been successfully employed for drug discovery and design. This review provides a comprehensive portrayal of these machine learning techniques and of their applications in medicinal chemistry. After introducing the basic principles, alongside some application notes, of the various machine learning algorithms, the current state-of-the art of AI-assisted pharmaceutical discovery is discussed, including applications in structure- and ligand-based virtual screening, de novo drug design, physicochemical and pharmacokinetic property prediction, drug repurposing, and related aspects. Finally, several challenges and limitations of the current methods are summarized, with a view to potential future directions for AI-assisted drug discovery and design.
Collapse
Affiliation(s)
- Xin Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Yifei Wang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Ryan Byrne
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Gisbert Schneider
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Shengyong Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| |
Collapse
|
12
|
Brandmaier S, Peijnenburg W, Durjava MK, Kolar B, Gramatica P, Papa E, Bhhatarai B, Kovarich S, Cassani S, Roy PP, Rahmberg M, Öberg T, Jeliazkova N, Golsteijn L, Comber M, Charochkina L, Novotarskyi S, Sushko I, Abdelaziz A, D'Onofrio E, Kunwar P, Ruggiu F, Tetko IV. The QSPR-THESAURUS: The Online Platform of the CADASTER Project. Altern Lab Anim 2019; 42:13-24. [DOI: 10.1177/026119291404200104] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Stefan Brandmaier
- Helmholtz-Zentrum München — German Research Centre for Environmental Health (GmbH), Research Unit of Molecular Epidemiology, Institute of Epidemiology II, Munich, Germany
| | - Willie Peijnenburg
- National Institute of Public Health and the Environment, Centre for Safety of Substances and Products (RIVM), Bilthoven, The Netherlands
- Leiden University, Institute of Environmental Sciences, Department of Conservation Biology, Leiden, The Netherlands
| | - Mojca K. Durjava
- National Institute for Health, Environment and Food, Maribor, Slovenia
| | - Boris Kolar
- National Institute for Health, Environment and Food, Maribor, Slovenia
| | - Paola Gramatica
- University of Insubria, QSAR Research Unit in Environmental Chemistry and Ecotoxicology, DiSTA, Varese, Italy
| | - Ester Papa
- University of Insubria, QSAR Research Unit in Environmental Chemistry and Ecotoxicology, DiSTA, Varese, Italy
| | - Barun Bhhatarai
- University of Insubria, QSAR Research Unit in Environmental Chemistry and Ecotoxicology, DiSTA, Varese, Italy
| | - Simona Kovarich
- University of Insubria, QSAR Research Unit in Environmental Chemistry and Ecotoxicology, DiSTA, Varese, Italy
| | - Stefano Cassani
- University of Insubria, QSAR Research Unit in Environmental Chemistry and Ecotoxicology, DiSTA, Varese, Italy
| | - Partha Pratim Roy
- University of Insubria, QSAR Research Unit in Environmental Chemistry and Ecotoxicology, DiSTA, Varese, Italy
| | - Magnus Rahmberg
- IVL Swedish Environmental Research Institute Ltd, Stockholm, Sweden
| | - Tomas Öberg
- School of Natural Sciences, Linnaeus University, Kalmar, Sweden
| | | | - Laura Golsteijn
- Radboud University Nijmegen, Institute for Wetland and Water Research, Department of Environmental Science, Nijmegen, The Netherlands
| | | | | | | | | | | | - Elisa D'Onofrio
- University of Insubria, QSAR Research Unit in Environmental Chemistry and Ecotoxicology, DiSTA, Varese, Italy
- Helmholtz-Zentrum München — German Research Centre for Environmental Health (GmbH), Institute of Structural Biology, Munich, Germany
| | - Prakash Kunwar
- Helmholtz-Zentrum München — German Research Centre for Environmental Health (GmbH), Institute of Structural Biology, Munich, Germany
| | - Fiorella Ruggiu
- Helmholtz-Zentrum München — German Research Centre for Environmental Health (GmbH), Institute of Structural Biology, Munich, Germany
| | - Igor V. Tetko
- eADMET GmbH, Garching, Germany
- Helmholtz-Zentrum München — German Research Centre for Environmental Health (GmbH), Institute of Structural Biology, Munich, Germany
- Chemistry Department, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
13
|
Brown TN, Armitage JM, Arnot JA. Application of an Iterative Fragment Selection (IFS) Method to Estimate Entropies of Fusion and Melting Points of Organic Chemicals. Mol Inform 2019; 38:e1800160. [PMID: 30816634 DOI: 10.1002/minf.201800160] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Accepted: 02/10/2019] [Indexed: 11/09/2022]
Abstract
The main objective of this study is to develop and evaluate novel Quantitative Structure-Property Relationships (QSPRs) for predicting entropy of fusion (ΔSM ) and melting point (TM ) of organic chemicals from chemical structure. The QSPRs are developed using the Iterative Fragment Selection (IFS) method that requires only 2D structural information from the user (SMILES codes) for property prediction. The QSPRs also provide information on the applicability domain for each calculation and uncertainty estimates for the predictions. The root mean square error (RMSE) for the external validation sets are 11.8 J mol-1 K-1 and 46.9 K for the ΔSM and TM QSPRs, respectively. The performance of the new QSPRs is comparable to other predictive methods but has advantages with respect to availability and ease of use as well as the guidance on applicability domain for each prediction. Limitations of the new QSPRs are discussed. The QSPRs are coded as a user-friendly, freely available tool.
Collapse
Affiliation(s)
| | - James M Armitage
- AES Armitage Environmental Sciences, Inc., Ottawa ON, Canada, K1L 8C3
| | - Jon A Arnot
- ARC Arnot Research and Consulting, Inc., Toronto ON, Canada, M4M 1W4.,Department of Physical and Environmental Sciences, University of Toronto Scarborough, Toronto ON, Canada, M1C 1A4.,Department of Pharmacology and Toxicology, University of Toronto, Toronto ON, Canada, M5S 1A8
| |
Collapse
|
14
|
Alantary D, Yalkowsky SH. Estimating the Physicochemical Properties of Polysubstituted Aromatic Compounds Using UPPER. J Pharm Sci 2018; 107:297-306. [DOI: 10.1016/j.xphs.2017.10.018] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Revised: 10/12/2017] [Accepted: 10/13/2017] [Indexed: 12/01/2022]
|
15
|
Zang Q, Mansouri K, Williams AJ, Judson RS, Allen DG, Casey WM, Kleinstreuer NC. In Silico Prediction of Physicochemical Properties of Environmental Chemicals Using Molecular Fingerprints and Machine Learning. J Chem Inf Model 2017; 57:36-49. [PMID: 28006899 PMCID: PMC6131700 DOI: 10.1021/acs.jcim.6b00625] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
There are little available toxicity data on the vast majority of chemicals in commerce. High-throughput screening (HTS) studies, such as those being carried out by the U.S. Environmental Protection Agency (EPA) ToxCast program in partnership with the federal Tox21 research program, can generate biological data to inform models for predicting potential toxicity. However, physicochemical properties are also needed to model environmental fate and transport, as well as exposure potential. The purpose of the present study was to generate an open-source quantitative structure-property relationship (QSPR) workflow to predict a variety of physicochemical properties that would have cross-platform compatibility to integrate into existing cheminformatics workflows. In this effort, decades-old experimental property data sets available within the EPA EPI Suite were reanalyzed using modern cheminformatics workflows to develop updated QSPR models capable of supplying computationally efficient, open, and transparent HTS property predictions in support of environmental modeling efforts. Models were built using updated EPI Suite data sets for the prediction of six physicochemical properties: octanol-water partition coefficient (logP), water solubility (logS), boiling point (BP), melting point (MP), vapor pressure (logVP), and bioconcentration factor (logBCF). The coefficient of determination (R2) between the estimated values and experimental data for the six predicted properties ranged from 0.826 (MP) to 0.965 (BP), with model performance for five of the six properties exceeding those from the original EPI Suite models. The newly derived models can be employed for rapid estimation of physicochemical properties within an open-source HTS workflow to inform fate and toxicity prediction models of environmental chemicals.
Collapse
Affiliation(s)
- Qingda Zang
- Integrated Laboratory Systems, Inc., Research Triangle Park, NC 27709, USA
| | - Kamel Mansouri
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Antony J. Williams
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Richard S. Judson
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - David G. Allen
- Integrated Laboratory Systems, Inc., Research Triangle Park, NC 27709, USA
| | - Warren M. Casey
- National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| | - Nicole C. Kleinstreuer
- National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA
| |
Collapse
|
16
|
Mansouri K, Grulke CM, Richard AM, Judson RS, Williams AJ. An automated curation procedure for addressing chemical errors and inconsistencies in public datasets used in QSAR modelling. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2016; 27:939-965. [PMID: 27885862 DOI: 10.1080/1062936x.2016.1253611] [Citation(s) in RCA: 81] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2016] [Accepted: 10/24/2016] [Indexed: 05/18/2023]
Abstract
The increasing availability of large collections of chemical structures and associated experimental data provides an opportunity to build robust QSAR models for applications in different fields. One common concern is the quality of both the chemical structure information and associated experimental data. Here we describe the development of an automated KNIME workflow to curate and correct errors in the structure and identity of chemicals using the publicly available PHYSPROP physicochemical properties and environmental fate datasets. The workflow first assembles structure-identity pairs using up to four provided chemical identifiers, including chemical name, CASRNs, SMILES, and MolBlock. Problems detected included errors and mismatches in chemical structure formats, identifiers and various structure validation issues, including hypervalency and stereochemistry descriptions. Subsequently, a machine learning procedure was applied to evaluate the impact of this curation process. The performance of QSAR models built on only the highest-quality subset of the original dataset was compared with the larger curated and corrected dataset. The latter showed statistically improved predictive performance. The final workflow was used to curate the full list of PHYSPROP datasets, and is being made publicly available for further usage and integration by the scientific community.
Collapse
Affiliation(s)
- K Mansouri
- a Oak Ridge Institute for Science and Education (ORISE) , Oak Ridge , TN , USA
- b US Environmental Protection Agency, Office of Research and Development , National Center for Computational Toxicology , Research Triangle Park, NC , USA
| | - C M Grulke
- b US Environmental Protection Agency, Office of Research and Development , National Center for Computational Toxicology , Research Triangle Park, NC , USA
| | - A M Richard
- b US Environmental Protection Agency, Office of Research and Development , National Center for Computational Toxicology , Research Triangle Park, NC , USA
| | - R S Judson
- b US Environmental Protection Agency, Office of Research and Development , National Center for Computational Toxicology , Research Triangle Park, NC , USA
| | - A J Williams
- b US Environmental Protection Agency, Office of Research and Development , National Center for Computational Toxicology , Research Triangle Park, NC , USA
| |
Collapse
|
17
|
Watkins M, Sizochenko N, Rasulev B, Leszczynski J. Estimation of melting points of large set of persistent organic pollutants utilizing QSPR approach. J Mol Model 2016; 22:55. [PMID: 26874948 DOI: 10.1007/s00894-016-2917-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2015] [Accepted: 01/18/2016] [Indexed: 11/28/2022]
Abstract
The presence of polyhalogenated persistent organic pollutants (POPs), such as Cl/Br-substituted benzenes, biphenyls, diphenyl ethers, and naphthalenes has been identified in all environmental compartments. The exposure to these compounds can pose potential risk not only for ecological systems, but also for human health. Therefore, efficient tools for comprehensive environmental risk assessment for POPs are required. Among the factors vital for environmental transport and fate processes is melting point of a compound. In this study, we estimated the melting points of a large group (1419 compounds) of chloro- and bromo- derivatives of dibenzo-p-dioxins, dibenzofurans, biphenyls, naphthalenes, diphenylethers, and benzenes by utilizing quantitative structure-property relationship (QSPR) techniques. The compounds were classified by applying structure-based clustering methods followed by GA-PLS modeling. In addition, random forest method has been applied to develop more general models. Factors responsible for melting point behavior and predictive ability of each method were discussed.
Collapse
Affiliation(s)
- Marquita Watkins
- Interdisciplinary Center for Nanotoxicity, Department of Chemistry and Biochemistry, Jackson State University, P.O. Box: 17910, Jackson, MS, USA
| | - Natalia Sizochenko
- Interdisciplinary Center for Nanotoxicity, Department of Chemistry and Biochemistry, Jackson State University, P.O. Box: 17910, Jackson, MS, USA
| | - Bakhtiyor Rasulev
- Interdisciplinary Center for Nanotoxicity, Department of Chemistry and Biochemistry, Jackson State University, P.O. Box: 17910, Jackson, MS, USA.,Center for Computationally Assisted Science and Technology, North Dakota State University, Fargo, ND, USA
| | - Jerzy Leszczynski
- Interdisciplinary Center for Nanotoxicity, Department of Chemistry and Biochemistry, Jackson State University, P.O. Box: 17910, Jackson, MS, USA.
| |
Collapse
|
18
|
Tetko IV, M. Lowe D, Williams AJ. The development of models to predict melting and pyrolysis point data associated with several hundred thousand compounds mined from PATENTS. J Cheminform 2016; 8:2. [PMID: 26807157 PMCID: PMC4724158 DOI: 10.1186/s13321-016-0113-y] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Accepted: 01/08/2016] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Melting point (MP) is an important property in regards to the solubility of chemical compounds. Its prediction from chemical structure remains a highly challenging task for quantitative structure-activity relationship studies. Success in this area of research critically depends on the availability of high quality MP data as well as accurate chemical structure representations in order to develop models. Currently, available datasets for MP predictions have been limited to around 50k molecules while lots more data are routinely generated following the synthesis of novel materials. Significant amounts of MP data are freely available within the patent literature and, if it were available in the appropriate form, could potentially be used to develop predictive models. RESULTS We have developed a pipeline for the automated extraction and annotation of chemical data from published PATENTS. Almost 300,000 data points have been collected and used to develop models to predict melting and pyrolysis (decomposition) points using tools available on the OCHEM modeling platform (http://ochem.eu). A number of technical challenges were simultaneously solved to develop models based on these data. These included the handing of sparse data matrices with >200,000,000,000 entries and parallel calculations using 32 × 6 cores per task using 13 descriptor sets totaling more than 700,000 descriptors. We showed that models developed using data collected from PATENTS had similar or better prediction accuracy compared to the highly curated data used in previous publications. The separation of data for chemicals that decomposed rather than melting, from compounds that did undergo a normal melting transition, was performed and models for both pyrolysis and MPs were developed. The accuracy of the consensus MP models for molecules from the drug-like region of chemical space was similar to their estimated experimental accuracy, 32 °C. Last but not least, important structural features related to the pyrolysis of chemicals were identified, and a model to predict whether a compound will decompose instead of melting was developed. CONCLUSIONS We have shown that automated tools for the analysis of chemical information have reached a mature stage allowing for the extraction and collection of high quality data to enable the development of structure-activity relationship models. The developed models and data are publicly available at http://ochem.eu/article/99826.
Collapse
Affiliation(s)
- Igor V. Tetko
- />Institute of Structural Biology, Helmholtz Zentrum München für Gesundheit und Umwelt (HMGU), Ingolstädter Landstraße 1, b. 60w, 85764 Neuherberg, Germany
- />BigChem GmbH, 85764 Neuherberg, Germany
| | - Daniel M. Lowe
- />NextMove Software Limited, Innovation Centre (Unit 23), Cambridge Science Park, Cambridge, CB4 0EY UK
| | | |
Collapse
|
19
|
Nieto-Draghi C, Fayet G, Creton B, Rozanska X, Rotureau P, de Hemptinne JC, Ungerer P, Rousseau B, Adamo C. A General Guidebook for the Theoretical Prediction of Physicochemical Properties of Chemicals for Regulatory Purposes. Chem Rev 2015; 115:13093-164. [PMID: 26624238 DOI: 10.1021/acs.chemrev.5b00215] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Carlos Nieto-Draghi
- IFP Energies nouvelles , 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| | - Guillaume Fayet
- INERIS, Parc Technologique Alata, BP2 , 60550 Verneuil-en-Halatte, France
| | - Benoit Creton
- IFP Energies nouvelles , 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| | - Xavier Rozanska
- Materials Design S.A.R.L. , 18, rue de Saisset, 92120 Montrouge, France
| | - Patricia Rotureau
- INERIS, Parc Technologique Alata, BP2 , 60550 Verneuil-en-Halatte, France
| | | | - Philippe Ungerer
- Materials Design S.A.R.L. , 18, rue de Saisset, 92120 Montrouge, France
| | - Bernard Rousseau
- Laboratoire de Chimie-Physique, Université Paris Sud , UMR 8000 CNRS, Bât. 349, 91405 Orsay Cedex, France
| | - Carlo Adamo
- Institut de Recherche Chimie Paris, PSL Research University, CNRS, Chimie Paristech , 11 rue P. et M. Curie, F-75005 Paris, France.,Institut Universitaire de France , 103 Boulevard Saint Michel, F-75005 Paris, France
| |
Collapse
|
20
|
Tetko IV, Sushko Y, Novotarskyi S, Patiny L, Kondratov I, Petrenko AE, Charochkina L, Asiri AM. How accurately can we predict the melting points of drug-like compounds? J Chem Inf Model 2014; 54:3320-9. [PMID: 25489863 PMCID: PMC4702524 DOI: 10.1021/ci5005288] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
This article contributes a highly accurate model for predicting the melting points (MPs) of medicinal chemistry compounds. The model was developed using the largest published data set, comprising more than 47k compounds. The distributions of MPs in drug-like and drug lead sets showed that >90% of molecules melt within [50,250]°C. The final model calculated an RMSE of less than 33 °C for molecules from this temperature interval, which is the most important for medicinal chemistry users. This performance was achieved using a consensus model that performed calculations to a significantly higher accuracy than the individual models. We found that compounds with reactive and unstable groups were overrepresented among outlying compounds. These compounds could decompose during storage or measurement, thus introducing experimental errors. While filtering the data by removing outliers generally increased the accuracy of individual models, it did not significantly affect the results of the consensus models. Three analyzed distance to models did not allow us to flag molecules, which had MP values fell outside the applicability domain of the model. We believe that this negative result and the public availability of data from this article will encourage future studies to develop better approaches to define the applicability domain of models. The final model, MP data, and identified reactive groups are available online at http://ochem.eu/article/55638.
Collapse
Affiliation(s)
- Igor V Tetko
- Helmholtz-Zentrum München - German Research Centre for Environmental Health (GmbH), Institute of Structural Biology , Munich 85764, Germany
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Wang D, He G, Chen H. Prediction for the detonation velocity of the nitrogen-rich energetic compounds based on quantum chemistry. RUSSIAN JOURNAL OF PHYSICAL CHEMISTRY A 2014. [DOI: 10.1134/s0036024414130032] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
22
|
Gramatica P, Chirico N, Papa E, Cassani S, Kovarich S. QSARINS: A new software for the development, analysis, and validation of QSAR MLR models. J Comput Chem 2013. [DOI: 10.1002/jcc.23361] [Citation(s) in RCA: 378] [Impact Index Per Article: 34.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Paola Gramatica
- Department of Theoretical and Applied Sciences; QSAR Research Unit in Environmental Chemistry and Ecotoxicology; University of Insubria; Via Dunant 3; 21100; Varese; Italy. E-mail:
| | - Nicola Chirico
- Department of Theoretical and Applied Sciences; QSAR Research Unit in Environmental Chemistry and Ecotoxicology; University of Insubria; Via Dunant 3; 21100; Varese; Italy. E-mail:
| | - Ester Papa
- Department of Theoretical and Applied Sciences; QSAR Research Unit in Environmental Chemistry and Ecotoxicology; University of Insubria; Via Dunant 3; 21100; Varese; Italy. E-mail:
| | - Stefano Cassani
- Department of Theoretical and Applied Sciences; QSAR Research Unit in Environmental Chemistry and Ecotoxicology; University of Insubria; Via Dunant 3; 21100; Varese; Italy. E-mail:
| | - Simona Kovarich
- Department of Theoretical and Applied Sciences; QSAR Research Unit in Environmental Chemistry and Ecotoxicology; University of Insubria; Via Dunant 3; 21100; Varese; Italy. E-mail:
| |
Collapse
|
23
|
Piir G, Sild S, Maran U. Comparative analysis of local and consensus quantitative structure-activity relationship approaches for the prediction of bioconcentration factor. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2013; 24:175-199. [PMID: 23410132 DOI: 10.1080/1062936x.2012.762426] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Quantitative structure-activity relationships (QSARs) are broadly classified as global or local, depending on their molecular constitution. Global models use large and diverse training sets covering a wide range of chemical space. Local models focus on smaller structurally or chemically similar subsets that are conventionally selected by human experts or alternatively using clustering analysis. The current study focuses on the comparative analysis of different clustering algorithms (expectation-maximization, K-means and hierarchical) for seven different descriptor sets as structural characteristics and two rule-based approaches to select subsets for designing local QSAR models. A total of 111 local QSAR models are developed for predicting bioconcentration factor. Predictions from local models were compared with corresponding predictions from the global model. The comparison of coefficients of determination (r(2)) and standard deviations for local models with similar subsets from the global model show improved prediction quality in 97% of cases. The descriptor content of derived QSARs is discussed and analyzed. Local QSAR models were further consolidated within the framework of consensus approach. All different consensus approaches increased performance over the global and local models. The consensus approach reduced the number of strongly deviating predictions by evening out prediction errors, which were produced by some local QSARs.
Collapse
Affiliation(s)
- G Piir
- Institute of Chemistry, University of Tartu, Tartu, Estonia
| | | | | |
Collapse
|
24
|
Salahinejad M, Le TC, Winkler DA. Capturing the crystal: prediction of enthalpy of sublimation, crystal lattice energy, and melting points of organic compounds. J Chem Inf Model 2013; 53:223-9. [PMID: 23215043 DOI: 10.1021/ci3005012] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Accurate computational prediction of melting points and aqueous solubilities of organic compounds would be very useful but is notoriously difficult. Predicting the lattice energies of compounds is key to understanding and predicting their melting behavior and ultimately their solubility behavior. We report robust, predictive, quantitative structure-property relationship (QSPR) models for enthalpies of sublimation, crystal lattice energies, and melting points for a very large and structurally diverse set of small organic compounds. Sparse Bayesian feature selection and machine learning methods were employed to select the most relevant molecular descriptors for the model and to generate parsimonious quantitative models. The final enthalpy of sublimation model is a four-parameter multilinear equation that has an r(2) value of 0.96 and an average absolute error of 7.9 ± 0.3 kJ.mol(-1). The melting point model can predict this property with a standard error of 45° ± 1 K and r(2) value of 0.79. Given the size and diversity of the training data, these conceptually transparent and accurate models can be used to predict sublimation enthalpy, lattice energy, and melting points of organic compounds in general.
Collapse
Affiliation(s)
- Maryam Salahinejad
- Faculty of Chemistry, Tarbiat Moallem University, Tehran 15719-14911, Iran
| | | | | |
Collapse
|
25
|
Prana V, Fayet G, Rotureau P, Adamo C. Development of validated QSPR models for impact sensitivity of nitroaliphatic compounds. JOURNAL OF HAZARDOUS MATERIALS 2012; 235-236:169-177. [PMID: 22871414 DOI: 10.1016/j.jhazmat.2012.07.036] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2012] [Revised: 06/11/2012] [Accepted: 07/16/2012] [Indexed: 06/01/2023]
Abstract
The European regulation of chemicals named REACH implies the assessment of a large number of substances based on their hazardous properties. However, the complete characterization of physico-chemical, toxicological and eco-toxicological properties by experimental means is incompatible with the imposed calendar of REACH. Hence, there is a real need in evaluating the capabilities of alternative methods such as quantitative structure-property relationship (QSPR) models, notably for physico-chemical properties. In the present work, the molecular structures of 50 itroaliphatic compounds were correlated with their impact sensitivities (h(50%)) using such predictive models. More than 400 olecular descriptors (constitutional, topological, geometrical, quantum chemical) were calculated and linear and multi-linear regressions were performed to find accurate quantitative relationships with experimental impact sensitivities. Considering different sets of descriptors, four predictive models were obtained and two of them were selected for their predictive reliability. To our knowledge, these QSPR models for the impact sensitivity of nitroaliphatic compounds are the first ones being rigorously validated (both internally and externally) with defined applicability domains. They hence follow all OECD principles for regulatory acceptability of QSPRs, allowing possible application in REACH.
Collapse
Affiliation(s)
- Vinca Prana
- Laboratoire d'Electrochimie, Chimie des Interfaces et Modélisation pour l'Energie, CNRS UMR-7575, Chimie ParisTech, 11 rue P. et M. Curie, 75231 Paris Cedex 05, France
| | | | | | | |
Collapse
|
26
|
Fayet G, Rotureau P, Prana V, Adamo C. Global and local quantitative structure-property relationship models to predict the impact sensitivity of nitro compounds. PROCESS SAFETY PROGRESS 2012. [DOI: 10.1002/prs.11499] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
27
|
Digles D, Ecker GF. Self-Organizing Maps for In Silico Screening and Data Visualization. Mol Inform 2011; 30:838-46. [PMID: 27468103 DOI: 10.1002/minf.201100082] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2011] [Accepted: 08/05/2011] [Indexed: 02/04/2023]
Abstract
Self-organizing maps, which are unsupervised artificial neural networks, have become a very useful tool in a wide area of disciplines, including medicinal chemistry. Here, we will focus on two applications of self-organizing maps: the use of self-organizing maps for in silico screening and for clustering and visualisation of large datasets. Additionally, the importance of parameter selection is discussed and some modifications to the original algorithm are summarised.
Collapse
Affiliation(s)
- Daniela Digles
- Department of Medicinal Chemistry, University of Vienna, Althanstrasse 14, 1090 Vienna, Austria phone/fax: +43-1-4277-55110/+43-1-4277-9551
| | - Gerhard F Ecker
- Department of Medicinal Chemistry, University of Vienna, Althanstrasse 14, 1090 Vienna, Austria phone/fax: +43-1-4277-55110/+43-1-4277-9551.
| |
Collapse
|
28
|
Chirico N, Gramatica P. Real external predictivity of QSAR models: how to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient. J Chem Inf Model 2011; 51:2320-35. [PMID: 21800825 DOI: 10.1021/ci200211n] [Citation(s) in RCA: 447] [Impact Index Per Article: 34.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The main utility of QSAR models is their ability to predict activities/properties for new chemicals, and this external prediction ability is evaluated by means of various validation criteria. As a measure for such evaluation the OECD guidelines have proposed the predictive squared correlation coefficient Q(2)(F1) (Shi et al.). However, other validation criteria have been proposed by other authors: the Golbraikh-Tropsha method, r(2)(m) (Roy), Q(2)(F2) (Schüürmann et al.), Q(2)(F3) (Consonni et al.). In QSAR studies these measures are usually in accordance, though this is not always the case, thus doubts can arise when contradictory results are obtained. It is likely that none of the aforementioned criteria is the best in every situation, so a comparative study using simulated data sets is proposed here, using threshold values suggested by the proponents or those widely used in QSAR modeling. In addition, a different and simple external validation measure, the concordance correlation coefficient (CCC), is proposed and compared with other criteria. Huge data sets were used to study the general behavior of validation measures, and the concordance correlation coefficient was shown to be the most restrictive. On using simulated data sets of a more realistic size, it was found that CCC was broadly in agreement, about 96% of the time, with other validation measures in accepting models as predictive, and in almost all the examples it was the most precautionary. The proposed concordance correlation coefficient also works well on real data sets, where it seems to be more stable, and helps in making decisions when the validation measures are in conflict. Since it is conceptually simple, and given its stability and restrictiveness, we propose the concordance correlation coefficient as a complementary, or alternative, more prudent measure of a QSAR model to be externally predictive.
Collapse
Affiliation(s)
- Nicola Chirico
- QSAR Research Group in Environmental Chemistry and Ecotoxicology, Department of Structural and Functional Biology, University of Insubria, Varese, Italy
| | | |
Collapse
|