1
|
Das S, Samal A, Ojha PK. Chemometrics-driven prediction and prioritization of diverse pesticides on chickens for addressing hazardous effects on public health. JOURNAL OF HAZARDOUS MATERIALS 2024; 471:134326. [PMID: 38636230 DOI: 10.1016/j.jhazmat.2024.134326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/09/2024] [Accepted: 04/15/2024] [Indexed: 04/20/2024]
Abstract
The extensive use of various pesticides in the agriculture field badly affects both chickens and humans, primarily through residues in food products and environmental exposure. This study offers the first quantitative structure-toxicity relationship (QSTR) and quantitative read-across-structure toxicity relationship (q-RASTR) models encompassing the LOEL and NOEL endpoints for acute toxicity in chicken, a widely consumed protein. The study's significance lies in the direct link between chemical toxicity in chicken, human intake, and environmental damage. Both the QSTR and the similarity-based read-across algorithms are applied concurrently to improve the predictability of the models. The q-RASTR models were generated by combining read-across derived similarity and error-based parameters, alongside structural and physicochemical descriptors. Machine Learning approaches (SVM and RR) were also employed with the optimization of relevant hyperparameters based on the cross-validation approach, and the final test set prediction results were compared. The PLS-based q-RASTR models for NOEL and LOEL endpoints showed good statistical performance, as traced from the external validation metrics Q2F1: 0.762-0.844; Q2F2: 0.759-0.831 and MAEtest: 0.195-0.214. The developed models were further used to screen the Pesticide Properties DataBase (PPDB) for potential toxicants in chickens. Thus, established models can address eco-toxicological data gaps and development of novel and safe eco-friendly pesticides.
Collapse
Affiliation(s)
- Shubha Das
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India
| | - Abhisek Samal
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India
| | - Probir Kumar Ojha
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India.
| |
Collapse
|
2
|
Zukić S, Maran U. Modelling of antiproliferative activity measured in HeLa cervical cancer cells in a series of xanthene derivatives. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2020; 31:905-921. [PMID: 33236957 DOI: 10.1080/1062936x.2020.1839131] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 10/15/2020] [Indexed: 06/11/2023]
Abstract
Cancer remains one of the leading causes of death in humans, and new drug substances are therefore being developed. Thus, the anti-cancer activity of xanthene derivatives has become an important topic in the development of new and potent anti-cancer drug substances. Previously published novel series of xanthen-3-one and xanthen-1,8-dione derivatives have been synthesized in one of our laboratories and showed anti-proliferative activity in HeLa cancer cell lines. This series serves as a good basis to develop quantitative structure-activity relationship (QSAR), to study the relations between anti-proliferative activity and chemical structures. A QSAR model has been derived that relies only on two-dimensional molecular descriptors, providing mechanistic insight into the anti-proliferative activity of xanthene derivatives. The model is validated internally and externally and additionally with the set of inactive compounds of the original data, confirming model applicability for the design and discovery of novel xanthene derivatives. The QSAR model is available at the QsarDB repository (http://dx.doi.10.15152/QDB.237).
Collapse
Affiliation(s)
- S Zukić
- Department of Pharmaceutical Chemistry, University of Sarajevo , Sarajevo, Bosnia and Herzegovina
| | - U Maran
- Department of Chemistry, University of Tartu , Tartu, Estonia
| |
Collapse
|
3
|
Seth A, Ojha PK, Roy K. QSAR modeling with ETA indices for cytotoxicity and enzymatic activity of diverse chemicals. JOURNAL OF HAZARDOUS MATERIALS 2020; 394:122498. [PMID: 32199202 DOI: 10.1016/j.jhazmat.2020.122498] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Revised: 03/07/2020] [Accepted: 03/07/2020] [Indexed: 06/10/2023]
Abstract
The discharge of huge amount of chemicals from industries into the environment has led to toxicity towards different living species. Therefore, risk assessment of these chemicals is essential. In order to comply with the ethical issues, in this present work, we have developed quantitative structure-activity relationship (QSAR) models for cytotoxicity against GFS (goldfish scale) tissue (Crassius auratus) and enzymatic activity against PLHC-1 cell line (topminnow hepatoma cell line) (Poeciliopsis lucida). The final models were developed by means of PLS (Partial Least Squares) regression method applying only ETA (extended topochemical atom) descriptors. The results obtained from various validation parameters (obtained from the both datasets) suggested that the developed models are statistically robust and predictive. From the insights obtained from the models developed from the Neutral Red dye (NR) dataset, it can be concluded that presence of bulky atoms, unsaturation, branching and hetero atoms (most importantly N, Cl) enhance the cytotoxicity towards the Goldfish scale tissue. On the other hand, in case of the Ethoxyresorufin-O-deethylase (EROD) dataset, presence of higher electronegative atoms (O, Cl), polycyclic aromatic hydrocarbons (PAHs) with more number of rings and absence of polar groups and hydrogen bond acceptors enhance enzymatic activity of the PLHC-1 cell line.
Collapse
Affiliation(s)
- Arnab Seth
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India
| | - Probir Kumar Ojha
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India
| | - Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India.
| |
Collapse
|
4
|
Viira B, García-Sosa AT, Maran U. Chemical structure and correlation analysis of HIV-1 NNRT and NRT inhibitors and database-curated, published inhibition constants with chemical structure in diverse datasets. J Mol Graph Model 2017; 76:205-223. [PMID: 28738270 DOI: 10.1016/j.jmgm.2017.06.019] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2017] [Revised: 06/18/2017] [Accepted: 06/19/2017] [Indexed: 01/26/2023]
Abstract
Human immunodeficiency virus (HIV-1) reverse transcriptase is a major target for designing anti-HIV drugs. Developed inhibitors are divided into non-nucleoside analog reverse-transcriptase inhibitors (NNRTIs) and nucleoside analog reverse-transcriptase inhibitors (NRTIs) depending on their mechanism. Given that many inhibitors have been studied and for many of them binding affinity constants have been calculated, it is beneficial to analyze the chemical landscape of these families of inhibitors and correlate these inhibition constants with molecular structure descriptors. For this, the HIV-1 RT data was retrieved from the ChEMBL database, carefully curated, and original literature verified, grouped into NRTIs and NNRTIs, analyzed using a hierarchical scaffold classification method and modelled with best multi-linear regression approach. Analysis of the HIV-1 NNRTIs subset results in ten different common structural parent types of oxazepanone, piperazinone, pyrazine, oxazinanone, diazinanone, pyridine, pyrrole, diazepanone, thiazole, and triazine. The same analysis for HIV-1 NRTIs groups structures into four different parent types of uracil, pyrimide, pyrimidione, and imidazole. Each scaffold tree corresponding to the parent types has been carefully analyzed and examined, and changes in chemical structure favorable to potency and stability are highlighted. For both subsets, descriptive and predictive QSAR models are derived, discussed and externally validated, revealing general trends in relationships between molecular structure and binding affinity constants in structurally diverse datasets. Data and QSAR models are available at the QsarDB repository (http://dx.doi.org/10.15152/QDB.202).
Collapse
Affiliation(s)
- Birgit Viira
- Institute of Chemistry, University of Tartu, Tartu 50411, Estonia
| | | | - Uko Maran
- Institute of Chemistry, University of Tartu, Tartu 50411, Estonia.
| |
Collapse
|
5
|
Drgan V, Župerl Š, Vračko M, Como F, Novič M. Robust modelling of acute toxicity towards fathead minnow (Pimephales promelas) using counter-propagation artificial neural networks and genetic algorithm. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2016; 27:501-519. [PMID: 27322761 DOI: 10.1080/1062936x.2016.1196388] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Accepted: 05/28/2016] [Indexed: 06/06/2023]
Abstract
Large worldwide use of chemicals has caused great concern about their possible adverse effects on human health, flora and fauna. Increased production of new chemicals has also increased demand for their risk assessment. Traditionally, results from animal tests have been used to assess toxicity of chemicals. However, such methods are ethically questionable since they involve killing and causing suffering of the test animals. Therefore, new in silico methods are being sought to replace the traditional in vivo and in vitro testing methods. In this article we report on one method that can be used to build robust models for the prediction of compounds' properties from their chemical structure. The method has been developed by combining a genetic algorithm, a counter-propagation artificial neural network and cross-validation. It has been tested using existing data on toxicity to fathead minnow (Pimephales promelas). The results show that the method may give reliable results for chemicals belonging to the applicability domain of the developed models. Therefore, it can aid the risk assessment of chemicals and consequently reduce demand for animal tests.
Collapse
Affiliation(s)
- V Drgan
- a National Institute of Chemistry , Ljubljana , Slovenia
| | - Š Župerl
- a National Institute of Chemistry , Ljubljana , Slovenia
| | - M Vračko
- a National Institute of Chemistry , Ljubljana , Slovenia
| | - F Como
- b Istituto di Ricerche Farmacologiche 'Mario Negri' , Milan , Italy
| | - M Novič
- a National Institute of Chemistry , Ljubljana , Slovenia
| |
Collapse
|
6
|
He J, Fu L, Wang Y, Li JJ, Wang XH, Su LM, Sheng LX, Zhao YH. Investigation on baseline toxicity to rats based on aliphatic compounds and comparison with toxicity to fish: Effect of exposure routes on toxicity. Regul Toxicol Pharmacol 2014; 70:98-106. [DOI: 10.1016/j.yrtph.2014.06.019] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2014] [Revised: 06/16/2014] [Accepted: 06/18/2014] [Indexed: 11/27/2022]
|
7
|
QSAR Classification Models of Acute Toxicity of Organic Compounds with Respect to Daphnia magna. Pharm Chem J 2014. [DOI: 10.1007/s11094-014-1086-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
8
|
Aruoja V, Moosus M, Kahru A, Sihtmäe M, Maran U. Measurement of baseline toxicity and QSAR analysis of 50 non-polar and 58 polar narcotic chemicals for the alga Pseudokirchneriella subcapitata. CHEMOSPHERE 2014; 96:23-32. [PMID: 23895738 DOI: 10.1016/j.chemosphere.2013.06.088] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Revised: 06/28/2013] [Accepted: 06/30/2013] [Indexed: 06/02/2023]
Abstract
In this paper a set of homogenous experimental algal toxicity data was measured for 50 non-polar narcotic chemicals using the alga Pseudokirchneriella subcapitata in a closed test with a growth rate endpoint. Most of the tested compounds are high volume industrial chemicals that so far lacked published REACH-compliant algal growth inhibition values. The test protocol fulfilled the criteria set forth in the OECD guideline 201 and had the same sensitivity as the open test which allowed direct comparison of toxicity values. Baseline QSAR model for non-polar narcotic compounds was established and compared with previous analogous models. Multi-linear QSAR model was derived for the non-polar and 58 previously tested polar (anilines and phenols) narcotic compounds modulating hydrophobicity, molecular size, electronic and molecular stability effects coded in the molecular descriptors. Descriptors in the model were analyzed and applicability domain was assessed providing further guidelines for the in silico prediction purposes in decision support while performing risk assessment. QSAR models in the manuscript are available on-line through QsarDB repository for exploring and prediction services (http://hdl.handle.net/10967/106).
Collapse
Affiliation(s)
- Villem Aruoja
- Laboratory of Environmental Toxicology, National Institute of Chemical Physics and Biophysics, Akadeemia tee 23, Tallinn 12618, Estonia.
| | | | | | | | | |
Collapse
|
9
|
Moosus M, Hiob R, Maran U. Quantitative relationship between rate constants and molecular structure descriptors for the gas phase hydrogen abstraction reactions. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2013; 24:501-518. [PMID: 23724929 DOI: 10.1080/1062936x.2013.792869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
The abstraction of hydrogen by general radicals has a wide role in environmental and also in technological processes because it results in reactive free radicals that play a vital role in atmospheric chemistry and also in biochemical processes. In addition to experimental studies, the theoretical modelling of this elementary reaction has been important for understanding and predicting respective rate constants. In this paper, molecular descriptors in the context of a QSAR approach are used to codify the relationship between molecular structure and rate constants. Unique experimental data is collected from the literature for the reaction R(i)• + R(j)H → R(i)H + R(j)•, where R(i)• = H• and R(j)• are diverse radicals. The four-parameter QSAR model (n = 34, r(2) = 0.81, r(2)(CV) = 0.74, r(2)(scr) = 0.12, s(2) = 0.19) is presented for the bimolecular rate constants, accompanied with model diagnostics and analysis of descriptors in the model.
Collapse
Affiliation(s)
- Maikki Moosus
- Institute of Chemistry, University of Tartu, Tartu, Estonia
| | | | | |
Collapse
|
10
|
Sousa IJ, Ferreira MJU, Molnár J, Fernandes MX. QSAR studies of macrocyclic diterpenes with P-glycoprotein inhibitory activity. Eur J Pharm Sci 2013; 48:542-53. [DOI: 10.1016/j.ejps.2012.11.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Revised: 10/23/2012] [Accepted: 11/17/2012] [Indexed: 10/27/2022]
|
11
|
Chakraborty A, Pan S, Chattaraj PK. Biological Activity and Toxicity: A Conceptual DFT Approach. STRUCTURE AND BONDING 2013. [DOI: 10.1007/978-3-642-32750-6_5] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
12
|
Acute toxicity of chemicals with respect to guppy studied using a linear discriminant – regression approach. Pharm Chem J 2012. [DOI: 10.1007/s11094-012-0826-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
13
|
Berhanu WM, Pillai GG, Oliferenko AA, Katritzky AR. Quantitative Structure-Activity/Property Relationships: The Ubiquitous Links between Cause and Effect. Chempluschem 2012. [DOI: 10.1002/cplu.201200038] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
14
|
Abstract
Chemometrics in Medicine and PharmacyThis minireview summarizes the basic ways of application of chemometrics in medicine and pharmacy. It brings a collection of applications of chemometric used for the solution of diverse practical problems, e.g. exploitation of biologically active species, effective use of biomarkers, advancement of clinical diagnosis, monitoring of the patient's state and prediction of its perspectives, drug design or classification of toxic chemical substances. The aim of this contribution is a brief presentation of versatile potentialities of contemporary chemometrical techniques and relevant software. They are exemplified by typical cases from literature as well as by own research results of the Chemometrics group at Department of Chemistry, the University of Ss. Cyril & Methodius in Trnava.
Collapse
|
15
|
Recurrent model of acute toxicity in homologous series of organic compounds. Pharm Chem J 2011. [DOI: 10.1007/s11094-011-0643-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
|
16
|
Moosus M, Maran U. Quantitative structure-activity relationship analysis of acute toxicity of diverse chemicals to Daphnia magna with whole molecule descriptors. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2011; 22:757-774. [PMID: 21999753 DOI: 10.1080/1062936x.2011.623317] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Quantitative structure-activity relationship analysis and estimation of toxicological effects at lower-mid trophic levels provide first aid means to understand the toxicity of chemicals. Daphnia magna serves as a good starting point for such toxicity studies and is also recognized for regulatory use in estimating the risk of chemicals. The ECOTOX database was queried and analysed for available data and a homogenous subset of 253 compounds for the endpoint LC50 48 h was established. A four-parameter quantitative structure-activity relationship was derived (coefficient of determination, r (2) = 0.740) for half of the compounds and internally validated (leave-one-out cross-validated coefficient of determination, [Formula: see text] = 0.714; leave-many-out coefficient of determination, [Formula: see text] = 0.738). External validation was carried out with the remaining half of the compounds (coefficient of determination for external validation, [Formula: see text] = 0.634). Two of the descriptors in the model (log P, average bonding information content) capture the structural characteristics describing penetration through bio-membranes. Another two descriptors (energy of highest occupied molecular orbital, weighted partial negative surface area) capture the electronic structural characteristics describing the interaction between the chemical and its hypothetic target in the cell. The applicability domain was subsequently analysed and discussed.
Collapse
Affiliation(s)
- M Moosus
- Institute of Chemistry, University of Tartu, Estonia
| | | |
Collapse
|
17
|
Fatemi MH, Dorostkar F, Ghorbannezhad Z. In silico prediction of free-radical chain transfer constants for some organic agents in styrene polymerization. MONATSHEFTE FUR CHEMIE 2011. [DOI: 10.1007/s00706-011-0527-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
18
|
Si H, Zhao J, Cui L, Lian N, Feng H, Duan YB, Hu Z. Study of Human Dopamine Sulfotransferases Based on Gene Expression Programming. Chem Biol Drug Des 2011; 78:370-7. [DOI: 10.1111/j.1747-0285.2011.01155.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
19
|
Luan F, Liu H, Gao Y, Guo L, Zhang X, Guo Y. A Quantitative Structure-Activity Relationship Study of Some Commercially Available Cephalosporins. ACTA ACUST UNITED AC 2009. [DOI: 10.1002/qsar.200810201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
20
|
|
21
|
Xia B, Liu K, Gong Z, Zheng B, Zhang X, Fan B. Rapid toxicity prediction of organic chemicals to Chlorella vulgaris using quantitative structure-activity relationships methods. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2009; 72:787-794. [PMID: 18950860 DOI: 10.1016/j.ecoenv.2008.09.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2008] [Revised: 06/25/2008] [Accepted: 09/06/2008] [Indexed: 05/27/2023]
Abstract
This paper presents the results of an optimization study on the toxicity of 91 aliphatic and aromatic compounds as well as a small subset of pesticides to algae Chlorella vulgaris, which was accomplished by using quantitative structure-activity relationships (QSAR). The linear (HM) and the nonlinear method radial basis function neural networks (RBFNN) were used to develop the QSAR models and both of them can give satisfactory prediction results. At the same time, by interpreting the descriptors, we can get some insight into structural features (molecular surface area, electrostatic repulsion, and hydrogen bonds) related to the toxic action. Finally, a detailed analysis on the model application domain defined the compounds, whose estimation can be accepted with confidence. The results of this study suggest that the proposed approaches could be successfully used as a general tool for the estimate of novel toxic compounds.
Collapse
Affiliation(s)
- Binbin Xia
- Department of Chemistry, Lanzhou University, Lanzhou 730000, Gansu, PR China
| | | | | | | | | | | |
Collapse
|
22
|
Katritzky AR, Slavov SH, Stoyanova-Slavova IS, Kahn I, Karelson M. Quantitative structure-activity relationship (QSAR) modeling of EC50 of aquatic toxicities for Daphnia magna. JOURNAL OF TOXICOLOGY AND ENVIRONMENTAL HEALTH. PART A 2009; 72:1181-1190. [PMID: 20077186 DOI: 10.1080/15287390903091863] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
The experimental EC(50) toxicities toward Daphnia magna for a series of 130 benzoic acids, benzaldehydes, phenylsulfonyl acetates, cycloalkane-carboxylates, benzanilides, and other esters were studied using the Best multilinear regression algorithm (BMLR) implemented in CODESSA. A modified quantitative structure-activity relationships (QSAR) procedure was applied guaranteeing the stability and reproducibility of the results. Separating the initial data set into training and test subsets generated three independent models with an average R(2) of .735. A five-descriptor general model including all 130 compounds, constructed using the descriptors found effective for the independent subsets, was characterized by the following statistical parameters: R(2) = .712; R(2)(cv) = .676; F = 61.331; s(2) = 0.6. The removal of two extreme outliers improved significantly the statistical parameters: R(2) = .759; R(2)(cv) = .728; F = 77.032; s(2) = 0.499. The sensitivity of the general model to chance correlations was estimated by applying a scrambling procedure involving 20 randomizations of the original property values. The resulting R(2) = .192 demonstrated the high robustness of the model proposed. The descriptors appearing in the obtained models are related to the biochemical nature of the adverse effects. An additional study of the EC(50)/LC(50) relationship for a series of 28 compounds (part of our general data set) revealed that these endpoints correlated with R(2) = .98.
Collapse
Affiliation(s)
- Alan R Katritzky
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida, USA.
| | | | | | | | | |
Collapse
|
23
|
Classification and Quantification of the Toxicity of Chemicals to Guppy, Fathead Minnow, and Rainbow Trout. Part 2. Polar Narcosis Mode of Action. ACTA ACUST UNITED AC 2008. [DOI: 10.1002/qsar.200860016] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
24
|
Luan F, Liu HT, Ma WP, Fan BT. QSPR analysis of air-to-blood distribution of volatile organic compounds. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2008; 71:731-739. [PMID: 18067958 DOI: 10.1016/j.ecoenv.2007.10.024] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2007] [Revised: 10/11/2007] [Accepted: 10/21/2007] [Indexed: 05/25/2023]
Abstract
Quantitative structure property relationship (QSPR) models for the prediction of human blood:air partition coefficient (log K(blood)) of volatile organic compounds (VOCs) has been developed based on the linear heuristic method (HM) and non-linear radial basis function neural networks (RBFNNs). Molecular descriptors that are calculated from the structures alone were used to represent the characteristics of the compounds. HM was used both to pre-select the whole descriptor sets and to build the linear model. RBFNN was performed to obtain more accurate models. Both the linear and the non-linear models can give very satisfactory prediction results: the correlation coefficient R was 0.964 and 0.979, and the root-mean-square (RMS) error was 0.3303 and 0.2542 for the whole data set, respectively. The prediction result of the non-linear model is better than that obtained by the linear model. In addition, this paper provides an effective method for predicting log K(blood) from its structures and gives some insight into the structural features related to the solubility of VOCs in human blood.
Collapse
Affiliation(s)
- F Luan
- Department of Applied Chemistry, Yantai University, Yantai 264005, PR China.
| | | | | | | |
Collapse
|
25
|
Raevsky O, Grigor'ev V, Weber E, Dearden J. Classification and Quantification of the Toxicity of Chemicals to Guppy, Fathead Minnow and Rainbow Trout: Part 1. Nonpolar Narcosis Mode of Action. ACTA ACUST UNITED AC 2008. [DOI: 10.1002/qsar.200860014] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
26
|
Gong Z, Xia B, Zhang R, Zhang X, Fan B. Quantitative Structure-Activity Relationship Study on Fish Toxicity of Substituted Benzenes. ACTA ACUST UNITED AC 2008. [DOI: 10.1002/qsar.200710096] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
27
|
Gorinchoy NN, Ogurtsov IY, Tihonovschi A, Balan I, Bersuker IB, Marenich A, Boggs J. Toxicophores and Quantitative Structure -Toxicity Relationships for Some Environmental Pollutants. CHEMISTRY JOURNAL OF MOLDOVA 2008. [DOI: 10.19261/cjm.2008.03(1).13] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
The electron-conformational (EC) method is employed to reveal the toxicophore and to predict aquatic toxicity quantitatively using as a training set a series of 51 compounds that have aquatic toxicity to fish. By performing conformational analysis (optimization of geometries of the low-energy conformers by the PM3 method) and electronic structure calculations (by ab initio method corrected within the SM54/PM3 solvatation model), the Electron-Conformational Matrix of Congruity (ECMC) was constructed for each conformation of these compounds. The toxicophore defined as the EC sub-matrix of activity (ECSA), a sub-matrix with matrix elements common to all the active compounds under consideration within minimal tolerances, is determined by an iterative procedure of comparison of their ECMC’s, gradually minimizing the tolerances. Starting with only the four most toxic compounds, their ECSA (toxicophore) was found to consists of a 4x4 matrix (four sites with certain electronic and topologic characteristics) which was shown to be present in 17 most active compounds. A structure-toxicity correlation between three toxicophore parameters and the activities of these 17 compounds with R2=0.94 was found. It is shown that the same toxicophore with larger tolerances satisfies the compounds with les activity, thus explicitly demonstrating how the activity is controlled by the tolerances quantitatively and which atoms (sites) are most flexible in this respect. This allows for getting slightly different toxicophores for different levels of activity. For some active compounds that have no toxicophore a bimolecular mechanism of activity is suggested. Distinguished from other QSAR methods, no arbitrary descriptors and no statistics are involved in this EC structure-activity investigation.
Collapse
|
28
|
Colombo A, Benfenati E, Karelson M, Maran U. The proposal of architecture for chemical splitting to optimize QSAR models for aquatic toxicity. CHEMOSPHERE 2008; 72:772-780. [PMID: 18471854 DOI: 10.1016/j.chemosphere.2008.03.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2007] [Revised: 02/20/2008] [Accepted: 03/11/2008] [Indexed: 05/26/2023]
Abstract
One of the challenges in the field of quantitative structure-activity relationship (QSAR) analysis is the correct classification of a chemical compound to an appropriate model for the prediction of activity. Thus, in previous studies, compounds have been divided into distinct groups according to their mode of action or chemical class. In the current study, theoretical molecular descriptors were used to divide 568 organic substances into subsets with toxicity measured for the 96-h lethal median concentration for the Fathead minnow (Pimephales promelas). Simple constitutional descriptors such as the number of aliphatic and aromatic rings and a quantum chemical descriptor, maximum bond order of a carbon atom divide compounds into nine subsets. For each subset of compounds the automatic forward selection of descriptors was applied to construct QSAR models. Significant correlations were achieved for each subset of chemicals and all models were validated with the leave-one-out internal validation procedure (R(2)(cv) approximately 0.80). The results encourage to consider this alternative way for the prediction of toxicity using QSAR subset models without direct reference to the mechanism of toxic action or the traditional chemical classification.
Collapse
Affiliation(s)
- Andrea Colombo
- Institute of Chemistry, University of Tartu, Tartu, Estonia
| | | | | | | |
Collapse
|
29
|
Classification structure–activity relationship (CSAR) studies for prediction of genotoxicity of thiophene derivatives. Toxicol Lett 2008; 177:10-9. [DOI: 10.1016/j.toxlet.2007.12.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2007] [Revised: 12/12/2007] [Accepted: 12/12/2007] [Indexed: 11/20/2022]
|
30
|
Lü W, Chen Y, Liu M, Chen X, Hu Z. QSPR prediction of n-octanol/water partition coefficient for polychlorinated biphenyls. CHEMOSPHERE 2007; 69:469-78. [PMID: 17568650 DOI: 10.1016/j.chemosphere.2007.04.044] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2007] [Revised: 04/11/2007] [Accepted: 04/12/2007] [Indexed: 05/15/2023]
Abstract
The logarithmic n-octanol/water partition coefficient (logK(ow)) is a very important property which concerns water-solubility, bioconcentration factor, toxicity and soil absorption coefficient of organic compounds. Quantitative structure-property relationship (QSPR) model for logK(ow) of 133 polychlorinated biphenyls (PCBs) is analyzed using heuristic method (HM) implemented in CODESSA. In order to indicate the influence of different molecular descriptors on logK(ow) values and well understand the important structural factors affecting the experimental values, three multivariable linear models derived from three groups of different molecular descriptors were built. Moreover, each molecular descriptor in these models was discussed to well understand the relationship between molecular structures and their logK(ow) values. The proposed models gave the following results: the square of correlation coefficient, R(2), for the models with one, two and three molecular descriptors was 0.8854, 0.9239 and 0.9285, respectively.
Collapse
Affiliation(s)
- Wenjuan Lü
- Department of Chemistry, Lanzhou University, Lanzhou 730000, Gansu Province, China
| | | | | | | | | |
Collapse
|
31
|
Liu H, Yao X, Liu M, Hu Z, Fan B. Study on adsorption behavior of volatile and semivolatile organic vapors to air-dry soils based on QSPR methods. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2007; 147:41-9. [PMID: 17240022 DOI: 10.1016/j.envpol.2006.08.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2005] [Revised: 08/23/2006] [Accepted: 08/27/2006] [Indexed: 05/13/2023]
Abstract
The accurate non-linear quantitive structure-property relationship model for predicting the adsorption constant of volatile and semivolatile organic vapors in soil was firstly developed based on support vector machine (SVM) by using the compounds' molecular descriptors calculated from the structure alone and the features of soil and air. Multiple linear regression (MLR) was used to build the linear QSPR model. Both the linear and non-linear models can give satisfactory prediction results: the correlation coefficient R was 0.953 and 0.995, the mean square error (MSE) was 0.0517 and 0.0057, respectively, for the whole dataset. The prediction result of the SVM model was better than that obtained by the MLR model, which proved non-linear model can simulate the relationship between the structural descriptors, the environmental condition and the soil/air distribution more accurately as well as SVM was a useful tool in the prediction of the adsorption constant of compounds.
Collapse
Affiliation(s)
- Huanxiang Liu
- Department of Chemistry, Lanzhou University, Lanzhou 730000, China
| | | | | | | | | |
Collapse
|
32
|
Bowen KR, Flanagan KB, Acree WE, Abraham MH, Rafols C. Correlation of the toxicity of organic compounds to tadpoles using the Abraham model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2006; 371:99-109. [PMID: 17011022 DOI: 10.1016/j.scitotenv.2006.08.030] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2006] [Revised: 08/11/2006] [Accepted: 08/18/2006] [Indexed: 05/12/2023]
Abstract
The narcotic and lethal concentrations of organic compounds have been compiled for several tadpole species (Rana temporaria, Rana pipiens, Rana japonica, Xenopus laevis and Rana brevipoda porosa). The narcotic and lethal concentrations have been correlated using the Abraham solvation parameter model to yield an equation that can be used to predict the narcotic concentrations of additional nonpolar and polar narcotic compounds to R. temporaria, and a more general correlation that should be applicable to different species of tadpoles. The more general equation is based on 240 experimental data points. A training set of 123 compounds could be fitted with the Abraham solvation parameters with R(2)=0.931 and S.D.=0.343 log units. The training equation predicted the test set of 122 values with AE=-0.022 log units, S.D.=0.300 log units and an average absolute error, AAE, of 0.227 log units. The structural features that are important in narcosis of tadpoles have been examined; it is concluded that hydrogen bond basicity reduces narcotic activity of compounds and that compound size increases narcotic activity. The solvation parameter model enables narcosis of tadpoles to be compared to various other biological processes and to physicochemical processes that might be used as models for narcosis.
Collapse
Affiliation(s)
- Kaci R Bowen
- Department of Chemistry, P. O. Box 305070, University of North Texas, Denton, TX 76203-5070, USA
| | | | | | | | | |
Collapse
|
33
|
Bowen KR, Flanagan KB, Acree WE, Abraham MH. Correlating toxicities of organic compounds to select protozoa using the Abraham model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2006; 369:109-18. [PMID: 16759684 DOI: 10.1016/j.scitotenv.2006.05.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2006] [Revised: 05/03/2006] [Accepted: 05/05/2006] [Indexed: 05/10/2023]
Abstract
The Abraham solvation parameter model is used to construct mathematical correlations for describing the nonspecific toxicity of organic compounds to three protozoas (Entosiphon sulcantum, Uronema parduczi and Chilomonas paramecium). The derived mathematical correlations describe the observed published toxicity data to within an overall average standard deviation of approximately 0.35 log units. The correlations can be used to estimate aquatic toxicities of organic chemicals to the three aquatic organisms studied, and to help in identifying compounds whose toxic mode of action might involve chemical specific reactivity, rather than nonpolar or polar narcosis. A principal component analysis of the correlation equations found in this work shows that no water-solvent system we have investigated is a good model for nonspecific aquatic toxicity towards the three protozoas. Furthermore, correlation equations for nonspecific aqueous toxicity towards various biological systems, that we have found in this work and in previous studies, cover such a wide range that no single water-solvent system could ever be a good model for all the biological systems.
Collapse
Affiliation(s)
- Kaci R Bowen
- Department of Chemistry, P O Box 305070, University of North Texas, Denton, TX 76203-5070, USA
| | | | | | | |
Collapse
|
34
|
Wang Y, Zhao C, Ma W, Liu H, Wang T, Jiang G. Quantitative structure-activity relationship for prediction of the toxicity of polybrominated diphenyl ether (PBDE) congeners. CHEMOSPHERE 2006; 64:515-24. [PMID: 16406101 DOI: 10.1016/j.chemosphere.2005.11.061] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2005] [Revised: 10/31/2005] [Accepted: 11/05/2005] [Indexed: 05/06/2023]
Abstract
Levels of Polybrominated diphenyl ether (PBDEs) are increasing in the environment due to their use as flame retardants. The similarities of structure to polychlorinated biphenyl (PCB) congeners suggest that they may share similar toxicological properties, such as hepatic enzyme induction. In this work, quantitative structure-activity relationship (QSAR) models were constructed based on 406 descriptors for the logarithm of toxicology index (aryl hydrocarbon receptor relative binding affinities, AhR, I) of 18 PBDE congeners. The method used for building model is the Heuristic method, which is included in comprehensive descriptors for structural and statistical analysis (CODESSA) software. The best regression model involved four descriptors, which were related to the conformational changes, atomic reactivity, molecular electrostatic field, and non-uniformity of mass distribution in a molecule of PBDEs, etc. The high square of the correlation coefficient R(2)(0.903) showed the model was satisfactory.
Collapse
Affiliation(s)
- Yawei Wang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, Beijing 100085, China
| | | | | | | | | | | |
Collapse
|
35
|
Simon-Hettich B, Rothfuss A, Steger-Hartmann T. Use of computer-assisted prediction of toxic effects of chemical substances. Toxicology 2006; 224:156-62. [PMID: 16707203 DOI: 10.1016/j.tox.2006.04.032] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2006] [Revised: 04/13/2006] [Accepted: 04/13/2006] [Indexed: 11/16/2022]
Abstract
The current revision of the European policy for the evaluation of chemicals (REACH) has lead to a controversy with regard to the need of additional animal safety testing. To avoid increases in animal testing but also to save time and resources, alternative in silico or in vitro tests for the assessment of toxic effects of chemicals are advocated. The draft of the original document issued in 29th October 2003 by the European Commission foresees the use of alternative methods but does not give further specification on which methods should be used. Computer-assisted prediction models, so-called predictive tools, besides in vitro models, will likely play an essential role in the proposed repertoire of "alternative methods". The current discussion has urged the Advisory Committee of the German Toxicology Society to present its position on the use of predictive tools in toxicology. Acceptable prediction models already exist for those toxicological endpoints which are based on well-understood mechanism, such as mutagenicity and skin sensitization, whereas mechanistically more complex endpoints such as acute, chronic or organ toxicities currently cannot be satisfactorily predicted. A potential strategy to assess such complex toxicities will lie in their dissection into models for the different steps or pathways leading to the final endpoint. Integration of these models should result in a higher predictivity. Despite these limitations, computer-assisted prediction tools already today play a complementary role for the assessment of chemicals for which no data is available or for which toxicological testing is impractical due to the lack of availability of sufficient compounds for testing. Furthermore, predictive tools offer support in the screening and the subsequent prioritization of compound for further toxicological testing, as expected within the scope of the European REACH program. This program will also lead to the collection of high-quality data which will broaden the database for further (Q)SAR approaches and will in turn increase the predictivity of predictive tools.
Collapse
|
36
|
de Roode D, Hoekzema C, de Vries-Buitenweg S, van de Waart B, van der Hoeven J. QSARs in ecotoxicological risk assessment. Regul Toxicol Pharmacol 2006; 45:24-35. [PMID: 16529851 DOI: 10.1016/j.yrtph.2006.01.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2005] [Indexed: 11/27/2022]
Abstract
The need for more ecotoxicological data encourages the use of QSARs because of the reduction of (animal) testing, time and cost. QSARs may however only be used if they prove to be reliable and accurate. In this paper, four QSARs were attempted to predict toxicity for 170 compounds from a broad chemical class, using them as a black-box. Predictions were obtained for 122 compounds, indicating an important drawback of QSARs, i.e., for 28% of the compounds QSARs cannot be used at all. Ecosar, Topkat, and QSARs for non-polar and polar narcosis generated predictions for 120, 39, 24, and 11 compounds, respectively. Correlations between experimental and predicted effect concentrations were significant for Topkat and the QSAR for polar narcosis, but generally poor for Ecosar and the QSAR for non-polar narcosis. When predicted effect concentrations for fish were allowed to deviate from experimental values by a factor of 5, correct predictions were generated for 77%, 54%, 68%, and 91% of the compounds using Ecosar, Topkat, and the QSARs for non-polar and polar narcosis, respectively. It was impossible to indicate specific chemical classes for which a QSAR should be used or not. The results show that currently available QSARs cannot be used as a black-box.
Collapse
Affiliation(s)
- Daphne de Roode
- NOTOX B.V., Hambakenwetering 7, P.O. Box 3476, 5203 DL 's Hertogenbosch, The Netherlands.
| | | | | | | | | |
Collapse
|
37
|
Liu H, Yao X, Zhang R, Liu M, Hu Z, Fan B. The accurate QSPR models to predict the bioconcentration factors of nonionic organic compounds based on the heuristic method and support vector machine. CHEMOSPHERE 2006; 63:722-33. [PMID: 16226786 DOI: 10.1016/j.chemosphere.2005.08.031] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2005] [Revised: 07/08/2005] [Accepted: 08/12/2005] [Indexed: 05/04/2023]
Abstract
The heuristic method (HM) and support vector machine (SVM) were used to build the linear and nonlinear quantitive structure-property relationship (QSPR) models for the prediction of the fish bioconcentration factors (BCF) for 122 diverse nonionic organic chemicals using the three descriptors calculated from the molecular structure alone and selected by HM. Both the linear and nonlinear model can give very satisfactory prediction results: the square of correlation coefficient R(2) was 0.929 and 0.953, the root mean square (RMS) error was 0.404 and 0.331, respectively for the whole dataset. The prediction result of the SVM model is better than that obtained by heuristic method, which proved SVM was a useful tool in the prediction of the BCF. At the same time, the HM model showed the influencing degree of different molecular descriptors on bioconcentration factors and then could improve the understanding for the bioconcentration mechanism of organic pollutants from molecular level.
Collapse
Affiliation(s)
- Huanxiang Liu
- Department of Chemistry, Lanzhou University, Lanzhou 730000, China
| | | | | | | | | | | |
Collapse
|
38
|
Roy K, Sanyal I. QSTR with Extended Topochemical Atom Indices. 7. QSAR of Substituted Benzenes toSaccharomyces cerevisiae. ACTA ACUST UNITED AC 2006. [DOI: 10.1002/qsar.200530172] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
39
|
Tämm K, Burk P. QSPR analysis for infinite dilution activity coefficients of organic compounds. J Mol Model 2005; 12:417-21. [PMID: 16333643 DOI: 10.1007/s00894-005-0062-2] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2005] [Accepted: 09/21/2005] [Indexed: 11/30/2022]
Abstract
A quantitative structure-property relationship study of the infinite-dilution activity coefficients for a set of 38 organic compounds in ionic liquids such as 1-methyl-3-ethylimidazolium bis((trifluoromethyl)sulfonyl)imide, 1,2-dimethyl-3-ethylimidazolium bis((trifluoromethyl)-sulfonyl)imide, and 4-methyl-N-butylpyridinium tetrafluoroborate. QSPR study was carried out using the CODESSA PRO program. A general three-parameter QSPR model was obtained. Three orthogonal theoretical molecular descriptors satisfactorily correlate with the activity coefficients. The descriptors, such as the complementary information content, the fractional partial negative surface area and the count of hydrogen donor sites describe the dilution process in ILs.
Collapse
Affiliation(s)
- Kaido Tämm
- University of Tartu, 2 Jakobi str., Tartu, 51014, Estonia.
| | | |
Collapse
|
40
|
Liu HX, Yao XJ, Zhang RS, Liu MC, Hu ZD, Fan BT. Prediction of the tissue/blood partition coefficients of organic compounds based on the molecular structure using least-squares support vector machines. J Comput Aided Mol Des 2005; 19:499-508. [PMID: 16317501 DOI: 10.1007/s10822-005-9003-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2005] [Accepted: 07/06/2005] [Indexed: 11/29/2022]
Abstract
The accurate nonlinear model for predicting the tissue/blood partition coefficients (PC) of organic compounds in different tissues was firstly developed based on least-squares support vector machines (LS-SVM), as a novel machine learning technique, by using the compounds' molecular descriptors calculated from the structure alone and the composition features of tissues. The heuristic method (HM) was used to select the appropriate molecular descriptors and build the linear model. The prediction result of the LS-SVM model is much better than that obtained by HM method and the prediction values of tissue/blood partition coefficients based on the LS-SVM model are in good agreement with the experimental values, which proved that nonlinear model can simulate the relationship between the structural descriptors, the tissue composition and the tissue/blood partition coefficients more accurately as well as LS-SVM was a powerful and promising tool in the prediction of the tissue/blood partition behaviour of compounds. Furthermore, this paper provided a new and effective method for predicting the tissue/blood partition behaviour of the compounds in the different tissues from their structures and gave some insight into structural features related to the partition process of the organic compounds in different tissues.
Collapse
Affiliation(s)
- H X Liu
- Department of Chemistry, Lanzhou University, 730000, Lanzhou, China
| | | | | | | | | | | |
Collapse
|
41
|
Liu H, Yao X, Zhang R, Liu M, Hu Z, Fan B. Accurate Quantitative Structure−Property Relationship Model To Predict the Solubility of C60 in Various Solvents Based on a Novel Approach Using a Least-Squares Support Vector Machine. J Phys Chem B 2005; 109:20565-71. [PMID: 16853662 DOI: 10.1021/jp052223n] [Citation(s) in RCA: 108] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
A least-squares support vector machine (LSSVM) was used for the first time as a novel machine-learning technique for the prediction of the solubility of C60 in a large number of diverse solvents using calculated molecular descriptors from the molecular structure alone and on the basis of the software CODESSA as inputs. The heuristic method of CODESSA was used to select the correlated descriptors and build the linear model. Both the linear and the nonlinear models can give very satisfactory prediction results: the square of the correlation coefficient R(2) was 0.892 and 0.903, and the root-mean-square error was 0.126 and 0.116, respectively, for the whole data set. The prediction result of the LSSVM model is better than that obtained by the heuristic method and the reference, which proved LSSVM was a useful tool in the prediction of the solubility of C60. In addition, this paper provided a new and effective method for predicting the solubility of C60 from its structures and gave some insight into the structural features related to the solubility of C60 in different solvents.
Collapse
Affiliation(s)
- Huanxiang Liu
- Department of Chemistry, Lanzhou University, Lanzhou 730000, People's Republic of China
| | | | | | | | | | | |
Collapse
|
42
|
Hoover KR, Acree WE, Abraham MH. Chemical Toxicity Correlations for Several Fish Species Based on the Abraham Solvation Parameter Model. Chem Res Toxicol 2005; 18:1497-505. [PMID: 16167843 DOI: 10.1021/tx050164z] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The Abraham solvation parameter model is used to construct mathematical correlations for describing the nonspecific aquatic toxicity of organic compounds to the fathead minnow, guppy, bluegill, goldfish, golden orfe, and high-eyes medaka. The derived mathematical correlations describe the observed published toxicity data to within an overall average standard deviation of approximately 0.28 log units. In the case of ester solutes, the descriptions were improved by introducing an indicator variable into the basic model. Derived correlations can be used to estimate aquatic toxicities of organic chemicals to the six fish species studied and to help in identifying compounds whose toxic mode of action might involve chemical specific reactivity, rather than nonpolar or polar narcosis. A principal component analysis of the correlation equations shows that the water-octanol system is a poor model for nonspecific aquatic toxicity but that the water-isobutanol and water-pentanol systems are much better models.
Collapse
Affiliation(s)
- Kaci R Hoover
- Department of Chemistry, University of North Texas, P.O. Box 305070, Denton, Texas 76203-5070, USA
| | | | | |
Collapse
|
43
|
Liu H, Yao X, Xue C, Zhang R, Liu M, Hu Z, Fan B. Study of quantitative structure–mobility relationship of the peptides based on the structural descriptors and support vector machines. Anal Chim Acta 2005. [DOI: 10.1016/j.aca.2005.04.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
44
|
Jónsdóttir SO, Jørgensen FS, Brunak S. Prediction methods and databases within chemoinformatics: emphasis on drugs and drug candidates. Bioinformatics 2005; 21:2145-60. [PMID: 15713739 DOI: 10.1093/bioinformatics/bti314] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION To gather information about available databases and chemoinformatics methods for prediction of properties relevant to the drug discovery and optimization process. RESULTS We present an overview of the most important databases with 2-dimensional and 3-dimensional structural information about drugs and drug candidates, and of databases with relevant properties. Access to experimental data and numerical methods for selecting and utilizing these data is crucial for developing accurate predictive in silico models. Many interesting predictive methods for classifying the suitability of chemical compounds as potential drugs, as well as for predicting their physico-chemical and ADMET properties have been proposed in recent years. These methods are discussed, and some possible future directions in this rapidly developing field are described.
Collapse
Affiliation(s)
- Svava Osk Jónsdóttir
- Center for Biological Sequence Analysis, BioCentrum-DTU, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark.
| | | | | |
Collapse
|
45
|
Liu HX, Hu RJ, Zhang RS, Yao XJ, Liu MC, Hu ZD, Fan BT. The prediction of human oral absorption for diffusion rate-limited drugs based on heuristic method and support vector machine. J Comput Aided Mol Des 2005; 19:33-46. [PMID: 16059665 DOI: 10.1007/s10822-005-0095-8] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2004] [Accepted: 01/03/2005] [Indexed: 10/25/2022]
Abstract
Support vector machine (SVM), as a novel machine learning technique, was used for the prediction of the human oral absorption for a large and diverse data set using the five descriptors calculated from the molecular structure alone. The molecular descriptors were selected by heuristic method (HM) implemented in CODESSA. At the same time, in order to show the influence of different molecular descriptors on absorption and to well understand the absorption mechanism, HM was used to build several multivariable linear models using different numbers of molecular descriptors. Both the linear and non-linear model can give satisfactory prediction results: the square of correlation coefficient R(2) was 0.78 and 0.86 for the training set, and 0.70 and 0.73 for the test set respectively. In addition, this paper provides a new and effective method for predicting the absorption of the drugs from their structures and gives some insight into structural features related to the absorption of the drugs.
Collapse
Affiliation(s)
- H X Liu
- Department of Chemistry, Lanzhou University, Lanzhou 730000, P.R. China
| | | | | | | | | | | | | |
Collapse
|
46
|
Katritzky AR, Kuanar M, Fara DC, Karelson M, Acree WE. QSPR treatment of rat blood:air, saline:air and olive oil:air partition coefficients using theoretical molecular descriptors. Bioorg Med Chem 2004; 12:4735-48. [PMID: 15294307 DOI: 10.1016/j.bmc.2004.05.028] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2004] [Revised: 05/13/2004] [Accepted: 05/25/2004] [Indexed: 11/19/2022]
Abstract
A QSPR treatment has been applied to a data set that consists of 100 diverse organic compounds to relate the logarithmic function of rat blood:air, saline:air and olive oil:air partition coefficients (denoted by log K(b:a), log K(s:a), and log K(o:a), respectively), with theoretical molecular and fragment descriptors. Three QSPR models with squared correlation coefficients of 0.881, 0.926, and 0.922, respectively, were obtained. The verification of the predictive power of these models on a test set of 33 organic chemicals that were not included in the training set gave satisfactory squared correlation coefficients: 0.791 for rat blood:air, 0.794 for saline:air and 0.846 for olive oil:air.
Collapse
Affiliation(s)
- Alan R Katritzky
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, FL 32611-17200, USA.
| | | | | | | | | |
Collapse
|
47
|
Tämm K, Fara DC, Katritzky AR, Burk P, Karelson M. A Quantitative Structure−Property Relationship Study of Lithium Cation Basicities. J Phys Chem A 2004. [DOI: 10.1021/jp037594n] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Kaido Tämm
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611, and Department of Chemistry, University of Tartu, 2 Jakobi Street, Tartu 51014, Estonia
| | - Dan C. Fara
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611, and Department of Chemistry, University of Tartu, 2 Jakobi Street, Tartu 51014, Estonia
| | - Alan R. Katritzky
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611, and Department of Chemistry, University of Tartu, 2 Jakobi Street, Tartu 51014, Estonia
| | - Peeter Burk
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611, and Department of Chemistry, University of Tartu, 2 Jakobi Street, Tartu 51014, Estonia
| | - Mati Karelson
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611, and Department of Chemistry, University of Tartu, 2 Jakobi Street, Tartu 51014, Estonia
| |
Collapse
|
48
|
Hawkins DM, Basak SC, Mills D. QSARs for chemical mutagens from structure: ridge regression fitting and diagnostics. ENVIRONMENTAL TOXICOLOGY AND PHARMACOLOGY 2004; 16:37-44. [PMID: 21782692 DOI: 10.1016/j.etap.2003.09.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2003] [Accepted: 09/08/2003] [Indexed: 05/31/2023]
Abstract
QSAR models have been developed for a diverse set of mutagens using computed molecular descriptors. Such models can be used in predicting mutagenicity from structure. All common methods-regression, neural nets, k-nearest neighbors-are 'linear smoothers'-weighted averages of the activities in the calibration data with weights dependent on the descriptors. While they have been studied extensively, a vital but overlooked area is 'case diagnostics', pointers to compounds that are poorly fitted, or are unusually influential in fitting the model. This is particularly true where the measured activity is binary-present or absent. We illustrate the use of numeric and graphic diagnostics, particularly that of the FF plot, with a data set with 508 compounds and 307 structural descriptors used to predict mutagenicity.
Collapse
Affiliation(s)
- Douglas M Hawkins
- School of Statistics, University of Minnesota, 313 Ford Hall, 224 Church Street S.E., Minneapolis, MN 55455, USA
| | | | | |
Collapse
|
49
|
de Wolf E, Ruelle P, van den Broeke J, Deelman BJ, van Koten G. Prediction of Partition Coefficients of Fluorous and Nonfluorous Solutes in Fluorous Biphasic Solvent Systems by Mobile Order and Disorder Theory. J Phys Chem B 2004. [DOI: 10.1021/jp036751f] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Elwin de Wolf
- Department of Metal-Mediated Synthesis, Debye Institute, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands, Institut d'Analyse Pharmaceutique, Section de Pharmacie, Université de Lausanne, BEP, CH-1015, Lausanne, Switzerland, and Atofina Vlissingen B.V., P.O. Box 70, 4380 AB, The Netherlands
| | - Paul Ruelle
- Department of Metal-Mediated Synthesis, Debye Institute, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands, Institut d'Analyse Pharmaceutique, Section de Pharmacie, Université de Lausanne, BEP, CH-1015, Lausanne, Switzerland, and Atofina Vlissingen B.V., P.O. Box 70, 4380 AB, The Netherlands
| | - Joep van den Broeke
- Department of Metal-Mediated Synthesis, Debye Institute, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands, Institut d'Analyse Pharmaceutique, Section de Pharmacie, Université de Lausanne, BEP, CH-1015, Lausanne, Switzerland, and Atofina Vlissingen B.V., P.O. Box 70, 4380 AB, The Netherlands
| | - Berth-Jan Deelman
- Department of Metal-Mediated Synthesis, Debye Institute, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands, Institut d'Analyse Pharmaceutique, Section de Pharmacie, Université de Lausanne, BEP, CH-1015, Lausanne, Switzerland, and Atofina Vlissingen B.V., P.O. Box 70, 4380 AB, The Netherlands
| | - Gerard van Koten
- Department of Metal-Mediated Synthesis, Debye Institute, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands, Institut d'Analyse Pharmaceutique, Section de Pharmacie, Université de Lausanne, BEP, CH-1015, Lausanne, Switzerland, and Atofina Vlissingen B.V., P.O. Box 70, 4380 AB, The Netherlands
| |
Collapse
|
50
|
Katritzky AR, Oliferenko P, Oliferenko A, Lomaka A, Karelson M. Nitrobenzene toxicity: QSAR correlations and mechanistic interpretations. J PHYS ORG CHEM 2003. [DOI: 10.1002/poc.643] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|