Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Santana R, Zuluaga R, Gañán P, Arrasate S, Onieva E, González-Díaz H. Designing nanoparticle release systems for drug-vitamin cancer co-therapy with multiplicative perturbation-theory machine learning (PTML) models. Nanoscale 2019;11:21811-21823. [PMID: 31691701 DOI: 10.1039/c9nr05070a] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

For:	Santana R, Zuluaga R, Gañán P, Arrasate S, Onieva E, González-Díaz H. Designing nanoparticle release systems for drug-vitamin cancer co-therapy with multiplicative perturbation-theory machine learning (PTML) models. Nanoscale 2019;11:21811-21823. [PMID: 31691701 DOI: 10.1039/c9nr05070a] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Number

Cited by Other Article(s)

He S, Segura Abarrategi J, Bediaga H, Arrasate S, González-Díaz H. On the additive artificial intelligence-based discovery of nanoparticle neurodegenerative disease drug delivery systems. BEILSTEIN JOURNAL OF NANOTECHNOLOGY 2024;15:535-555. [PMID: 38774585 PMCID: PMC11106676 DOI: 10.3762/bjnano.15.47] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 04/23/2024] [Indexed: 05/24/2024]

Abstract

Neurodegenerative diseases are characterized by slowly progressing neuronal cell death. Conventional drug treatment strategies often fail because of poor solubility, low bioavailability, and the inability of the drugs to effectively cross the blood-brain barrier. Therefore, the development of new neurodegenerative disease drugs (NDDs) requires immediate attention. Nanoparticle (NP) systems are of increasing interest for transporting NDDs to the central nervous system. However, discovering effective nanoparticle neuronal disease drug delivery systems (N2D3Ss) is challenging because of the vast number of combinations of NP and NDD compounds, as well as the various assays involved. Artificial intelligence/machine learning (AI/ML) algorithms have the potential to accelerate this process by predicting the most promising NDD and NP candidates for assaying. Nevertheless, the relatively limited amount of reported data on N2D3S activity compared to assayed NDDs makes AI/ML analysis challenging. In this work, the IFPTML technique, which combines information fusion (IF), perturbation theory (PT), and machine learning (ML), was employed to address this challenge. Initially, we conducted the fusion into a unified dataset comprising 4403 NDD assays from ChEMBL and 260 NP cytotoxicity assays from journal articles. Through a resampling process, three new working datasets were generated, each containing 500,000 cases. We utilized linear discriminant analysis (LDA) along with artificial neural network (ANN) algorithms, such as multilayer perceptron (MLP) and deep learning networks (DLN), to construct linear and non-linear IFPTML models. The IFPTML-LDA models exhibited sensitivity (Sn) and specificity (Sp) values in the range of 70% to 73% (>375,000 training cases) and 70% to 80% (>125,000 validation cases), respectively. In contrast, the IFPTML-MLP and IFPTML-DLN achieved Sn and Sp values in the range of 85% to 86% for both training and validation series. Additionally, IFPTML-ANN models showed an area under the receiver operating curve (AUROC) of approximately 0.93 to 0.95. These results indicate that the IFPTML models could serve as valuable tools in the design of drug delivery systems for neurosciences.

Collapse

Kleandrova VV, Cordeiro MNDS, Speck-Planche A. Optimizing drug discovery using multitasking models for quantitative structure-biological effect relationships: an update of the literature. Expert Opin Drug Discov 2023;18:1231-1243. [PMID: 37639708 DOI: 10.1080/17460441.2023.2251385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 08/21/2023] [Indexed: 08/31/2023]

Shirokii N, Din Y, Petrov I, Seregin Y, Sirotenko S, Razlivina J, Serov N, Vinogradov V. Quantitative Prediction of Inorganic Nanomaterial Cellular Toxicity via Machine Learning. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2023;19:e2207106. [PMID: 36772908 DOI: 10.1002/smll.202207106] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 01/09/2023] [Indexed: 05/11/2023]

Johnston ST, Faria M. Equation learning to identify nano-engineered particle-cell interactions: an interpretable machine learning approach. NANOSCALE 2022;14:16502-16515. [PMID: 36314284 DOI: 10.1039/d2nr04668g] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Speck-Planche A, Kleandrova VV. Multi-Condition QSAR Model for the Virtual Design of Chemicals with Dual Pan-Antiviral and Anti-Cytokine Storm Profiles. ACS OMEGA 2022;7:32119-32130. [PMID: 36120024 PMCID: PMC9476185 DOI: 10.1021/acsomega.2c03363] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 08/19/2022] [Indexed: 06/15/2023]

Konstantopoulos G, Koumoulos EP, Charitidis CA. Digital Innovation Enabled Nanomaterial Manufacturing; Machine Learning Strategies and Green Perspectives. NANOMATERIALS 2022;12:nano12152646. [PMID: 35957077 PMCID: PMC9370746 DOI: 10.3390/nano12152646] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 07/28/2022] [Accepted: 07/29/2022] [Indexed: 02/05/2023]

Palai D, Tahara H, Chikami S, Latag GV, Maeda S, Komura C, Kurioka H, Hayashi T. Prediction of Serum Adsorption onto Polymer Brush Films by Machine Learning. ACS Biomater Sci Eng 2022;8:3765-3772. [PMID: 35905395 DOI: 10.1021/acsbiomaterials.2c00441] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Diéguez-Santana K, Casañola-Martin GM, Torres R, Rasulev B, Green JR, González-Díaz H. Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds. Mol Pharm 2022;19:2151-2163. [PMID: 35671399 PMCID: PMC9986951 DOI: 10.1021/acs.molpharmaceut.2c00029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Serov N, Vinogradov V. Artificial intelligence to bring nanomedicine to life. Adv Drug Deliv Rev 2022;184:114194. [PMID: 35283223 DOI: 10.1016/j.addr.2022.114194] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 03/04/2022] [Accepted: 03/07/2022] [Indexed: 12/13/2022]

Abstract

The technology of drug delivery systems (DDSs) has demonstrated an outstanding performance and effectiveness in production of pharmaceuticals, as it is proved by many FDA-approved nanomedicines that have an enhanced selectivity, manageable drug release kinetics and synergistic therapeutic actions. Nonetheless, to date, the rational design and high-throughput development of nanomaterial-based DDSs for specific purposes is far from a routine practice and is still in its infancy, mainly due to the limitations in scientists' capabilities to effectively acquire, analyze, manage, and comprehend complex and ever-growing sets of experimental data, which is vital to develop DDSs with a set of desired functionalities. At the same time, this task is feasible for the data-driven approaches, high throughput experimentation techniques, process automatization, artificial intelligence (AI) technology, and machine learning (ML) approaches, which is referred to as The Fourth Paradigm of scientific research. Therefore, an integration of these approaches with nanomedicine and nanotechnology can potentially accelerate the rational design and high-throughput development of highly efficient nanoformulated drugs and smart materials with pre-defined functionalities. In this Review, we survey the important results and milestones achieved to date in the application of data science, high throughput, as well as automatization approaches, combined with AI and ML to design and optimize DDSs and related nanomaterials. This manuscript mission is not only to reflect the state-of-art in data-driven nanomedicine, but also show how recent findings in the related fields can transform the nanomedicine's image. We discuss how all these results can be used to boost nanomedicine translation to the clinic, as well as highlight the future directions for the development, data-driven, high throughput experimentation-, and AI-assisted design, as well as the production of nanoformulated drugs and smart materials with pre-defined properties and behavior. This Review will be of high interest to the chemists involved in materials science, nanotechnology, and DDSs development for biomedical applications, although the general nature of the presented approaches enables knowledge translation to many other fields of science.

Collapse

PTML Modeling for Pancreatic Cancer Research: In Silico Design of Simultaneous Multi-Protein and Multi-Cell Inhibitors. Biomedicines 2022;10:biomedicines10020491. [PMID: 35203699 PMCID: PMC8962338 DOI: 10.3390/biomedicines10020491] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 02/10/2022] [Accepted: 02/15/2022] [Indexed: 02/07/2023] Open

Smart materials: rational design in biosystems via artificial intelligence. Trends Biotechnol 2022;40:987-1003. [DOI: 10.1016/j.tibtech.2022.01.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 01/09/2022] [Accepted: 01/10/2022] [Indexed: 12/12/2022]

Quevedo-Tumailli V, Ortega-Tenezaca B, González-Díaz H. IFPTML Mapping of Drug Graphs with Protein and Chromosome Structural Networks vs. Pre-Clinical Assay Information for Discovery of Antimalarial Compounds. Int J Mol Sci 2021;22:ijms222313066. [PMID: 34884870 PMCID: PMC8657696 DOI: 10.3390/ijms222313066] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 11/23/2021] [Accepted: 11/24/2021] [Indexed: 11/16/2022] Open

Abstract

The parasite species of genus Plasmodium causes Malaria, which remains a major global health problem due to parasite resistance to available Antimalarial drugs and increasing treatment costs. Consequently, computational prediction of new Antimalarial compounds with novel targets in the proteome of Plasmodium sp. is a very important goal for the pharmaceutical industry. We can expect that the success of the pre-clinical assay depends on the conditions of assay per se, the chemical structure of the drug, the structure of the target protein to be targeted, as well as on factors governing the expression of this protein in the proteome such as genes (Deoxyribonucleic acid, DNA) sequence and/or chromosomes structure. However, there are no reports of computational models that consider all these factors simultaneously. Some of the difficulties for this kind of analysis are the dispersion of data in different datasets, the high heterogeneity of data, etc. In this work, we analyzed three databases ChEMBL (Chemical database of the European Molecular Biology Laboratory), UniProt (Universal Protein Resource), and NCBI-GDV (National Center for Biotechnology Information—Genome Data Viewer) to achieve this goal. The ChEMBL dataset contains outcomes for 17,758 unique assays of potential Antimalarial compounds including numeric descriptors (variables) for the structure of compounds as well as a huge amount of information about the conditions of assays. The NCBI-GDV and UniProt datasets include the sequence of genes, proteins, and their functions. In addition, we also created two partitions (c_assayj = c_aj and c_dataj = cd_j) of categorical variables from theChEMBL dataset. These partitions contain variables that encode information about experimental conditions of preclinical assays (c_aj) or about the nature and quality of data (c_dj). These categorical variables include information about 22 parameters of biological activity (c_a0), 28 target proteins (c_a1), and 9 organisms of assay (c_a2), etc. We also created another partition of (c_protj = c_pj) including categorical variables with biological information about the target proteins, genes, and chromosomes. These variables cover32 genes (c_p0), 10 chromosomes (c_p1), gene orientation (c_p2), and 31 protein functions (c_p3). We used a Perturbation-Theory Machine Learning Information Fusion (IFPTML) algorithm to map all this information (from three databases) into and train a predictive model. Shannon’s entropy measure Sh_k (numerical variables) was used to quantify the information about the structure of drugs, protein sequences, gene sequences, and chromosomes in the same information scale. Perturbation Theory Operators (PTOs) with the form of Moving Average (MA) operators have been used to quantify perturbations (deviations) in the structural variables with respect to their expected values for different subsets (partitions) of categorical variables. We obtained three IFPTML models using General Discriminant Analysis (GDA), Classification Tree with Univariate Splits (CTUS), and Classification Tree with Linear Combinations (CTLC). The IFPTML-CTLC presented the better performance with Sensitivity Sn(%) = 83.6/85.1, and Specificity Sp(%) = 89.8/89.7 for training/validation sets, respectively. This model could become a useful tool for the optimization of preclinical assays of new Antimalarial compounds vs. different proteins in the proteome of Plasmodium.

Collapse

Diéguez-Santana K, González-Díaz H. Towards machine learning discovery of dual antibacterial drug-nanoparticle systems. NANOSCALE 2021;13:17854-17870. [PMID: 34671801 DOI: 10.1039/d1nr04178a] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Abstract

Artificial Intelligence/Machine Learning (AI/ML) algorithms may speed up the design of DADNP systems formed by Antibacterial Drugs (AD) and Nanoparticles (NP). In this work, we used IFPTML = Information Fusion (IF) + Perturbation-Theory (PT) + Machine Learning (ML) algorithm for the first time to study of a large dataset of putative DADNP systems composed by >165 000 ChEMBL AD assays and 300 NP assays vs. multiple bacteria species. We trained alternative models with Linear Discriminant Analysis (LDA), Artificial Neural Networks (ANN), Bayesian Networks (BNN), K-Nearest Neighbour (KNN) and other algorithms. IFPTML-LDA model was simpler with values of Sp ≈ 90% and Sn ≈ 74% in both training (>124 K cases) and validation (>41 K cases) series. IFPTML-ANN and KNN models are notably more complicated even when they are more balanced Sn ≈ Sp ≈ 88.5%-99.0% and AUROC ≈ 0.94-0.99 in both series. We also carried out a simulation (>1900 calculations) of the expected behavior for putative DADNPs in 72 different biological assays. The putative DADNPs studied are formed by 27 different drugs with multiple classes of NP and types of coats. In addition, we tested the validity of our additive model with 80 DADNP complexes experimentally synthetized and biologically tested (reported in >45 papers). All these DADNPs show values of MIC < 50 μg mL^-1 (cutoff used) better that MIC of AD and NP alone (synergistic or additive effect). The assays involve DADNP complexes with 10 types of NP, 6 coating materials, NP size range 5-100 nm vs. 15 different antibiotics, and 12 bacteria species. The IFPTML-LDA model classified correctly 100% (80 out of 80) DADNP complexes as biologically active. IFPMTL additive strategy may become a useful tool to assist the design of DADNP systems for antibacterial therapy taking into consideration only information about AD and NP components by separate.

Collapse

Prediction of Anti-Glioblastoma Drug-Decorated Nanoparticle Delivery Systems Using Molecular Descriptors and Machine Learning. Int J Mol Sci 2021;22:ijms222111519. [PMID: 34768951 PMCID: PMC8584266 DOI: 10.3390/ijms222111519] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/08/2021] [Accepted: 10/22/2021] [Indexed: 12/22/2022] Open

Gomes SIL, Amorim MJB, Pokhrel S, Mädler L, Fasano M, Chiavazzo E, Asinari P, Jänes J, Tämm K, Burk J, Scott-Fordsmand JJ. Machine learning and materials modelling interpretation of in vivo toxicological response to TiO₂ nanoparticles library (UV and non-UV exposure). NANOSCALE 2021;13:14666-14678. [PMID: 34533558 DOI: 10.1039/d1nr03231c] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Abstract

Assessing the risks of nanomaterials/nanoparticles (NMs/NPs) under various environmental conditions requires a more systematic approach, including the comparison of effects across many NMs with identified different but related characters/descriptors. Hence, there is an urgent need to provide coherent (eco)toxicological datasets containing comprehensive toxicity information relating to a diverse spectra of NPs characters. These datasets are test benches for developing holistic methodologies with broader applicability. In the present study we assessed the effects of a custom design Fe-doped TiO₂ NPs library, using the soil invertebrate Enchytraeus crypticus (Oligochaeta), via a 5-day pulse via aqueous exposure followed by a 21-days recovery period in soil (survival, reproduction assessment). Obviously, when testing TiO₂, realistic conditions should include UV exposure. The 11 Fe-TiO₂ library contains NPs of size range between 5-27 nm with varying %Fe (enabling the photoactivation of TiO₂ at energy wavelengths in the visible-light range). The NPs were each described by 122 descriptors, being a mixture of measured and atomistic model descriptors. The data were explored using single and univariate statistical methods, combined with machine learning and multiscale modelling techniques. An iterative pruning process was adopted for identifying automatically the most significant descriptors. TiO₂ NPs toxicity decreased when combined with UV. Notably, the short-term water exposure induced lasting biological responses even after longer-term recovery in clean exposure. The correspondence with Fe-content correlated with the band-gap hence the reduction of UV oxidative stress. The inclusion of both measured and modelled materials data benefitted the explanation of the results, when combined with machine learning.

Collapse

Computational Drug Repurposing for Antituberculosis Therapy: Discovery of Multi-Strain Inhibitors. Antibiotics (Basel) 2021;10:antibiotics10081005. [PMID: 34439055 PMCID: PMC8388932 DOI: 10.3390/antibiotics10081005] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 08/15/2021] [Accepted: 08/17/2021] [Indexed: 12/13/2022] Open

Zhang H, Barnard AS. Impact of atomistic or crystallographic descriptors for classification of gold nanoparticles. NANOSCALE 2021;13:11887-11898. [PMID: 34190263 DOI: 10.1039/d1nr02258j] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Ortega-Tenezaca B, González-Díaz H. IFPTML mapping of nanoparticle antibacterial activity vs. pathogen metabolic networks. NANOSCALE 2021;13:1318-1330. [PMID: 33410431 DOI: 10.1039/d0nr07588d] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abstract

Nanoparticles are useful antimicrobial drug-release systems, but some nanoparticles also exhibit antibacterial activity. However, investigation of their antibacterial activity is a difficult and slow process due to the numerous combinations of nanoparticle size, shape, and composition vs. biological tests, assay organisms, and multiple activity parameters to be measured. Additionally, the overuse of antibiotics has led to the emergence of resistant bacterial strains with different metabolic networks. Computational models may speed up this process, but the models reported to date do not to consider all the previous factors, and the data sources are dispersed and not curated. Thus, herein, we used an information fusion, perturbation-theory machine learning (IFPTML) approach, which is introduced by us for the first time, to fit a model for the discovery of antibacterial nanoparticles. The dataset studied had 15 classes of nanoparticles (1-100 nm) with most cases in the range of 1-50 nm vs. >20 pathogenic bacteria species with different metabolic networks. The nanoparticles studied included metal nanoparticles of Au, Ag, and Cu; oxide nanoparticles of Zn, Cu, La, Al, Fe, Sn, Ti, Cd, and Si; and metal salt nanoparticles of CuI and CdS. We used the SOFT.PTML software (our own application) with a user-friendly interface for the IFPTML calculations and a control statistics package. Using SOFT.PTML, we found a linear logistic regression equation that could model 4 biological activity parameters using only 8 variables with χ2 = 2265.75, p-level <0.05, sensitivity, Sn = 79.4, and specificity, Sp = 99.3, for 3213 cases (nanoparticle-bacteria pairs) in the training series. The model had Sn = 80.8 and Sp = 99.3 for 2114 cases in the external validation series. We also developed a random forest non-linear model with higher values of Sn and Sp = 98-99% in the training/validation series, although it was more complicated to use. SOFT.PTML has been demonstrated to be a useful tool for the analysis of complex data in nanotechnology. We also introduced a new anabolism-catabolism unbalance index of metabolic networks to reveal the biological connotation of the IFPTML predictions for antibacterial nanoparticles. These new models open a new door for the discovery of NPs vs. new bacterial species and strains with different topological structures of their metabolic networks.

Collapse

Chan C, Du S, Dong Y, Cheng X. Computational and Experimental Approaches to Investigate Lipid Nanoparticles as Drug and Gene Delivery Systems. Curr Top Med Chem 2021;21:92-114. [PMID: 33243123 PMCID: PMC8191596 DOI: 10.2174/1568026620666201126162945] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 10/16/2020] [Accepted: 10/22/2020] [Indexed: 02/06/2023]

Santana R, Zuluaga R, Gañán P, Arrasate S, Onieva E, Montemore MM, González-Díaz H. PTML Model for Selection of Nanoparticles, Anticancer Drugs, and Vitamins in the Design of Drug-Vitamin Nanoparticle Release Systems for Cancer Cotherapy. Mol Pharm 2020;17:2612-2627. [PMID: 32459098 DOI: 10.1021/acs.molpharmaceut.0c00308] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Abstract

Nanosystems are gaining momentum in pharmaceutical sciences because of the wide variety of possibilities for designing these systems to have specific functions. Specifically, studies of new cancer cotherapy drug-vitamin release nanosystems (DVRNs) including anticancer compounds and vitamins or vitamin derivatives have revealed encouraging results. However, the number of possible combinations of design and synthesis conditions is remarkably high. In addition, a large number of anticancer and vitamin derivatives have been already assayed, but a notably less number of cases of DVRNs were assayed as a whole (with the anticancer compound and the vitamin linked to them). Our approach combines with the perturbation theory and machine learning (PTML) model to predict the probability of obtaining an interesting DVRN by changing the anticancer compound and/or the vitamin present in a DVRN that is already tested for other anticancer compounds or vitamins that have not been tested yet as part of a DVRN. In a previous work, we built a linear PTML model useful for the design of these nanosystems. In doing so, we used information fusion (IF) techniques to carry out data enrichment of DVRN data compiled from the literature with the data for preclinical assays of vitamins from the ChEMBL database. The design features of DVRNs and the assay conditions of nanoparticles (NPs) and vitamins were included as multiplicative PT operators (PTOs) to the system, which indicates the importance of these variables. However, the previous work omitted experiments with nonlinear ML techniques and different types of PTOs such as metric-based PTOs. More importantly, the previous work does not consider the structure of the anticancer drug to be included in the new DVRNs. In this work, we are going to accomplish three main objectives (tasks). In the first task, we found a new model, alternative to the one published before, for the rational design of DVRNs using metric-based PTOs. The most accurate PTML model was the artificial neural network model, which showed values of specificity, sensitivity, and accuracy in the range of 90-95% in training and external validation series for more than 130,000 cases (DVRNs vs ChEMBL assays). Furthermore, in the second task, we used IF techniques to carry out data enrichment of our previous data set. In doing so, we constructed a new working data set of >970,000 cases with the data of preclinical assays of DVRNs, vitamins, and anticancer compounds from the ChEMBL database. All these assays have multiple continuous variables or descriptors d_k and categorical variables c_j (conditions of the assays) for drugs (d_ack, c_acj), vitamins (d_vk, c_vj), and NPs (d_nk, c_nj). These data include >20,000 potential anticancer compounds with >270 protein targets (c_ac1), >580 assay cell organisms (c_ac2), and so forth. Furthermore, we include >36,000 assay vitamin derivatives in >6200 types of cells (c_2vit), >120 assay organisms (c_3vit), >60 assay strains (c_4vit), and so forth. The enriched data set also contains >20 types of DVRNs (c_5n) with 9 NP core materials (c_4n), 8 synthesis methods (c_7n), and so forth. We expressed all this information with PTOs and developed a qualitatively new PTML model that incorporates information of the anticancer drugs. This new model presents 96-97% of accuracy for training and external validation subsets. In the last task, we carried out a comparative study of ML and/or PTML models published and described how the models we are presenting cover the gap of knowledge in terms of drug delivery. In conclusion, we present here for the first time a multipurpose PTML model that is able to select NPs, anticancer compounds, and vitamins and their conditions of assay for DVRN design.

Collapse

Santana R, Zuluaga R, Gañán P, Arrasate S, Onieva E, González-Díaz H. Predicting coated-nanoparticle drug release systems with perturbation-theory machine learning (PTML) models. NANOSCALE 2020;12:13471-13483. [PMID: 32613998 DOI: 10.1039/d0nr01849j] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Abstract

Nanoparticles (NPs) decorated with coating agents (polymers, gels, proteins, etc.) form Nanoparticle Drug Delivery Systems (DDNS), which are of high interest in nanotechnology and biomaterials science. There have been increasing reports of experimental data sets of biological activity, toxicity, and delivery properties of DDNS. However, these data sets are still dispersed and not as large as the datasets of DDNS components (NP and drugs). This has prompted researchers to train Machine Learning (ML) algorithms that are able to design new DDNS based on the properties of their components. However, most ML models reported up to date predictions of the specific activities of NP or drugs over a determined target or cell line. In this paper, we combine Perturbation Theory and Machine Learning (PTML algorithm) to train a model that is able to predict the best components (NP, coating agent, and drug) for DDNS design. In so doing, we downloaded a dataset of >30 000 preclinical assays of drugs from ChEMBL. We also downloaded an NP data set formed by preclinical assays of coated Metal Oxide Nanoparticles (MONPs) from public sources. Both the drugs and NP datasets of preclinical assays cover multiple conditions of assays that can be listed as two arrays, namely, cjdrug and cjNP. The cjdrug array includes >504 biological activity parameters (c0drug), >340 target proteins (c1drug), >650 types of cells (c2drug), >120 assay organisms (c3drug), and >60 assay strains (c4drug). On the other hand, the cjNP array includes 3 biological activity parameters (c0NP), 40 types of proteins (c1NP), 10 shapes of nanoparticles (c2NP), 6 assay media (c3NP), and 12 coating agents (c4NP). After downloading, we pre-processed both the data sets by separate calculation PT operators that are able to account for changes (perturbations) in the drug, coating agents, and NP chemical structure and/or physicochemical properties as well as for the assay conditions. Next, we carry out an information fusion process to form a final dataset of above 500 000 DDNS (drug + MONP pairs). We also trained other linear and non-linear PTML models using R studio scripts for comparative purposes. To the best of our knowledge, this is the first multi-label PTML model that is useful for the selection of drugs, coating agents, and metal or metal-oxide nanoparticles to be assembled in order to design new DDNS with optimal activity/toxicity profiles.

Collapse

Halder AK, Melo A, Cordeiro MNDS. A unified in silico model based on perturbation theory for assessing the genotoxicity of metal oxide nanoparticles. CHEMOSPHERE 2020;244:125489. [PMID: 31812055 DOI: 10.1016/j.chemosphere.2019.125489] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Revised: 11/19/2019] [Accepted: 11/26/2019] [Indexed: 06/10/2023]

Carracedo-Reboredo P, Corona R, Martinez-Nunes M, Fernandez-Lozano C, Tsiliki G, Sarimveis H, Aranzamendi E, Arrasate S, Sotomayor N, Lete E, Munteanu CR, González-Díaz H. MCDCalc: Markov Chain Molecular Descriptors Calculator for Medicinal Chemistry. Curr Top Med Chem 2019;20:305-317. [PMID: 31878856 DOI: 10.2174/1568026620666191226092431] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 09/17/2019] [Accepted: 09/17/2019] [Indexed: 11/22/2022]

Abstract

AIMS

Cheminformatics models are able to predict different outputs (activity, property, chemical reactivity) in single molecules or complex molecular systems (catalyzed organic synthesis, metabolic reactions, nanoparticles, etc.).

BACKGROUND

OBJECTIVE

Cheminformatics prediction of complex catalytic enantioselective reactions is a major goal in organic synthesis research and chemical industry. Markov Chain Molecular Descriptors (MCDs) have been largely used to solve Cheminformatics problems. There are different types of Markov chain descriptors such as Markov-Shannon entropies (Shk), Markov Means (Mk), Markov Moments (πk), etc. However, there are other possible MCDs that have not been used before. In addition, the calculation of MCDs is done very often using specific software not always available for general users and there is not an R library public available for the calculation of MCDs. This fact, limits the availability of MCMDbased Cheminformatics procedures.

METHODS

We studied the enantiomeric excess ee(%)[Rcat] for 324 α-amidoalkylation reactions. These reactions have a complex mechanism depending on various factors. The model includes MCDs of the substrate, solvent, chiral catalyst, product along with values of time of reaction, temperature, load of catalyst, etc. We tested several Machine Learning regression algorithms. The Random Forest regression model has R2 > 0.90 in training and test. Secondly, the biological activity of 5644 compounds against colorectal cancer was studied.

RESULTS

We developed very interesting model able to predict with Specificity and Sensitivity 70-82% the cases of preclinical assays in both training and validation series.

CONCLUSION

The work shows the potential of the new tool for computational studies in organic and medicinal chemistry.

Collapse

Affiliation(s)

Paula Carracedo-Reboredo Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, CITIC, Campus Elviña s/n, 15071, A Coruña, Spain.,Group of Artificial Neural Networks and Adaptative Systems, Medical Imaging, and Diagnostic Radiology (RNASA-IMEDIR), Institute of Biomedical Research of Coruna (INIBIC), Hospital Complex of University of A Coruna (CHUAC), Sergas, University of Coruna (UDC), Xubias de arriba 84, 15006, A Coruna, Spain.,Department of Organic Chemistry II, University of the Basque Country UPV/EHU, 48940, Leioa, Bilbao, Spain
Ramiro Corona Department of Organic Chemistry II, University of the Basque Country UPV/EHU, 48940, Leioa, Bilbao, Spain
Mikel Martinez-Nunes Department of Organic Chemistry II, University of the Basque Country UPV/EHU, 48940, Leioa, Bilbao, Spain
Carlos Fernandez-Lozano Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, CITIC, Campus Elviña s/n, 15071, A Coruña, Spain.,Group of Artificial Neural Networks and Adaptative Systems, Medical Imaging, and Diagnostic Radiology (RNASA-IMEDIR), Institute of Biomedical Research of Coruna (INIBIC), Hospital Complex of University of A Coruna (CHUAC), Sergas, University of Coruna (UDC), Xubias de arriba 84, 15006, A Coruna, Spain
Georgia Tsiliki Institute for the Management of Information Systems, ATHENA Research and Innovation Centre, 15125, Athens, Greece
Haralambos Sarimveis School of Chemical Engineering, National Technical University of Athens, Zografou, Campus, 15780, Athens, Greece.,Pharma-Informatics Unit, ATHENA Research and Innovation Centre, 15125, Athens, Greece
Eider Aranzamendi Department of Organic Chemistry II, University of the Basque Country UPV/EHU, 48940, Leioa, Bilbao, Spain
Sonia Arrasate Department of Organic Chemistry II, University of the Basque Country UPV/EHU, 48940, Leioa, Bilbao, Spain
Nuria Sotomayor Group of Artificial Neural Networks and Adaptative Systems, Medical Imaging, and Diagnostic Radiology (RNASA-IMEDIR), Institute of Biomedical Research of Coruna (INIBIC), Hospital Complex of University of A Coruna (CHUAC), Sergas, University of Coruna (UDC), Xubias de arriba 84, 15006, A Coruna, Spain
Esther Lete Department of Organic Chemistry II, University of the Basque Country UPV/EHU, 48940, Leioa, Bilbao, Spain
Cristian Robert Munteanu Department of Computer Science and Information Technologies, Faculty of Computer Science, University of A Coruña, CITIC, Campus Elviña s/n, 15071, A Coruña, Spain.,Group of Artificial Neural Networks and Adaptative Systems, Medical Imaging, and Diagnostic Radiology (RNASA-IMEDIR), Institute of Biomedical Research of Coruna (INIBIC), Hospital Complex of University of A Coruna (CHUAC), Sergas, University of Coruna (UDC), Xubias de arriba 84, 15006, A Coruna, Spain
Humbert González-Díaz Basque Center for Biophysics, University of the Basque Country UPV/EHU, 48940, Leioa, Bilbao, Spain.,IKERBASQUE, Basque Foundation for Science, 48011, Bilbao, Spain

Collapse