1
|
Toropova AP, Meneses J, Alfaro-Moreno E, Toropov AA. The system of self-consistent models based on quasi-SMILES as a tool to predict the potential of nano-inhibitors of human lung carcinoma cell line A549 for different experimental conditions. Drug Chem Toxicol 2024; 47:306-313. [PMID: 36744523 DOI: 10.1080/01480545.2023.2174986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/28/2022] [Accepted: 10/11/2022] [Indexed: 02/07/2023]
Abstract
The different features of the impact of nanoparticles on cells, such as the structure of the core, presence/absence of doping, quality of surface, diameter, and dose, were used to define quasi-SMILES, a line of symbols encoded the above physicochemical features of the impact of nanoparticles. The correlation weight for each code in the quasi-SMILES has been calculated by the Monte Carlo method. The descriptor, which is the sum of the correlation weights, is the basis for a one-variable model of the biological activity of nano-inhibitors of human lung carcinoma cell line A549. The system of models obtained by the above scheme was checked on the self-consistence, i.e., reproducing the statistical quality of these models observed for different distributions of available nanomaterials into the training and validation sets. The computational experiments confirm the excellent potential of the approach as a tool to predict the impact of nanomaterials under different experimental conditions. In conclusion, our model is a self-consistent model system that provides a user to assess the reliability of the statistical quality of the used approach.
Collapse
Affiliation(s)
- Alla P Toropova
- Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
| | - João Meneses
- Nanosafety Group, International Iberian Nanotechnology Laboratory, Braga, Portugal
| | | | - Andrey A Toropov
- Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
| |
Collapse
|
2
|
Furxhi I, Willighagen E, Evelo C, Costa A, Gardini D, Ammar A. A data reusability assessment in the nanosafety domain based on the NSDRA framework followed by an exploratory quantitative structure activity relationships (QSAR) modeling targeting cellular viability. NANOIMPACT 2023; 31:100475. [PMID: 37423508 DOI: 10.1016/j.impact.2023.100475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 07/03/2023] [Accepted: 07/04/2023] [Indexed: 07/11/2023]
Abstract
INTRODUCTION The current effort towards the digital transformation across multiple scientific domains requires data that is Findable, Accessible, Interoperable and Reusable (FAIR). In addition to the FAIR data, what is required for the application of computational tools, such as Quantitative Structure Activity Relationships (QSARs), is a sufficient data volume and the ability to merge sources into homogeneous digital assets. In the nanosafety domain there is a lack of FAIR available metadata. METHODOLOGY To address this challenge, we utilized 34 datasets from the nanosafety domain by exploiting the NanoSafety Data Reusability Assessment (NSDRA) framework, which allowed the annotation and assessment of dataset's reusability. From the framework's application results, eight datasets targeting the same endpoint (i.e. numerical cellular viability) were selected, processed and merged to test several hypothesis including universal versus nanogroup-specific QSAR models (metal oxide and nanotubes), and regression versus classification Machine Learning (ML) algorithms. RESULTS Universal regression and classification QSARs reached an 0.86 R2 and 0.92 accuracy, respectively, for the test set. Nanogroup-specific regression models reached 0.88 R2 for nanotubes test set followed by metal oxide (0.78). Nanogroup-specific classification models reached 0.99 accuracy for nanotubes test set, followed by metal oxide (0.91). Feature importance revealed different patterns depending on the dataset with common influential features including core size, exposure conditions and toxicological assay. Even in the case where the available experimental knowledge was merged, the models still failed to correctly predict the outputs of an unseen dataset, revealing the cumbersome conundrum of scientific reproducibility in realistic applications of QSAR for nanosafety. To harness the full potential of computational tools and ensure their long-term applications, embracing FAIR data practices is imperative in driving the development of responsible QSAR models. CONCLUSIONS This study reveals that the digitalization of nanosafety knowledge in a reproducible manner has a long way towards its successful pragmatic implementation. The workflow carried out in the study shows a promising approach to increase the FAIRness across all the elements of computational studies, from dataset's annotation, selection, merging to FAIR modeling reporting. This has significant implications for future research as it provides an example of how to utilize and report different tools available in the nanosafety knowledge system, while increasing the transparency of the results. One of the main benefits of this workflow is that it promotes data sharing and reuse, which is essential for advancing scientific knowledge by making data and metadata FAIR compliant. In addition, the increased transparency and reproducibility of the results can enhance the trustworthiness of the computational findings.
Collapse
Affiliation(s)
- Irini Furxhi
- Transgero Limited, Cullinagh, Newcastle West, Co. Limerick, Ireland; Dept. of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93, Ireland.
| | - Egon Willighagen
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, the Netherlands.
| | - Chris Evelo
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, the Netherlands.
| | - Anna Costa
- National Research Council, Institute of Science, Technology and Sustainability for Ceramics, Faenza, Italy.
| | - Davide Gardini
- National Research Council, Institute of Science, Technology and Sustainability for Ceramics, Faenza, Italy.
| | - Ammar Ammar
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, the Netherlands.
| |
Collapse
|
3
|
Yan X, Yue T, Winkler DA, Yin Y, Zhu H, Jiang G, Yan B. Converting Nanotoxicity Data to Information Using Artificial Intelligence and Simulation. Chem Rev 2023. [PMID: 37262026 DOI: 10.1021/acs.chemrev.3c00070] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Decades of nanotoxicology research have generated extensive and diverse data sets. However, data is not equal to information. The question is how to extract critical information buried in vast data streams. Here we show that artificial intelligence (AI) and molecular simulation play key roles in transforming nanotoxicity data into critical information, i.e., constructing the quantitative nanostructure (physicochemical properties)-toxicity relationships, and elucidating the toxicity-related molecular mechanisms. For AI and molecular simulation to realize their full impacts in this mission, several obstacles must be overcome. These include the paucity of high-quality nanomaterials (NMs) and standardized nanotoxicity data, the lack of model-friendly databases, the scarcity of specific and universal nanodescriptors, and the inability to simulate NMs at realistic spatial and temporal scales. This review provides a comprehensive and representative, but not exhaustive, summary of the current capability gaps and tools required to fill these formidable gaps. Specifically, we discuss the applications of AI and molecular simulation, which can address the large-scale data challenge for nanotoxicology research. The need for model-friendly nanotoxicity databases, powerful nanodescriptors, new modeling approaches, molecular mechanism analysis, and design of the next-generation NMs are also critically discussed. Finally, we provide a perspective on future trends and challenges.
Collapse
Affiliation(s)
- Xiliang Yan
- Institute of Environmental Research at the Greater Bay Area, Key Laboratory for Water Quality and Conservation of the Pearl River Delta, Ministry of Education, Guangzhou University, Guangzhou 510006, China
| | - Tongtao Yue
- Key Laboratory of Marine Environment and Ecology, Ministry of Education, Institute of Coastal Environmental Pollution Control, Ocean University of China, Qingdao 266100, China
| | - David A Winkler
- Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria 3052, Australia
- School of Pharmacy, University of Nottingham, Nottingham NG7 2QL, U.K
- Department of Biochemistry and Chemistry, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, Victoria 3086, Australia
| | - Yongguang Yin
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Hao Zhu
- Department of Chemistry and Biochemistry, Rowan University, Glassboro, New Jersey 08028, United States
| | - Guibin Jiang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Bing Yan
- Institute of Environmental Research at the Greater Bay Area, Key Laboratory for Water Quality and Conservation of the Pearl River Delta, Ministry of Education, Guangzhou University, Guangzhou 510006, China
| |
Collapse
|
4
|
Meneses J, González-Durruthy M, Fernandez-de-Gortari E, Toropova AP, Toropov AA, Alfaro-Moreno E. A Nano-QSTR model to predict nano-cytotoxicity: an approach using human lung cells data. Part Fibre Toxicol 2023; 20:21. [PMID: 37211608 DOI: 10.1186/s12989-023-00530-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 05/01/2023] [Indexed: 05/23/2023] Open
Abstract
BACKGROUND The widespread use of new engineered nanomaterials (ENMs) in industries such as cosmetics, electronics, and diagnostic nanodevices, has been revolutionizing our society. However, emerging studies suggest that ENMs present potentially toxic effects on the human lung. In this regard, we developed a machine learning (ML) nano-quantitative-structure-toxicity relationship (QSTR) model to predict the potential human lung nano-cytotoxicity induced by exposure to ENMs based on metal oxide nanoparticles. RESULTS Tree-based learning algorithms (e.g., decision tree (DT), random forest (RF), and extra-trees (ET)) were able to predict ENMs' cytotoxic risk in an efficient, robust, and interpretable way. The best-ranked ET nano-QSTR model showed excellent statistical performance with R2 and Q2-based metrics of 0.95, 0.80, and 0.79 for training, internal validation, and external validation subsets, respectively. Several nano-descriptors linked to the core-type and surface coating reactivity properties were identified as the most relevant characteristics to predict human lung nano-cytotoxicity. CONCLUSIONS The proposed model suggests that a decrease in the ENMs diameter could significantly increase their potential ability to access lung subcellular compartments (e.g., mitochondria and nuclei), promoting strong nano-cytotoxicity and epithelial barrier dysfunction. Additionally, the presence of polyethylene glycol (PEG) as a surface coating could prevent the potential release of cytotoxic metal ions, promoting lung cytoprotection. Overall, the current work could pave the way for efficient decision-making, prediction, and mitigation of the potential occupational and environmental ENMs risks.
Collapse
Affiliation(s)
- João Meneses
- NanoSafety Group, International Iberian Nanotechnology Laboratory, Braga, 4715-330, Portugal
| | | | | | - Alla P Toropova
- Instituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, Milano, 20156, Italy
| | - Andrey A Toropov
- Instituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, Milano, 20156, Italy
| | - Ernesto Alfaro-Moreno
- NanoSafety Group, International Iberian Nanotechnology Laboratory, Braga, 4715-330, Portugal.
| |
Collapse
|
5
|
Li J, Yue L, Zhao Q, Cao X, Tang W, Chen F, Wang C, Wang Z. Prediction models on biomass and yield of rice affected by metal (oxide) nanoparticles using nano-specific descriptors. NANOIMPACT 2022; 28:100429. [PMID: 36130713 DOI: 10.1016/j.impact.2022.100429] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 09/12/2022] [Accepted: 09/14/2022] [Indexed: 06/15/2023]
Abstract
The use of in silico tools to investigate the interactions between metal (oxide) nanoparticles (NPs) and plant biological responses is preferred because it allows us to understand molecular mechanisms and improve prediction efficiency by saving time, labor, and cost. In this study, four models (C5.0 decision tree, discriminant function analysis, random forest, and stepwise multiple linear regression analysis) were applied to predict the effect of NPs on rice biomass and yield. Nano-specific descriptors (size-dependent molecular descriptors and image-based descriptors) were introduced to estimate the behavior of NPs in plants to appropriately represent the wide space of NPs. The results showed that size-dependent molecular descriptors (e.g., E-state and connectivity indices) and image-based descriptors (e.g., extension, area, and minimum ferret diameter) were associated with the behavior of NPs in rice. The performance of the constructed models was within acceptable ranges (correlation coefficient ranged from 0.752 to 0.847 for biomass and from 0.803 to 0.905 for yield, while the accuracy ranged from 64% to 77% for biomass and 81% to 89% for yield). The developed model can be used to quickly and efficiently evaluate the impact of NPs under a wide range of experimental conditions and sufficient training data.
Collapse
Affiliation(s)
- Jing Li
- Institute of Environmental Processotes and Pollution Control, and School of Environment and Civil Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Engineering Laboratory for Biomass Energy and Carbon Reduction Technology, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Key Laboratory of Anaerobic Biotechnology, Jiangnan University, Wuxi, Jiangsu 214122, China
| | - Le Yue
- Institute of Environmental Processotes and Pollution Control, and School of Environment and Civil Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Engineering Laboratory for Biomass Energy and Carbon Reduction Technology, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Key Laboratory of Anaerobic Biotechnology, Jiangnan University, Wuxi, Jiangsu 214122, China
| | - Qing Zhao
- Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, China; National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangzhou 510650, China
| | - Xuesong Cao
- Institute of Environmental Processotes and Pollution Control, and School of Environment and Civil Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Engineering Laboratory for Biomass Energy and Carbon Reduction Technology, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Key Laboratory of Anaerobic Biotechnology, Jiangnan University, Wuxi, Jiangsu 214122, China
| | - Weihao Tang
- Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, China; National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangzhou 510650, China
| | - Feiran Chen
- Institute of Environmental Processotes and Pollution Control, and School of Environment and Civil Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Engineering Laboratory for Biomass Energy and Carbon Reduction Technology, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Key Laboratory of Anaerobic Biotechnology, Jiangnan University, Wuxi, Jiangsu 214122, China
| | - Chuanxi Wang
- Institute of Environmental Processotes and Pollution Control, and School of Environment and Civil Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Engineering Laboratory for Biomass Energy and Carbon Reduction Technology, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Key Laboratory of Anaerobic Biotechnology, Jiangnan University, Wuxi, Jiangsu 214122, China.
| | - Zhenyu Wang
- Institute of Environmental Processotes and Pollution Control, and School of Environment and Civil Engineering, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Engineering Laboratory for Biomass Energy and Carbon Reduction Technology, Jiangnan University, Wuxi, Jiangsu 214122, China; Jiangsu Key Laboratory of Anaerobic Biotechnology, Jiangnan University, Wuxi, Jiangsu 214122, China
| |
Collapse
|
6
|
Furxhi I. Health and environmental safety of nanomaterials: O Data, Where Art Thou? NANOIMPACT 2022; 25:100378. [PMID: 35559884 DOI: 10.1016/j.impact.2021.100378] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 12/15/2021] [Accepted: 12/17/2021] [Indexed: 06/15/2023]
Abstract
Nanotechnology keeps drawing attention due to the great tunable properties of nanomaterials in comparison to their bulk conventional materials. The growth of nanotechnology in combination with the digitization era has led to an increased need of safety related data. In addition to safety, new data-driven paradigms on safe and sustainable by design materials are stressing the necessity of data even more. Data is a fundamental asset to the scientific community in studying and analysing the entire life-cycle of nanomaterials. Unfortunately, data exist in a scattered fashion, in different sources and formats. To our knowledge, there is no study focusing on aspects of actual data-structure knowledge that exists in literature and databases. The purpose of this review research is to transparently and comprehensively, display to the nanoscience community the datasets readily available for machine learning purposes making it convenient and more efficient for the next users such as modellers or data curators to retrieve information. We systematically recorded the features and descriptors available in the datasets and provide synopsised information on their ranges, forms and metrics in the supplementary material.
Collapse
Affiliation(s)
- Irini Furxhi
- Transgero Limited, Cullinagh, Newcastle West, Co. Limerick, Ireland; Dept. of Accounting and Finance, Kemmy Business School, University of Limerick, V94PH93, Ireland.
| |
Collapse
|
7
|
Shin HK. Topological Distance-Based Electron Interaction Tensor to Apply a Convolutional Neural Network on Drug-like Compounds. ACS OMEGA 2021; 6:35757-35768. [PMID: 34984306 PMCID: PMC8717557 DOI: 10.1021/acsomega.1c05693] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 12/08/2021] [Indexed: 05/15/2023]
Abstract
Deep learning (DL) models in quantitative structure-activity relationship fed the molecular structure directly to the network without using human-designed descriptors by representing molecule as a graph or string (e.g., SMILES code). However, these two representations were oversimplification of real molecules to reflect chemical properties of molecular structures. Given that the choice of molecular representation determines the architecture of the DL model to apply, a novel way of molecular representation can open a way to apply diverse DL networks developed and used in other fields. A topological distance-based electron interaction (TDEi) tensor has been developed in this study inspired by the quantum mechanical model of the molecule, which defines a molecule with electrons and protons. In the TDEi tensor, the atomic orbital (AO) of each atom is represented by an electron configuration (EC) vector, which is a bit string based on the presence and absence of electrons in each AO according to spin indicated by positive and negative signs. Interactions between EC vectors were calculated based on the topological distance between atoms in a molecule. As a molecular structure was translated into 3D array, CNN models (modified VGGNet) were applied using a TDEi tensor to predict four physicochemical properties of drug-like compound datasets: MP (275,131), Lipop (4193), Esol (1127), and Freesolv (639). Models achieved good prediction accuracy. PCA showed that a stronger correlation was observed between the extracted features and the target endpoint as features were extracted from the deeper layer.
Collapse
Affiliation(s)
- Hyun Kil Shin
- Department
of Predictive Toxicology, Korea Institute
of Toxicology, Daejeon 34114, Republic of Korea
- Human
and Environmental Toxicology, University
of Science and Technology, Daejeon 34113, Republic of Korea
| |
Collapse
|
8
|
Robinson RLM, Sarimveis H, Doganis P, Jia X, Kotzabasaki M, Gousiadou C, Harper SL, Wilkins T. Identifying diverse metal oxide nanomaterials with lethal effects on embryonic zebrafish using machine learning. BEILSTEIN JOURNAL OF NANOTECHNOLOGY 2021; 12:1297-1325. [PMID: 34934606 PMCID: PMC8649207 DOI: 10.3762/bjnano.12.97] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2021] [Accepted: 10/28/2021] [Indexed: 06/14/2023]
Abstract
Manufacturers of nanomaterial-enabled products need models of endpoints that are relevant to human safety to support the "safe by design" paradigm and avoid late-stage attrition. Increasingly, embryonic zebrafish (Danio Rerio) are recognised as a key human safety relevant in vivo test system. Hence, machine learning models were developed for identifying metal oxide nanomaterials causing lethality to embryonic zebrafish up to 24 hours post-fertilisation, or excess lethality in the period of 24-120 hours post-fertilisation, at concentrations of 250 ppm or less. Models were developed using data from the Nanomaterial Biological-Interactions Knowledgebase for a dataset of 44 diverse, coated and uncoated metal or, in one case, metalloid oxide nanomaterials. Different modelling approaches were evaluated using nested cross-validation on this dataset. Models were initially developed for both lethality endpoints using multiple descriptors representing the composition of the core, shell and surface functional groups, as well as particle characteristics. However, interestingly, the 24 hours post-fertilisation data were found to be harder to predict, which could reflect different exposure routes. Hence, subsequent analysis focused on the prediction of excess lethality at 120 hours-post fertilisation. The use of two data augmentation approaches, applied for the first time in nano-QSAR research, was explored, yet both failed to boost predictive performance. Interestingly, it was found that comparable results to those originally obtained using multiple descriptors could be obtained using a model based upon a single, simple descriptor: the Pauling electronegativity of the metal atom. Since it is widely recognised that a variety of intrinsic and extrinsic nanomaterial characteristics contribute to their toxicological effects, this is a surprising finding. This may partly reflect the need to investigate more sophisticated descriptors in future studies. Future studies are also required to examine how robust these modelling results are on truly external data, which were not used to select the single descriptor model. This will require further laboratory work to generate comparable data to those studied herein.
Collapse
Affiliation(s)
| | - Haralambos Sarimveis
- School of Chemical Engineering, National Technical University of Athens, 9 Heroon Polytechniou str. Zografou Campus, 15780 Athens, Greece
| | - Philip Doganis
- School of Chemical Engineering, National Technical University of Athens, 9 Heroon Polytechniou str. Zografou Campus, 15780 Athens, Greece
| | - Xiaodong Jia
- School of Chemical and Process Engineering, University of Leeds, Leeds, LS2 9JT, United Kingdom
| | - Marianna Kotzabasaki
- School of Chemical Engineering, National Technical University of Athens, 9 Heroon Polytechniou str. Zografou Campus, 15780 Athens, Greece
| | - Christiana Gousiadou
- School of Chemical Engineering, National Technical University of Athens, 9 Heroon Polytechniou str. Zografou Campus, 15780 Athens, Greece
| | - Stacey Lynn Harper
- School of Chemical, Biological, and Environmental Engineering, Oregon State University, Corvallis, Oregon, USA
- Department of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, USA
- Oregon Nanoscience and Microtechnologies Institute, Eugene, Oregon, USA
| | - Terry Wilkins
- School of Chemical and Process Engineering, University of Leeds, Leeds, LS2 9JT, United Kingdom
| |
Collapse
|