Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wu L, Huang R, Tetko IV, Xia Z, Xu J, Tong W. Trade-off Predictivity and Explainability for Machine-Learning Powered Predictive Toxicology: An in-Depth Investigation with Tox21 Data Sets. Chem Res Toxicol 2021;34:541-549. [PMID: 33513003 DOI: 10.1021/acs.chemrestox.0c00373] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

For:	Wu L, Huang R, Tetko IV, Xia Z, Xu J, Tong W. Trade-off Predictivity and Explainability for Machine-Learning Powered Predictive Toxicology: An in-Depth Investigation with Tox21 Data Sets. Chem Res Toxicol 2021;34:541-549. [PMID: 33513003 DOI: 10.1021/acs.chemrestox.0c00373] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Number

Cited by Other Article(s)

Wu L, Xu J, Tong W. PERform: assessing model performance with predictivity and explainability readiness formula. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART C, TOXICOLOGY AND CARCINOGENESIS 2024:1-16. [PMID: 38619534 DOI: 10.1080/26896583.2024.2340391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]

Di Stefano M, Galati S, Piazza L, Granchi C, Mancini S, Fratini F, Macchia M, Poli G, Tuccinardi T. VenomPred 2.0: A Novel In Silico Platform for an Extended and Human Interpretable Toxicological Profiling of Small Molecules. J Chem Inf Model 2024;64:2275-2289. [PMID: 37676238 PMCID: PMC11005041 DOI: 10.1021/acs.jcim.3c00692] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Indexed: 09/08/2023]

Zubrod JP, Galic N, Vaugeois M, Dreier DA. Bio-QSARs 2.0: Unlocking a new level of predictive power for machine learning-based ecotoxicity predictions by exploiting chemical and biological information. ENVIRONMENT INTERNATIONAL 2024;186:108607. [PMID: 38593686 DOI: 10.1016/j.envint.2024.108607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/07/2024] [Accepted: 03/25/2024] [Indexed: 04/11/2024]

Abstract

Practical, legal, and ethical reasons necessitate the development of methods to replace animal experiments. Computational techniques to acquire information that traditionally relied on animal testing are considered a crucial pillar among these so-called new approach methodologies. In this light, we recently introduced the Bio-QSAR concept for multispecies aquatic toxicity regression tasks. These machine learning models, trained on both chemical and biological information, are capable of both cross-chemical and cross-species predictions. Here, we significantly extend these models' applicability. This was realized by increasing the quantity of training data by a factor of approximately 20, accomplished by considering both additional chemicals and aquatic organisms. Additionally, variable test durations and associated random effects were accommodated by employing a machine learning algorithm that combines tree-boosting with mixed-effects modeling (i.e., Gaussian Process Boosting). We also explored various biological descriptors including Dynamic Energy Budget model parameters, taxonomic distances, as well as genus-specific traits and investigated the inclusion of mode-of-action information. Through these efforts, we developed Bio-QSARs for fish and aquatic invertebrates with exceptional predictive power (R squared of up to 0.92 on independent test sets). Moreover, we made considerable strides to make models applicable for a range of use cases in environmental risk assessment as well as research and development of chemicals. Models were made fully explainable by implementing an algorithmic multicollinearity correction combined with SHapley Additive exPlanations. Furthermore, we devised novel approaches for applicability domain construction that take feature importance into account. We are hence confident these models, which are available via open access, will make a significant contribution towards the implementation of new approach methodologies and ultimately have the potential to support "Green Chemistry" and "Green Toxicology".

Collapse

Srithanyarat T, Taoma K, Sutthibutpong T, Ruengjitchatchawalya M, Liangruksa M, Laomettachit T. Interpreting drug synergy in breast cancer with deep learning using target-protein inhibition profiles. BioData Min 2024;17:8. [PMID: 38424554 PMCID: PMC10905801 DOI: 10.1186/s13040-024-00359-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 02/23/2024] [Indexed: 03/02/2024] Open

Abstract

BACKGROUND

Breast cancer is the most common malignancy among women worldwide. Despite advances in treating breast cancer over the past decades, drug resistance and adverse effects remain challenging. Recent therapeutic progress has shifted toward using drug combinations for better treatment efficiency. However, with a growing number of potential small-molecule cancer inhibitors, in silico strategies to predict pharmacological synergy before experimental trials are required to compensate for time and cost restrictions. Many deep learning models have been previously proposed to predict the synergistic effects of drug combinations with high performance. However, these models heavily relied on a large number of drug chemical structural fingerprints as their main features, which made model interpretation a challenge.

RESULTS

This study developed a deep neural network model that predicts synergy between small-molecule pairs based on their inhibitory activities against 13 selected key proteins. The synergy prediction model achieved a Pearson correlation coefficient between model predictions and experimental data of 0.63 across five breast cancer cell lines. BT-549 and MCF-7 achieved the highest correlation of 0.67 when considering individual cell lines. Despite achieving a moderate correlation compared to previous deep learning models, our model offers a distinctive advantage in terms of interpretability. Using the inhibitory activities against key protein targets as the main features allowed a straightforward interpretation of the model since the individual features had direct biological meaning. By tracing the synergistic interactions of compounds through their target proteins, we gained insights into the patterns our model recognized as indicative of synergistic effects.

CONCLUSIONS

The framework employed in the present study lays the groundwork for future advancements, especially in model interpretation. By combining deep learning techniques and target-specific models, this study shed light on potential patterns of target-protein inhibition profiles that could be exploited in breast cancer treatment.

Collapse

Gurmessa DK, Jimma W. Explainable machine learning for breast cancer diagnosis from mammography and ultrasound images: a systematic review. BMJ Health Care Inform 2024;31:e100954. [PMID: 38307616 PMCID: PMC10840064 DOI: 10.1136/bmjhci-2023-100954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 01/21/2024] [Indexed: 02/04/2024] Open

Abstract

BACKGROUND

Breast cancer is the most common disease in women. Recently, explainable artificial intelligence (XAI) approaches have been dedicated to investigate breast cancer. An overwhelming study has been done on XAI for breast cancer. Therefore, this study aims to review an XAI for breast cancer diagnosis from mammography and ultrasound (US) images. We investigated how XAI methods for breast cancer diagnosis have been evaluated, the existing ethical challenges, research gaps, the XAI used and the relation between the accuracy and explainability of algorithms.

METHODS

In this work, Preferred Reporting Items for Systematic Reviews and Meta-Analyses checklist and diagram were used. Peer-reviewed articles and conference proceedings from PubMed, IEEE Explore, ScienceDirect, Scopus and Google Scholar databases were searched. There is no stated date limit to filter the papers. The papers were searched on 19 September 2023, using various combinations of the search terms 'breast cancer', 'explainable', 'interpretable', 'machine learning', 'artificial intelligence' and 'XAI'. Rayyan online platform detected duplicates, inclusion and exclusion of papers.

RESULTS

This study identified 14 primary studies employing XAI for breast cancer diagnosis from mammography and US images. Out of the selected 14 studies, only 1 research evaluated humans' confidence in using the XAI system-additionally, 92.86% of identified papers identified dataset and dataset-related issues as research gaps and future direction. The result showed that further research and evaluation are needed to determine the most effective XAI method for breast cancer.

CONCLUSION

XAI is not conceded to increase users' and doctors' trust in the system. For the real-world application, effective and systematic evaluation of its trustworthiness in this scenario is lacking.

PROSPERO REGISTRATION NUMBER

CRD42023458665.

Collapse

Li T, Liu Z, Thakkar S, Roberts R, Tong W. DeepAmes: A deep learning-powered Ames test predictive model with potential for regulatory application. Regul Toxicol Pharmacol 2023;144:105486. [PMID: 37633327 DOI: 10.1016/j.yrtph.2023.105486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 07/14/2023] [Accepted: 08/23/2023] [Indexed: 08/28/2023]

Zubrod JP, Galic N, Vaugeois M, Dreier DA. Physiological variables in machine learning QSARs allow for both cross-chemical and cross-species predictions. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023;263:115250. [PMID: 37487435 DOI: 10.1016/j.ecoenv.2023.115250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 06/23/2023] [Accepted: 07/09/2023] [Indexed: 07/26/2023]

Liu W, Wang Z, Chen J, Tang W, Wang H. Machine Learning Model for Screening Thyroid Stimulating Hormone Receptor Agonists Based on Updated Datasets and Improved Applicability Domain Metrics. Chem Res Toxicol 2023. [PMID: 37209109 DOI: 10.1021/acs.chemrestox.3c00074] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]

Patlewicz G, Paul-Friedman K, Houck K, Zhang L, Huang R, Xia M, Brown J, Simmons SO. Evaluating the utility of a high throughput thiol-containing fluorescent probe to screen for reactivity: A case study with the Tox21 library. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2023;26:10.1016/j.comtox.2023.100271. [PMID: 37388277 PMCID: PMC10304587 DOI: 10.1016/j.comtox.2023.100271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/01/2023]

Escher BI, Altenburger R, Blüher M, Colbourne JK, Ebinghaus R, Fantke P, Hein M, Köck W, Kümmerer K, Leipold S, Li X, Scheringer M, Scholz S, Schloter M, Schweizer PJ, Tal T, Tetko I, Traidl-Hoffmann C, Wick LY, Fenner K. Modernizing persistence-bioaccumulation-toxicity (PBT) assessment with high throughput animal-free methods. Arch Toxicol 2023;97:1267-1283. [PMID: 36952002 PMCID: PMC10110678 DOI: 10.1007/s00204-023-03485-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Accepted: 03/13/2023] [Indexed: 03/24/2023]

Abstract

The assessment of persistence (P), bioaccumulation (B), and toxicity (T) of a chemical is a crucial first step at ensuring chemical safety and is a cornerstone of the European Union's chemicals regulation REACH (Registration, Evaluation, Authorization, and Restriction of Chemicals). Existing methods for PBT assessment are overly complex and cumbersome, have produced incorrect conclusions, and rely heavily on animal-intensive testing. We explore how new-approach methodologies (NAMs) can overcome the limitations of current PBT assessment. We propose two innovative hazard indicators, termed cumulative toxicity equivalents (CTE) and persistent toxicity equivalents (PTE). Together they are intended to replace existing PBT indicators and can also accommodate the emerging concept of PMT (where M stands for mobility). The proposed "toxicity equivalents" can be measured with high throughput in vitro bioassays. CTE refers to the toxic effects measured directly in any given sample, including single chemicals, substitution products, or mixtures. PTE is the equivalent measure of cumulative toxicity equivalents measured after simulated environmental degradation of the sample. With an appropriate panel of animal-free or alternative in vitro bioassays, CTE and PTE comprise key environmental and human health hazard indicators. CTE and PTE do not require analytical identification of transformation products and mixture components but instead prompt two key questions: is the chemical or mixture toxic, and is this toxicity persistent or can it be attenuated by environmental degradation? Taken together, the proposed hazard indicators CTE and PTE have the potential to integrate P, B/M and T assessment into one high-throughput experimental workflow that sidesteps the need for analytical measurements and will support the Chemicals Strategy for Sustainability of the European Union.

Collapse

Affiliation(s)

Beate I Escher Helmholtz Centre for Environmental Research-UFZ, Permoserstr. 15, E04318, Leipzig, Germany. Environmental Toxicology, Department of Geosciences, Eberhard Karls University Tübingen, Schnarrenbergstr. 94-96, E72076, Tübingen, Germany.
Rolf Altenburger Helmholtz Centre for Environmental Research-UFZ, Permoserstr. 15, E04318, Leipzig, Germany
Matthias Blüher Helmholtz Institute for Metabolic, Obesity and Vascular Research (HI-MAG) of the Helmholtz Munich-German Research Centre for Environmental Health (GmbH) at the University of Leipzig and University Hospital Leipzig, Leipzig, Germany
John K Colbourne Environmental Genomics Group, School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
Ralf Ebinghaus Institute of Coastal Environmental Chemistry, Helmholtz Zentrum Hereon, Max-Planck-Straße 1, 21502, Geesthacht, Germany
Peter Fantke Quantitative Sustainability Assessment, Department of Environmental and Resource Engineering, Technical University of Denmark, Produktionstorvet 424, 2800, Kgs. Lyngby, Denmark
Michaela Hein Helmholtz Centre for Environmental Research-UFZ, Permoserstr. 15, E04318, Leipzig, Germany
Wolfgang Köck Helmholtz Centre for Environmental Research-UFZ, Permoserstr. 15, E04318, Leipzig, Germany
Klaus Kümmerer Institute of Sustainable and Environmental Chemistry, Leuphana University Lüneburg, Universitätsallee 1, 21335, Lüneburg, Germany International Sustainable Chemistry Collaboration Centre (ISC3), Friedrich-Ebert-Allee 32 + 36, D-53113, Bonn, Germany
Sina Leipold Helmholtz Centre for Environmental Research-UFZ, Permoserstr. 15, E04318, Leipzig, Germany Department for Political Science, Friedrich-Schiller-University Jena, Bachstr. 18k, 07743, Jena, Germany
Xiaojing Li Environmental Genomics Group, School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
Martin Scheringer Institute of Biogeochemistry and Pollutant Dynamics, ETH Zürich, 8092, Zurich, Switzerland
Stefan Scholz Helmholtz Centre for Environmental Research-UFZ, Permoserstr. 15, E04318, Leipzig, Germany
Michael Schloter Comparative Microbiome Analysis, Environmental Health Centre, Helmholtz Munich - German Research Centre for Environmental Health (GmbH), Ingolstädter Landstr. 1, 85764, Neuherberg, Germany
Pia-Johanna Schweizer Research Institute for Sustainability-Helmholtz Centre Potsdam, Berliner Strasse 130, 14467, Potsdam, Germany
Tamara Tal Helmholtz Centre for Environmental Research-UFZ, Permoserstr. 15, E04318, Leipzig, Germany
Igor Tetko Institute of Structural Biology, Molecular Targets and Therapeutics Centre, Helmholtz Munich - German Research Centre for Environmental Health (GmbH), Ingolstädter Landstr. 1, 85764, Neuherberg, Germany
Claudia Traidl-Hoffmann Environmental Medicine Faculty of Medicine, University of Augsburg, Stenglinstrasse 2, 86156, Augsburg, Germany Institute of Environmental Medicine, Environmental Health Centre, Helmholtz Munich - German Research Centre for Environmental Health (GmbH), Ingolstädter Landstr. 1, 85764, Neuherberg, Germany
Lukas Y Wick Helmholtz Centre for Environmental Research-UFZ, Permoserstr. 15, E04318, Leipzig, Germany
Kathrin Fenner Department of Environmental Chemistry, Swiss Federal Institute of Aquatic Science and Technology (Eawag), 8600, Dübendorf, Switzerland Department of Chemistry, University of Zürich, 8057, Zurich, Switzerland

Collapse

Molecular Property Prediction by Combining LSTM and GAT. Biomolecules 2023;13:biom13030503. [PMID: 36979438 PMCID: PMC10046625 DOI: 10.3390/biom13030503] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 02/10/2023] [Accepted: 03/06/2023] [Indexed: 03/12/2023] Open

John L, Mahanta HJ, Soujanya Y, Sastry GN. Assessing machine learning approaches for predicting failures of investigational drug candidates during clinical trials. Comput Biol Med 2023;153:106494. [PMID: 36587568 DOI: 10.1016/j.compbiomed.2022.106494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 11/30/2022] [Accepted: 12/27/2022] [Indexed: 12/30/2022]

Abstract

One of the major challenges in drug development is having acceptable levels of efficacy and safety throughout all the phases of clinical trials followed by the successful launch in the market. While there are many factors such as molecular properties, toxicity parameters, mechanism of action at the target site, etc. that regulates the therapeutic action of a compound, a holistic approach directed towards data-driven studies will invariably strengthen the predictive toxicological sciences. Our quest for the current study is to find out various reasons as to why an investigational candidate would fail in the clinical trials after multiple iterations of refinement and optimization. We have compiled a dataset that comprises of approved and withdrawn drugs as well as toxic compounds and essentially have used time-split based approach to generate the training and validation set. Five highly robust and scalable machine learning binary classifiers were used to develop the predictive models that were trained with features like molecular descriptors and fingerprints and then validated rigorously to achieve acceptable performance in terms of a set of performance metrics. The mean AUC scores for all the five classifiers with the hold-out test set were obtained in the range of 0.66-0.71. The models were further used to predict the probability score for the clinical candidate dataset. The top compounds predicted to be toxic were analyzed to estimate different dimensions of toxicity. Apparently, through this study, we propose that with the appropriate use of feature extraction and machine learning methods, one can estimate the likelihood of success or failure of investigational drugs candidates thereby opening an avenue for future trends in computational toxicological studies. The models developed in the study can be accessed at https://github.com/gnsastry/predicting_clinical_trials.git.

Collapse

Belfield SJ, Cronin MTD, Enoch SJ, Firman JW. Guidance for good practice in the application of machine learning in development of toxicological quantitative structure-activity relationships (QSARs). PLoS One 2023;18:e0282924. [PMID: 37163504 PMCID: PMC10171609 DOI: 10.1371/journal.pone.0282924] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 02/26/2023] [Indexed: 05/12/2023] Open

Abstract

Recent years have seen a substantial growth in the adoption of machine learning approaches for the purposes of quantitative structure-activity relationship (QSAR) development. Such a trend has coincided with desire to see a shifting in the focus of methodology employed within chemical safety assessment: away from traditional reliance upon animal-intensive in vivo protocols, and towards increased application of in silico (or computational) predictive toxicology. With QSAR central amongst techniques applied in this area, the emergence of algorithms trained through machine learning with the objective of toxicity estimation has, quite naturally, arisen. On account of the pattern-recognition capabilities of the underlying methods, the statistical power of the ensuing models is potentially considerable-appropriate for the handling even of vast, heterogeneous datasets. However, such potency comes at a price: this manifesting as the general practical deficits observed with respect to the reproducibility, interpretability and generalisability of the resulting tools. Unsurprisingly, these elements have served to hinder broader uptake (most notably within a regulatory setting). Areas of uncertainty liable to accompany (and hence detract from applicability of) toxicological QSAR have previously been highlighted, accompanied by the forwarding of suggestions for "best practice" aimed at mitigation of their influence. However, the scope of such exercises has remained limited to "classical" QSAR-that conducted through use of linear regression and related techniques, with the adoption of comparatively few features or descriptors. Accordingly, the intention of this study has been to extend the remit of best practice guidance, so as to address concerns specific to employment of machine learning within the field. In doing so, the impact of strategies aimed at enhancing the transparency (feature importance, feature reduction), generalisability (cross-validation) and predictive power (hyperparameter optimisation) of algorithms, trained upon real toxicity data through six common learning approaches, is evaluated.

Collapse

Hasannejadasl H, Osong B, Bermejo I, van der Poel H, Vanneste B, van Roermund J, Aben K, Zhang Z, Kiemeney L, Van Oort I, Verwey R, Hochstenbach L, Bloemen E, Dekker A, Fijten RRR. A comparison of machine learning models for predicting urinary incontinence in men with localized prostate cancer. Front Oncol 2023;13:1168219. [PMID: 37124522 PMCID: PMC10130634 DOI: 10.3389/fonc.2023.1168219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 03/13/2023] [Indexed: 05/02/2023] Open

Abstract

Introduction

Urinary incontinence (UI) is a common side effect of prostate cancer treatment, but in clinical practice, it is difficult to predict. Machine learning (ML) models have shown promising results in predicting outcomes, yet the lack of transparency in complex models known as "black-box" has made clinicians wary of relying on them in sensitive decisions. Therefore, finding a balance between accuracy and explainability is crucial for the implementation of ML models. The aim of this study was to employ three different ML classifiers to predict the probability of experiencing UI in men with localized prostate cancer 1-year and 2-year after treatment and compare their accuracy and explainability.

Methods

We used the ProZIB dataset from the Netherlands Comprehensive Cancer Organization (Integraal Kankercentrum Nederland; IKNL) which contained clinical, demographic, and PROM data of 964 patients from 65 Dutch hospitals. Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) algorithms were applied to predict (in)continence after prostate cancer treatment.

Results

All models have been externally validated according to the TRIPOD Type 3 guidelines and their performance was assessed by accuracy, sensitivity, specificity, and AUC. While all three models demonstrated similar performance, LR showed slightly better accuracy than RF and SVM in predicting the risk of UI one year after prostate cancer treatment, achieving an accuracy of 0.75, a sensitivity of 0.82, and an AUC of 0.79. All models for the 2-year outcome performed poorly in the validation set, with an accuracy of 0.6 for LR, 0.65 for RF, and 0.54 for SVM.

Conclusion

The outcomes of our study demonstrate the promise of using non-black box models, such as LR, to assist clinicians in recognizing high-risk patients and making informed treatment choices. The coefficients of the LR model show the importance of each feature in predicting results, and the generated nomogram provides an accessible illustration of how each feature impacts the predicted outcome. Additionally, the model's simplicity and interpretability make it a more appropriate option in scenarios where comprehending the model's predictions is essential.

Collapse

Affiliation(s)

Hajar Hasannejadasl Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht University Medical Center, Maastricht, Netherlands
Biche Osong Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht University Medical Center, Maastricht, Netherlands
Inigo Bermejo Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht University Medical Center, Maastricht, Netherlands
Henk van der Poel Department of Urology, Netherlands Cancer Institute, Amsterdam, and Amsterdam University Medical Centers, Amsterdam, Netherlands
Ben Vanneste Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht University Medical Center, Maastricht, Netherlands Department of Human Structure and Repair, Department of Radiation Oncology, Ghent University Hospital, Ghent, Belgium
Joep van Roermund Department of Urology, Maastricht University Medical Center, Maastricht, Netherlands
Katja Aben Department of Research and Development, Netherlands Comprehensive Cancer Organization, Utrecht, Netherlands Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, Netherlands
Zhen Zhang Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht University Medical Center, Maastricht, Netherlands
Lambertus Kiemeney Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, Netherlands
Inge Van Oort Department of Urology, Radboud University Medical Center, Nijmegen, Netherlands
Renee Verwey Center of Expertise for Innovative Care and Technology (EIZT), School of Nursing, Zuyd University of Applied Sciences, Heerlen, Netherlands
Laura Hochstenbach Center of Expertise for Innovative Care and Technology (EIZT), School of Nursing, Zuyd University of Applied Sciences, Heerlen, Netherlands
Esther Bloemen Center of Expertise for Innovative Care and Technology (EIZT), School of Nursing, Zuyd University of Applied Sciences, Heerlen, Netherlands Expertise Center Empowering Healthy Behavior, Fontys University of Applied Sciences, Eindhoven, Netherlands
Andre Dekker Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht University Medical Center, Maastricht, Netherlands
Rianne R. R. Fijten Department of Radiation Oncology (MAASTRO), GROW School for Oncology and Reproduction, Maastricht University Medical Center, Maastricht, Netherlands *Correspondence: Rianne R. R. Fijten,

Collapse

Bifarin OO. Interpretable machine learning with tree-based shapley additive explanations: Application to metabolomics datasets for binary classification. PLoS One 2023;18:e0284315. [PMID: 37141218 PMCID: PMC10159207 DOI: 10.1371/journal.pone.0284315] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 03/28/2023] [Indexed: 05/05/2023] Open

Luo Y, Cuneo KC, Lawrence TS, Matuszak MM, Dawson LA, Niraula D, Ten Haken RK, El Naqa I. A human-in-the-loop based Bayesian network approach to improve imbalanced radiation outcomes prediction for hepatocellular cancer patients with stereotactic body radiotherapy. Front Oncol 2022;12:1061024. [PMID: 36568208 PMCID: PMC9782976 DOI: 10.3389/fonc.2022.1061024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 11/01/2022] [Indexed: 12/13/2022] Open

Abstract

Background

Imbalanced outcome is one of common characteristics of oncology datasets. Current machine learning approaches have limitation in learning from such datasets. Here, we propose to resolve this problem by utilizing a human-in-the-loop (HITL) approach, which we hypothesize will also lead to more accurate and explainable outcome prediction models.

Methods

A total of 119 HCC patients with 163 tumors were used in the study. 81 patients with 104 tumors from the University of Michigan Hospital treated with SBRT were considered as a discovery dataset for radiation outcomes model building. The external testing dataset included 59 tumors from 38 patients with SBRT from Princess Margaret Hospital. In the discovery dataset, 100 tumors from 77 patients had local control (LC) (96% of 104 tumors) and 23 patients had at least one grade increment of ALBI (I-ALBI) during six-month follow up (28% of 81 patients). Each patient had a total of 110 features, where 15 or 20 features were identified by physicians as expert knowledge features (EKFs) for LC or I-ALBI prediction. We proposed a HITL based Bayesian network (HITL-BN) approach to enhance the capability of selecting important features from imbalanced data in terms of accuracy and explainability through humans' participation by integrating feature importance ranking and Markov blanket algorithms. A pure data-driven Bayesian network (PD-BN) method was applied to the same discovery dataset of HCC patients as a benchmark.

Results

In the training and testing phases, the areas under receiver operating characteristic curves of the HITL-BN models for LC or I-ALBI prediction during SBRT are 0.85 (95% confidence interval: 0.75-0.95) or 0.89 (0.81-0.95) and 0.77 or 0.78, respectively. They significantly outperformed the during-treatment PD-BN model in predicting LC or I-ALBI based on the discovery cross-validation and testing datasets from the Delong tests.

Conclusion

By allowing the human expert to be part of the model building process, the HITL-BN approach yielded significantly improved accuracy as well as better explainability when dealing with imbalanced outcomes in the prediction of post-SBRT treatment response of HCC patients when compared to the PD-BN method.

Collapse

Chen P, Wang R, Chen G, An B, Liu M, Wang Q, Tao Y. Thyroid endocrine disruption and hepatotoxicity induced by bisphenol AF: Integrated zebrafish embryotoxicity test and deep learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022;822:153639. [PMID: 35131240 DOI: 10.1016/j.scitotenv.2022.153639] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Revised: 01/28/2022] [Accepted: 01/29/2022] [Indexed: 06/14/2023]

Jeong K, Lee JY, Woo S, Kim D, Jeon Y, Ryu TI, Hwang SR, Jeong WH. Vapor Pressure and Toxicity Prediction for Novichok Agent Candidates Using Machine Learning Model: Preparation for Unascertained Nerve Agents after Chemical Weapons Convention Schedule 1 Update. Chem Res Toxicol 2022;35:774-781. [PMID: 35317551 DOI: 10.1021/acs.chemrestox.1c00410] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Liu X, Lu D, Zhang A, Liu Q, Jiang G. Data-Driven Machine Learning in Environmental Pollution: Gains and Problems. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022;56:2124-2133. [PMID: 35084840 DOI: 10.1021/acs.est.1c06157] [Citation(s) in RCA: 64] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Guan S, Fu N. Class imbalance learning with Bayesian optimization applied in drug discovery. Sci Rep 2022;12:2069. [PMID: 35136094 PMCID: PMC8827090 DOI: 10.1038/s41598-022-05717-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 01/11/2022] [Indexed: 11/12/2022] Open

Hao Y, Moore JH. TargetTox: A Feature Selection Pipeline for Identifying Predictive Targets Associated with Drug Toxicity. J Chem Inf Model 2021;61:5386-5394. [PMID: 34757743 DOI: 10.1021/acs.jcim.1c00733] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Fuhrman JD, Gorre N, Hu Q, Li H, El Naqa I, Giger ML. A review of explainable and interpretable AI with applications in COVID-19 imaging. Med Phys 2021;49:1-14. [PMID: 34796530 PMCID: PMC8646613 DOI: 10.1002/mp.15359] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 10/14/2021] [Accepted: 10/25/2021] [Indexed: 12/24/2022] Open

Kleinstreuer NC, Tetko IV, Tong W. Introduction to Special Issue: Computational Toxicology. Chem Res Toxicol 2021;34:171-175. [PMID: 33583184 DOI: 10.1021/acs.chemrestox.1c00032] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]