101
|
Khened M, Kori A, Rajkumar H, Krishnamurthi G, Srinivasan B. A generalized deep learning framework for whole-slide image segmentation and analysis. Sci Rep 2021; 11:11579. [PMID: 34078928 PMCID: PMC8172839 DOI: 10.1038/s41598-021-90444-8] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 05/04/2021] [Indexed: 12/22/2022] Open
Abstract
Histopathology tissue analysis is considered the gold standard in cancer diagnosis and prognosis. Whole-slide imaging (WSI), i.e., the scanning and digitization of entire histology slides, are now being adopted across the world in pathology labs. Trained histopathologists can provide an accurate diagnosis of biopsy specimens based on WSI data. Given the dimensionality of WSIs and the increase in the number of potential cancer cases, analyzing these images is a time-consuming process. Automated segmentation of tumorous tissue helps in elevating the precision, speed, and reproducibility of research. In the recent past, deep learning-based techniques have provided state-of-the-art results in a wide variety of image analysis tasks, including the analysis of digitized slides. However, deep learning-based solutions pose many technical challenges, including the large size of WSI data, heterogeneity in images, and complexity of features. In this study, we propose a generalized deep learning-based framework for histopathology tissue analysis to address these challenges. Our framework is, in essence, a sequence of individual techniques in the preprocessing-training-inference pipeline which, in conjunction, improve the efficiency and the generalizability of the analysis. The combination of techniques we have introduced includes an ensemble segmentation model, division of the WSI into smaller overlapping patches while addressing class imbalances, efficient techniques for inference, and an efficient, patch-based uncertainty estimation framework. Our ensemble consists of DenseNet-121, Inception-ResNet-V2, and DeeplabV3Plus, where all the networks were trained end to end for every task. We demonstrate the efficacy and improved generalizability of our framework by evaluating it on a variety of histopathology tasks including breast cancer metastases (CAMELYON), colon cancer (DigestPath), and liver cancer (PAIP). Our proposed framework has state-of-the-art performance across all these tasks and is ranked within the top 5 currently for the challenges based on these datasets. The entire framework along with the trained models and the related documentation are made freely available at GitHub and PyPi. Our framework is expected to aid histopathologists in accurate and efficient initial diagnosis. Moreover, the estimated uncertainty maps will help clinicians to make informed decisions and further treatment planning or analysis.
Collapse
Affiliation(s)
- Mahendra Khened
- Department of Engineering Design, Indian Institute of Technology Madras, Chennai, 600036, India
| | - Avinash Kori
- Department of Engineering Design, Indian Institute of Technology Madras, Chennai, 600036, India
| | - Haran Rajkumar
- Department of Engineering Design, Indian Institute of Technology Madras, Chennai, 600036, India
| | - Ganapathy Krishnamurthi
- Department of Engineering Design, Indian Institute of Technology Madras, Chennai, 600036, India.
| | - Balaji Srinivasan
- Department of Mechanical Engineering, Indian Institute of Technology Madras, Chennai, 600036, India
| |
Collapse
|
102
|
Wan S, Sinclair RC, Coveney PV. Uncertainty quantification in classical molecular dynamics. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2021; 379:20200082. [PMID: 33775140 PMCID: PMC8059622 DOI: 10.1098/rsta.2020.0082] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Accepted: 11/02/2020] [Indexed: 05/24/2023]
Abstract
Molecular dynamics simulation is now a widespread approach for understanding complex systems on the atomistic scale. It finds applications from physics and chemistry to engineering, life and medical science. In the last decade, the approach has begun to advance from being a computer-based means of rationalizing experimental observations to producing apparently credible predictions for a number of real-world applications within industrial sectors such as advanced materials and drug discovery. However, key aspects concerning the reproducibility of the method have not kept pace with the speed of its uptake in the scientific community. Here, we present a discussion of uncertainty quantification for molecular dynamics simulation designed to endow the method with better error estimates that will enable it to be used to report actionable results. The approach adopted is a standard one in the field of uncertainty quantification, namely using ensemble methods, in which a sufficiently large number of replicas are run concurrently, from which reliable statistics can be extracted. Indeed, because molecular dynamics is intrinsically chaotic, the need to use ensemble methods is fundamental and holds regardless of the duration of the simulations performed. We discuss the approach and illustrate it in a range of applications from materials science to ligand-protein binding free energy estimation. This article is part of the theme issue 'Reliability and reproducibility in computational science: implementing verification, validation and uncertainty quantification in silico'.
Collapse
Affiliation(s)
- Shunzhou Wan
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
| | - Robert C. Sinclair
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
| | - Peter V. Coveney
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
- Institute for Informatics, Science Park 904, University of Amsterdam, 1098 XH Amsterdam, The Netherlands
| |
Collapse
|
103
|
Bhinder B, Gilvary C, Madhukar NS, Elemento O. Artificial Intelligence in Cancer Research and Precision Medicine. Cancer Discov 2021; 11:900-915. [PMID: 33811123 DOI: 10.1158/2159-8290.cd-21-0090] [Citation(s) in RCA: 298] [Impact Index Per Article: 74.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 02/06/2021] [Accepted: 02/08/2021] [Indexed: 11/16/2022]
Abstract
Artificial intelligence (AI) is rapidly reshaping cancer research and personalized clinical care. Availability of high-dimensionality datasets coupled with advances in high-performance computing, as well as innovative deep learning architectures, has led to an explosion of AI use in various aspects of oncology research. These applications range from detection and classification of cancer, to molecular characterization of tumors and their microenvironment, to drug discovery and repurposing, to predicting treatment outcomes for patients. As these advances start penetrating the clinic, we foresee a shifting paradigm in cancer care becoming strongly driven by AI. SIGNIFICANCE: AI has the potential to dramatically affect nearly all aspects of oncology-from enhancing diagnosis to personalizing treatment and discovering novel anticancer drugs. Here, we review the recent enormous progress in the application of AI to oncology, highlight limitations and pitfalls, and chart a path for adoption of AI in the cancer clinic.
Collapse
Affiliation(s)
- Bhavneet Bhinder
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, New York.,Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York
| | | | | | - Olivier Elemento
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, New York. .,Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York.,OneThree Biotech, New York, New York
| |
Collapse
|
104
|
Wang K, Hu Q, Gao B, Lin Q, Zhuge FW, Zhang DY, Wang L, He YH, Scheicher RH, Tong H, Miao XS. Threshold switching memristor-based stochastic neurons for probabilistic computing. MATERIALS HORIZONS 2021; 8:619-629. [PMID: 34821279 DOI: 10.1039/d0mh01759k] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Biological neurons exhibit dynamic excitation behavior in the form of stochastic firing, rather than stiffly giving out spikes upon reaching a fixed threshold voltage, which empowers the brain to perform probabilistic inference in the face of uncertainty. However, owing to the complexity of the stochastic firing process in biological neurons, the challenge of fabricating and applying stochastic neurons with bio-realistic dynamics to probabilistic scenarios remains to be fully addressed. In this work, a novel CuS/GeSe conductive-bridge threshold switching memristor is fabricated and singled out to realize electronic stochastic neurons, which is ascribed to the similarity between the stochastic switching behavior observed in the device and that of biological ion channels. The corresponding electric circuit of a stochastic neuron is then constructed and the probabilistic firing capacity of the neuron is utilized to implement Bayesian inference in a spiking neural network (SNN). The application prospects are demonstrated on the example of a tumor diagnosis task, where common fatal diagnostic errors of a conventional artificial neural network are successfully circumvented. Moreover, in comparison to deterministic neuron-based SNNs, the stochastic neurons enable SNNs to deliver an estimate of the uncertainty in their predictions, and the fidelity of the judgement is drastically improved by 81.2%.
Collapse
Affiliation(s)
- Kuan Wang
- Wuhan National Laboratory for Optoelectronics, School of Optical and Electronic Information, Huazhong University of Science and Technology, Wuhan 430074, China.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
105
|
Qayyum A, Qadir J, Bilal M, Al-Fuqaha A. Secure and Robust Machine Learning for Healthcare: A Survey. IEEE Rev Biomed Eng 2021; 14:156-180. [PMID: 32746371 DOI: 10.1109/rbme.2020.3013489] [Citation(s) in RCA: 106] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Recent years have witnessed widespread adoption of machine learning (ML)/deep learning (DL) techniques due to their superior performance for a variety of healthcare applications ranging from the prediction of cardiac arrest from one-dimensional heart signals to computer-aided diagnosis (CADx) using multi-dimensional medical images. Notwithstanding the impressive performance of ML/DL, there are still lingering doubts regarding the robustness of ML/DL in healthcare settings (which is traditionally considered quite challenging due to the myriad security and privacy issues involved), especially in light of recent results that have shown that ML/DL are vulnerable to adversarial attacks. In this paper, we present an overview of various application areas in healthcare that leverage such techniques from security and privacy point of view and present associated challenges. In addition, we present potential methods to ensure secure and privacy-preserving ML for healthcare applications. Finally, we provide insight into the current research challenges and promising directions for future research.
Collapse
|
106
|
|
107
|
Tanno R, Worrall DE, Kaden E, Ghosh A, Grussu F, Bizzi A, Sotiropoulos SN, Criminisi A, Alexander DC. Uncertainty modelling in deep learning for safer neuroimage enhancement: Demonstration in diffusion MRI. Neuroimage 2021; 225:117366. [DOI: 10.1016/j.neuroimage.2020.117366] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Revised: 08/28/2020] [Accepted: 09/05/2020] [Indexed: 12/14/2022] Open
|
108
|
Verhelst HM, Stannat AW, Mecacci G. Machine Learning Against Terrorism: How Big Data Collection and Analysis Influences the Privacy-Security Dilemma. SCIENCE AND ENGINEERING ETHICS 2020; 26:2975-2984. [PMID: 32696430 PMCID: PMC7755624 DOI: 10.1007/s11948-020-00254-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2018] [Accepted: 07/11/2020] [Indexed: 06/11/2023]
Abstract
Rapid advancements in machine learning techniques allow mass surveillance to be applied on larger scales and utilize more and more personal data. These developments demand reconsideration of the privacy-security dilemma, which describes the tradeoffs between national security interests and individual privacy concerns. By investigating mass surveillance techniques that use bulk data collection and machine learning algorithms, we show why these methods are unlikely to pinpoint terrorists in order to prevent attacks. The diverse characteristics of terrorist attacks-especially when considering lone-wolf terrorism-lead to irregular and isolated (digital) footprints. The irregularity of data affects the accuracy of machine learning algorithms and the mass surveillance that depends on them which can be explained by three kinds of known problems encountered in machine learning theory: class imbalance, the curse of dimensionality, and spurious correlations. Proponents of mass surveillance often invoke the distinction between collecting data and metadata, in which the latter is understood as a lesser breach of privacy. Their arguments commonly overlook the ambiguity in the definitions of data and metadata and ignore the ability of machine learning techniques to infer the former from the latter. Given the sparsity of datasets used for machine learning in counterterrorism and the privacy risks attendant with bulk data collection, policymakers and other relevant stakeholders should critically re-evaluate the likelihood of success of the algorithms and the collection of data on which they depend.
Collapse
Affiliation(s)
- H. M. Verhelst
- Delft Institute of Applied Mathematics, Delft University of Technology, Van Mourik Broekmanweg 6, 2628XE Delft, The Netherlands
| | - A. W. Stannat
- Delft Institute of Applied Mathematics, Delft University of Technology, Van Mourik Broekmanweg 6, 2628XE Delft, The Netherlands
| | - G. Mecacci
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Montessorilaan 3, 6525 HR Nijmegen, The Netherlands
| |
Collapse
|
109
|
Mervin LH, Johansson S, Semenova E, Giblin KA, Engkvist O. Uncertainty quantification in drug design. Drug Discov Today 2020; 26:474-489. [PMID: 33253918 DOI: 10.1016/j.drudis.2020.11.027] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 07/13/2020] [Accepted: 11/23/2020] [Indexed: 01/03/2023]
Abstract
Machine learning and artificial intelligence are increasingly being applied to the drug-design process as a result of the development of novel algorithms, growing access, the falling cost of computation and the development of novel technologies for generating chemically and biologically relevant data. There has been recent progress in fields such as molecular de novo generation, synthetic route prediction and, to some extent, property predictions. Despite this, most research in these fields has focused on improving the accuracy of the technologies, rather than on quantifying the uncertainty in the predictions. Uncertainty quantification will become a key component in autonomous decision making and will be crucial for integrating machine learning and chemistry automation to create an autonomous design-make-test-analyse cycle. This review covers the empirical, frequentist and Bayesian approaches to uncertainty quantification, and outlines how they can be used for drug design. We also outline the impact of uncertainty quantification on decision making.
Collapse
Affiliation(s)
- Lewis H Mervin
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK.
| | - Simon Johansson
- Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg, Sweden; Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | - Elizaveta Semenova
- Data Sciences and Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK
| | - Kathryn A Giblin
- Medicinal Chemistry, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Ola Engkvist
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| |
Collapse
|
110
|
Lawson CE, Martí JM, Radivojevic T, Jonnalagadda SVR, Gentz R, Hillson NJ, Peisert S, Kim J, Simmons BA, Petzold CJ, Singer SW, Mukhopadhyay A, Tanjore D, Dunn JG, Garcia Martin H. Machine learning for metabolic engineering: A review. Metab Eng 2020; 63:34-60. [PMID: 33221420 DOI: 10.1016/j.ymben.2020.10.005] [Citation(s) in RCA: 124] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 10/22/2020] [Accepted: 10/31/2020] [Indexed: 12/14/2022]
Abstract
Machine learning provides researchers a unique opportunity to make metabolic engineering more predictable. In this review, we offer an introduction to this discipline in terms that are relatable to metabolic engineers, as well as providing in-depth illustrative examples leveraging omics data and improving production. We also include practical advice for the practitioner in terms of data management, algorithm libraries, computational resources, and important non-technical issues. A variety of applications ranging from pathway construction and optimization, to genetic editing optimization, cell factory testing, and production scale-up are discussed. Moreover, the promising relationship between machine learning and mechanistic models is thoroughly reviewed. Finally, the future perspectives and most promising directions for this combination of disciplines are examined.
Collapse
Affiliation(s)
- Christopher E Lawson
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA
| | - Jose Manuel Martí
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Tijana Radivojevic
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Sai Vamshi R Jonnalagadda
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Reinhard Gentz
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Nathan J Hillson
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Sean Peisert
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; University of California Davis, Davis, CA, 95616, USA
| | - Joonhoon Kim
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Pacific Northwest National Laboratory, Richland, 99354, WA, USA
| | - Blake A Simmons
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Christopher J Petzold
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Steven W Singer
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA
| | - Aindrila Mukhopadhyay
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, USA
| | - Deepti Tanjore
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Advanced Biofuels and Bioproducts Process Development Unit, Emeryville, CA, 94608, USA
| | | | - Hector Garcia Martin
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA; Basque Center for Applied Mathematics, 48009, Bilbao, Spain; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, USA.
| |
Collapse
|
111
|
Char DS, Abràmoff MD, Feudtner C. Identifying Ethical Considerations for Machine Learning Healthcare Applications. THE AMERICAN JOURNAL OF BIOETHICS : AJOB 2020; 20:7-17. [PMID: 33103967 PMCID: PMC7737650 DOI: 10.1080/15265161.2020.1819469] [Citation(s) in RCA: 132] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Along with potential benefits to healthcare delivery, machine learning healthcare applications (ML-HCAs) raise a number of ethical concerns. Ethical evaluations of ML-HCAs will need to structure the overall problem of evaluating these technologies, especially for a diverse group of stakeholders. This paper outlines a systematic approach to identifying ML-HCA ethical concerns, starting with a conceptual model of the pipeline of the conception, development, implementation of ML-HCAs, and the parallel pipeline of evaluation and oversight tasks at each stage. Over this model, we layer key questions that raise value-based issues, along with ethical considerations identified in large part by a literature review, but also identifying some ethical considerations that have yet to receive attention. This pipeline model framework will be useful for systematic ethical appraisals of ML-HCA from development through implementation, and for interdisciplinary collaboration of diverse stakeholders that will be required to understand and subsequently manage the ethical implications of ML-HCAs.
Collapse
Affiliation(s)
- Danton S. Char
- Department of Anesthesiology, Stanford University School of Medicine, Division of Pediatric Cardiac Anesthesia, and the Center for Biomedical Ethics, Stanford University School of Medicine
| | - Michael D. Abràmoff
- The Robert C. Watzke Professor of Ophthalmology of Visual Sciences, of Electrical and Computer Engineering and of Biomedical Engineering, University of Iowa, Iowa City, IA; Founder and Executive Chairman, IDx, Coralville, IA
| | - Chris Feudtner
- The Department of Medical Ethics, The Children’s Hospital of Philadelphia; and the Departments of Pediatrics, Medical Ethics and Healthcare Policy, The Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA
| |
Collapse
|
112
|
Towards Knowledge Uncertainty Estimation for Open Set Recognition. MACHINE LEARNING AND KNOWLEDGE EXTRACTION 2020. [DOI: 10.3390/make2040028] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Uncertainty is ubiquitous and happens in every single prediction of Machine Learning models. The ability to estimate and quantify the uncertainty of individual predictions is arguably relevant, all the more in safety-critical applications. Real-world recognition poses multiple challenges since a model’s knowledge about physical phenomenon is not complete, and observations are incomplete by definition. However, Machine Learning algorithms often assume that train and test data distributions are the same and that all testing classes are present during training. A more realistic scenario is the Open Set Recognition, where unknown classes can be submitted to an algorithm during testing. In this paper, we propose a Knowledge Uncertainty Estimation (KUE) method to quantify knowledge uncertainty and reject out-of-distribution inputs. Additionally, we quantify and distinguish aleatoric and epistemic uncertainty with the classical information-theoretical measures of entropy by means of ensemble techniques. We performed experiments on four datasets with different data modalities and compared our results with distance-based classifiers, SVM-based approaches and ensemble techniques using entropy measures. Overall, the effectiveness of KUE in distinguishing in- and out-distribution inputs obtained better results in most cases and was at least comparable in others. Furthermore, a classification with rejection option based on a proposed combination strategy between different measures of uncertainty is an application of uncertainty with proven results.
Collapse
|
113
|
Lechat B, Hansen K, Catcheside P, Zajamsek B. Beyond K-complex binary scoring during sleep: probabilistic classification using deep learning. Sleep 2020; 43:zsaa077. [PMID: 32301485 PMCID: PMC7751135 DOI: 10.1093/sleep/zsaa077] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Revised: 03/18/2020] [Indexed: 12/21/2022] Open
Abstract
STUDY OBJECTIVES K-complexes (KCs) are a recognized electroencephalography marker of sensory processing and a defining feature of sleep stage 2. KC frequency and morphology may also be reflective of sleep quality, aging, and a range of sleep and sensory processing deficits. However, manual scoring of K-complexes is impractical, time-consuming, and thus costly and currently not well-standardized. Although automated KC detection methods have been developed, performance and uptake remain limited. METHODS The proposed algorithm is based on a deep neural network and Gaussian process, which gives the input waveform a probability of being a KC ranging from 0% to 100%. The algorithm was trained on half a million synthetic KCs derived from manually scored sleep stage 2 KCs from the Montreal Archive of Sleep Study containing 19 healthy young participants. Algorithm performance was subsequently assessed on 700 independent recordings from the Cleveland Family Study using sleep stages 2 and 3 data. RESULTS The developed algorithm showed an F1 score (a measure of binary classification accuracy) of 0.78 and thus outperforms currently available KC scoring algorithms with F1 = 0.2-0.6. The probabilistic approach also captured expected variability in KC shape and amplitude within individuals and across age groups. CONCLUSIONS An automated probabilistic KC classification is well suited and effective for systematic KC detection for a more in-depth exploration of potential relationships between KCs during sleep and clinical outcomes such as health impacts and daytime symptomatology.
Collapse
Affiliation(s)
- Bastien Lechat
- Adelaide Institute for Sleep Health, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Kristy Hansen
- Adelaide Institute for Sleep Health, College of Science and Engineering, Flinders University, Adelaide, Australia
| | - Peter Catcheside
- Adelaide Institute for Sleep Health, College of Medicine and Public Health, Flinders University, Adelaide, Australia
| | - Branko Zajamsek
- Adelaide Institute for Sleep Health, College of Medicine and Public Health, Flinders University, Adelaide, Australia
| |
Collapse
|
114
|
Unceta I, Nin J, Pujol O. Environmental Adaptation and Differential Replication in Machine Learning. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E1122. [PMID: 33286891 PMCID: PMC7597251 DOI: 10.3390/e22101122] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 09/17/2020] [Accepted: 09/29/2020] [Indexed: 11/16/2022]
Abstract
When deployed in the wild, machine learning models are usually confronted with an environment that imposes severe constraints. As this environment evolves, so do these constraints. As a result, the feasible set of solutions for the considered need is prone to change in time. We refer to this problem as that of environmental adaptation. In this paper, we formalize environmental adaptation and discuss how it differs from other problems in the literature. We propose solutions based on differential replication, a technique where the knowledge acquired by the deployed models is reused in specific ways to train more suitable future generations. We discuss different mechanisms to implement differential replications in practice, depending on the considered level of knowledge. Finally, we present seven examples where the problem of environmental adaptation can be solved through differential replication in real-life applications.
Collapse
Affiliation(s)
- Irene Unceta
- BBVA Data & Analytics, 28050 Madrid, Spain;
- Department of Mathematics and Computer Science, Universitat de Barcelona, 08007 Barcelona, Spain;
| | - Jordi Nin
- Department of Operations, Innovation and Data Sciences, Universitat Ramon Llull, ESADE, 08172 Sant Cugat del Vallès, Spain
| | - Oriol Pujol
- Department of Mathematics and Computer Science, Universitat de Barcelona, 08007 Barcelona, Spain;
| |
Collapse
|
115
|
Radivojević T, Costello Z, Workman K, Garcia Martin H. A machine learning Automated Recommendation Tool for synthetic biology. Nat Commun 2020; 11:4879. [PMID: 32978379 PMCID: PMC7519645 DOI: 10.1038/s41467-020-18008-4] [Citation(s) in RCA: 133] [Impact Index Per Article: 26.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 07/27/2020] [Indexed: 01/07/2023] Open
Abstract
Synthetic biology allows us to bioengineer cells to synthesize novel valuable molecules such as renewable biofuels or anticancer drugs. However, traditional synthetic biology approaches involve ad-hoc engineering practices, which lead to long development times. Here, we present the Automated Recommendation Tool (ART), a tool that leverages machine learning and probabilistic modeling techniques to guide synthetic biology in a systematic fashion, without the need for a full mechanistic understanding of the biological system. Using sampling-based optimization, ART provides a set of recommended strains to be built in the next engineering cycle, alongside probabilistic predictions of their production levels. We demonstrate the capabilities of ART on simulated data sets, as well as experimental data from real metabolic engineering projects producing renewable biofuels, hoppy flavored beer without hops, fatty acids, and tryptophan. Finally, we discuss the limitations of this approach, and the practical consequences of the underlying assumptions failing.
Collapse
Affiliation(s)
- Tijana Radivojević
- DOE Agile BioFoundry, Emeryville, CA, 94608, USA
- Biofuels and Bioproducts Division, DOE Joint BioEnergy Institute, Emeryville, CA, 94608, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Zak Costello
- DOE Agile BioFoundry, Emeryville, CA, 94608, USA
- Biofuels and Bioproducts Division, DOE Joint BioEnergy Institute, Emeryville, CA, 94608, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Kenneth Workman
- DOE Agile BioFoundry, Emeryville, CA, 94608, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- Department of Bioengineering, University of California, Berkeley, CA, 94720, USA
| | - Hector Garcia Martin
- DOE Agile BioFoundry, Emeryville, CA, 94608, USA.
- Biofuels and Bioproducts Division, DOE Joint BioEnergy Institute, Emeryville, CA, 94608, USA.
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
- BCAM, Basque Center for Applied Mathematics, Bilbao, 48009, Spain.
| |
Collapse
|
116
|
Cheng M, Nazarian S, Bogdan P. There Is Hope After All: Quantifying Opinion and Trustworthiness in Neural Networks. Front Artif Intell 2020; 3:54. [PMID: 33733171 PMCID: PMC7861320 DOI: 10.3389/frai.2020.00054] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 06/26/2020] [Indexed: 11/13/2022] Open
Abstract
Artificial Intelligence (AI) plays a fundamental role in the modern world, especially when used as an autonomous decision maker. One common concern nowadays is "how trustworthy the AIs are." Human operators follow a strict educational curriculum and performance assessment that could be exploited to quantify how much we entrust them. To quantify the trust of AI decision makers, we must go beyond task accuracy especially when facing limited, incomplete, misleading, controversial or noisy datasets. Toward addressing these challenges, we describe DeepTrust, a Subjective Logic (SL) inspired framework that constructs a probabilistic logic description of an AI algorithm and takes into account the trustworthiness of both dataset and inner algorithmic workings. DeepTrust identifies proper multi-layered neural network (NN) topologies that have high projected trust probabilities, even when trained with untrusted data. We show that uncertain opinion of data is not always malicious while evaluating NN's opinion and trustworthiness, whereas the disbelief opinion hurts trust the most. Also trust probability does not necessarily correlate with accuracy. DeepTrust also provides a projected trust probability of NN's prediction, which is useful when the NN generates an over-confident output under problematic datasets. These findings open new analytical avenues for designing and improving the NN topology by optimizing opinion and trustworthiness, along with accuracy, in a multi-objective optimization formulation, subject to space and time constraints.
Collapse
Affiliation(s)
- Mingxi Cheng
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, United States
| | - Shahin Nazarian
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, United States
| | - Paul Bogdan
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, United States
| |
Collapse
|
117
|
Jin EH, Lee D, Bae JH, Kang HY, Kwak MS, Seo JY, Yang JI, Yang SY, Lim SH, Yim JY, Lim JH, Chung GE, Chung SJ, Choi JM, Han YM, Kang SJ, Lee J, Chan Kim H, Kim JS. Improved Accuracy in Optical Diagnosis of Colorectal Polyps Using Convolutional Neural Networks with Visual Explanations. Gastroenterology 2020; 158:2169-2179.e8. [PMID: 32119927 DOI: 10.1053/j.gastro.2020.02.036] [Citation(s) in RCA: 87] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 01/10/2020] [Accepted: 02/20/2020] [Indexed: 02/08/2023]
Abstract
BACKGROUND & AIMS Narrow-band imaging (NBI) can be used to determine whether colorectal polyps are adenomatous or hyperplastic. We investigated whether an artificial intelligence (AI) system can increase the accuracy of characterizations of polyps by endoscopists of different skill levels. METHODS We developed convolutional neural networks (CNNs) for evaluation of diminutive colorectal polyps, based on efficient neural architecture searches via parameter sharing with augmentation using NBIs of diminutive (≤5 mm) polyps, collected from October 2015 through October 2017 at the Seoul National University Hospital, Healthcare System Gangnam Center (training set). We trained the CNN using images from 1100 adenomatous polyps and 1050 hyperplastic polyps from 1379 patients. We then tested the system using 300 images of 180 adenomatous polyps and 120 hyperplastic polyps, obtained from January 2018 to May 2019. We compared the accuracy of 22 endoscopists of different skill levels (7 novices, 4 experts, and 11 NBI-trained experts) vs the CNN in evaluation of images (adenomatous vs hyperplastic) from 180 adenomatous and 120 hyperplastic polyps. The endoscopists then evaluated the polyp images with knowledge of the CNN-processed results. We conducted mixed-effect logistic and linear regression analyses to determine the effects of AI assistance on the accuracy of analysis of diminutive colorectal polyps by endoscopists (primary outcome). RESULTS The CNN distinguished adenomatous vs hyperplastic diminutive polyps with 86.7% accuracy, based on histologic analysis as the reference standard. Endoscopists distinguished adenomatous vs hyperplastic diminutive polyps with 82.5% overall accuracy (novices, 73.8% accuracy; experts, 83.8% accuracy; and NBI-trained experts, 87.6% accuracy). With knowledge of the CNN-processed results, the overall accuracy of the endoscopists increased to 88.5% (P < .05). With knowledge of the CNN-processed results, the accuracy of novice endoscopists increased to 85.6% (P < .05). The CNN-processed results significantly reduced endoscopist time of diagnosis (from 3.92 to 3.37 seconds per polyp, P = .042). CONCLUSIONS We developed a CNN that significantly increases the accuracy of evaluation of diminutive colorectal polyps (as adenomatous vs hyperplastic) and reduces the time of diagnosis by endoscopists. This AI assistance system significantly increased the accuracy of analysis by novice endoscopists, who achieved near-expert levels of accuracy without extra training. The CNN assistance system can reduce the skill-level dependence of endoscopists and costs.
Collapse
Affiliation(s)
- Eun Hyo Jin
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Dongheon Lee
- Interdisciplinary Program in Bioengineering, Graduate School, Seoul National University, Seoul, Korea
| | - Jung Ho Bae
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Hae Yeon Kang
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Min-Sun Kwak
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Ji Yeon Seo
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Jong In Yang
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Sun Young Yang
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Seon Hee Lim
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Jeong Yoon Yim
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Joo Hyun Lim
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Goh Eun Chung
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Su Jin Chung
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Ji Min Choi
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Yoo Min Han
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Seung Joo Kang
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea
| | - Jooyoung Lee
- Department of Internal Medicine, Liver Research Institute, Seoul National University College of Medicine, Seoul, Korea
| | - Hee Chan Kim
- Interdisciplinary Program in Bioengineering, Graduate School, Seoul National University, Seoul, Korea; Department of Biomedical Engineering College of Medicine, Seoul National University, Seoul, Korea; Institute of Medical & Biological Engineering, Medical Research Center, Seoul National University, Seoul, Korea.
| | - Joo Sung Kim
- Department of Internal Medicine, Healthcare Research Institute, Seoul National University Hospital Healthcare System Gangnam Center, Seoul, Korea; Department of Internal Medicine, Liver Research Institute, Seoul National University College of Medicine, Seoul, Korea.
| |
Collapse
|
118
|
Natekar P, Kori A, Krishnamurthi G. Demystifying Brain Tumor Segmentation Networks: Interpretability and Uncertainty Analysis. Front Comput Neurosci 2020; 14:6. [PMID: 32116620 PMCID: PMC7025464 DOI: 10.3389/fncom.2020.00006] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2019] [Accepted: 01/17/2020] [Indexed: 11/13/2022] Open
Abstract
The accurate automatic segmentation of gliomas and its intra-tumoral structures is important not only for treatment planning but also for follow-up evaluations. Several methods based on 2D and 3D Deep Neural Networks (DNN) have been developed to segment brain tumors and to classify different categories of tumors from different MRI modalities. However, these networks are often black-box models and do not provide any evidence regarding the process they take to perform this task. Increasing transparency and interpretability of such deep learning techniques is necessary for the complete integration of such methods into medical practice. In this paper, we explore various techniques to explain the functional organization of brain tumor segmentation models and to extract visualizations of internal concepts to understand how these networks achieve highly accurate tumor segmentations. We use the BraTS 2018 dataset to train three different networks with standard architectures and outline similarities and differences in the process that these networks take to segment brain tumors. We show that brain tumor segmentation networks learn certain human-understandable disentangled concepts on a filter level. We also show that they take a top-down or hierarchical approach to localizing the different parts of the tumor. We then extract visualizations of some internal feature maps and also provide a measure of uncertainty with regards to the outputs of the models to give additional qualitative evidence about the predictions of these networks. We believe that the emergence of such human-understandable organization and concepts might aid in the acceptance and integration of such methods in medical diagnosis.
Collapse
Affiliation(s)
- Parth Natekar
- Department of Engineering Design, Indian Institute of Technology Madras, Chennai, India
| | - Avinash Kori
- Department of Engineering Design, Indian Institute of Technology Madras, Chennai, India
| | | |
Collapse
|
119
|
Lee S, Chu YS, Yoo SK, Choi S, Choe SJ, Koh SB, Chung KY, Xing L, Oh B, Yang S. Augmented decision-making for acral lentiginous melanoma detection using deep convolutional neural networks. J Eur Acad Dermatol Venereol 2020; 34:1842-1850. [PMID: 31919901 DOI: 10.1111/jdv.16185] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 12/13/2019] [Indexed: 12/14/2022]
Abstract
BACKGROUND Several studies have achieved high-level performance of melanoma detection using convolutional neural networks (CNNs). However, few have described the extent to which the implementation of CNNs improves the diagnostic performance of the physicians. OBJECTIVE This study is aimed at developing a CNN for detecting acral lentiginous melanoma (ALM) and investigating whether its implementation can improve the initial decision for ALM detection made by the physicians. METHODS A CNN was trained using 1072 dermoscopic images of acral benign nevi, ALM and intermediate tumours. To investigate whether the implementation of CNN can improve the initial decision for ALM detection, 60 physicians completed a three-stage survey. In Stage I, they were asked for their decisions solely on the basis of dermoscopic images provided to them. In Stage II, they were also provided with clinical information. In Stage III, they were provided with the additional diagnosis and probability predicted by the CNN. RESULTS The accuracy of ALM detection in the participants was 74.7% (95% confidence interval [CI], 72.6-76.8%) in Stage I and 79.0% (95% CI, 76.7-81.2%) in Stage II. In Stage III, it was 86.9% (95% CI, 85.3-88.4%), which exceeds the accuracy delivered in Stage I by 12.2%p (95% CI, 10.1-14.3%p) and Stage II by 7.9%p (95% CI, 6.0-9.9%p). Moreover, the concordance between the participants considerably increased (Fleiss-κ of 0.436 [95% CI, 0.437-0.573] in Stage I, 0.506 [95% CI, 0.621-0.749] in Stage II and 0.684 [95% CI, 0.621-0.749] in Stage III). CONCLUSIONS Augmented decision-making improved the performance of and concordance between the clinical decisions of a diverse group of experts. This study demonstrates the potential use of CNNs as an adjoining, decision-supporting system for physicians' decisions.
Collapse
Affiliation(s)
- S Lee
- Department of Dermatology, Yonsei University Wonju College of Medicine, Wonju, Korea.,Department of Preventive Medicine, Yonsei University Wonju College of Medicine, Wonju, Korea
| | - Y S Chu
- Department of Biomedical Engineering, Yonsei University, Wonju, Korea
| | - S K Yoo
- Department of Biomedical Engineering, Yonsei University, Wonju, Korea
| | - S Choi
- Department of Dermatology and Cutaneous Biology Research Institute, Severance Hospital, Yonsei University College of Medicine, Seoul, Korea
| | - S J Choe
- Department of Dermatology, Yonsei University Wonju College of Medicine, Wonju, Korea
| | - S B Koh
- Department of Preventive Medicine, Yonsei University Wonju College of Medicine, Wonju, Korea
| | - K Y Chung
- Department of Dermatology and Cutaneous Biology Research Institute, Severance Hospital, Yonsei University College of Medicine, Seoul, Korea
| | - L Xing
- Department of Radiation Oncology, Stanford University, Stanford, CA, USA
| | - B Oh
- Department of Dermatology and Cutaneous Biology Research Institute, Severance Hospital, Yonsei University College of Medicine, Seoul, Korea
| | - S Yang
- Department of Biomedical Engineering, Yonsei University, Wonju, Korea
| |
Collapse
|
120
|
Interpretable multimodal deep learning for real-time pan-tissue pan-disease pathology search on social media. Mod Pathol 2020; 33:2169-2185. [PMID: 32467650 PMCID: PMC7581495 DOI: 10.1038/s41379-020-0540-1] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 03/17/2020] [Accepted: 03/17/2020] [Indexed: 02/07/2023]
Abstract
Pathologists are responsible for rapidly providing a diagnosis on critical health issues. Challenging cases benefit from additional opinions of pathologist colleagues. In addition to on-site colleagues, there is an active worldwide community of pathologists on social media for complementary opinions. Such access to pathologists worldwide has the capacity to improve diagnostic accuracy and generate broader consensus on next steps in patient care. From Twitter we curate 13,626 images from 6,351 tweets from 25 pathologists from 13 countries. We supplement the Twitter data with 113,161 images from 1,074,484 PubMed articles. We develop machine learning and deep learning models to (i) accurately identify histopathology stains, (ii) discriminate between tissues, and (iii) differentiate disease states. Area Under Receiver Operating Characteristic (AUROC) is 0.805-0.996 for these tasks. We repurpose the disease classifier to search for similar disease states given an image and clinical covariates. We report precision@k = 1 = 0.7618 ± 0.0018 (chance 0.397 ± 0.004, mean ±stdev ). The classifiers find that texture and tissue are important clinico-visual features of disease. Deep features trained only on natural images (e.g., cats and dogs) substantially improved search performance, while pathology-specific deep features and cell nuclei features further improved search to a lesser extent. We implement a social media bot (@pathobot on Twitter) to use the trained classifiers to aid pathologists in obtaining real-time feedback on challenging cases. If a social media post containing pathology text and images mentions the bot, the bot generates quantitative predictions of disease state (normal/artifact/infection/injury/nontumor, preneoplastic/benign/low-grade-malignant-potential, or malignant) and lists similar cases across social media and PubMed. Our project has become a globally distributed expert system that facilitates pathological diagnosis and brings expertise to underserved regions or hospitals with less expertise in a particular disease. This is the first pan-tissue pan-disease (i.e., from infection to malignancy) method for prediction and search on social media, and the first pathology study prospectively tested in public on social media. We will share data through http://pathobotology.org . We expect our project to cultivate a more connected world of physicians and improve patient care worldwide.
Collapse
|
121
|
Ethical considerations about artificial intelligence for prognostication in intensive care. Intensive Care Med Exp 2019; 7:70. [PMID: 31823128 PMCID: PMC6904702 DOI: 10.1186/s40635-019-0286-6] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 11/28/2019] [Indexed: 11/25/2022] Open
Abstract
Background Prognosticating the course of diseases to inform decision-making is a key component of intensive care medicine. For several applications in medicine, new methods from the field of artificial intelligence (AI) and machine learning have already outperformed conventional prediction models. Due to their technical characteristics, these methods will present new ethical challenges to the intensivist. Results In addition to the standards of data stewardship in medicine, the selection of datasets and algorithms to create AI prognostication models must involve extensive scrutiny to avoid biases and, consequently, injustice against individuals or groups of patients. Assessment of these models for compliance with the ethical principles of beneficence and non-maleficence should also include quantification of predictive uncertainty. Respect for patients’ autonomy during decision-making requires transparency of the data processing by AI models to explain the predictions derived from these models. Moreover, a system of continuous oversight can help to maintain public trust in this technology. Based on these considerations as well as recent guidelines, we propose a pathway to an ethical implementation of AI-based prognostication. It includes a checklist for new AI models that deals with medical and technical topics as well as patient- and system-centered issues. Conclusion AI models for prognostication will become valuable tools in intensive care. However, they require technical refinement and a careful implementation according to the standards of medical ethics.
Collapse
|
122
|
Lovis C. Unlocking the Power of Artificial Intelligence and Big Data in Medicine. J Med Internet Res 2019; 21:e16607. [PMID: 31702565 PMCID: PMC6874800 DOI: 10.2196/16607] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Revised: 10/18/2019] [Accepted: 10/20/2019] [Indexed: 12/17/2022] Open
Abstract
Data-driven science and its corollaries in machine learning and the wider field of artificial intelligence have the potential to drive important changes in medicine. However, medicine is not a science like any other: It is deeply and tightly bound with a large and wide network of legal, ethical, regulatory, economical, and societal dependencies. As a consequence, the scientific and technological progresses in handling information and its further processing and cross-linking for decision support and predictive systems must be accompanied by parallel changes in the global environment, with numerous stakeholders, including citizen and society. What can be seen at the first glance as a barrier and a mechanism slowing down the progression of data science must, however, be considered an important asset. Only global adoption can transform the potential of big data and artificial intelligence into an effective breakthroughs in handling health and medicine. This requires science and society, scientists and citizens, to progress together.
Collapse
Affiliation(s)
- Christian Lovis
- Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland
- Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland
| |
Collapse
|
123
|
Lemke HU. Some germinal roots of AI and their impact on Computer Assisted Radiology and Surgery (CARS). Int J Comput Assist Radiol Surg 2019; 14:1619-1625. [PMID: 31606791 DOI: 10.1007/s11548-019-02069-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Affiliation(s)
- Heinz U Lemke
- International Foundation for Computer Assisted Radiology and Surgery (IFCARS), Kussaberg, Germany.
| |
Collapse
|
124
|
Bhattacharya T, Brettin T, Doroshow JH, Evrard YA, Greenspan EJ, Gryshuk AL, Hoang TT, Lauzon CBV, Nissley D, Penberthy L, Stahlberg E, Stevens R, Streitz F, Tourassi G, Xia F, Zaki G. AI Meets Exascale Computing: Advancing Cancer Research With Large-Scale High Performance Computing. Front Oncol 2019; 9:984. [PMID: 31632915 PMCID: PMC6783509 DOI: 10.3389/fonc.2019.00984] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Accepted: 09/16/2019] [Indexed: 12/02/2022] Open
Abstract
The application of data science in cancer research has been boosted by major advances in three primary areas: (1) Data: diversity, amount, and availability of biomedical data; (2) Advances in Artificial Intelligence (AI) and Machine Learning (ML) algorithms that enable learning from complex, large-scale data; and (3) Advances in computer architectures allowing unprecedented acceleration of simulation and machine learning algorithms. These advances help build in silico ML models that can provide transformative insights from data including: molecular dynamics simulations, next-generation sequencing, omics, imaging, and unstructured clinical text documents. Unique challenges persist, however, in building ML models related to cancer, including: (1) access, sharing, labeling, and integration of multimodal and multi-institutional data across different cancer types; (2) developing AI models for cancer research capable of scaling on next generation high performance computers; and (3) assessing robustness and reliability in the AI models. In this paper, we review the National Cancer Institute (NCI) -Department of Energy (DOE) collaboration, Joint Design of Advanced Computing Solutions for Cancer (JDACS4C), a multi-institution collaborative effort focused on advancing computing and data technologies to accelerate cancer research on three levels: molecular, cellular, and population. This collaboration integrates various types of generated data, pre-exascale compute resources, and advances in ML models to increase understanding of basic cancer biology, identify promising new treatment options, predict outcomes, and eventually prescribe specialized treatments for patients with cancer.
Collapse
Affiliation(s)
- Tanmoy Bhattacharya
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Thomas Brettin
- Computing, Environment and Life Sciences Directorate, Argonne National Laboratory, Lemont, IL, United States
| | - James H Doroshow
- Division of Cancer Treatment and Diagnosis, National Cancer Institute, Bethesda, MD, United States
| | - Yvonne A Evrard
- Applied Development and Research Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, United States
| | - Emily J Greenspan
- Center for Biomedical Informatics and Information Technology, National Cancer Institute, Bethesda, MD, United States
| | - Amy L Gryshuk
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Thuc T Hoang
- National Nuclear Security Administration, U.S. Department of Energy, Advanced Simulation and Computing, Washington, DC, United States
| | - Carolyn B Vea Lauzon
- Office of Science, U.S. Department of Energy, Advanced Scientific Computing Research, Washington, DC, United States
| | - Dwight Nissley
- NCI RAS Initiative, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, United States
| | - Lynne Penberthy
- Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD, United States
| | - Eric Stahlberg
- Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, United States
| | - Rick Stevens
- Computing, Environment and Life Sciences Directorate, Argonne National Laboratory, Lemont, IL, United States.,Computer Science Department, University of Chicago, Chicago, IL, United States
| | - Fred Streitz
- High Performance Computing Innovation Center, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Georgia Tourassi
- Health Data Sciences Institute, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Fangfang Xia
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, United States
| | - George Zaki
- Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, United States
| |
Collapse
|
125
|
Artificial intelligence's essential role in the process of drug discovery. FUTURE DRUG DISCOVERY 2019. [DOI: 10.4155/fdd-2019-0026] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
|
126
|
Ryu S, Kwon Y, Kim WY. A Bayesian graph convolutional network for reliable prediction of molecular properties with uncertainty quantification. Chem Sci 2019; 10:8438-8446. [PMID: 31803423 PMCID: PMC6839511 DOI: 10.1039/c9sc01992h] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Accepted: 07/21/2019] [Indexed: 01/14/2023] Open
Abstract
Deep neural networks have been increasingly used in various chemical fields. In the nature of a data-driven approach, their performance strongly depends on data used in training. Therefore, models developed in data-deficient situations can cause highly uncertain predictions, leading to vulnerable decision making. Here, we show that Bayesian inference enables more reliable prediction with quantitative uncertainty analysis. Decomposition of the predictive uncertainty into model- and data-driven uncertainties allows us to elucidate the source of errors for further improvements. For molecular applications, we devised a Bayesian graph convolutional network (GCN) and evaluated its performance for molecular property predictions. Our study on the classification problem of bio-activity and toxicity shows that the confidence of prediction can be quantified in terms of the predictive uncertainty, leading to more accurate virtual screening of drug candidates than standard GCNs. The result of log P prediction illustrates that data noise affects the data-driven uncertainty more significantly than the model-driven one. Based on this finding, we could identify artefacts that arose from quantum mechanical calculations in the Harvard Clean Energy Project dataset. Consequently, the Bayesian GCN is critical for molecular applications under data-deficient conditions.
Collapse
Affiliation(s)
- Seongok Ryu
- Department of Chemistry , KAIST , 291 Daehak-ro, Yuseong-gu , Daejeon 34141 , Republic of Korea .
| | - Yongchan Kwon
- Department of Statistics , Seoul National University , 1 Gwanak-ro, Gwanak-gu , Seoul 08826 , Republic of Korea
| | - Woo Youn Kim
- Department of Chemistry , KAIST , 291 Daehak-ro, Yuseong-gu , Daejeon 34141 , Republic of Korea .
- KI for Artificial Intelligence , KAIST , 291 Daehak-ro, Yuseong-gu , Daejeon 34141 , Republic of Korea
| |
Collapse
|
127
|
|
128
|
Dewey M, Wilkens U. The Bionic Radiologist: avoiding blurry pictures and providing greater insights. NPJ Digit Med 2019; 2:65. [PMID: 31388567 PMCID: PMC6616477 DOI: 10.1038/s41746-019-0142-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Accepted: 05/28/2019] [Indexed: 12/11/2022] Open
Abstract
Radiology images and reports have long been digitalized. However, the potential of the more than 3.6 billion radiology examinations performed annually worldwide has largely gone unused in the effort to digitally transform health care. The Bionic Radiologist is a concept that combines humanity and digitalization for better health care integration of radiology. At a practical level, this concept will achieve critical goals: (1) testing decisions being made scientifically on the basis of disease probabilities and patient preferences; (2) image analysis done consistently at any time and at any site; and (3) treatment suggestions that are closely linked to imaging results and are seamlessly integrated with other information. The Bionic Radiologist will thus help avoiding missed care opportunities, will provide continuous learning in the work process, and will also allow more time for radiologists' primary roles: interacting with patients and referring physicians. To achieve that potential, one has to cope with many implementation barriers at both the individual and institutional levels. These include: reluctance to delegate decision making, a possible decrease in image interpretation knowledge and the perception that patient safety and trust are at stake. To facilitate implementation of the Bionic Radiologist the following will be helpful: uncertainty quantifications for suggestions, shared decision making, changes in organizational culture and leadership style, maintained expertise through continuous learning systems for training, and role development of the involved experts. With the support of the Bionic Radiologist, disparities are reduced and the delivery of care is provided in a humane and personalized fashion.
Collapse
Affiliation(s)
- Marc Dewey
- Charité—Universitätsmedizin Berlin and Berlin Institute of Health, Berlin, Germany
| | - Uta Wilkens
- Ruhr-University Bochum, Institute of Work Science, Bochum, Germany
| |
Collapse
|
129
|
Lemke HU, Melzer A. Back to the roots of AI and their relevance for health care today. MINIM INVASIV THER 2019; 28:65-68. [PMID: 30950663 DOI: 10.1080/13645706.2019.1596955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Heinz U Lemke
- a IMSaT University Dundee, UK & ICCAS University Leipzig , Leipzig , Germany
| | - Andreas Melzer
- a IMSaT University Dundee, UK & ICCAS University Leipzig , Leipzig , Germany
| |
Collapse
|
130
|
|
131
|
Buendia R, Kogej T, Engkvist O, Carlsson L, Linusson H, Johansson U, Toccaceli P, Ahlberg E. Accurate Hit Estimation for Iterative Screening Using Venn-ABERS Predictors. J Chem Inf Model 2019; 59:1230-1237. [PMID: 30726080 DOI: 10.1021/acs.jcim.8b00724] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Iterative screening has emerged as a promising approach to increase the efficiency of high-throughput screening (HTS) campaigns in drug discovery. By learning from a subset of the compound library, inferences on what compounds to screen next can be made by predictive models. One of the challenges of iterative screening is to decide how many iterations to perform. This is mainly related to difficulties in estimating the prospective hit rate in any given iteration. In this article, a novel method based on Venn-ABERS predictors is proposed. The method provides accurate estimates of the number of hits retrieved in any given iteration during an HTS campaign. The estimates provide the necessary information to support the decision on the number of iterations needed to maximize the screening outcome. Thus, this method offers a prospective screening strategy for early-stage drug discovery.
Collapse
Affiliation(s)
- Ruben Buendia
- Department of Information Technology , University of Borås , SE-501 90 Borås , Sweden
| | - Thierry Kogej
- Discovery Sciences , AstraZeneca IMED Biotech Unit , SE-431 83 Mölndal , Sweden
| | - Ola Engkvist
- Discovery Sciences , AstraZeneca IMED Biotech Unit , SE-431 83 Mölndal , Sweden
| | - Lars Carlsson
- Discovery Sciences , AstraZeneca IMED Biotech Unit , SE-431 83 Mölndal , Sweden.,Department of Computer Science, Royal Holloway , University of London , Egham , Surrey TW20 0EX , United Kingdom
| | - Henrik Linusson
- Department of Information Technology , University of Borås , SE-501 90 Borås , Sweden
| | - Ulf Johansson
- Department of Information Technology , University of Borås , SE-501 90 Borås , Sweden
| | - Paolo Toccaceli
- Department of Computer Science, Royal Holloway , University of London , Egham , Surrey TW20 0EX , United Kingdom
| | - Ernst Ahlberg
- Data Science and AI, Drug Safety & Metabolism , AstraZeneca IMED Biotech Unit , SE-431 83 Mölndal , Sweden
| |
Collapse
|