1
|
Du J, Li W, Liu P, Vong CM, You Y, Lei B, Wang T. Federated learning using model projection for multi-center disease diagnosis with non-IID data. Neural Netw 2024; 178:106409. [PMID: 38823069 DOI: 10.1016/j.neunet.2024.106409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 04/28/2024] [Accepted: 05/23/2024] [Indexed: 06/03/2024]
Abstract
Multi-center disease diagnosis aims to build a global model for all involved medical centers. Due to privacy concerns, it is infeasible to collect data from multiple centers for training (i.e., centralized learning). Federated Learning (FL) is a decentralized framework that enables multiple clients (e.g., medical centers) to collaboratively train a global model while retaining patient data locally for privacy. However, in practice, the data across medical centers are not independently and identically distributed (Non-IID), causing two challenging issues: (1) catastrophic forgetting at clients, i.e., the local model at clients will forget the knowledge received from the global model after local training, causing reduced performance; and (2) invalid aggregation at the server, i.e., the global model at the server may not be favorable to some clients after model aggregation, resulting in a slow convergence rate. To mitigate these issues, an innovative Federated learning using Model Projection (FedMoP) is proposed, which guarantees: (1) the loss of local model on global data does not increase after local training without accessing the global data so that the performance will not be degenerated; and (2) the loss of global model on local data does not increase after aggregation without accessing local data so that convergence rate can be improved. Extensive experimental results show that our FedMoP outperforms state-of-the-art FL methods in terms of accuracy, convergence rate and communication cost. In particular, our FedMoP also achieves comparable or even higher accuracy than centralized learning. Thus, our FedMoP can ensure privacy protection while outperforming centralized learning in accuracy and communication cost.
Collapse
Affiliation(s)
- Jie Du
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Medical School, Shenzhen University, Shenzhen 518060, Guangdong, China.
| | - Wei Li
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Medical School, Shenzhen University, Shenzhen 518060, Guangdong, China.
| | - Peng Liu
- Artificial Intelligence Industrial Innovation Research Center, Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen, 518110, China.
| | - Chi-Man Vong
- Department of Computer and Information Science, University of Macau, Macau SAR, 999078, China.
| | - Yongke You
- Department of nephrology, Shenzhen University General Hospital, Shenzhen University, Shenzhen 518060, Guangdong, China.
| | - Baiying Lei
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Medical School, Shenzhen University, Shenzhen 518060, Guangdong, China.
| | - Tianfu Wang
- National-Regional Key Technology Engineering Laboratory for Medical Ultrasound, Guangdong Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Medical School, Shenzhen University, Shenzhen 518060, Guangdong, China.
| |
Collapse
|
2
|
Darzi E, Sijtsema NM, van Ooijen PMA. A comparative study of federated learning methods for COVID-19 detection. Sci Rep 2024; 14:3944. [PMID: 38365940 PMCID: PMC10873416 DOI: 10.1038/s41598-024-54323-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 02/11/2024] [Indexed: 02/18/2024] Open
Abstract
Deep learning has proven to be highly effective in diagnosing COVID-19; however, its efficacy is contingent upon the availability of extensive data for model training. The data sharing among hospitals, which is crucial for training robust models, is often restricted by privacy regulations. Federated learning (FL) emerges as a solution by enabling model training across multiple hospitals while preserving data privacy. However, the deployment of FL can be resource-intensive, necessitating efficient utilization of computational and network resources. In this study, we evaluate the performance and resource efficiency of five FL algorithms in the context of COVID-19 detection using Convolutional Neural Networks (CNNs) in a decentralized setting. The evaluation involves varying the number of participating entities, the number of federated rounds, and the selection algorithms. Our findings indicate that the Cyclic Weight Transfer algorithm exhibits superior performance, particularly when the number of participating hospitals is limited. These insights hold practical implications for the deployment of FL algorithms in COVID-19 detection and broader medical image analysis.
Collapse
Affiliation(s)
- Erfan Darzi
- Harvard Medical school, Harvard University, 300 Longwood avenue, Boston, United States.
| | - Nanna M Sijtsema
- Department of Radiotherapy, University Medical Center Groningen, University of Groningen, Hanzeplein 1, Groningen, The Netherlands
- Machine Learning Lab, Data Science Center in Health (DASH), University Medical Groningen, University of Groningen, Hanzeplein 1, Groningen, The Netherlands
| | - P M A van Ooijen
- Department of Radiotherapy, University Medical Center Groningen, University of Groningen, Hanzeplein 1, Groningen, The Netherlands
- Machine Learning Lab, Data Science Center in Health (DASH), University Medical Groningen, University of Groningen, Hanzeplein 1, Groningen, The Netherlands
| |
Collapse
|
3
|
Souza R, Stanley EAM, Camacho M, Camicioli R, Monchi O, Ismail Z, Wilms M, Forkert ND. A multi-center distributed learning approach for Parkinson's disease classification using the traveling model paradigm. Front Artif Intell 2024; 7:1301997. [PMID: 38384277 PMCID: PMC10879577 DOI: 10.3389/frai.2024.1301997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 01/23/2024] [Indexed: 02/23/2024] Open
Abstract
Distributed learning is a promising alternative to central learning for machine learning (ML) model training, overcoming data-sharing problems in healthcare. Previous studies exploring federated learning (FL) or the traveling model (TM) setup for medical image-based disease classification often relied on large databases with a limited number of centers or simulated artificial centers, raising doubts about real-world applicability. This study develops and evaluates a convolution neural network (CNN) for Parkinson's disease classification using data acquired by 83 diverse real centers around the world, mostly contributing small training samples. Our approach specifically makes use of the TM setup, which has proven effective in scenarios with limited data availability but has never been used for image-based disease classification. Our findings reveal that TM is effective for training CNN models, even in complex real-world scenarios with variable data distributions. After sufficient training cycles, the TM-trained CNN matches or slightly surpasses the performance of the centrally trained counterpart (AUROC of 83% vs. 80%). Our study highlights, for the first time, the effectiveness of TM in 3D medical image classification, especially in scenarios with limited training samples and heterogeneous distributed data. These insights are relevant for situations where ML models are supposed to be trained using data from small or remote medical centers, and rare diseases with sparse cases. The simplicity of this approach enables a broad application to many deep learning tasks, enhancing its clinical utility across various contexts and medical facilities.
Collapse
Affiliation(s)
- Raissa Souza
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, Canada
| | - Emma A. M. Stanley
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, Canada
| | - Milton Camacho
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Biomedical Engineering Graduate Program, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, Canada
| | - Richard Camicioli
- Department of Medicine (Neurology), Neuroscience and Mental Health Institute, University of Alberta, Edmonton, AB, Canada
| | - Oury Monchi
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Department of Radiology, Radio-oncology and Nuclear Medicine, Université de Montréal, Montréal, QC, Canada
- Centre de Recherche, Institut Universitaire de Gériatrie de Montréal, Montréal, QC, Canada
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Zahinoor Ismail
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Department of Psychiatry, University of Calgary, Calgary, AB, Canada
- Clinical and Biomedical Sciences, Faculty of Health and Life Sciences, University of Exeter, Exeter, United Kingdom
| | - Matthias Wilms
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, Canada
- Department of Pediatrics, University of Calgary, Calgary, AB, Canada
- Department of Community Health Sciences, University of Calgary, Calgary, AB, Canada
| | - Nils D. Forkert
- Department of Radiology, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, AB, Canada
- Department of Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| |
Collapse
|
4
|
Ji Y, Gao Y, Bao R, Li Q, Liu D, Sun Y, Ye Y. Prediction of COVID-19 Patients' Emergency Room Revisit using Multi-Source Transfer Learning. IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS. IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS 2023; 2023:138-144. [PMID: 38486663 PMCID: PMC10939709 DOI: 10.1109/ichi57859.2023.00028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/17/2024]
Abstract
The coronavirus disease 2019 (COVID-19) has led to a global pandemic of significant severity. In addition to its high level of contagiousness, COVID-19 can have a heterogeneous clinical course, ranging from asymptomatic carriers to severe and potentially life-threatening health complications. Many patients have to revisit the emergency room (ER) within a short time after discharge, which significantly increases the workload for medical staff. Early identification of such patients is crucial for helping physicians focus on treating life-threatening cases. In this study, we obtained Electronic Health Records (EHRs) of 3,210 encounters from 13 affiliated ERs within the University of Pittsburgh Medical Center between March 2020 and January 2021. We leveraged a Natural Language Processing technique, ScispaCy, to extract clinical concepts and used the 1001 most frequent concepts to develop 7-day revisit models for COVID-19 patients in ERs. The research data we collected were obtained from 13 ERs, which may have distributional differences that could affect the model development. To address this issue, we employed a classic deep transfer learning method called the Domain Adversarial Neural Network (DANN) and evaluated different modeling strategies, including the Multi-DANN algorithm (which considers the source differences), the Single-DANN algorithm (which doesn't consider the source differences), and three baseline methods: using only source data, using only target data, and using a mixture of source and target data. Results showed that the Multi-DANN models outperformed the Single-DANN models and baseline models in predicting revisits of COVID-19 patients to the ER within 7 days after discharge (median AUROC = 0.8 vs. 0.5). Notably, the Multi-DANN strategy effectively addressed the heterogeneity among multiple source domains and improved the adaptation of source data to the target domain. Moreover, the high performance of Multi-DANN models indicates that EHRs are informative for developing a prediction model to identify COVID-19 patients who are very likely to revisit an ER within 7 days after discharge.
Collapse
Affiliation(s)
- Yuelyu Ji
- Department of Information Science, School of Computing and Information, University of Pittsburgh, Pittsburgh,USA
| | - Yuhe Gao
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, USA
| | - Runxue Bao
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, USA
| | - Qi Li
- School of Business, State University of New York at New Paltz, New Paltz, USA
| | - Disheng Liu
- Department of Information Science, School of Computing and Information, University of Pittsburgh Pittsburgh, USA
| | - Yiming Sun
- Department of Electrical and Computer Engineering, Swanson School of Engineering, University of Pittsburgh Pittsburgh, USA
| | - Ye Ye
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, USA
| |
Collapse
|
5
|
You S, Reyes M. Influence of contrast and texture based image modifications on the performance and attention shift of U-Net models for brain tissue segmentation. FRONTIERS IN NEUROIMAGING 2022; 1:1012639. [PMID: 37555149 PMCID: PMC10406260 DOI: 10.3389/fnimg.2022.1012639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 10/12/2022] [Indexed: 08/10/2023]
Abstract
Contrast and texture modifications applied during training or test-time have recently shown promising results to enhance the generalization performance of deep learning segmentation methods in medical image analysis. However, a deeper understanding of this phenomenon has not been investigated. In this study, we investigated this phenomenon using a controlled experimental setting, using datasets from the Human Connectome Project and a large set of simulated MR protocols, in order to mitigate data confounders and investigate possible explanations as to why model performance changes when applying different levels of contrast and texture-based modifications. Our experiments confirm previous findings regarding the improved performance of models subjected to contrast and texture modifications employed during training and/or testing time, but further show the interplay when these operations are combined, as well as the regimes of model improvement/worsening across scanning parameters. Furthermore, our findings demonstrate a spatial attention shift phenomenon of trained models, occurring for different levels of model performance, and varying in relation to the type of applied image modification.
Collapse
Affiliation(s)
- Suhang You
- Medical Image Analysis Group, ARTORG, Graduate School for Cellular and Biomedical Sciences, University of Bern, Bern, Switzerland
| | | |
Collapse
|
6
|
Polson JS, Zhang H, Nael K, Salamon N, Yoo BY, El-Saden S, Starkman S, Kim N, Kang DW, Speier WF, Arnold CW. Identifying acute ischemic stroke patients within the thrombolytic treatment window using deep learning. J Neuroimaging 2022; 32:1153-1160. [PMID: 36068184 DOI: 10.1111/jon.13043] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 08/11/2022] [Accepted: 08/16/2022] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND AND PURPOSE Treatment of acute ischemic stroke is heavily contingent upon time, as there is a strong relationship between time clock and tissue progression. Work has established imaging biomarker assessments as surrogates for time since stroke (TSS), namely, by comparing signal mismatch between diffusion-weighted imaging (DWI) and fluid-attenuated inversion recovery (FLAIR) imaging. Our goal was to develop an automatic technique for determining TSS from imaging that does not require subspecialist radiology expertise. METHODS Using 772 patients (66 ± 9 years, 319 women), we developed and externally evaluated a deep learning network for classifying TSS from MR images and compared algorithm predictions to neuroradiologist assessments of DWI-FLAIR mismatch. Models were trained to classify TSS within 4.5 hours and performance metrics with confidence intervals were reported on both internal and external evaluation sets. RESULTS Three board-certified neuroradiologists' DWI-FLAIR mismatch assessments, based on majority vote, yielded a sensitivity of .62, a specificity of .86, and a Fleiss' kappa of .46 when used to classify TSS. The deep learning method performed similarly to radiologists and outperformed previously reported methods, with the best model achieving an average evaluation accuracy, sensitivity, and specificity of .726, .712, and .741, respectively, on an internal cohort and .724, .757, and .679, respectively, on an external cohort. CONCLUSION Our model achieved higher generalization performance on external evaluation datasets than the current state-of-the-art for TSS classification. These results demonstrate the potential of automatic assessment of onset time from imaging without the need for expertly trained radiologists.
Collapse
Affiliation(s)
- Jennifer S Polson
- Computational Diagnostics Lab, University of California, Los Angeles, Los Angeles, California, USA.,Department of Bioengineering, University of California, Los Angeles, Los Angeles, California, USA
| | - Haoyue Zhang
- Computational Diagnostics Lab, University of California, Los Angeles, Los Angeles, California, USA.,Department of Bioengineering, University of California, Los Angeles, Los Angeles, California, USA
| | - Kambiz Nael
- Department of Radiology, University of California, Los Angeles, Los Angeles, California, USA
| | - Noriko Salamon
- Department of Radiology, University of California, Los Angeles, Los Angeles, California, USA
| | - Bryan Y Yoo
- Department of Radiology, University of California, Los Angeles, Los Angeles, California, USA
| | - Suzie El-Saden
- Department of Radiology, VA Phoenix Healthcare System, Phoenix, Arizona, USA
| | - Sidney Starkman
- Departments of Emergency Medicine and Neurology, University of California, Los Angeles, Los Angeles, California, USA
| | - Namkug Kim
- Department of Neurology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | - Dong-Wha Kang
- Department of Neurology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | - William F Speier
- Computational Diagnostics Lab, University of California, Los Angeles, Los Angeles, California, USA.,Department of Radiology, University of California, Los Angeles, Los Angeles, California, USA
| | - Corey W Arnold
- Computational Diagnostics Lab, University of California, Los Angeles, Los Angeles, California, USA.,Department of Bioengineering, University of California, Los Angeles, Los Angeles, California, USA.,Department of Radiology, University of California, Los Angeles, Los Angeles, California, USA.,Department of Pathology, University of California, Los Angeles, Los Angeles, California, USA
| |
Collapse
|
7
|
Zhang M, Qu L, Singh P, Kalpathy-Cramer J, Rubin DL. SplitAVG: A Heterogeneity-Aware Federated Deep Learning Method for Medical Imaging. IEEE J Biomed Health Inform 2022; 26:4635-4644. [PMID: 35749336 PMCID: PMC9749741 DOI: 10.1109/jbhi.2022.3185956] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Federated learning is an emerging research paradigm for enabling collaboratively training deep learning models without sharing patient data. However, the data from different institutions are usually heterogeneous across institutions, which may reduce the performance of models trained using federated learning. In this study, we propose a novel heterogeneity-aware federated learning method, SplitAVG, to overcome the performance drops from data heterogeneity in federated learning. Unlike previous federated methods that require complex heuristic training or hyper parameter tuning, our SplitAVG leverages the simple network split and feature map concatenation strategies to encourage the federated model training an unbiased estimator of the target data distribution. We compare SplitAVG with seven state-of-the-art federated learning methods, using centrally hosted training data as the baseline on a suite of both synthetic and real-world federated datasets. We find that the performance of models trained using all the comparison federated learning methods degraded significantly with the increasing degrees of data heterogeneity. In contrast, SplitAVG method achieves comparable results to the baseline method under all heterogeneous settings, that it achieves 96.2% of the accuracy and 110.4% of the mean absolute error obtained by the baseline in a diabetic retinopathy binary classification dataset and a bone age prediction dataset, respectively, on highly heterogeneous data partitions. We conclude that SplitAVG method can effectively overcome the performance drops from variability in data distributions across institutions. Experimental results also show that SplitAVG can be adapted to different base convolutional neural networks (CNNs) and generalized to various types of medical imaging tasks. The code is publicly available at https://github.com/zm17943/SplitAVG.
Collapse
|
8
|
Large-scale investigation of deep learning approaches for ventilated lung segmentation using multi-nuclear hyperpolarized gas MRI. Sci Rep 2022; 12:10566. [PMID: 35732795 PMCID: PMC9217976 DOI: 10.1038/s41598-022-14672-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 06/10/2022] [Indexed: 11/08/2022] Open
Abstract
Respiratory diseases are leading causes of mortality and morbidity worldwide. Pulmonary imaging is an essential component of the diagnosis, treatment planning, monitoring, and treatment assessment of respiratory diseases. Insights into numerous pulmonary pathologies can be gleaned from functional lung MRI techniques. These include hyperpolarized gas ventilation MRI, which enables visualization and quantification of regional lung ventilation with high spatial resolution. Segmentation of the ventilated lung is required to calculate clinically relevant biomarkers. Recent research in deep learning (DL) has shown promising results for numerous segmentation problems. Here, we evaluate several 3D convolutional neural networks to segment ventilated lung regions on hyperpolarized gas MRI scans. The dataset consists of 759 helium-3 (3He) or xenon-129 (129Xe) volumetric scans and corresponding expert segmentations from 341 healthy subjects and patients with a wide range of pathologies. We evaluated segmentation performance for several DL experimental methods via overlap, distance and error metrics and compared them to conventional segmentation methods, namely, spatial fuzzy c-means (SFCM) and K-means clustering. We observed that training on combined 3He and 129Xe MRI scans using a 3D nn-UNet outperformed other DL methods, achieving a mean ± SD Dice coefficient of 0.963 ± 0.018, average boundary Hausdorff distance of 1.505 ± 0.969 mm, Hausdorff 95th percentile of 5.754 ± 6.621 mm and relative error of 0.075 ± 0.039. Moreover, limited differences in performance were observed between 129Xe and 3He scans in the testing set. Combined training on 129Xe and 3He yielded statistically significant improvements over the conventional methods (p < 0.0001). In addition, we observed very strong correlation and agreement between DL and expert segmentations, with Pearson correlation of 0.99 (p < 0.0001) and Bland-Altman bias of - 0.8%. The DL approach evaluated provides accurate, robust and rapid segmentations of ventilated lung regions and successfully excludes non-lung regions such as the airways and artefacts. This approach is expected to eliminate the need for, or significantly reduce, subsequent time-consuming manual editing.
Collapse
|
9
|
Qu L, Zhou Y, Liang PP, Xia Y, Wang F, Adeli E, Fei-Fei L, Rubin D. Rethinking Architecture Design for Tackling Data Heterogeneity in Federated Learning. PROCEEDINGS. IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION 2022; 2022:10051-10061. [PMID: 36624800 PMCID: PMC9826695 DOI: 10.1109/cvpr52688.2022.00982] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Federated learning is an emerging research paradigm enabling collaborative training of machine learning models among different organizations while keeping data private at each institution. Despite recent progress, there remain fundamental challenges such as the lack of convergence and the potential for catastrophic forgetting across real-world heterogeneous devices. In this paper, we demonstrate that self-attention-based architectures (e.g., Transformers) are more robust to distribution shifts and hence improve federated learning over heterogeneous data. Concretely, we conduct the first rigorous empirical investigation of different neural architectures across a range of federated algorithms, real-world benchmarks, and heterogeneous data splits. Our experiments show that simply replacing convolutional networks with Transformers can greatly reduce catastrophic forgetting of previous devices, accelerate convergence, and reach a better global model, especially when dealing with heterogeneous data. We release our code and pretrained models to encourage future exploration in robust architectures as an alternative to current research efforts on the optimization front.
Collapse
|
10
|
One-Shot Distributed Generalized Eigenvalue Problem (DGEP): Concept, Algorithm and Experiments. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12105128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
This paper focuses on the design of a distributed algorithm for generalized eigenvalue problems (GEPs) in one-shot communication. Since existing distributed methods for eigenvalue decomposition cannot be applied to GEP, a general one-shot distributed GEP framework is proposed. The theoretical analysis of the approximation error reveals its relation to the divergence of the data covariance, the eigenvalues of the empirical data covariance, and the number of local servers. If the symmetric data covariance has repeated eigenvalues in GEP, e.g., in canonical component analysis, we further modify the method for better convergence and prove the necessity experimentally. Numerical experiments validate the effectiveness of the proposed algorithms both on synthetic and real-world datasets.
Collapse
|
11
|
Darzidehkalani E, Ghasemi-Rad M, van Ooijen PMA. Federated Learning in Medical Imaging: Part I: Toward Multicentral Health Care Ecosystems. J Am Coll Radiol 2022; 19:969-974. [PMID: 35483439 DOI: 10.1016/j.jacr.2022.03.015] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 03/28/2022] [Accepted: 03/29/2022] [Indexed: 11/28/2022]
Abstract
With recent developments in medical imaging facilities, extensive medical imaging data are produced every day. This increasing amount of data provides an opportunity for researchers to develop data-driven methods and deliver better health care. However, data-driven models require a large amount of data to be adequately trained. Furthermore, there is always a limited amount of data available in each data center. Hence, deep learning models trained on local data centers might not reach their total performance capacity. One solution could be to accumulate all data from different centers into one center. However, data privacy regulations do not allow medical institutions to easily combine their data, and this becomes increasingly difficult when institutions from multiple countries are involved. Another solution is to use privacy-preserving algorithms, which can make use of all the data available in multiple centers while keeping the sensitive data private. Federated learning (FL) is such a mechanism that enables deploying large-scale machine learning models trained on different data centers without sharing sensitive data. In FL, instead of transferring data, a general model is trained on local data sets and transferred between data centers. FL has been identified as a promising field of research, with extensive possible uses in medical research and practice. This article introduces FL, with a comprehensive look into its concepts and recent research trends in medical imaging.
Collapse
Affiliation(s)
- Erfan Darzidehkalani
- Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands; Machine Learning Lab, Data Science Center in Health, University Medical Center Groningen, University of Groningen, the Netherlands.
| | - Mohammad Ghasemi-Rad
- Department of Interventional Radiology, Baylor College of Medicine, Houston, Texas
| | - P M A van Ooijen
- Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands; Machine Learning Lab, Data Science Center in Health, University Medical Center Groningen, University of Groningen, the Netherlands
| |
Collapse
|
12
|
An edge-driven multi-agent optimization model for infectious disease detection. APPL INTELL 2022; 52:14362-14373. [PMID: 35280108 PMCID: PMC8898659 DOI: 10.1007/s10489-021-03145-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/24/2021] [Indexed: 11/25/2022]
Abstract
This research work introduces a new intelligent framework for infectious disease detection by exploring various emerging and intelligent paradigms. We propose new deep learning architectures such as entity embedding networks, long-short term memory, and convolution neural networks, for accurately learning heterogeneous medical data in identifying disease infection. The multi-agent system is also consolidated for increasing the autonomy behaviours of the proposed framework, where each agent can easily share the derived learning outputs with the other agents in the system. Furthermore, evolutionary computation algorithms, such as memetic algorithms, and bee swarm optimization controlled the exploration of the hyper-optimization parameter space of the proposed framework. Intensive experimentation has been established on medical data. Strong results obtained confirm the superiority of our framework against the solutions that are state of the art, in both detection rate, and runtime performance, where the detection rate reaches 98% for handling real use cases.
Collapse
|
13
|
Torkzadehmahani R, Nasirigerdeh R, Blumenthal DB, Kacprowski T, List M, Matschinske J, Spaeth J, Wenke NK, Baumbach J. Privacy-Preserving Artificial Intelligence Techniques in Biomedicine. Methods Inf Med 2022; 61:e12-e27. [PMID: 35062032 PMCID: PMC9246509 DOI: 10.1055/s-0041-1740630] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Background
Artificial intelligence (AI) has been successfully applied in numerous scientific domains. In biomedicine, AI has already shown tremendous potential, e.g., in the interpretation of next-generation sequencing data and in the design of clinical decision support systems.
Objectives
However, training an AI model on sensitive data raises concerns about the privacy of individual participants. For example, summary statistics of a genome-wide association study can be used to determine the presence or absence of an individual in a given dataset. This considerable privacy risk has led to restrictions in accessing genomic and other biomedical data, which is detrimental for collaborative research and impedes scientific progress. Hence, there has been a substantial effort to develop AI methods that can learn from sensitive data while protecting individuals' privacy.
Method
This paper provides a structured overview of recent advances in privacy-preserving AI techniques in biomedicine. It places the most important state-of-the-art approaches within a unified taxonomy and discusses their strengths, limitations, and open problems.
Conclusion
As the most promising direction, we suggest combining federated machine learning as a more scalable approach with other additional privacy-preserving techniques. This would allow to merge the advantages to provide privacy guarantees in a distributed way for biomedical applications. Nonetheless, more research is necessary as hybrid approaches pose new challenges such as additional network or computation overhead.
Collapse
Affiliation(s)
- Reihaneh Torkzadehmahani
- Institute for Artificial Intelligence in Medicine and Healthcare, Technical University of Munich, Munich, Germany
| | - Reza Nasirigerdeh
- Institute for Artificial Intelligence in Medicine and Healthcare, Technical University of Munich, Munich, Germany.,Klinikum Rechts der Isar, Technical University of Munich, Munich, Germany
| | - David B Blumenthal
- Department of Artificial Intelligence in Biomedical Engineering (AIBE), Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany
| | - Tim Kacprowski
- Division of Data Science in Biomedicine, Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Medical School Hannover, Braunschweig, Germany.,Braunschweig Integrated Centre of Systems Biology (BRICS), TU Braunschweig, Braunschweig, Germany
| | - Markus List
- Chair of Experimental Bioinformatics, Technical University of Munich, Munich, Germany
| | - Julian Matschinske
- E.U. Horizon2020 FeatureCloud Project Consortium.,Chair of Computational Systems Biology, University of Hamburg, Hamburg, Germany
| | - Julian Spaeth
- E.U. Horizon2020 FeatureCloud Project Consortium.,Chair of Computational Systems Biology, University of Hamburg, Hamburg, Germany
| | - Nina Kerstin Wenke
- E.U. Horizon2020 FeatureCloud Project Consortium.,Chair of Computational Systems Biology, University of Hamburg, Hamburg, Germany
| | - Jan Baumbach
- E.U. Horizon2020 FeatureCloud Project Consortium.,Chair of Computational Systems Biology, University of Hamburg, Hamburg, Germany.,Institute of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
14
|
Bento M, Fantini I, Park J, Rittner L, Frayne R. Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets. Front Neuroinform 2022; 15:805669. [PMID: 35126080 PMCID: PMC8811356 DOI: 10.3389/fninf.2021.805669] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Accepted: 12/27/2021] [Indexed: 12/22/2022] Open
Abstract
Large, multi-site, heterogeneous brain imaging datasets are increasingly required for the training, validation, and testing of advanced deep learning (DL)-based automated tools, including structural magnetic resonance (MR) image-based diagnostic and treatment monitoring approaches. When assembling a number of smaller datasets to form a larger dataset, understanding the underlying variability between different acquisition and processing protocols across the aggregated dataset (termed “batch effects”) is critical. The presence of variation in the training dataset is important as it more closely reflects the true underlying data distribution and, thus, may enhance the overall generalizability of the tool. However, the impact of batch effects must be carefully evaluated in order to avoid undesirable effects that, for example, may reduce performance measures. Batch effects can result from many sources, including differences in acquisition equipment, imaging technique and parameters, as well as applied processing methodologies. Their impact, both beneficial and adversarial, must be considered when developing tools to ensure that their outputs are related to the proposed clinical or research question (i.e., actual disease-related or pathological changes) and are not simply due to the peculiarities of underlying batch effects in the aggregated dataset. We reviewed applications of DL in structural brain MR imaging that aggregated images from neuroimaging datasets, typically acquired at multiple sites. We examined datasets containing both healthy control participants and patients that were acquired using varying acquisition protocols. First, we discussed issues around Data Access and enumerated the key characteristics of some commonly used publicly available brain datasets. Then we reviewed methods for correcting batch effects by exploring the two main classes of approaches: Data Harmonization that uses data standardization, quality control protocols or other similar algorithms and procedures to explicitly understand and minimize unwanted batch effects; and Domain Adaptation that develops DL tools that implicitly handle the batch effects by using approaches to achieve reliable and robust results. In this narrative review, we highlighted the advantages and disadvantages of both classes of DL approaches, and described key challenges to be addressed in future studies.
Collapse
Affiliation(s)
- Mariana Bento
- Electrical and Software Engineering, Schulich School of Engineering, University of Calgary, Calgary, AB, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Calgary Image Processing and Analysis Centre, Foothills Medical Centre, Calgary, AB, Canada
- *Correspondence: Mariana Bento
| | - Irene Fantini
- School of Electrical and Computer Engineering, University of Campinas, Campinas, Brazil
| | - Justin Park
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Calgary Image Processing and Analysis Centre, Foothills Medical Centre, Calgary, AB, Canada
- Radiology and Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Leticia Rittner
- School of Electrical and Computer Engineering, University of Campinas, Campinas, Brazil
| | - Richard Frayne
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Calgary Image Processing and Analysis Centre, Foothills Medical Centre, Calgary, AB, Canada
- Radiology and Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Seaman Family MR Research Centre, Foothills Medical Centre, Calgary, AB, Canada
| |
Collapse
|
15
|
Designing clinically translatable artificial intelligence systems for high-dimensional medical imaging. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00399-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
|
16
|
Çallı E, Sogancioglu E, van Ginneken B, van Leeuwen KG, Murphy K. Deep learning for chest X-ray analysis: A survey. Med Image Anal 2021; 72:102125. [PMID: 34171622 DOI: 10.1016/j.media.2021.102125] [Citation(s) in RCA: 95] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 05/17/2021] [Accepted: 05/27/2021] [Indexed: 12/14/2022]
Abstract
Recent advances in deep learning have led to a promising performance in many medical image analysis tasks. As the most commonly performed radiological exam, chest radiographs are a particularly important modality for which a variety of applications have been researched. The release of multiple, large, publicly available chest X-ray datasets in recent years has encouraged research interest and boosted the number of publications. In this paper, we review all studies using deep learning on chest radiographs published before March 2021, categorizing works by task: image-level prediction (classification and regression), segmentation, localization, image generation and domain adaptation. Detailed descriptions of all publicly available datasets are included and commercial systems in the field are described. A comprehensive discussion of the current state of the art is provided, including caveats on the use of public datasets, the requirements of clinically useful systems and gaps in the current literature.
Collapse
Affiliation(s)
- Erdi Çallı
- Radboud University Medical Center, Institute for Health Sciences, Department of Medical Imaging, Nijmegen, the Netherlands.
| | - Ecem Sogancioglu
- Radboud University Medical Center, Institute for Health Sciences, Department of Medical Imaging, Nijmegen, the Netherlands
| | - Bram van Ginneken
- Radboud University Medical Center, Institute for Health Sciences, Department of Medical Imaging, Nijmegen, the Netherlands
| | - Kicky G van Leeuwen
- Radboud University Medical Center, Institute for Health Sciences, Department of Medical Imaging, Nijmegen, the Netherlands
| | - Keelin Murphy
- Radboud University Medical Center, Institute for Health Sciences, Department of Medical Imaging, Nijmegen, the Netherlands
| |
Collapse
|
17
|
Kirienko M, Sollini M, Ninatti G, Loiacono D, Giacomello E, Gozzi N, Amigoni F, Mainardi L, Lanzi PL, Chiti A. Distributed learning: a reliable privacy-preserving strategy to change multicenter collaborations using AI. Eur J Nucl Med Mol Imaging 2021; 48:3791-3804. [PMID: 33847779 PMCID: PMC8041944 DOI: 10.1007/s00259-021-05339-7] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 03/24/2021] [Indexed: 12/12/2022]
Abstract
Purpose The present scoping review aims to assess the non-inferiority of distributed learning over centrally and locally trained machine learning (ML) models in medical applications. Methods We performed a literature search using the term “distributed learning” OR “federated learning” in the PubMed/MEDLINE and EMBASE databases. No start date limit was used, and the search was extended until July 21, 2020. We excluded articles outside the field of interest; guidelines or expert opinion, review articles and meta-analyses, editorials, letters or commentaries, and conference abstracts; articles not in the English language; and studies not using medical data. Selected studies were classified and analysed according to their aim(s). Results We included 26 papers aimed at predicting one or more outcomes: namely risk, diagnosis, prognosis, and treatment side effect/adverse drug reaction. Distributed learning was compared to centralized or localized training in 21/26 and 14/26 selected papers, respectively. Regardless of the aim, the type of input, the method, and the classifier, distributed learning performed close to centralized training, but two experiments focused on diagnosis. In all but 2 cases, distributed learning outperformed locally trained models. Conclusion Distributed learning resulted in a reliable strategy for model development; indeed, it performed equally to models trained on centralized datasets. Sensitive data can get preserved since they are not shared for model development. Distributed learning constitutes a promising solution for ML-based research and practice since large, diverse datasets are crucial for success.
Collapse
Affiliation(s)
- Margarita Kirienko
- Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.,Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | - Martina Sollini
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy. .,IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy.
| | - Gaia Ninatti
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | | | | | - Noemi Gozzi
- IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| | | | | | | | - Arturo Chiti
- Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy.,IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| |
Collapse
|
18
|
Sarma KV, Harmon S, Sanford T, Roth HR, Xu Z, Tetreault J, Xu D, Flores MG, Raman AG, Kulkarni R, Wood BJ, Choyke PL, Priester AM, Marks LS, Raman SS, Enzmann D, Turkbey B, Speier W, Arnold CW. Federated learning improves site performance in multicenter deep learning without data sharing. J Am Med Inform Assoc 2021; 28:1259-1264. [PMID: 33537772 PMCID: PMC8200268 DOI: 10.1093/jamia/ocaa341] [Citation(s) in RCA: 57] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 11/30/2020] [Accepted: 12/23/2020] [Indexed: 02/06/2023] Open
Abstract
Objective To demonstrate enabling multi-institutional training without centralizing or sharing the underlying physical data via federated learning (FL). Materials and Methods Deep learning models were trained at each participating institution using local clinical data, and an additional model was trained using FL across all of the institutions. Results We found that the FL model exhibited superior performance and generalizability to the models trained at single institutions, with an overall performance level that was significantly better than that of any of the institutional models alone when evaluated on held-out test sets from each institution and an outside challenge dataset. Discussion The power of FL was successfully demonstrated across 3 academic institutions while avoiding the privacy risk associated with the transfer and pooling of patient data. Conclusion Federated learning is an effective methodology that merits further study to enable accelerated development of models across institutions, enabling greater generalizability in clinical use.
Collapse
Affiliation(s)
- Karthik V Sarma
- Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, USA.,Department of Bioengineering, University of California, Los Angeles, Los Angeles, California, USA
| | - Stephanie Harmon
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA.,Clinical Monitoring Research Program Directorate, Frederick National Laboratory for Cancer Research, Frederick, Maryland, USA
| | - Thomas Sanford
- Department of Urology, SUNY Upstate Medical Center, Syracuse, New York, USA
| | | | - Ziyue Xu
- NVIDIA Corporation, Bethesda, Maryland, USA
| | | | - Daguang Xu
- NVIDIA Corporation, Bethesda, Maryland, USA
| | | | - Alex G Raman
- Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, USA
| | - Rushikesh Kulkarni
- Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, USA
| | - Bradford J Wood
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Peter L Choyke
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Alan M Priester
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, California, USA.,Department of Urology, University of California, Los Angeles, Los Angeles, California, USA
| | - Leonard S Marks
- Department of Urology, University of California, Los Angeles, Los Angeles, California, USA
| | - Steven S Raman
- Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, USA
| | - Dieter Enzmann
- Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, USA
| | - Baris Turkbey
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - William Speier
- Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, USA
| | - Corey W Arnold
- Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, USA.,Department of Bioengineering, University of California, Los Angeles, Los Angeles, California, USA.,Department of Pathology and Laboratory Medicine University of California, Los Angeles, Los Angeles, California, USA
| |
Collapse
|
19
|
Harrison JH, Gilbertson JR, Hanna MG, Olson NH, Seheult JN, Sorace JM, Stram MN. Introduction to Artificial Intelligence and Machine Learning for Pathology. Arch Pathol Lab Med 2021; 145:1228-1254. [PMID: 33493264 DOI: 10.5858/arpa.2020-0541-cp] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/10/2020] [Indexed: 11/06/2022]
Abstract
CONTEXT.— Recent developments in machine learning have stimulated intense interest in software that may augment or replace human experts. Machine learning may impact pathology practice by offering new capabilities in analysis, interpretation, and outcomes prediction using images and other data. The principles of operation and management of machine learning systems are unfamiliar to pathologists, who anticipate a need for additional education to be effective as expert users and managers of the new tools. OBJECTIVE.— To provide a background on machine learning for practicing pathologists, including an overview of algorithms, model development, and performance evaluation; to examine the current status of machine learning in pathology and consider possible roles and requirements for pathologists in local deployment and management of machine learning systems; and to highlight existing challenges and gaps in deployment methodology and regulation. DATA SOURCES.— Sources include the biomedical and engineering literature, white papers from professional organizations, government reports, electronic resources, and authors' experience in machine learning. References were chosen when possible for accessibility to practicing pathologists without specialized training in mathematics, statistics, or software development. CONCLUSIONS.— Machine learning offers an array of techniques that in recent published results show substantial promise. Data suggest that human experts working with machine learning tools outperform humans or machines separately, but the optimal form for this combination in pathology has not been established. Significant questions related to the generalizability of machine learning systems, local site verification, and performance monitoring remain to be resolved before a consensus on best practices and a regulatory environment can be established.
Collapse
Affiliation(s)
- James H Harrison
- From the Department of Pathology, University of Virginia School of Medicine, Charlottesville (Harrison)
| | - John R Gilbertson
- the Departments of Biomedical Informatics and Pathology, University of Pittsburgh, Pittsburgh, Pennsylvania (Gilbertson)
| | - Matthew G Hanna
- the Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, New York (Hanna)
| | - Niels H Olson
- the Defense Innovation Unit, Mountain View, California (Olson).,the Department of Pathology, Uniformed Services University, Bethesda, Maryland (Olson)
| | - Jansen N Seheult
- the Department of Pathology, University of Pittsburgh, and Vitalant Specialty Labs, Pittsburgh, Pennsylvania (Seheult)
| | - James M Sorace
- the US Department of Health and Human Services, retired, Lutherville, Maryland (Sorace)
| | - Michelle N Stram
- the Department of Forensic Medicine, New York University, and Office of Chief Medical Examiner, New York, New York (Stram)
| |
Collapse
|
20
|
Chang K, Singh P, Vepakomma P, Poirot MG, Raskar R, Rubin DL, Kalpathy-Cramer J. Privacy-preserving collaborative deep learning methods for multiinstitutional training without sharing patient data. Artif Intell Med 2021. [DOI: 10.1016/b978-0-12-821259-2.00006-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
21
|
Tajgardoon M, Visweswaran S. Using Distribution Divergence to Predict Changes in the Performance of Clinical Predictive Models. Artif Intell Med 2021. [DOI: 10.1007/978-3-030-77211-6_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
22
|
Aledhari M, Razzak R, Parizi RM, Saeed F. Federated Learning: A Survey on Enabling Technologies, Protocols, and Applications. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 8:140699-140725. [PMID: 32999795 PMCID: PMC7523633 DOI: 10.1109/access.2020.3013541] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
This paper provides a comprehensive study of Federated Learning (FL) with an emphasis on enabling software and hardware platforms, protocols, real-life applications and use-cases. FL can be applicable to multiple domains but applying it to different industries has its own set of obstacles. FL is known as collaborative learning, where algorithm(s) get trained across multiple devices or servers with decentralized data samples without having to exchange the actual data. This approach is radically different from other more established techniques such as getting the data samples uploaded to servers or having data in some form of distributed infrastructure. FL on the other hand generates more robust models without sharing data, leading to privacy-preserved solutions with higher security and access privileges to data. This paper starts by providing an overview of FL. Then, it gives an overview of technical details that pertain to FL enabling technologies, protocols, and applications. Compared to other survey papers in the field, our objective is to provide a more thorough summary of the most relevant protocols, platforms, and real-life use-cases of FL to enable data scientists to build better privacy-preserving solutions for industries in critical need of FL. We also provide an overview of key challenges presented in the recent literature and provide a summary of related research work. Moreover, we explore both the challenges and advantages of FL and present detailed service use-cases to illustrate how different architectures and protocols that use FL can fit together to deliver desired results.
Collapse
Affiliation(s)
- Mohammed Aledhari
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA, 30060 USA
| | - Rehma Razzak
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA, 30060 USA
| | - Reza M Parizi
- College of Computing and Software Engineering, Kennesaw State University, Marietta, GA, 30060 USA
| | - Fahad Saeed
- School of Computing and Information Sciences, Florida International University, Miami, FL, 33199 USA
| |
Collapse
|