1
|
Benouis M, Andre E, Can YS. Balancing Between Privacy and Utility for Affect Recognition Using Multitask Learning in Differential Privacy-Added Federated Learning Settings: Quantitative Study. JMIR Ment Health 2024; 11:e60003. [PMID: 39714484 DOI: 10.2196/60003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 08/21/2024] [Accepted: 08/22/2024] [Indexed: 12/24/2024] Open
Abstract
Background The rise of wearable sensors marks a significant development in the era of affective computing. Their popularity is continuously increasing, and they have the potential to improve our understanding of human stress. A fundamental aspect within this domain is the ability to recognize perceived stress through these unobtrusive devices. Objective This study aims to enhance the performance of emotion recognition using multitask learning (MTL), a technique extensively explored across various machine learning tasks, including affective computing. By leveraging the shared information among related tasks, we seek to augment the accuracy of emotion recognition while confronting the privacy threats inherent in the physiological data captured by these sensors. Methods To address the privacy concerns associated with the sensitive data collected by wearable sensors, we proposed a novel framework that integrates differential privacy and federated learning approaches with MTL. This framework was designed to efficiently identify mental stress while preserving private identity information. Through this approach, we aimed to enhance the performance of emotion recognition tasks while preserving user privacy. Results Comprehensive evaluations of our framework were conducted using 2 prominent public datasets. The results demonstrate a significant improvement in emotion recognition accuracy, achieving a rate of 90%. Furthermore, our approach effectively mitigates privacy risks, as evidenced by limiting reidentification accuracies to 47%. Conclusions This study presents a promising approach to advancing emotion recognition capabilities while addressing privacy concerns in the context of empathetic sensors. By integrating MTL with differential privacy and federated learning, we have demonstrated the potential to achieve high levels of accuracy in emotion recognition while ensuring the protection of user privacy. This research contributes to the ongoing efforts to use affective computing in a privacy-aware and ethical manner.
Collapse
Affiliation(s)
- Mohamed Benouis
- Faculty of Applied Computer Science, Augsburg University, Augsburg, Germany
| | - Elisabeth Andre
- Faculty of Applied Computer Science, Augsburg University, Augsburg, Germany
| | - Yekta Said Can
- Faculty of Applied Computer Science, Augsburg University, Augsburg, Germany
| |
Collapse
|
2
|
Qiu W, Quan C, Zhu L, Yu Y, Wang Z, Ma Y, Sun M, Chang Y, Qian K, Hu B, Yamamoto Y, Schuller BW. Heart Sound Abnormality Detection From Multi-Institutional Collaboration: Introducing a Federated Learning Framework. IEEE Trans Biomed Eng 2024; 71:2802-2813. [PMID: 38700959 DOI: 10.1109/tbme.2024.3393557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]
Abstract
OBJECTIVE Early diagnosis of cardiovascular diseases is a crucial task in medical practice. With the application of computer audition in the healthcare field, artificial intelligence (AI) has been applied to clinical non-invasive intelligent auscultation of heart sounds to provide rapid and effective pre-screening. However, AI models generally require large amounts of data which may cause privacy issues. Unfortunately, it is difficult to collect large amounts of healthcare data from a single centre. METHODS In this study, we propose federated learning (FL) optimisation strategies for the practical application in multi-centre institutional heart sound databases. The horizontal FL is mainly employed to tackle the privacy problem by aligning the feature spaces of FL participating institutions without information leakage. In addition, techniques based on deep learning have poor interpretability due to their "black-box" property, which limits the feasibility of AI in real medical data. To this end, vertical FL is utilised to address the issues of model interpretability and data scarcity. CONCLUSION Experimental results demonstrate that, the proposed FL framework can achieve good performance for heart sound abnormality detection by taking the personal privacy protection into account. Moreover, using the federated feature space is beneficial to balance the interpretability of the vertical FL and the privacy of the data. SIGNIFICANCE This work realises the potential of FL from research to clinical practice, and is expected to have extensive application in the federated smart medical system.
Collapse
|
3
|
Qiu W, Quan C, Yu Y, Kara E, Qian K, Hu B, Schuller BW, Yamamoto Y. Federated Abnormal Heart Sound Detection with Weak to No Labels. CYBORG AND BIONIC SYSTEMS 2024; 5:0152. [PMID: 39257898 PMCID: PMC11382922 DOI: 10.34133/cbsystems.0152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 06/14/2024] [Indexed: 09/12/2024] Open
Abstract
Cardiovascular diseases are a prominent cause of mortality, emphasizing the need for early prevention and diagnosis. Utilizing artificial intelligence (AI) models, heart sound analysis emerges as a noninvasive and universally applicable approach for assessing cardiovascular health conditions. However, real-world medical data are dispersed across medical institutions, forming "data islands" due to data sharing limitations for security reasons. To this end, federated learning (FL) has been extensively employed in the medical field, which can effectively model across multiple institutions. Additionally, conventional supervised classification methods require fully labeled data classes, e.g., binary classification requires labeling of positive and negative samples. Nevertheless, the process of labeling healthcare data is time-consuming and labor-intensive, leading to the possibility of mislabeling negative samples. In this study, we validate an FL framework with a naive positive-unlabeled (PU) learning strategy. Semisupervised FL model can directly learn from a limited set of positive samples and an extensive pool of unlabeled samples. Our emphasis is on vertical-FL to enhance collaboration across institutions with different medical record feature spaces. Additionally, our contribution extends to feature importance analysis, where we explore 6 methods and provide practical recommendations for detecting abnormal heart sounds. The study demonstrated an impressive accuracy of 84%, comparable to outcomes in supervised learning, thereby advancing the application of FL in abnormal heart sound detection.
Collapse
Affiliation(s)
- Wanyong Qiu
- Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing 100081, China
- School of Medical Technology and School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
| | - Chen Quan
- Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing 100081, China
- School of Medical Technology and School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
| | - Yongzi Yu
- Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing 100081, China
- School of Medical Technology and School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
| | - Eda Kara
- Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing 100081, China
- School of Medical Technology and School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
| | - Kun Qian
- Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing 100081, China
- School of Medical Technology and School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
| | - Bin Hu
- Key Laboratory of Brain Health Intelligent Evaluation and Intervention, Ministry of Education (Beijing Institute of Technology), Beijing 100081, China
- School of Medical Technology and School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
| | - Björn W Schuller
- CHI-Chair of Health Informatics, MRI, Technical University of Munich, 80290 Munich, Germany
- GLAM-Group on Language, Audio, & Music, Imperial College London, London SW7 2AZ, UK
| | - Yoshiharu Yamamoto
- Edicational Physiology Laboratory, Graduate School of Education, The University of Tokyo, Tokyo 113-0033, Japan
| |
Collapse
|
4
|
Qiu W, Feng Y, Li Y, Chang Y, Qian K, Hu B, Yamamoto Y, Schuller BW. Fed-MStacking: Heterogeneous Federated Learning With Stacking Misaligned Labels for Abnormal Heart Sound Detection. IEEE J Biomed Health Inform 2024; 28:5055-5066. [PMID: 39012744 DOI: 10.1109/jbhi.2024.3428512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2024]
Abstract
Ubiquitous sensing has been widely applied in smart healthcare, providing an opportunity for intelligent heart sound auscultation. However, smart devices contain sensitive information, raising user privacy concerns. To this end, federated learning (FL) has been adopted as an effective solution, enabling decentralised learning without data sharing, thus preserving data privacy in the Internet of Health Things (IoHT). Nevertheless, traditional FL requires the same architectural models to be trained across local clients and global servers, leading to a lack of model heterogeneity and client personalisation. For medical institutions with private data clients, this study proposes Fed-MStacking, a heterogeneous FL framework that incorporates a stacking ensemble learning strategy to support clients in building their own models. The secondary objective of this study is to address scenarios involving local clients with data characterised by inconsistent labelling. Specifically, the local client contains only one case type, and the data cannot be shared within or outside the institution. To train a global multi-class classifier, we aggregate missing class information from all clients at each institution and build meta-data, which then participates in FL training via a meta-learner. We apply the proposed framework to a multi-institutional heart sound database. The experiments utilise random forests (RFs), feedforward neural networks (FNNs), and convolutional neural networks (CNNs) as base classifiers. The results show that the heterogeneous stacking of local models performs better compared to homogeneous stacking.
Collapse
|
5
|
Ferrara E. Large Language Models for Wearable Sensor-Based Human Activity Recognition, Health Monitoring, and Behavioral Modeling: A Survey of Early Trends, Datasets, and Challenges. SENSORS (BASEL, SWITZERLAND) 2024; 24:5045. [PMID: 39124092 PMCID: PMC11314694 DOI: 10.3390/s24155045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 07/29/2024] [Accepted: 07/31/2024] [Indexed: 08/12/2024]
Abstract
The proliferation of wearable technology enables the generation of vast amounts of sensor data, offering significant opportunities for advancements in health monitoring, activity recognition, and personalized medicine. However, the complexity and volume of these data present substantial challenges in data modeling and analysis, which have been addressed with approaches spanning time series modeling to deep learning techniques. The latest frontier in this domain is the adoption of large language models (LLMs), such as GPT-4 and Llama, for data analysis, modeling, understanding, and human behavior monitoring through the lens of wearable sensor data. This survey explores the current trends and challenges in applying LLMs for sensor-based human activity recognition and behavior modeling. We discuss the nature of wearable sensor data, the capabilities and limitations of LLMs in modeling them, and their integration with traditional machine learning techniques. We also identify key challenges, including data quality, computational requirements, interpretability, and privacy concerns. By examining case studies and successful applications, we highlight the potential of LLMs in enhancing the analysis and interpretation of wearable sensor data. Finally, we propose future directions for research, emphasizing the need for improved preprocessing techniques, more efficient and scalable models, and interdisciplinary collaboration. This survey aims to provide a comprehensive overview of the intersection between wearable sensor data and LLMs, offering insights into the current state and future prospects of this emerging field.
Collapse
Affiliation(s)
- Emilio Ferrara
- Thomas Lord Department of Computer Science, University of Southern California, Los Angeles, CA 90007, USA;
- Information Sciences Institute, School of Advanced Computing, University of Southern California, Los Angeles, CA 90007, USA
| |
Collapse
|
6
|
De Ridder D, Siddiqi MA, Dauwels J, Serdijn WA, Strydis C. NeuroDots: From Single-Target to Brain-Network Modulation: Why and What Is Needed? Neuromodulation 2024; 27:711-729. [PMID: 38639704 DOI: 10.1016/j.neurom.2024.01.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 11/05/2023] [Accepted: 01/10/2024] [Indexed: 04/20/2024]
Abstract
OBJECTIVES Current techniques in brain stimulation are still largely based on a phrenologic approach that a single brain target can treat a brain disorder. Nevertheless, meta-analyses of brain implants indicate an overall success rate of 50% improvement in 50% of patients, irrespective of the brain-related disorder. Thus, there is still a large margin for improvement. The goal of this manuscript is to 1) develop a general theoretical framework of brain functioning that is amenable to surgical neuromodulation, and 2) describe the engineering requirements of the next generation of implantable brain stimulators that follow from this theoretic model. MATERIALS AND METHODS A neuroscience and engineering literature review was performed to develop a universal theoretical model of brain functioning and dysfunctioning amenable to surgical neuromodulation. RESULTS Even though a single target can modulate an entire network, research in network science reveals that many brain disorders are the consequence of maladaptive interactions among multiple networks rather than a single network. Consequently, targeting the main connector hubs of those multiple interacting networks involved in a brain disorder is theoretically more beneficial. We, thus, envision next-generation network implants that will rely on distributed, multisite neuromodulation targeting correlated and anticorrelated interacting brain networks, juxtaposing alternative implant configurations, and finally providing solid recommendations for the realization of such implants. In doing so, this study pinpoints the potential shortcomings of other similar efforts in the field, which somehow fall short of the requirements. CONCLUSION The concept of network stimulation holds great promise as a universal approach for treating neurologic and psychiatric disorders.
Collapse
Affiliation(s)
- Dirk De Ridder
- Section of Neurosurgery, Department of Surgical Sciences, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand.
| | - Muhammad Ali Siddiqi
- Department of Electrical Engineering, Lahore University of Management Sciences, Lahore, Pakistan; Neuroscience Department, Erasmus Medical Center, Rotterdam, The Netherlands; Quantum and Computer Engineering Department, Delft University of Technology, Delft, The Netherlands
| | - Justin Dauwels
- Microelectronics Department, Delft University of Technology, Delft, The Netherlands
| | - Wouter A Serdijn
- Neuroscience Department, Erasmus Medical Center, Rotterdam, The Netherlands; Section Bioelectronics, Delft University of Technology, Delft, The Netherlands
| | - Christos Strydis
- Neuroscience Department, Erasmus Medical Center, Rotterdam, The Netherlands; Quantum and Computer Engineering Department, Delft University of Technology, Delft, The Netherlands
| |
Collapse
|
7
|
Li J, Washington P. A Comparison of Personalized and Generalized Approaches to Emotion Recognition Using Consumer Wearable Devices: Machine Learning Study. JMIR AI 2024; 3:e52171. [PMID: 38875573 PMCID: PMC11127131 DOI: 10.2196/52171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 02/19/2024] [Accepted: 03/23/2024] [Indexed: 06/16/2024]
Abstract
BACKGROUND There are a wide range of potential adverse health effects, ranging from headaches to cardiovascular disease, associated with long-term negative emotions and chronic stress. Because many indicators of stress are imperceptible to observers, the early detection of stress remains a pressing medical need, as it can enable early intervention. Physiological signals offer a noninvasive method for monitoring affective states and are recorded by a growing number of commercially available wearables. OBJECTIVE We aim to study the differences between personalized and generalized machine learning models for 3-class emotion classification (neutral, stress, and amusement) using wearable biosignal data. METHODS We developed a neural network for the 3-class emotion classification problem using data from the Wearable Stress and Affect Detection (WESAD) data set, a multimodal data set with physiological signals from 15 participants. We compared the results between a participant-exclusive generalized, a participant-inclusive generalized, and a personalized deep learning model. RESULTS For the 3-class classification problem, our personalized model achieved an average accuracy of 95.06% and an F1-score of 91.71%; our participant-inclusive generalized model achieved an average accuracy of 66.95% and an F1-score of 42.50%; and our participant-exclusive generalized model achieved an average accuracy of 67.65% and an F1-score of 43.05%. CONCLUSIONS Our results emphasize the need for increased research in personalized emotion recognition models given that they outperform generalized models in certain contexts. We also demonstrate that personalized machine learning models for emotion classification are viable and can achieve high performance.
Collapse
Affiliation(s)
- Joe Li
- Information and Computer Sciences, University of Hawai`i at Mānoa, Honolulu, HI, United States
| | - Peter Washington
- Information and Computer Sciences, University of Hawai`i at Mānoa, Honolulu, HI, United States
| |
Collapse
|
8
|
Shin H, Ryu K, Kim JY, Lee S. Application of privacy protection technology to healthcare big data. Digit Health 2024; 10:20552076241282242. [PMID: 39502481 PMCID: PMC11536567 DOI: 10.1177/20552076241282242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 08/23/2024] [Indexed: 11/08/2024] Open
Abstract
With the advent of the big data era, data security issues are becoming more common. Healthcare organizations have more data to use for analysis, but they lose money every year due to their inability to prevent data leakage. To overcome these challenges, research on the use of data protection technologies in healthcare is actively underway, particularly research on state-of-the-art technologies, such as federated learning announced by Google and blockchain technology, which has recently attracted attention. To learn about these research efforts, we explored the research, methods, and limitations of the most widely used privacy technologies. After investigating related papers published between 2017 and 2023 and identifying the latest technology trends, we selected related papers and reviewed related technologies. In the process, four technologies were the focus of this study: blockchain, federated learning, isomorphic encryption, and differential privacy. Overall, our analysis provides researchers with insight into privacy technology research by suggesting the limitations of current privacy technologies and suggesting future research directions.
Collapse
Affiliation(s)
- Hyunah Shin
- Department of Healthcare Data Science Center, Konyang University Hospital, Daejeon, Republic of Korea
| | - Kyeongmin Ryu
- Department of Healthcare Data Science Center, Konyang University Hospital, Daejeon, Republic of Korea
| | - Jong-Yeup Kim
- Department of Healthcare Data Science Center, Konyang University Hospital, Daejeon, Republic of Korea
- Department of Otorhinolaryngology—Head and Neck Surgery, Konyang University College of Medicine, Daejeon, Republic of Korea
- Department of Biomedical Informatics, Konyang University College of Medicine, Daejeon, Republic of Korea
| | - Suehyun Lee
- College of IT Convergence, Gachon University, Seongnam, Republic of Korea
| |
Collapse
|
9
|
Ghadi YY, Mazhar T, Shah SFA, Haq I, Ahmad W, Ouahada K, Hamam H. Integration of federated learning with IoT for smart cities applications, challenges, and solutions. PeerJ Comput Sci 2023; 9:e1657. [PMID: 38192447 PMCID: PMC10773731 DOI: 10.7717/peerj-cs.1657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 09/29/2023] [Indexed: 01/10/2024]
Abstract
In the past few years, privacy concerns have grown, making the financial models of businesses more vulnerable to attack. In many cases, it is hard to emphasize the importance of monitoring things in real-time with data from Internet of Things (IoT) devices. The people who make the IoT devices and those who use them face big problems when they try to use Artificial Intelligence (AI) techniques in real-world applications, where data must be collected and processed at a central location. Federated learning (FL) has made a decentralized, cooperative AI system that can be used by many IoT apps that use AI. It is possible because it can train AI on IoT devices that are spread out and do not need to share data. FL allows local models to be trained on local data and share their knowledge to improve a global model. Also, shared learning allows models from all over the world to be trained using data from all over the world. This article looks at the IoT in all of its forms, including "smart" businesses, "smart" cities, "smart" transportation, and "smart" healthcare. This study looks at the safety problems that the federated learning with IoT (FL-IoT) area has brought to market. This research is needed to explore because federated learning is a new technique, and a small amount of work is done on challenges faced during integration with IoT. This research also helps in the real world in such applications where encrypted data must be sent from one place to another. Researchers and graduate students are the audience of our article.
Collapse
Affiliation(s)
- Yazeed Yasin Ghadi
- Department of Computer Science and Software Engineering, Al Ain University, Abu Dhabi, UAE
| | - Tehseen Mazhar
- Department of Computer Science, Virtual University of Pakistan, Lahore, Punjab, Pakistan
| | - Syed Faisal Abbas Shah
- Department of Computer Science, Virtual University of Pakistan, Lahore, Punjab, Pakistan
| | - Inayatul Haq
- School of Electrical and Information Engineering, Zhengzhou University, Zhengzhou, Henan, China
| | - Wasim Ahmad
- Department of Computer Science and Information Technology, University of Malakand, Chakdara, Dir, Pakistan
| | - Khmaies Ouahada
- School of Electrical Engineering, Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg, South Africa
| | - Habib Hamam
- School of Electrical Engineering, Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg, South Africa
- Commune d’Akanda, International Institute of Technology and Management, BP Libreville, Estuaire, Gabon
- Faculty of Engineering, University of Moncton, Moncton, New Brunswick, Canada
- College of Computer Science and Engineering, University of Ha’il, Ha’il, Saudi Arabia
- Production & Skills Development, Spectrum of Knowledge Production & Skills Development, Sfax, Tunisia
| |
Collapse
|
10
|
Allareddy V, Rampa S, Venugopalan SR, Elnagar MH, Lee MK, Oubaidin M, Yadav S. Blockchain technology and federated machine learning for collaborative initiatives in orthodontics and craniofacial health. Orthod Craniofac Res 2023; 26 Suppl 1:118-123. [PMID: 37036565 DOI: 10.1111/ocr.12662] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 03/25/2023] [Accepted: 03/29/2023] [Indexed: 04/11/2023]
Abstract
There is a paucity of largescale collaborative initiatives in orthodontics and craniofacial health. Such nationally representative projects would yield findings that are generalizable. The lack of large-scale collaborative initiatives in the field of orthodontics creates a deficiency in study outcomes that can be applied to the population at large. The objective of this study is to provide a narrative review of potential applications of blockchain technology and federated machine learning to improve collaborative care. We conducted a narrative review of articles published from 2018 to 2023 to provide a high level overview of blockchain technology, federated machine learning, remote monitoring, and genomics and how they can be leveraged together to establish a patient centered model of care. To strengthen the empirical framework for clinical decision making in healthcare, we suggest use of blockchain technology and integrating it with federated machine learning. There are several challenges to adoption of these technologies in the current healthcare ecosystem. Nevertheless, this may be an ideal time to explore how best we can integrate these technologies to deliver high quality personalized care. This article provides an overview of blockchain technology and federated machine learning and how they can be leveraged to initiate collaborative projects that will have the patient at the center of care.
Collapse
Affiliation(s)
- Veerasathpurush Allareddy
- Department of Orthodontics, University of Illinois Chicago College of Dentistry, Chicago, Illinois, USA
| | | | | | - Mohammed H Elnagar
- University of Illinois Chicago College of Dentistry, Chicago, Illinois, USA
| | - Min Kyeong Lee
- University of Illinois Chicago College of Dentistry, Chicago, Illinois, USA
| | - Maysaa Oubaidin
- University of Illinois Chicago College of Dentistry, Chicago, Illinois, USA
| | - Sumit Yadav
- UNMC College of Dentistry, Lincoln, Nebraska, USA
| |
Collapse
|
11
|
Liu Y, Bi D. Quantitative risk analysis of treatment plans for patients with tumor by mining historical similar patients from electronic health records using federated learning. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2023; 43:2422-2449. [PMID: 36906293 DOI: 10.1111/risa.14124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 12/11/2022] [Accepted: 02/06/2023] [Indexed: 06/18/2023]
Abstract
The determination of a treatment plan for a target patient with tumor is a difficult problem due to the existence of heterogeneity in patients' responses, incomplete information about tumor states, and asymmetric knowledge between doctors and patients, and so on. In this paper, a method for quantitative risk analysis of treatment plans for patients with tumor is proposed. To reduce the impacts of the heterogeneity in patients' responses on analysis results, the method conducts risk analysis by mining historical similar patients from Electronic Health Records (EHRs) in multiple hospitals using federated learning (FL). For this, the Recursive Feature Elimination based on the Support Vector Machine (SVM) and Deep Learning Important FeaTures (DeepLIFT) are extended into the FL framework to select key features and determine key feature weights for identifying historical similar patients. Then, in the database of each collaborative hospital, the similarities between the target patient and all historical patients are calculated, and the historical similar patients are determined. According to the statistics of tumor states and treatment outcomes of historical similar patients in all collaborative hospitals, the related data (including the probabilities of different tumor states and possible outcomes of different treatment plans) for risk analysis of the alternative treatment plans can be obtained, which can eliminate the asymmetric knowledge between doctors and patients. The related data are valuable for the doctor and patient to make their decisions. Experimental studies have been conducted to verify the feasibility and effectiveness of the proposed method.
Collapse
Affiliation(s)
- Yang Liu
- School of Economics and Management, Dalian University of Technology, Dalian, China
| | - Donghai Bi
- School of Economics and Management, Dalian University of Technology, Dalian, China
| |
Collapse
|
12
|
Wang T, Du Y, Gong Y, Choo KKR, Guo Y. Applications of Federated Learning in Mobile Health: Scoping Review. J Med Internet Res 2023; 25:e43006. [PMID: 37126398 PMCID: PMC10186185 DOI: 10.2196/43006] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 01/24/2023] [Accepted: 03/10/2023] [Indexed: 03/12/2023] Open
Abstract
BACKGROUND The proliferation of mobile health (mHealth) applications is partly driven by the advancements in sensing and communication technologies, as well as the integration of artificial intelligence techniques. Data collected from mHealth applications, for example, on sensor devices carried by patients, can be mined and analyzed using artificial intelligence-based solutions to facilitate remote and (near) real-time decision-making in health care settings. However, such data often sit in data silos, and patients are often concerned about the privacy implications of sharing their raw data. Federated learning (FL) is a potential solution, as it allows multiple data owners to collaboratively train a machine learning model without requiring access to each other's raw data. OBJECTIVE The goal of this scoping review is to gain an understanding of FL and its potential in dealing with sensitive and heterogeneous data in mHealth applications. Through this review, various stakeholders, such as health care providers, practitioners, and policy makers, can gain insight into the limitations and challenges associated with using FL in mHealth and make informed decisions when considering implementing FL-based solutions. METHODS We conducted a scoping review following the guidelines of PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews). We searched 7 commonly used databases. The included studies were analyzed and summarized to identify the possible real-world applications and associated challenges of using FL in mHealth settings. RESULTS A total of 1095 articles were retrieved during the database search, and 26 articles that met the inclusion criteria were included in the review. The analysis of these articles revealed 2 main application areas for FL in mHealth, that is, remote monitoring and diagnostic and treatment support. More specifically, FL was found to be commonly used for monitoring self-care ability, health status, and disease progression, as well as in diagnosis and treatment support of diseases. The review also identified several challenges (eg, expensive communication, statistical heterogeneity, and system heterogeneity) and potential solutions (eg, compression schemes, model personalization, and active sampling). CONCLUSIONS This scoping review has highlighted the potential of FL as a privacy-preserving approach in mHealth applications and identified the technical limitations associated with its use. The challenges and opportunities outlined in this review can inform the research agenda for future studies in this field, to overcome these limitations and further advance the use of FL in mHealth.
Collapse
Affiliation(s)
- Tongnian Wang
- Department of Information Systems and Cyber Security, The University of Texas at San Antonio, San Antonio, TX, United States
| | - Yan Du
- School of Nursing, The University of Texas Health Science Center at San Antonio, San Antonio, TX, United States
| | - Yanmin Gong
- Department of Electrical and Computer Engineering, The University of Texas at San Antonio, San Antonio, TX, United States
| | - Kim-Kwang Raymond Choo
- Department of Information Systems and Cyber Security, The University of Texas at San Antonio, San Antonio, TX, United States
| | - Yuanxiong Guo
- Department of Information Systems and Cyber Security, The University of Texas at San Antonio, San Antonio, TX, United States
| |
Collapse
|
13
|
Tewari A. mHealth Systems Need a Privacy-by-Design Approach: Commentary on "Federated Machine Learning, Privacy-Enhancing Technologies, and Data Protection Laws in Medical Research: Scoping Review". J Med Internet Res 2023; 25:e46700. [PMID: 36995757 PMCID: PMC10131640 DOI: 10.2196/46700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 02/22/2023] [Indexed: 02/24/2023] Open
Abstract
Brauneck and colleagues have combined technical and legal perspectives in their timely and valuable paper "Federated Machine Learning, Privacy-Enhancing Technologies, and Data Protection Laws in Medical Research: Scoping Review." Researchers who design mobile health (mHealth) systems must adopt the same privacy-by-design approach that privacy regulations (eg, General Data Protection Regulation) do. In order to do this successfully, we will have to overcome implementation challenges in privacy-enhancing technologies such as differential privacy. We will also have to pay close attention to emerging technologies such as private synthetic data generation.
Collapse
Affiliation(s)
- Ambuj Tewari
- Department of Statistics, University of Michigan, Ann Arbor, MI, United States
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI, United States
| |
Collapse
|
14
|
Issa GF, Shaalan K, Shaalan Y, Saeed HA, Fatima N, Rehman AU. Brain Stroke Prediction Using ANN. 2022 INTERNATIONAL CONFERENCE ON CYBER RESILIENCE (ICCR) 2022. [DOI: 10.1109/iccr56254.2022.9995885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Affiliation(s)
- Ghassan F. Issa
- Skyline University College, University City Sharjah,School of Information Technology,Sharjah,UAE,1797
| | - Khaled Shaalan
- The British University in Dubai,Faculty of Engineering and IT,United Arab Emirates
| | | | - Hafiza Afia Saeed
- Agriculture University,Department of Computer Science,Faisalabad,Pakistan
| | - Noor Fatima
- Fatima memorial hospital college of medicine and dentistry Lahore,Lahore,Pakistan,54000
| | - Abd Ur Rehman
- Riphah international University,Riphah School of Computing,Lahore,Pakistan
| |
Collapse
|
15
|
Fauzi MA, Yang B, Blobel B. Comparative Analysis between Individual, Centralized, and Federated Learning for Smartwatch Based Stress Detection. J Pers Med 2022; 12:1584. [PMID: 36294724 PMCID: PMC9605246 DOI: 10.3390/jpm12101584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 09/14/2022] [Accepted: 09/20/2022] [Indexed: 11/17/2022] Open
Abstract
Machine learning has been proven to provide good performances on stress detection tasks using multi-modal sensor data from a smartwatch. Generally, machine learning techniques need a sufficient amount of data to train a robust model. Thus, we need to collect data from several users and send them to a central server to feed the algorithm. However, the uploaded data may contain sensitive information that can jeopardize the user's privacy. Federated learning can tackle this challenge by enabling the model to be trained using data from all users without the user's data leaving the user's device. In this study, we implement federated learning-based stress detection and provide a comparative analysis between individual, centralized, and federated learning. The experiment was conducted on WESAD dataset by using Logistic Regression as the classifier. The experiment results show that in terms of accuracy, federated learning cannot reach the performance level of both individual and centralized learning. The individual learning strategy performs best with an average accuracy of 0.9998 and an average F1-measure of 0.9996.
Collapse
Affiliation(s)
- Muhammad Ali Fauzi
- Department of Information Security and Communication Technology, Norwegian University of Science and Technology (NTNU), 2815 Gjøvik, Norway
| | - Bian Yang
- Department of Information Security and Communication Technology, Norwegian University of Science and Technology (NTNU), 2815 Gjøvik, Norway
| | - Bernd Blobel
- Medical Faculty, University of Regensburg, 93053 Regensburg, Germany
- eHealth Competence Center Bavaria, Deggendorf Institute of Technology, 94469 Deggendorf, Germany
- First Medical Faculty, Charles University Prague, 12800 Prague, Czech Republic
| |
Collapse
|
16
|
Zhang L, Vashisht H, Totev A, Trinh N, Ward T. A comparison of distributed machine learning methods for the support of “many labs” collaborations in computational modeling of decision making. Front Psychol 2022; 13:943198. [PMID: 36092038 PMCID: PMC9453750 DOI: 10.3389/fpsyg.2022.943198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 07/25/2022] [Indexed: 12/03/2022] Open
Abstract
Deep learning models are powerful tools for representing the complex learning processes and decision-making strategies used by humans. Such neural network models make fewer assumptions about the underlying mechanisms thus providing experimental flexibility in terms of applicability. However, this comes at the cost of involving a larger number of parameters requiring significantly more data for effective learning. This presents practical challenges given that most cognitive experiments involve relatively small numbers of subjects. Laboratory collaborations are a natural way to increase overall dataset size. However, data sharing barriers between laboratories as necessitated by data protection regulations encourage the search for alternative methods to enable collaborative data science. Distributed learning, especially federated learning (FL), which supports the preservation of data privacy, is a promising method for addressing this issue. To verify the reliability and feasibility of applying FL to train neural networks models used in the characterization of decision making, we conducted experiments on a real-world, many-labs data pool including experiment data-sets from ten independent studies. The performance of single models trained on single laboratory data-sets was poor. This unsurprising finding supports the need for laboratory collaboration to train more reliable models. To that end we evaluated four collaborative approaches. The first approach represents conventional centralized learning (CL-based) and is the optimal approach but requires complete sharing of data which we wish to avoid. The results however establish a benchmark for the other three approaches, federated learning (FL-based), incremental learning (IL-based), and cyclic incremental learning (CIL-based). We evaluate these approaches in terms of prediction accuracy and capacity to characterize human decision-making strategies. The FL-based model achieves performance most comparable to that of the CL-based model. This indicates that FL has value in scaling data science methods to data collected in computational modeling contexts when data sharing is not convenient, practical or permissible.
Collapse
Affiliation(s)
- Lili Zhang
- School of Computing, Dublin City University, Dublin, Ireland
- Insight Science Foundation Ireland Research Centre for Data Analytics, Dublin, Ireland
- *Correspondence: Lili Zhang
| | | | - Andrey Totev
- School of Computing, Dublin City University, Dublin, Ireland
| | - Nam Trinh
- School of Computing, Dublin City University, Dublin, Ireland
| | - Tomas Ward
- School of Computing, Dublin City University, Dublin, Ireland
- Insight Science Foundation Ireland Research Centre for Data Analytics, Dublin, Ireland
| |
Collapse
|
17
|
Antunes RS, da Costa CA, Küderle A, Yari IA, Eskofier B. Federated Learning for Healthcare: Systematic Review and Architecture Proposal. ACM T INTEL SYST TEC 2022. [DOI: 10.1145/3501813] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The use of machine learning (ML) with electronic health records (EHR) is growing in popularity as a means to extract knowledge that can improve the decision-making process in healthcare. Such methods require training of high-quality learning models based on diverse and comprehensive datasets, which are hard to obtain due to the sensitive nature of medical data from patients. In this context, federated learning (FL) is a methodology that enables the distributed training of machine learning models with remotely hosted datasets without the need to accumulate data and, therefore, compromise it. FL is a promising solution to improve ML-based systems, better aligning them to regulatory requirements, improving trustworthiness and data sovereignty. However, many open questions must be addressed before the use of FL becomes widespread. This article aims at presenting a systematic literature review on current research about FL in the context of EHR data for healthcare applications. Our analysis highlights the main research topics, proposed solutions, case studies, and respective ML methods. Furthermore, the article discusses a general architecture for FL applied to healthcare data based on the main insights obtained from the literature review. The collected literature corpus indicates that there is extensive research on the privacy and confidentiality aspects of training data and model sharing, which is expected given the sensitive nature of medical data. Studies also explore improvements to the aggregation mechanisms required to generate the learning model from distributed contributions and case studies with different types of medical data.
Collapse
Affiliation(s)
| | | | | | | | - Björn Eskofier
- Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany
| |
Collapse
|
18
|
A Systematic Review of Federated Learning in the Healthcare Area: From the Perspective of Data Properties and Applications. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app112311191] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Recent advances in deep learning have shown many successful stories in smart healthcare applications with data-driven insight into improving clinical institutions’ quality of care. Excellent deep learning models are heavily data-driven. The more data trained, the more robust and more generalizable the performance of the deep learning model. However, pooling the medical data into centralized storage to train a robust deep learning model faces privacy, ownership, and strict regulation challenges. Federated learning resolves the previous challenges with a shared global deep learning model using a central aggregator server. At the same time, patient data remain with the local party, maintaining data anonymity and security. In this study, first, we provide a comprehensive, up-to-date review of research employing federated learning in healthcare applications. Second, we evaluate a set of recent challenges from a data-centric perspective in federated learning, such as data partitioning characteristics, data distributions, data protection mechanisms, and benchmark datasets. Finally, we point out several potential challenges and future research directions in healthcare applications.
Collapse
|