1
|
Zhang J, Wu J, Qiu Y, Song A, Li W, Li X, Liu Y. Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review. Comput Biol Med 2023; 153:106517. [PMID: 36623438 PMCID: PMC9814440 DOI: 10.1016/j.compbiomed.2022.106517] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 12/23/2022] [Accepted: 12/31/2022] [Indexed: 01/07/2023]
Abstract
The growing and aging of the world population have driven the shortage of medical resources in recent years, especially during the COVID-19 pandemic. Fortunately, the rapid development of robotics and artificial intelligence technologies help to adapt to the challenges in the healthcare field. Among them, intelligent speech technology (IST) has served doctors and patients to improve the efficiency of medical behavior and alleviate the medical burden. However, problems like noise interference in complex medical scenarios and pronunciation differences between patients and healthy people hamper the broad application of IST in hospitals. In recent years, technologies such as machine learning have developed rapidly in intelligent speech recognition, which is expected to solve these problems. This paper first introduces IST's procedure and system architecture and analyzes its application in medical scenarios. Secondly, we review existing IST applications in smart hospitals in detail, including electronic medical documentation, disease diagnosis and evaluation, and human-medical equipment interaction. In addition, we elaborate on an application case of IST in the early recognition, diagnosis, rehabilitation training, evaluation, and daily care of stroke patients. Finally, we discuss IST's limitations, challenges, and future directions in the medical field. Furthermore, we propose a novel medical voice analysis system architecture that employs active hardware, active software, and human-computer interaction to realize intelligent and evolvable speech recognition. This comprehensive review and the proposed architecture offer directions for future studies on IST and its applications in smart hospitals.
Collapse
Affiliation(s)
- Jun Zhang
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China,Corresponding author
| | - Jingyue Wu
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Yiyi Qiu
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Aiguo Song
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Weifeng Li
- Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Xin Li
- Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Yecheng Liu
- Emergency Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
| |
Collapse
|
2
|
Tao H, Shan S, Hu Z, Zhu C, Ge H. Strong Generalized Speech Emotion Recognition Based on Effective Data Augmentation. ENTROPY (BASEL, SWITZERLAND) 2022; 25:68. [PMID: 36673208 PMCID: PMC9857941 DOI: 10.3390/e25010068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Revised: 12/24/2022] [Accepted: 12/26/2022] [Indexed: 06/17/2023]
Abstract
The absence of labeled samples limits the development of speech emotion recognition (SER). Data augmentation is an effective way to address sample sparsity. However, there is a lack of research on data augmentation algorithms in the field of SER. In this paper, the effectiveness of classical acoustic data augmentation methods in SER is analyzed, based on which a strong generalized speech emotion recognition model based on effective data augmentation is proposed. The model uses a multi-channel feature extractor consisting of multiple sub-networks to extract emotional representations. Different kinds of augmented data that can effectively improve SER performance are fed into the sub-networks, and the emotional representations are obtained by the weighted fusion of the output feature maps of each sub-network. And in order to make the model robust to unseen speakers, we employ adversarial training to generalize emotion representations. A discriminator is used to estimate the Wasserstein distance between the feature distributions of different speakers and to force the feature extractor to learn the speaker-invariant emotional representations by adversarial training. The simulation experimental results on the IEMOCAP corpus show that the performance of the proposed method is 2-9% ahead of the related SER algorithm, which proves the effectiveness of the proposed method.
Collapse
Affiliation(s)
- Huawei Tao
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| | - Shuai Shan
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
| | - Ziyi Hu
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
| | - Chunhua Zhu
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| | - Hongyi Ge
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| |
Collapse
|
3
|
Cowden RG, Wȩziak-Białowolska D, McNeely E, VanderWeele TJ. Are depression and suffering distinct? An empirical analysis. Front Psychol 2022; 13:970466. [PMID: 36186371 PMCID: PMC9518749 DOI: 10.3389/fpsyg.2022.970466] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 08/08/2022] [Indexed: 11/13/2022] Open
Abstract
Depression and the subjective experience of suffering are distinct forms of distress, but they are sometimes commingled with one another. Using a cross-sectional sample of flight attendants (n = 4,652), we tested for further empirical evidence distinguishing depression and suffering. Correlations with 15 indices covering several dimensions of well-being (i.e., physical health, emotional well-being, psychological well-being, character strengths, social well-being, financial/material well-being) indicated that associations with worse well-being were mostly stronger for depression than suffering. There was a large positive correlation between depression and suffering, but we also found evidence of notable non-concurrent depression and suffering in the sample. After dividing participants into four groups that varied based on severity of depression and suffering, regression analyses showed higher levels of well-being among those with both none-mild depression and none-mild suffering compared to those with moderate-severe depression, moderate-severe suffering, or both. All indices of well-being were lowest among the group of participants with moderate-severe depression and moderate-severe suffering. In addition to providing further evidence supporting a distinction between depression and suffering, our findings suggest that concurrent depression and suffering may be more disruptive to well-being than when either is present alone.
Collapse
Affiliation(s)
- Richard G. Cowden
- Human Flourishing Program, Institute for Quantitative Social Science, Harvard University, Cambridge, MA, United States
- *Correspondence: Richard G. Cowden,
| | - Dorota Wȩziak-Białowolska
- Human Flourishing Program, Institute for Quantitative Social Science, Harvard University, Cambridge, MA, United States
- Sustainability and Health Initiative (SHINE), Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, United States
- Centre for Evaluation and Analysis of Public Policies, Faculty of Philosophy, Jagiellonian University, Cracow, Poland
| | - Eileen McNeely
- Sustainability and Health Initiative (SHINE), Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, United States
| | - Tyler J. VanderWeele
- Human Flourishing Program, Institute for Quantitative Social Science, Harvard University, Cambridge, MA, United States
| |
Collapse
|
4
|
A Prospective Observational Pilot Study of Young Women Undergoing Initial Breast Cancer Treatment and Their Biopsychosocial Profile. REHABILITATION ONCOLOGY 2022. [DOI: 10.1097/01.reo.0000000000000298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
5
|
Not seeing the forest for the trees: a systematic review of comprehensive distress management programs and implementation strategies. Curr Opin Support Palliat Care 2021; 14:220-231. [PMID: 32657813 DOI: 10.1097/spc.0000000000000513] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
PURPOSE OF REVIEW Clinically significant distress is common in patients with cancer and if untreated can be associated with adverse outcomes. This article offers a review of current approaches to implementing and reporting the minimum components of distress screening and management interventions in cancer services. RECENT FINDINGS Twenty-two relevant published articles were identified from January 2018 to February 2020. The reporting of recommended minimum components of distress screening and management interventions in these articles was not consistent. The majority of studies used validated tools to conduct initial screening. However, recommendations were either not reported or not followed regarding subsequent pathway components, secondary assessment, referral pathways linked to screening results and rescreening. The majority of studies did not include a description of the implementation of the distress screening program. A small number of studies described a comprehensive set of implementation strategies. SUMMARY Distress screening and management interventions in cancer are an important component of comprehensive cancer care. To improve patient outcomes and guide researchers and services to identify effective models, studies must include and evaluate minimum recommended components and implementation strategies. Addressing these limitations with high-quality, robust interventions is vital for advancing the implementation of effective distress management.
Collapse
|
6
|
Mustaqeem, Kwon S. Att-Net: Enhanced emotion recognition system using lightweight self-attention module. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107101] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
7
|
Latif S, Qadir J, Qayyum A, Usama M, Younis S. Speech Technology for Healthcare: Opportunities, Challenges, and State of the Art. IEEE Rev Biomed Eng 2021; 14:342-356. [PMID: 32746367 DOI: 10.1109/rbme.2020.3006860] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Speech technology is not appropriately explored even though modern advances in speech technology-especially those driven by deep learning (DL) technology-offer unprecedented opportunities for transforming the healthcare industry. In this paper, we have focused on the enormous potential of speech technology for revolutionising the healthcare domain. More specifically, we review the state-of-the-art approaches in automatic speech recognition (ASR), speech synthesis or text to speech (TTS), and health detection and monitoring using speech signals. We also present a comprehensive overview of various challenges hindering the growth of speech-based services in healthcare. To make speech-based healthcare solutions more prevalent, we discuss open issues and suggest some possible research directions aimed at fully leveraging the advantages of other technologies for making speech-based healthcare solutions more effective.
Collapse
|
8
|
Conclusions. RESEARCHES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE TO MITIGATE PANDEMICS 2021. [PMCID: PMC8085314 DOI: 10.1016/b978-0-323-90959-4.00006-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
This chapter presents the usage of data science, which further helps in exploring the global pandemic COVID-19. This disease suppresses an overwhelming burden, not only to healthcare systems but to the world's economy too. In this era of techniques and technologies, it is believed that data science can better utilize scarce healthcare resources. In this chapter, we provide an introduction of data science and its applications, which helps in combating different aspects of COVID-19. Publicly available datasets related to disease are used as community resources. Different kinds of datasets are used to analyze various aspects of pandemic at different scales. These different kinds of datasets can be audio, video, textual, speech, and sensor data. More than hundreds of research articles are also studied to prepare a bibliometric study. Apart from grabbing all the advantages from datasets, this paper highlights a few challenges, such as surety of correct data, need of multidisciplinary collaboration, new data modality, security issues, and availability of data.
Collapse
|
9
|
Towards Automatic Depression Detection: A BiLSTM/1D CNN-Based Model. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10238701] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Depression is a global mental health problem, the worst cases of which can lead to self-injury or suicide. An automatic depression detection system is of great help in facilitating clinical diagnosis and early intervention of depression. In this work, we propose a new automatic depression detection method utilizing speech signals and linguistic content from patient interviews. Specifically, the proposed method consists of three components, which include a Bidirectional Long Short-Term Memory (BiLSTM) network with an attention layer to deal with linguistic content, a One-Dimensional Convolutional Neural Network (1D CNN) to deal with speech signals, and a fully connected network integrating the outputs of the previous two models to assess the depressive state. Evaluated on two publicly available datasets, our method achieves state-of-the-art performance compared with the existing methods. In addition, our method utilizes audio and text features simultaneously. Therefore, it can get rid of the misleading information provided by the patients. As a conclusion, our method can automatically evaluate the depression state and does not require an expert to conduct the psychological evaluation on site. Our method greatly improves the detection accuracy, as well as the efficiency.
Collapse
|
10
|
Latif S, Usman M, Manzoor S, Iqbal W, Qadir J, Tyson G, Castro I, Razi A, Boulos MNK, Weller A, Crowcroft J. Leveraging Data Science to Combat COVID-19: A Comprehensive Review. IEEE TRANSACTIONS ON ARTIFICIAL INTELLIGENCE 2020; 1:85-103. [PMID: 37982070 PMCID: PMC8545032 DOI: 10.1109/tai.2020.3020521] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 07/07/2020] [Accepted: 08/26/2020] [Indexed: 11/17/2023]
Abstract
COVID-19, an infectious disease caused by the SARS-CoV-2 virus, was declared a pandemic by the World Health Organisation (WHO) in March 2020. By mid-August 2020, more than 21 million people have tested positive worldwide. Infections have been growing rapidly and tremendous efforts are being made to fight the disease. In this paper, we attempt to systematise the various COVID-19 research activities leveraging data science, where we define data science broadly to encompass the various methods and tools-including those from artificial intelligence (AI), machine learning (ML), statistics, modeling, simulation, and data visualization-that can be used to store, process, and extract insights from data. In addition to reviewing the rapidly growing body of recent research, we survey public datasets and repositories that can be used for further work to track COVID-19 spread and mitigation strategies. As part of this, we present a bibliometric analysis of the papers produced in this short span of time. Finally, building on these insights, we highlight common challenges and pitfalls observed across the surveyed works. We also created a live resource repository at https://github.com/Data-Science-and-COVID-19/Leveraging-Data-Science-To-Combat-COVID-19-A-Comprehensive-Review that we intend to keep updated with the latest resources including new papers and datasets.
Collapse
Affiliation(s)
- Siddique Latif
- University of Southern QueenslandSpringfieldQueensland4300Australia
- Distributed Sensing Systems Group, Data61CSIROPullenvaleQLD4069Australia
| | - Muhammad Usman
- Seoul National UniversitySeoul08700South Korea
- Center for Artificial Intelligence in Medicine and Imaging, HealthHub Company Ltd.Seoul06524South Korea
| | - Sanaullah Manzoor
- Center for Artificial Intelligence in Medicine and Imaging, HealthHub Company Ltd.Seoul06524South Korea
| | - Waleed Iqbal
- Information Technology UniversityPunjab5400Pakistan
| | | | - Gareth Tyson
- Queen Mary University of LondonLondonE1 4NSU.K.
- Queen Mary University of LondonLondonE1 4NSU.K.
| | | | | | - Maged N. Kamel Boulos
- Turner Institute for Brain and Mental Health & Monash Biomedical Imaging, Monash UniversityMelbourne3800Australia
| | - Adrian Weller
- the School of Information Management, Sun Yat-sen UniversityGuangzhou510006China
- University of CambridgeCambridgeCB2 1PZU.K.
| | - Jon Crowcroft
- Alan Turing InstituteLondonNW1 2DBU.K.
- University of CambridgeCambridgeCB2 1TNU.K.
| |
Collapse
|
11
|
Raphael D, Frey R, Gott M. Distress in post-treatment hematological cancer survivors: Prevalence and predictors. J Psychosoc Oncol 2019; 38:328-342. [PMID: 31642396 DOI: 10.1080/07347332.2019.1679320] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Objectives: To calculate the prevalence of psychosocial distress, and identify factors that predict distress, in early post-treatment hematological cancer survivors.Design: Cross-sectional survey containing self-report measures.Sample/Methods: Post-treatment hematological cancer survivors in remission (>18 years) (n = 409) completed questionnaires. Distress was measured with the distress thermometer (DT). Logistic regression was used to identify predictors of distress.Findings: Overall 21.9% (n = 93) of respondents reported significant distress (DT ≥4). Significant distress was twice as high in those born overseas (OR = 2.09, p = .03), 3.5 times higher in those with lower social support (OR = 3.51, p = <.001) and five times higher in those with increased fear of recurrence (OR = 0.17, p = <.001).Implications for Psychosocial Providers: Early identification of distress may decrease psychosocial issues in the post-treatment period, especially as psychosocial services have been shown to improve wellbeing for those who are distressed.
Collapse
Affiliation(s)
- Deborah Raphael
- School of Nursing, Faculty of Medical and Health Sciences, The University of Auckland, Auckland, New Zealand
| | - Rosemary Frey
- School of Nursing, Faculty of Medical and Health Sciences, The University of Auckland, Auckland, New Zealand
| | - Merryn Gott
- School of Nursing, Faculty of Medical and Health Sciences, The University of Auckland, Auckland, New Zealand
| |
Collapse
|
12
|
Chambers SK, Dunn J. Re-imagining Psycho-oncology. Eur J Cancer Care (Engl) 2019; 28:e13136. [PMID: 31318133 DOI: 10.1111/ecc.13136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/09/2022]
Affiliation(s)
- Suzanne K Chambers
- University of Technology Sydney, Sydney, New South Wales, Australia.,Menzies Health Institute Queensland, Griffith University, Brisbane, Queensland, Australia.,Exercise Medicine Research Institute, Edith Cowan University, Joondalup, Western Australia, Australia.,University of Southern Queensland, Toowoomba, Queensland, Australia
| | - Jeff Dunn
- University of Technology Sydney, Sydney, New South Wales, Australia.,University of Southern Queensland, Toowoomba, Queensland, Australia.,Prostate Cancer Foundation of Australia, Sydney, New South Wales, Australia.,Cancer Council Queensland, Brisbane, Queensland, Australia
| |
Collapse
|