1
|
Paradise Vit A, Magid A. Exploring Topics, Emotions, and Sentiments in Health Organization Posts and Public Responses on Instagram: Content Analysis. JMIR INFODEMIOLOGY 2025; 5:e70576. [PMID: 40315451 DOI: 10.2196/70576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/25/2024] [Revised: 01/26/2025] [Accepted: 04/13/2025] [Indexed: 05/04/2025]
Abstract
BACKGROUND Social media is a vital tool for health organizations, enabling them to share evidence-based information, educate the public, correct misinformation, and support a more informed and healthier society. OBJECTIVE This study aimed to categorize health organizations' content on social media into topics; examine public engagement, sentiment, and emotional responses to these topics; and identify gaps in fear between health organizations' messages and the public response. METHODS Real data were collected from the official Instagram accounts of health organizations worldwide. The BERTopic algorithm for topic modeling was used to categorize health organizations' posts into distinct topics. For each identified topic, we analyzed the engagement metrics (number of comments and likes) of posts categorized under the same topic, calculating the average engagement received. We examined the sentiment and emotional content of both posts and responses within the same topic, providing insights into the distributions of sentiment and emotions for each topic. Special attention was given to identifying emotions, such as fear, expressed in the posts and responses. In addition, a linguistic analysis and an analysis of sentiments and emotions over time were conducted. RESULTS A total of 6082 posts and 82,982 comments were collected from the official Instagram accounts of 8 health organizations. The study revealed that topics related to COVID-19, vaccines, and humanitarian crises (such as the Ukraine conflict and the war in Gaza) generated the highest engagement. Our sentiment analysis of the responses to health organizations' posts showed that topics related to vaccines and monkeypox generated the highest percentage of negative responses. Fear was the dominant emotion expressed in the posts' text, while the public's responses showed more varied emotions, with anger notably high in discussions around vaccines. Gaps were observed between the level of fear conveyed in posts published by health organizations and in the fear conveyed in the public's responses to such posts, especially regarding mask wearing during COVID-19 and the influenza vaccine. CONCLUSIONS This study underscores the importance of transparent communication that considers the emotional and sentiment-driven responses of the public on social media, particularly regarding vaccines. Understanding the psychological and social dynamics associated with public interaction with health information online can help health organizations achieve public health goals, fostering trust, countering misinformation, and promoting informed health behavior.
Collapse
Affiliation(s)
- Abigail Paradise Vit
- Department of Information Systems, The Max Stern Emek Yezreel College, Jezreel Valley Regional Council, Israel
| | - Avi Magid
- Management, Rambam Healthcare Campus, Haifa, Israel
- Department of International Health, Maastricht University, Maastricht, The Netherlands
| |
Collapse
|
2
|
Zamir MT, Ullah F, Tariq R, Bangyal WH, Arif M, Gelbukh A. Machine and deep learning algorithms for sentiment analysis during COVID-19: A vision to create fake news resistant society. PLoS One 2024; 19:e0315407. [PMID: 39700256 DOI: 10.1371/journal.pone.0315407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 11/26/2024] [Indexed: 12/21/2024] Open
Abstract
Informal education via social media plays a crucial role in modern learning, offering self-directed and community-driven opportunities to gain knowledge, skills, and attitudes beyond traditional educational settings. These platforms provide access to a broad range of learning materials, such as tutorials, blogs, forums, and interactive content, making education more accessible and tailored to individual interests and needs. However, challenges like information overload and the spread of misinformation highlight the importance of digital literacy in ensuring users can critically evaluate the credibility of information. Consequently, the significance of sentiment analysis has grown in contemporary times due to the widespread utilization of social media platforms as a means for individuals to articulate their viewpoints. Twitter (now X) is well recognized as a prominent social media platform that is predominantly utilized for microblogging. Individuals commonly engage in expressing their viewpoints regarding contemporary events, hence presenting a significant difficulty for scholars to categorize the sentiment associated with such expressions effectively. This research study introduces a highly effective technique for detecting misinformation related to the COVID-19 pandemic. The spread of fake news during the COVID-19 pandemic has created significant challenges for public health and safety because misinformation about the virus, its transmission, and treatments has led to confusion and distrust among the public. This research study introduce highly effective techniques for detecting misinformation related to the COVID-19 pandemic. The methodology of this work includes gathering a dataset comprising fabricated news articles sourced from a corpus and subjected to the natural language processing (NLP) cycle. After applying some filters, a total of five machine learning classifiers and three deep learning classifiers were employed to forecast the sentiment of news articles, distinguishing between those that are authentic and those that are fabricated. This research employs machine learning classifiers, namely Support Vector Machine, Logistic Regression, K-Nearest Neighbors, Decision Trees, and Random Forest, to analyze and compare the obtained results. This research employs Convolutional Neural Networks, Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU) as deep learning classifiers, and afterwards compares the obtained results. The results indicate that the BiGRU deep learning classifier demonstrates high accuracy and efficiency, with the following indicators: accuracy of 0.91, precision of 0.90, recall of 0.93, and F1-score of 0.92. For the same algorithm, the true negatives, and true positives came out to be 555 and 580, respectively, whereas, the false negatives and false positives came out to be 81, and 68, respectively. In conclusion, this research highlights the effectiveness of the BiGRU deep learning classifier in detecting misinformation related to COVID-19, emphasizing its significance for fostering media literacy and resilience against fake news in contemporary society. The implications of this research are significant for higher education and lifelong learners as it highlights the potential for using advanced machine learning to help educators and institutions in the process of combating the spread of misinformation and promoting critical thinking skills among students. By applying these methods to analyze and classify news articles, educators can develop more effective tools and curricula for teaching media literacy and information validation, equipping students with the skills needed to discern between authentic and fabricated information in the context of the COVID-19 pandemic and beyond. The implications of this research extrapolate to the creation of a society that is resistant to the spread of fake news through social media platforms.
Collapse
Affiliation(s)
- Muhammad Tayyab Zamir
- Centro de Investigación en Computación (CIC), Instituto Politécnico Nacional, Ciudad de México, México
| | - Fida Ullah
- Centro de Investigación en Computación (CIC), Instituto Politécnico Nacional, Ciudad de México, México
| | - Rasikh Tariq
- Tecnologico de Monterrey, Institute for the Future of Education, Monterrey, N.L., México
| | | | - Muhammad Arif
- Centro de Investigación en Computación (CIC), Instituto Politécnico Nacional, Ciudad de México, México
| | - Alexander Gelbukh
- Centro de Investigación en Computación (CIC), Instituto Politécnico Nacional, Ciudad de México, México
| |
Collapse
|
3
|
Jahin MA, Shovon MSH, Mridha MF, Islam MR, Watanobe Y. A hybrid transformer and attention based recurrent neural network for robust and interpretable sentiment analysis of tweets. Sci Rep 2024; 14:24882. [PMID: 39438715 PMCID: PMC11496622 DOI: 10.1038/s41598-024-76079-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Accepted: 10/10/2024] [Indexed: 10/25/2024] Open
Abstract
Sentiment analysis is a pivotal tool in understanding public opinion, consumer behavior, and social trends, underpinning applications ranging from market research to political analysis. However, existing sentiment analysis models frequently encounter challenges related to linguistic diversity, model generalizability, explainability, and limited availability of labeled datasets. To address these shortcomings, we propose the Transformer and Attention-based Bidirectional LSTM for Sentiment Analysis (TRABSA) model, a novel hybrid sentiment analysis framework that integrates transformer-based architecture, attention mechanism, and recurrent neural networks like BiLSTM. The TRABSA model leverages the powerful RoBERTa-based transformer model for initial feature extraction, capturing complex linguistic nuances from a vast corpus of tweets. This is followed by an attention mechanism that highlights the most informative parts of the text, enhancing the model's focus on critical sentiment-bearing elements. Finally, the BiLSTM networks process these refined features, capturing temporal dependencies and improving the overall sentiment classification into positive, neutral, and negative classes. Leveraging the latest RoBERTa-based transformer model trained on a vast corpus of 124M tweets, our research bridges existing gaps in sentiment analysis benchmarks, ensuring state-of-the-art accuracy and relevance. Furthermore, we contribute to data diversity by augmenting existing datasets with 411,885 tweets from 32 English-speaking countries and 7,500 tweets from various US states. This study also compares six word-embedding techniques, identifying the most robust preprocessing and embedding methodologies crucial for accurate sentiment analysis and model performance. We meticulously label tweets into positive, neutral, and negative classes using three distinct lexicon-based approaches and select the best one, ensuring optimal sentiment analysis outcomes and model efficacy. Here, we demonstrate that the TRABSA model outperforms the current seven traditional machine learning models, four stacking models, and four hybrid deep learning models, yielding notable gain in accuracy (94%) and effectiveness with a macro average precision of 94%, recall of 93%, and F1-score of 94%. Our further evaluation involves two extended and four external datasets, demonstrating the model's consistent superiority, robustness, and generalizability across diverse contexts and datasets. Finally, by conducting a thorough study with SHAP and LIME explainable visualization approaches, we offer insights into the interpretability of the TRABSA model, improving comprehension and confidence in the model's predictions. Our study results make it easier to analyze how citizens respond to resources and events during pandemics since they are integrated into a decision-support system. Applications of this system provide essential assistance for efficient pandemic management, such as resource planning, crowd control, policy formation, vaccination tactics, and quick reaction programs.
Collapse
Affiliation(s)
- Md Abrar Jahin
- Department of Industrial Engineering and Management, Khulna University of Engineering & Technology (KUET), Khulna, 9203, Bangladesh
- Advanced Machine Intelligence Research (AMIR) Lab, Dhaka, 1229, Bangladesh
| | - Md Sakib Hossain Shovon
- Department of Computer Science, American International University-Bangladesh, Dhaka, 1229, Bangladesh
- Advanced Machine Intelligence Research (AMIR) Lab, Dhaka, 1229, Bangladesh
| | - M F Mridha
- Department of Computer Science, American International University-Bangladesh, Dhaka, 1229, Bangladesh.
- Advanced Machine Intelligence Research (AMIR) Lab, Dhaka, 1229, Bangladesh.
| | - Md Rashedul Islam
- Offshore AI Development Group, Department of R &D, Chowagiken Corp., Hokkaido, Japan.
- Department of Computer Science and Engineering, University of Asia Pacific, Dhaka, 1216, Bangladesh.
| | - Yutaka Watanobe
- Department of Computer Science and Engineering, University of Aizu, Aizu-Wakamatsu, 965-8580, Japan
| |
Collapse
|
4
|
Paradise Vit A, Magid A. Differences in Fear and Negativity Levels Between Formal and Informal Health-Related Websites: Analysis of Sentiments and Emotions. J Med Internet Res 2024; 26:e55151. [PMID: 39120928 PMCID: PMC11344190 DOI: 10.2196/55151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 05/19/2024] [Accepted: 06/07/2024] [Indexed: 08/10/2024] Open
Abstract
BACKGROUND Searching for web-based health-related information is frequently performed by the public and may affect public behavior regarding health decision-making. Particularly, it may result in anxiety, erroneous, and harmful self-diagnosis. Most searched health-related topics are cancer, cardiovascular diseases, and infectious diseases. A health-related web-based search may result in either formal or informal medical website, both of which may evoke feelings of fear and negativity. OBJECTIVE Our study aimed to assess whether there is a difference in fear and negativity levels between information appearing on formal and informal health-related websites. METHODS A web search was performed to retrieve the contents of websites containing symptoms of selected diseases, using selected common symptoms. Retrieved websites were classified into formal and informal websites. Fear and negativity of each content were evaluated using 3 transformer models. A fourth transformer model was fine-tuned using an existing emotion data set obtained from a web-based health community. For formal and informal websites, fear and negativity levels were aggregated. t tests were conducted to evaluate the differences in fear and negativity levels between formal and informal websites. RESULTS In this study, unique websites (N=1448) were collected, of which 534 were considered formal and 914 were considered informal. There were 1820 result pages from formal websites and 1494 result pages from informal websites. According to our findings, fear levels were statistically higher (t2753=3.331; P<.001) on formal websites (mean 0.388, SD 0.177) than on informal websites (mean 0.366, SD 0.168). The results also show that the level of negativity was statistically higher (t2753=2.726; P=.006) on formal websites (mean 0.657, SD 0.211) than on informal websites (mean 0.636, SD 0.201). CONCLUSIONS Positive texts may increase the credibility of formal health websites and increase their usage by the general public and the public's compliance to the recommendations. Increasing the usage of natural language processing tools before publishing health-related information to achieve a more positive and less stressful text to be disseminated to the public is recommended.
Collapse
Affiliation(s)
- Abigail Paradise Vit
- Department of Information Systems, The Max Stern Yezreel Valley College, Emek Yezreel, Israel
| | - Avi Magid
- Department of Information Systems, The Max Stern Yezreel Valley College, Emek Yezreel, Israel
- Management, Rambam Health Care Campus, Haifa, Israel
| |
Collapse
|
5
|
Bishal MM, Chowdory MRH, Das A, Kabir MA. COVIDHealth: A novel labeled dataset and machine learning-based web application for classifying COVID-19 discourses on Twitter. Heliyon 2024; 10:e34103. [PMID: 39100452 PMCID: PMC11295851 DOI: 10.1016/j.heliyon.2024.e34103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 06/27/2024] [Accepted: 07/03/2024] [Indexed: 08/06/2024] Open
Abstract
The COVID-19 pandemic has sparked widespread health-related discussions on social media platforms like Twitter (now named 'X'). However, the lack of labeled Twitter data poses significant challenges for theme-based classification and tweet aggregation. To address this gap, we developed a machine learning-based web application that automatically classifies COVID-19 discourses into five categories: health risks, prevention, symptoms, transmission, and treatment. We collected and labeled 6,667 COVID-19-related tweets using the Twitter API, and applied various feature extraction methods to extract relevant features. We then compared the performance of seven classical machine learning algorithms (Decision Tree, Random Forest, Stochastic Gradient Descent, Adaboost, K-Nearest Neighbor, Logistic Regression, and Linear SVC) and four deep learning techniques (LSTM, CNN, RNN, and BERT) for classification. Our results show that the CNN achieved the highest precision (90.41%), recall (90.4%), F1 score (90.4%), and accuracy (90.4%). The Linear SVC algorithm exhibited the highest precision (85.71%), recall (86.94%), and F1 score (86.13%) among classical machine learning approaches. Our study advances the field of health-related data analysis and classification, and offers a publicly accessible web-based tool for public health researchers and practitioners. This tool has the potential to support addressing public health challenges and enhancing awareness during pandemics. The dataset and application are accessible at https://github.com/Bishal16/COVID19-Health-Related-Data-Classification-Website.
Collapse
Affiliation(s)
- Mahathir Mohammad Bishal
- Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Chattogram, 4349, Bangladesh
| | - Md. Rakibul Hassan Chowdory
- Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Chattogram, 4349, Bangladesh
| | - Anik Das
- Department of Computer Science, St. Francis Xavier University, Antigonish, B2G 2W5, NS, Canada
| | - Muhammad Ashad Kabir
- School of Computing, Mathematics, and Engineering, Charles Sturt University, Bathurst, 2795, NSW, Australia
| |
Collapse
|
6
|
Mirugwe A, Ashaba C, Namale A, Akello E, Bichetero E, Kansiime E, Nyirenda J. Sentiment Analysis of Social Media Data on Ebola Outbreak Using Deep Learning Classifiers. Life (Basel) 2024; 14:708. [PMID: 38929691 PMCID: PMC11204680 DOI: 10.3390/life14060708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 05/13/2024] [Accepted: 05/15/2024] [Indexed: 06/28/2024] Open
Abstract
The Ebola virus disease (EVD) is an extremely contagious and fatal illness caused by the Ebola virus. Recently, Uganda witnessed an outbreak of EVD, which generated much attention on various social media platforms. To ensure effective communication and implementation of targeted health interventions, it is crucial for stakeholders to comprehend the sentiments expressed in the posts and discussions on these online platforms. In this study, we used deep learning techniques to analyse the sentiments expressed in Ebola-related tweets during the outbreak. We explored the application of three deep learning techniques to classify the sentiments in 8395 tweets as positive, neutral, or negative. The techniques examined included a 6-layer convolutional neural network (CNN), a 6-layer long short-term memory model (LSTM), and an 8-layer Bidirectional Encoder Representations from Transformers (BERT) model. The study found that the BERT model outperformed both the CNN and LSTM-based models across all the evaluation metrics, achieving a remarkable classification accuracy of 95%. These findings confirm the reported effectiveness of Transformer-based architectures in tasks related to natural language processing, such as sentiment analysis.
Collapse
Affiliation(s)
- Alex Mirugwe
- School of Public Health, Makerere University, Kampala P.O. Box 7072, Uganda
| | - Clare Ashaba
- School of Public Health, Makerere University, Kampala P.O. Box 7072, Uganda
| | - Alice Namale
- School of Public Health, Makerere University, Kampala P.O. Box 7072, Uganda
| | - Evelyn Akello
- School of Public Health, Makerere University, Kampala P.O. Box 7072, Uganda
| | - Edward Bichetero
- School of Public Health, Makerere University, Kampala P.O. Box 7072, Uganda
| | - Edgar Kansiime
- School of Public Health, Makerere University, Kampala P.O. Box 7072, Uganda
| | - Juwa Nyirenda
- Department of Statistical Science, University of Cape Town, Cape Town 7700, South Africa
| |
Collapse
|
7
|
Almazroi AA, Ayub N. Enhancing aspect-based multi-labeling with ensemble learning for ethical logistics. PLoS One 2024; 19:e0295248. [PMID: 38771789 PMCID: PMC11108219 DOI: 10.1371/journal.pone.0295248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 11/20/2023] [Indexed: 05/23/2024] Open
Abstract
In the dynamic domain of logistics, effective communication is essential for streamlined operations. Our innovative solution, the Multi-Labeling Ensemble (MLEn), tackles the intricate task of extracting multi-labeled data, employing advanced techniques for accurate preprocessing of textual data through the NLTK toolkit. This approach is carefully tailored to the prevailing language used in logistics communication. MLEn utilizes innovative methods, including sentiment intensity analysis, Word2Vec, and Doc2Vec, ensuring comprehensive feature extraction. This proves particularly suitable for logistics in e-commerce, capturing nuanced communication essential for efficient operations. Ethical considerations are a cornerstone in logistics communication, and MLEn plays a pivotal role in detecting and categorizing inappropriate language, aligning inherently with ethical norms. Leveraging Tf-IDF and Vader for feature enhancement, MLEn adeptly discerns and labels ethically sensitive content in logistics communication. Across diverse datasets, including Emotions, MLEn consistently achieves impressive accuracy levels ranging from 92% to 97%, establishing its superiority in the logistics context. Particularly, our proposed method, DenseNet-EHO, outperforms BERT by 8% and surpasses other techniques by a 15-25% efficiency. A comprehensive analysis, considering metrics such as precision, recall, F1-score, Ranking Loss, Jaccard Similarity, AUC-ROC, sensitivity, and time complexity, underscores DenseNet-EHO's efficiency, aligning with the practical demands within the logistics track. Our research significantly contributes to enhancing precision, diversity, and computational efficiency in aspect-based sentiment analysis within logistics. By integrating cutting-edge preprocessing, sentiment intensity analysis, and vectorization, MLEn emerges as a robust framework for multi-label datasets, consistently outperforming conventional approaches and giving outstanding precision, accuracy, and efficiency in the logistics field.
Collapse
Affiliation(s)
- Abdulwahab Ali Almazroi
- Department of Information Technology, College of Computing and Information Technology at Khulais, University of Jeddah, Jeddah, Saudi Arabia
| | - Nasir Ayub
- Department of Creative Technologies, Air University Islamabad, Islamabad, Pakistan
| |
Collapse
|
8
|
Al Sailawi ASA, Kangavari MR. Utilizing AI for extracting insights on post WHO's COVID-19 vaccination declaration from X (Twitter) social network. AIMS Public Health 2024; 11:349-378. [PMID: 39027386 PMCID: PMC11252579 DOI: 10.3934/publichealth.2024018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 02/27/2024] [Accepted: 03/12/2024] [Indexed: 07/20/2024] Open
Abstract
This study explores the use of artificial intelligence (AI) to analyze information from X (previously Twitter) feeds related to COVID-19, specifically focusing on the time following the World Health Organization's (WHO) vaccination announcement. This aspect of the pandemic has not been studied by other researchers focusing on vaccination news. By utilizing advanced AI algorithms, the research aims to examine a wealth of data, sentiments, and trends to enhance crisis management strategies effectively. Our methods involved collecting a dataset of tweets from December 2020 to July 2021. By using specific keywords strategically, we gathered a substantial 15.5 million tweets, focusing on important hashtags like #vaccine and #coronavirus while filtering out irrelevant replies and retweets. The assessment of three different machine learning models-BiLSTM, FFNN, and CNN - highlights the exceptional performance of BiLSTM, achieving an impressive F1-score of 0.84 on the test set, with Precision and Recall metrics at 0.85 and 0.83, respectively. The study provides a detailed visualization of global sentiments on COVID-19 topics, with a main goal of extracting insights to manage public health crises effectively. Sentiment labels were predicted using various classification models and categorized as positive, negative, and neutral for each country after adjusting for population differences. An important finding from the analysis is the variation in sentiments across regions, for instance, with Eastern European countries showing positive views on post-vaccination economic recovery, while China and the United States express negative opinions on the same topic.
Collapse
Affiliation(s)
- Ali S. Abed Al Sailawi
- School of Computer Engineering, Iran University of Science and Technology, Tehran, Iran
- College of Law, University of Misan, Amarah, Iraq
| | | |
Collapse
|
9
|
Lossio-Ventura JA, Weger R, Lee AY, Guinee EP, Chung J, Atlas L, Linos E, Pereira F. A Comparison of ChatGPT and Fine-Tuned Open Pre-Trained Transformers (OPT) Against Widely Used Sentiment Analysis Tools: Sentiment Analysis of COVID-19 Survey Data. JMIR Ment Health 2024; 11:e50150. [PMID: 38271138 PMCID: PMC10813836 DOI: 10.2196/50150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 11/16/2023] [Accepted: 11/17/2023] [Indexed: 01/27/2024] Open
Abstract
BACKGROUND Health care providers and health-related researchers face significant challenges when applying sentiment analysis tools to health-related free-text survey data. Most state-of-the-art applications were developed in domains such as social media, and their performance in the health care context remains relatively unknown. Moreover, existing studies indicate that these tools often lack accuracy and produce inconsistent results. OBJECTIVE This study aims to address the lack of comparative analysis on sentiment analysis tools applied to health-related free-text survey data in the context of COVID-19. The objective was to automatically predict sentence sentiment for 2 independent COVID-19 survey data sets from the National Institutes of Health and Stanford University. METHODS Gold standard labels were created for a subset of each data set using a panel of human raters. We compared 8 state-of-the-art sentiment analysis tools on both data sets to evaluate variability and disagreement across tools. In addition, few-shot learning was explored by fine-tuning Open Pre-Trained Transformers (OPT; a large language model [LLM] with publicly available weights) using a small annotated subset and zero-shot learning using ChatGPT (an LLM without available weights). RESULTS The comparison of sentiment analysis tools revealed high variability and disagreement across the evaluated tools when applied to health-related survey data. OPT and ChatGPT demonstrated superior performance, outperforming all other sentiment analysis tools. Moreover, ChatGPT outperformed OPT, exhibited higher accuracy by 6% and higher F-measure by 4% to 7%. CONCLUSIONS This study demonstrates the effectiveness of LLMs, particularly the few-shot learning and zero-shot learning approaches, in the sentiment analysis of health-related survey data. These results have implications for saving human labor and improving efficiency in sentiment analysis tasks, contributing to advancements in the field of automated sentiment analysis.
Collapse
Affiliation(s)
| | - Rachel Weger
- School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Angela Y Lee
- Department of Communication, Stanford University, Stanford, CA, United States
| | - Emily P Guinee
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| | - Joyce Chung
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| | - Lauren Atlas
- National Center For Complementary and Alternative Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Eleni Linos
- School of Medicine, Stanford University, Stanford, CA, United States
| | - Francisco Pereira
- National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
| |
Collapse
|
10
|
Khalid ET, Salah Khalefa M, Yassen W, Adil Yassin A. Omicron virus emotions understanding system based on deep learning architecture. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING 2023; 14:9497-9507. [PMID: 37288131 PMCID: PMC10113983 DOI: 10.1007/s12652-023-04615-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 04/04/2023] [Indexed: 06/09/2023]
Abstract
Emotions understanding has acquired a significant interest in the last few years because it has introduced remarkable services in many aspects regarding public opinion mining and recognition in the field of marketing, seeking product reviews, reviews of movies, and healthcare issues based on sentiment understanding. This conducted research has utilized the issue of Omicron virus as a case study to implement a emotions analysis framework to explore the global attitude and sentiment toward Omicron variant as an expression of Positive feeling, Neutral, and Negative feeling. Because since December 2021. Omicron variant has gained obvious attention and wide discussions on social media platforms that revealed lots of fears and anxiety feeling, due to its rapid spreading and infection ability between humans that could exceed the Delta variant infection. Therefore, this paper proposes to develop a framework utilizes techniques of natural languages processing (NLP) in deep learning methods using neural network model of Bidirectional-Long-Short-Term-Memory (Bi-LSTM) and deep neural network (DNN) to achieve accurate results. This study utilizes textual data collected and pulled from the Twitter platform (users' tweets) for the time interval from 11-Dec.-2021 to 18-Dec.-2021. Consequently, the overall achieved accuracy for the developed model is 0.946%. The produced results from carrying out the proposed framework for sentiment understanding have recorded Negative sentiment at 42.3%, Positive sentiment at 35.8%, and Neutral sentiment at 21.9% of overall extracted tweets. The acquired accuracy using data of validation for the deployed model is 0.946%.
Collapse
Affiliation(s)
- Eman Thabet Khalid
- Department of Computer Sciences, College of Education for Pure Sciences, University of Basrah, Basrah, 6100 Iraq
| | - Mustafa Salah Khalefa
- Department of Computer Sciences, College of Education for Pure Sciences, University of Basrah, Basrah, 6100 Iraq
| | - Wijdan Yassen
- Department of Computer Sciences, College of Education for Pure Sciences, University of Basrah, Basrah, 6100 Iraq
| | - Ali Adil Yassin
- Department of Computer Sciences, College of Education for Pure Sciences, University of Basrah, Basrah, 6100 Iraq
| |
Collapse
|
11
|
Tiwari D, Nagpal B, Bhati BS, Mishra A, Kumar M. A systematic review of social network sentiment analysis with comparative study of ensemble-based techniques. Artif Intell Rev 2023; 56:1-55. [PMID: 37362894 PMCID: PMC10091348 DOI: 10.1007/s10462-023-10472-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2023] [Indexed: 06/28/2023]
Abstract
Sentiment Analysis (SA) of text reviews is an emerging concern in Natural Language Processing (NLP). It is a broadly active method for analyzing and extracting opinions from text using individual or ensemble learning techniques. This field has unquestionable potential in the digital world and social media platforms. Therefore, we present a systematic survey that organizes and describes the current scenario of the SA and provides a structured overview of proposed approaches from traditional to advance. This work also discusses the SA-related challenges, feature engineering techniques, benchmark datasets, popular publication platforms, and best algorithms to advance the automatic SA. Furthermore, a comparative study has been conducted to assess the performance of bagging and boosting-based ensemble techniques for social network SA. Bagging and Boosting are two major approaches of ensemble learning that contain various ensemble algorithms to classify sentiment polarity. Recent studies recommend that ensemble learning techniques have the potential of applicability for sentiment classification. This analytical study examines the bagging and boosting-based ensemble techniques on four benchmark datasets to provide extensive knowledge regarding ensemble techniques for SA. The efficiency and accuracy of these techniques have been measured in terms of TPR, FPR, Weighted F-Score, Weighted Precision, Weighted Recall, Accuracy, ROC-AUC curve, and Run-Time. Moreover, comparative results reveal that bagging-based ensemble techniques outperformed boosting-based techniques for text classification. This extensive review aims to present benchmark information regarding social network SA that will be helpful for future research in this field.
Collapse
Affiliation(s)
- Dimple Tiwari
- Ambedkar Institute of Advanced Communication Technologies and Research (GGSIPU), Delhi, India
| | - Bharti Nagpal
- NSUT East Campus (Formerly Ambedkar Institute of Advanced Communication Technologies and Research), Delhi, India
| | | | - Ashutosh Mishra
- School of Integrated Technology, Yonsei University, Seoul, South Korea
- Department of Electronics & Communication Engineering, Graphic Era Deemed to be University, Dehradun, India
| | - Manoj Kumar
- Faculty of Engineering and Information Sciences, University of Wollongong in Dubai, Dubai Knowledge Park, Dubai, UAE
- MEU Research Unit, Faculty of Information Technology, Middle East University, Amman 11831, Jordan
| |
Collapse
|
12
|
Gupta K, Bajaj V. Deep learning models-based CT-scan image classification for automated screening of COVID-19. Biomed Signal Process Control 2023; 80:104268. [PMID: 36267466 PMCID: PMC9556167 DOI: 10.1016/j.bspc.2022.104268] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 09/07/2022] [Accepted: 09/26/2022] [Indexed: 02/01/2023]
Abstract
COVID-19 is the most transmissible disease, caused by the SARS-CoV-2 virus that severely infects the lungs and the upper respiratory tract of the human body. This virus badly affected the lives and wellness of millions of people worldwide and spread widely. Early diagnosis, timely treatment, and proper confinement of the infected patients are some possible ways to control the spreading of coronavirus. Computed tomography (CT) scanning has proven useful in diagnosing several respiratory lung problems, including COVID-19 infections. Automated detection of COVID-19 using chest CT-scan images may reduce the clinician's load and save the lives of thousands of people. This study proposes a robust framework for the automated screening of COVID-19 using chest CT-scan images and deep learning-based techniques. In this work, a publically accessible CT-scan image dataset (contains the 1252 COVID-19 and 1230 non-COVID chest CT images), two pre-trained deep learning models (DLMs) namely, MobileNetV2 and DarkNet19, and a newly-designed lightweight DLM, are utilized for the automated screening of COVID-19. A repeated ten-fold holdout validation method is utilized for the training, validation, and testing of DLMs. The highest classification accuracy of 98.91% is achieved using transfer-learned DarkNet19. The proposed framework is ready to be tested with more CT images. The simulation results with the publicly available COVID-19 CT scan image dataset are included to show the effectiveness of the presented study.
Collapse
|
13
|
Aslan S, Kızıloluk S, Sert E. TSA-CNN-AOA: Twitter sentiment analysis using CNN optimized via arithmetic optimization algorithm. Neural Comput Appl 2023; 35:10311-10328. [PMID: 36714074 PMCID: PMC9867606 DOI: 10.1007/s00521-023-08236-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 01/06/2023] [Indexed: 01/21/2023]
Abstract
COVID-19, a novel virus from the coronavirus family, broke out in Wuhan city of China and spread all over the world, killing more than 5.5 million people. The speed of spreading is still critical as an infectious disease, and it causes more and more deaths each passing day. COVID-19 pandemic has resulted in many different psychological effects on people's mental states, such as anxiety, fear, and similar complex feelings. Millions of people worldwide have shared their opinions on COVID-19 on several social media websites, particularly on Twitter. Therefore, it is likely to minimize the negative psychological impact of the disease on society by obtaining individuals' views on COVID-19 from social media platforms, making deductions from their statements, and identifying negative statements about the disease. In this respect, Twitter sentiment analysis (TSA), a recently popular research topic, is used to perform data analysis on social media platforms such as Twitter and reach certain conclusions. The present study, too, proposes TSA using convolutional neural network optimized via arithmetic optimization algorithm (TSA-CNN-AOA) approach. Firstly, using a designed API, 173,638 tweets about COVID-19 were extracted from Twitter between July 25, 2020, and August 30, 2020 to create a database. Later, significant information was extracted from this database using FastText Skip-gram. The proposed approach benefits from a designed convolutional neural network (CNN) model as a feature extractor. Thanks to arithmetic optimization algorithm (AOA), a feature selection process was also applied to the features obtained from CNN. Later, K-nearest neighbors (KNN), support vector machine, and decision tree were used to classify tweets as positive, negative, and neutral. In order to measure the TSA performance of the proposed method, it was compared with different approaches. The results demonstrated that TSA-CNN-AOA (KNN) achieved the highest tweet classification performance with an accuracy rate of 95.098. It is evident from the experimental studies that the proposed approach displayed a much higher TSA performance compared to other similar approaches in the existing literature.
Collapse
Affiliation(s)
- Serpil Aslan
- Department of Software Engineering, Faculty of Engineering and Natural Sciences, Malatya Turgut Ozal University, 44210 Malatya, Turkey
| | - Soner Kızıloluk
- Department of Computer Engineering, Faculty of Engineering and Natural Sciences, Malatya Turgut Ozal University, 44210 Malatya, Turkey
| | - Eser Sert
- Department of Computer Engineering, Faculty of Engineering and Natural Sciences, Malatya Turgut Ozal University, 44210 Malatya, Turkey
| |
Collapse
|
14
|
Swapnarekha H, Nayak J, Behera HS, Dash PB, Pelusi D. An optimistic firefly algorithm-based deep learning approach for sentiment analysis of COVID-19 tweets. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:2382-2407. [PMID: 36899539 DOI: 10.3934/mbe.2023112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The unprecedented rise in the number of COVID-19 cases has drawn global attention, as it has caused an adverse impact on the lives of people all over the world. As of December 31, 2021, more than 2, 86, 901, 222 people have been infected with COVID-19. The rise in the number of COVID-19 cases and deaths across the world has caused fear, anxiety and depression among individuals. Social media is the most dominant tool that disturbed human life during this pandemic. Among the social media platforms, Twitter is one of the most prominent and trusted social media platforms. To control and monitor the COVID-19 infection, it is necessary to analyze the sentiments of people expressed on their social media platforms. In this study, we proposed a deep learning approach known as a long short-term memory (LSTM) model for the analysis of tweets related to COVID-19 as positive or negative sentiments. In addition, the proposed approach makes use of the firefly algorithm to enhance the overall performance of the model. Further, the performance of the proposed model, along with other state-of-the-art ensemble and machine learning models, has been evaluated by using performance metrics such as accuracy, precision, recall, the AUC-ROC and the F1-score. The experimental results reveal that the proposed LSTM + Firefly approach obtained a better accuracy of 99.59% when compared with the other state-of-the-art models.
Collapse
Affiliation(s)
- H Swapnarekha
- Department of Information Technology, Aditya Institute of Technology and Management (AITAM), Tekkali, Andhra Pradesh 532201, India
- Department of Information Technology, Veer Surendra Sai University of Technology, Burla 768018, India
| | - Janmenjoy Nayak
- Department of Computer Science, Maharaja Sriram Chandra Bhanja Deo University, Baripada, Odisha 757003, India
| | - H S Behera
- Department of Information Technology, Veer Surendra Sai University of Technology, Burla 768018, India
| | - Pandit Byomakesha Dash
- Department of Information Technology, Aditya Institute of Technology and Management (AITAM), Tekkali, Andhra Pradesh 532201, India
| | - Danilo Pelusi
- Communication Sciences, University of Teramo, Coste Sant'agostino Campus, Teramo 64100, Italy
| |
Collapse
|
15
|
Kour H, Gupta MK. AI Assisted Attention Mechanism for Hybrid Neural Model to Assess Online Attitudes About COVID-19. Neural Process Lett 2022; 55:1-40. [PMID: 36575702 PMCID: PMC9780630 DOI: 10.1007/s11063-022-11112-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/10/2022] [Indexed: 12/24/2022]
Abstract
COVID-19 is a novel virus that presents challenges due to a lack of consistent and in-depth research. The news of the COVID-19 spreads across the globe, resulting in a flood of posts on social media sites. Apart from health, social, and economic disturbances brought by the COVID-19 pandemic, another important consequence involves public mental health crises which is of greater concern. Data related to COVID-19 is a valuable asset for researchers in understanding people's feelings related to the pandemic. It is thus important to extract the early information evolving public sentiments on social platforms during the outbreak of COVID-19. The objective of this study is to look at people's perceptions of the COVID-19 pandemic who interact with each other and share tweets on the Twitter platform. COVIDSenti, a large-scale benchmark dataset comprising 90,000 COVID-19 tweets collected from February to March 2020, during the initial phases of the outbreak served as the foundation for our experiments. A pre-trained bidirectional encoder representations from transformers (BERT) model is fine-tuned and embeddings generated are combined with two long short-term memory networks to propose the residual encoder transformation network model. The proposed model is used for multiclass text classification on a large dataset labeled as positive, negative, and neutral. The experimental outcomes validate that: (1) the proposed model is the best performing model, with 98% accuracy and 96% F1-score; (2) It also outperforms conventional machine learning algorithms and different variants of BERT, and (3) the approach achieves better results as compared to state-of-the-art on different benchmark datasets.
Collapse
Affiliation(s)
- Harnain Kour
- Department of Computer Science and Engineering, Shri Mata Vaishno Devi University, Katra, India
| | - Manoj K. Gupta
- Department of Computer Science and Engineering, Shri Mata Vaishno Devi University, Katra, India
| |
Collapse
|
16
|
Kumari S, Pushphavathi TP. Intelligent lead-based bidirectional long short term memory for COVID-19 sentiment analysis. SOCIAL NETWORK ANALYSIS AND MINING 2022; 13:1. [PMID: 36532863 PMCID: PMC9734439 DOI: 10.1007/s13278-022-01005-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 11/12/2022] [Accepted: 11/19/2022] [Indexed: 12/12/2022]
Abstract
Social media is an online platform with millions of users and is utilized to spread news, information, world events, discuss ideas, etc. During the COVID-19 pandemic, information and ideas are shared by users both officially and by citizens. Here, the detection of useful content from social media is a challenging task. Hence, natural language processing (NLP) and deep learning are widely utilized for the analysis of the emotions of people during the COVID-19 pandemic. Hence, this research introduces a deep learning mechanism for identifying the sentiment of the people by considering the online Twitter data regarding COVID-19. The intelligent lead-based BiLSTM is utilized to analyze people's sentiments. Here, the loss of the classifier while learning the data is eliminated through the incorporation of the intelligent lead optimization. Hence, the loss is reduced, and a more accurate analysis is obtained. The intelligent lead optimization is devised by considering the role of the informer in identifying the enemy base to safeguard the territory from attack along with the Monarch's knowledge. The performance of the intelligent lead-based BiLSTM for the sentiment analysis is assessed using the metrics like accuracy, sensitivity, and specificity and obtained the values of 96.11, 99.22, and 95.35%, respectively, which are 14.24, 10.45, and 26.57% enhanced performance compared to the baseline KNN technique.
Collapse
Affiliation(s)
- Santoshi Kumari
- Computer Science and Engineering, M S Ramaiah University of Applied Sciences, No. 470P, 4th Phase, Peenya Industrial Area, Bangalore, 560058 India
| | - T. P. Pushphavathi
- Computer Science and Engineering, M S Ramaiah University of Applied Sciences, No. 470P, 4th Phase, Peenya Industrial Area, Bangalore, 560058 India
| |
Collapse
|
17
|
Ahmad W, Wang B, Martin P, Xu M, Xu H. Enhanced sentiment analysis regarding COVID-19 news from global channels. JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE 2022; 6:19-57. [PMID: 36465148 PMCID: PMC9702932 DOI: 10.1007/s42001-022-00189-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 11/06/2022] [Indexed: 05/05/2023]
Abstract
For a healthy society to exist, it is crucial for the media to focus on disease-related issues so that more people are widely aware of them and reduce health risks. Recently, deep neural networks have become a popular tool for textual sentiment analysis, which can provide valuable insights and real-time monitoring and analysis regarding health issues. In this paper, as part of an effort to develop an effective model that can elicit public sentiment on COVID-19 news, we propose a novel approach Cov-Att-BiLSTM for sentiment analysis of COVID-19 news headlines using deep neural networks. We integrate attention mechanisms, embedding techniques, and semantic level data labeling into the prediction process to enhance the accuracy. To evaluate the proposed approach, we compared it to several deep and machine learning classifiers using various metrics of categorization efficiency and prediction quality, and the experimental results demonstrate its superiority with 0.931 testing accuracy. Furthermore, 73,138 pandemic-related tweets posted on six global channels were analyzed by the proposed approach, which accurately reflects global coverage of COVID-19 news and vaccination.
Collapse
Affiliation(s)
- Waseem Ahmad
- School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China
| | - Bang Wang
- School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China
| | - Philecia Martin
- School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, China
| | - Minghua Xu
- School of Journalism and Information Communication, Huazhong University of Science and Technology, Wuhan, China
| | - Han Xu
- School of Journalism and Information Communication, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
18
|
Qiu D, Yu Y, Chen L. Emotion Analysis of COVID-19 Vaccines Based on a Fuzzy Convolutional Neural Network. Cognit Comput 2022; 16:1-15. [PMID: 36406893 PMCID: PMC9666947 DOI: 10.1007/s12559-022-10068-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 10/16/2022] [Indexed: 11/17/2022]
Abstract
COVID-19 created immense global challenges in 2020, and the world will live under its threat indefinitely. Much of the information on social media supported the government in addressing this major public health event. On January 9, to control the virus, the Chinese government announced universal vaccinations. However, due to a range of varied interpretations, people held different attitudes towards vaccination. Therefore, the success of the mass immunization strategy greatly depended on the public perception of the COVID-19 vaccine. This article explores the changes in people's emotional attitudes towards vaccines and the reasons behind them in the context of the global pandemic in an effort to help mankind overcome this ongoing crisis. For this article, microblogs from January to September containing Chinese people's responses to the COVID-19 vaccines were collected. Based on fuzzy logic and deep learning, we advance the hypothesis that fuzzy vector adaptive improvements will make it possible to better express language emotion and that fuzzy emotion vectors can be integrated into deep learning models, thus making these models more interpretable. Based on this assumption, we design a deep learning model with a fuzzy emotion vector. The experimental results show the positive effect of this model. By applying the model in analyses of people's attitudes towards vaccines, we can obtain people's attitudes towards vaccines in different time periods. We discovered that the most negative emotions about the vaccine appeared in April and that the most positive emotions about the vaccine appeared in February. Combined with word cloud technology and the LDA model, we can effectively explore the reasons for the changes in vaccine attitudes. Our findings show that people's negative emotions about the vaccine are always higher than their positive emotions about the vaccine and that people's attitudes towards the vaccine are closely related to the progress of the epidemic. There is also a certain relationship between people's attitudes towards the vaccine and those towards the vaccination.
Collapse
Affiliation(s)
- Dong Qiu
- School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Nanan, Chongqing 400065 China
- College of Science, Chongqing University of Posts and Telecommunications, Nanan, Chongqing 400065 China
- School of Mathematics and Information Science, Guangxi University, Nanning, China
| | - Yang Yu
- School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Nanan, Chongqing 400065 China
| | - Lei Chen
- School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Nanan, Chongqing 400065 China
| |
Collapse
|
19
|
Public Health Policy Monitoring through Public Perceptions: A Case of COVID-19 Tweet Analysis. INFORMATION 2022. [DOI: 10.3390/info13110543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Since the start of the COVID-19 pandemic, government authorities have responded by issuing new public health policies, many of which were intended to contain its spread but ended up limiting economic and social activities. The citizen responses to these policies are diverse, ranging from goodwill to fear and anger. It is challenging to determine whether or not these public health policies achieved the intended impact. This requires systematic data collection and scientific studies, which can be very time-consuming. To overcome such challenges, in this paper, we provide an alternative approach to continuously monitor and dynamically make sense of how public health policies impact citizens. Our approach is to continuously collect Twitter posts related to COVID-19 policies and to analyze the public reactions. We have developed a web-based system that collects tweets daily and generates timelines and geographical displays of citizens’ “concern levels”. Tracking the public reactions towards different policies can help government officials assess the policy impacts in a more dynamic and real-time manner. For this paper, we collected and analyzed over 16 million tweets related to ten policies over a 10-month period. We obtained several findings; for example, the “COVID-19 (General)” and ”Ventilators” policies engendered the highest concern levels, while the “Face Coverings” policy caused the lowest. Nine out of ten policies exhibited significant changes in concern levels during the observation period.
Collapse
|
20
|
Chen Y, Zhang Z. An easy numeric data augmentation method for early-stage COVID-19 tweets exploration of participatory dynamics of public attention and news coverage. Inf Process Manag 2022; 59:103073. [PMID: 36061343 PMCID: PMC9420706 DOI: 10.1016/j.ipm.2022.103073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Revised: 08/21/2022] [Accepted: 08/25/2022] [Indexed: 11/05/2022]
Abstract
With the onset of COVID-19, the pandemic has aroused huge discussions on social media like Twitter, followed by many social media analyses concerning it. Despite such an abundance of studies, however, little work has been done on reactions from the public and officials on social networks and their associations, especially during the early outbreak stage. In this paper, a total of 9,259,861 COVID-19-related English tweets published from 31 December 2019 to 11 March 2020 are accumulated for exploring the participatory dynamics of public attention and news coverage during the early stage of the pandemic. An easy numeric data augmentation (ENDA) technique is proposed for generating new samples while preserving label validity. It attains superior performance on text classification tasks with deep models (BERT) than an easier data augmentation method. To demonstrate the efficacy of ENDA further, experiments and ablation studies have also been implemented on other benchmark datasets. The classification results of COVID-19 tweets show tweets peaks trigged by momentous events and a strong positive correlation between the daily number of personal narratives and news reports. We argue that there were three periods divided by the turning points on January 20 and February 23 and the low level of news coverage suggests the missed windows for government response in early January and February. Our study not only contributes to a deeper understanding of the dynamic patterns and relationships of public attention and news coverage on social media during the pandemic but also sheds light on early emergency management and government response on social media during global health crises.
Collapse
|
21
|
Jayachandran S, Dumala A. Recurrent neural network based sentiment analysis of social media data during corona pandemic under national lockdown. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-221883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The Corona virus pandemic has affected the normal course of life. People all over the world take the social media to express their opinions and general emotions regarding this phenomenon. In a relatively short period of time, tweets about the new Corona virus increased by an amount never before seen on the social networking site Twitter. In this research work, Sentiment Analysis of Social Media Data to Identify the Feelings of Indians during Corona Pandemic under National Lockdown using recurrent neural network is proposed. The proposed method is analyzed using four steps: that is Data collection, data preparation, Building sentiment analysis model and Visualization of the results. For Data collection, the twitter dataset are collected from social networking platform twitter by application programming interface. For Data preparation, the input data set are pre-processed for removing URL links, removing unnecessary spaces, removing punctuations and numbers. After data cleaning or preprocessing entire particular characters and non-US characters from Standard Code for Information Interchange, apart from hash tag, are extracted as refined tweet text. In addition, entire behaviors less than three alphabets are not assumed at analysis of tweets, lastly, tokenization and derivation was carried out by Porter Stemmer to perform opinion mining. To authenticate the method, categorized the tweets linked to COVID-19 national lockdown. For categorization, recurrent neural method is used. RNN classify the sentiment classification as positive, negative and neutral sentiment scores. The efficiency of the proposed RNN based Sentimental analysis classification of COVID-19 is assessed various performances by evaluation metrics, like sensitivity, precision, recall, f-measure, specificity and accuracy. The proposed method attains 24.51%, 25.35%, 31.45% and 24.53% high accuracy, 43.51%, 52.35%, 21.45% and 28.53% high sensitivity than the existing methods.
Collapse
Affiliation(s)
- Shana Jayachandran
- Department of Computer Applications, Coimbatore Institute of Technology, Coimbatore, Tamilnadu, India
| | - Anveshini Dumala
- Department of Information Technology, Vignan’s Nirula Institute of Technology and Science for Women, Guntur, India
| |
Collapse
|
22
|
Fattoh IE, Kamal Alsheref F, Ead WM, Youssef AM. Semantic Sentiment Classification for COVID-19 Tweets Using Universal Sentence Encoder. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6354543. [PMID: 36248924 PMCID: PMC9556213 DOI: 10.1155/2022/6354543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 08/30/2022] [Accepted: 09/23/2022] [Indexed: 11/17/2022]
Abstract
The spread of data on the web has increased in the last twenty years. One of the reasons is the appearance of social media. The data on social sites describe many real-life events in our daily lives. In the period of the COVID-19 pandemic, a lot of people and media organizations were writing and documenting their health status and the latest news about the coronavirus on social media. Using these tweets (sentiments) about the coronavirus and analyzing them in a computational model can help decision makers in measuring public opinion and yielding remarkable findings. In this research article, we introduce a deep learning sentiment analysis model based on Universal Sentence Encoder. The dataset used in this research was collected from Twitter, and it was classified as positive, neutral, and negative. The sentence embedding model determines the meaning of word sequences instead of individual words. The model divides the dataset into training and testing and depends on the sentence similarity in detecting sentiment class. The obtained accuracy results reached 78.062%, and this result outperforms many traditional ML classifiers based on TF-IDF applied on the same dataset and another model based on the CNN classifier.
Collapse
Affiliation(s)
- Ibrahim Eldesouky Fattoh
- Computer Science Department, Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef, Egypt
| | - Fahad Kamal Alsheref
- Information Systems Department, Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef, Egypt
| | - Waleed M. Ead
- Information Systems Department, Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef, Egypt
| | | |
Collapse
|
23
|
Zulfiker MS, Kabir N, Biswas AA, Zulfiker S, Uddin MS. Analyzing the public sentiment on COVID-19 vaccination in social media: Bangladesh context. ARRAY 2022; 15:100204. [PMID: 35722449 PMCID: PMC9188682 DOI: 10.1016/j.array.2022.100204] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 05/31/2022] [Accepted: 06/03/2022] [Indexed: 01/25/2023] Open
Abstract
Since December 2019, the world has been fighting against the COVID-19 pandemic. This epidemic has revealed a bitter truth that though humans have advanced to unprecedented heights in the last few decades in terms of technology, they are lagging far behind in the fields of medical science and health care. Several institutes and research organizations have stepped up to introduce different vaccines to combat the pandemic. Bangladesh government has also taken steps to provide widespread vaccinations from January 2021. The Bangladeshi netizens are frequently sharing their thoughts, emotions, and experiences about the COVID-19 vaccines and the vaccination process on different social media sites like Facebook, Twitter, etc. This study has analyzed the views and opinions that they have expressed on different social media platforms about the vaccines and the ongoing vaccination program. For performing this study, the reactions of the Bangladeshi netizens on social media have been collected. The Latent Dirichlet Allocation (LDA) model has been used to extract the most common topics expressed by the netizens regarding the vaccines and vaccination process in Bangladesh. Finally, this study has applied different deep learning as well as traditional machine learning algorithms to identify the sentiments and polarity of the opinions of the netizens. The performance of these models has been assessed using a variety of metrics such as accuracy, precision, sensitivity, specificity, and F1-score to identify the best one. Sentiment analysis lessons from these opinions can help the government to prepare itself for the future pandemic.
Collapse
Affiliation(s)
- Md Sabab Zulfiker
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Nasrin Kabir
- Department of Computer Science and Engineering, Jahangirnagar University, Dhaka, Bangladesh
| | - Al Amin Biswas
- Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
| | - Sunjare Zulfiker
- Department of Electrical and Computer Engineering, North South University, Dhaka, Bangladesh
| | - Mohammad Shorif Uddin
- Department of Computer Science and Engineering, Jahangirnagar University, Dhaka, Bangladesh
| |
Collapse
|
24
|
A machine learning approach in analysing the effect of hyperboles using negative sentiment tweets for sarcasm detection. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2022. [DOI: 10.1016/j.jksuci.2022.01.008] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
25
|
Ahmad A, Rustam F, Saad E, Siddique MA, Lee E, Mansilla AO, Díez IDLT, Ashraf I. Analyzing preventive precautions to limit spread of COVID-19. PLoS One 2022; 17:e0272350. [PMID: 36001556 PMCID: PMC9401132 DOI: 10.1371/journal.pone.0272350] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2022] [Accepted: 07/19/2022] [Indexed: 01/08/2023] Open
Abstract
With the global spread of COVID-19, the governments advised the public for adopting safety precautions to limit its spread. The virus spreads from people, contaminated places, and nozzle droplets that necessitate strict precautionary measures. Consequently, different safety precautions have been implemented to fight COVID-19 such as wearing a facemask, restriction of social gatherings, keeping 6 feet distance, etc. Despite the warnings, highlighted need for such measures, and the increasing severity of the pandemic situation, the expected number of people adopting these precautions is low. This study aims at assessing and understanding the public perception of COVID-19 safety precautions, especially the use of facemask. A unified framework of sentiment lexicon with the proposed ensemble EB-DT is devised to analyze sentiments regarding safety precautions. Extensive experiments are performed with a large dataset collected from Twitter. In addition, the factors leading to a negative perception of safety precautions are analyzed by performing topic analysis using the Latent Dirichlet allocation algorithm. The experimental results reveal that 12% of the tweets correspond to negative sentiments towards facemask precaution mainly by its discomfort. Analysis of change in peoples' sentiment over time indicates a gradual increase in the positive sentiments regarding COVID-19 restrictions.
Collapse
Affiliation(s)
- Ayaz Ahmad
- Department of Computer Science, Khawaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan
| | - Furqan Rustam
- Department of Software Engineering, School of Systems and Technology, University of Management and Technology Lahore, Lahore, Pakistan
| | - Eysha Saad
- Department of Computer Science, Khawaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan
| | - Muhammad Abubakar Siddique
- Department of Computer Science, Khawaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan
| | - Ernesto Lee
- Department of Computer Science, Broward College, Broward County, Florida, United States of America
| | - Arturo Ortega Mansilla
- European University of The Atlantic, Santander, Spain
- Iberoamerican International University, Campeche, Mexico
| | - Isabel de la Torre Díez
- Department of Signal Theory and Communications and Telematic Engineering, Unviersity of Valladolid, Valladolid, Spain
| | - Imran Ashraf
- Information and Communication Engineering, Yeungnam University, Gyeongsan, Korea
| |
Collapse
|
26
|
Gao Y, Zhao C, Sun B, Zhao W. Effects of investor sentiment on stock volatility: new evidences from multi-source data in China's green stock markets. FINANCIAL INNOVATION 2022; 8:77. [PMID: 36034681 PMCID: PMC9395953 DOI: 10.1186/s40854-022-00381-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 08/11/2022] [Indexed: 06/15/2023]
Abstract
The effect of investor sentiment on stock volatility is a highly attractive research question in both the academic field and the real financial industry. With the proposal of China's "dual carbon" target, green stocks have gradually become an essential branch of Chinese stock markets. Focusing on 106 stocks from the new energy, environmental protection, and carbon-neutral sectors, we construct two investor sentiment proxies using Internet text and stock trading data, respectively. The Internet sentiment is based on posts from Eastmoney Guba, and the trading sentiment comes from a variety of trading indicators. In addition, we divide the realized volatility into continuous and jump parts, and then investigate the effects of investor sentiment on different types of volatilities. Our empirical findings show that both sentiment indices impose significant positive impacts on realized, continuous, and jump volatilities, where trading sentiment is the main factor. We further explore the mediating effect of information asymmetry, measured by the volume-synchronized probability of informed trading (VPIN), on the path of investor sentiment affecting stock volatility. It is evidenced that investor sentiments are positively correlated with the VPIN, and they can affect volatilities through the VPIN. We then divide the total sample around the coronavirus disease 2019 (COVID-19) pandemic. The empirical results reveal that the market volatility after the COVID-19 pandemic is more susceptible to investor sentiments, especially to Internet sentiment. Our study is of great significance for maintaining the stability of green stock markets and reducing market volatility.
Collapse
Affiliation(s)
- Yang Gao
- School of Economics and Management, Beijing University of Technology, No. 100 Pingleyuan, Chaoyang District, Beijing, 100124 People’s Republic of China
| | - Chengjie Zhao
- School of Economics and Management, Beijing University of Technology, No. 100 Pingleyuan, Chaoyang District, Beijing, 100124 People’s Republic of China
| | - Bianxia Sun
- Department of Finance, Southern University of Science and Technology, Room 3#317, Wisdom Valley, No. 1088 Xueyuan Rd., Nanshan District, Shenzhen, 518055 People’s Republic of China
| | - Wandi Zhao
- School of Statistics, Capital University of Economics and Business, Fengtai District, Beijing, 100070 People’s Republic of China
| |
Collapse
|
27
|
Shukla D, Chandra G, Pandey B, Dwivedi SK. A comprehensive survey on sentiment analysis: Challenges and future insights. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-213372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
With the rise of social networks, people now express their sentiments more frequently and comfortably through their social media activities on different events, person, and every little thing surrounding them. This generates a lot of unstructured data; billions of users post tweets every day as a daily regime on Twitter itself. This has given rise to many texts classification and analysis tasks, Sentiment Analysis (SA) being one of them. Through SA, it is conferred whether the users have negative or positive orientations in their opinions; the results of this task are significantly useful for decision-makers in various fields. This paper presents various facets of SA, like the process followed in SA, levels, approaches, and sentences considered in SA. Aspects such as growth, techniques, the share of various platforms, and SA pipeline are also covered in this paper. At last, we have highlighted some major challenges in order to define future directions.
Collapse
Affiliation(s)
- Diksha Shukla
- Department of Computer Science, BBAU (A Central University), Lucknow
| | - Ganesh Chandra
- Department of Computer Science, BBAU (A Central University), Lucknow
| | - Babita Pandey
- Department of Computer Science, BBAU (A Central University), Lucknow
| | - Sanjay K. Dwivedi
- Department of Computer Science, BBAU (A Central University), Lucknow
| |
Collapse
|
28
|
Data-Driven Prediction of COVID-19 Daily New Cases through a Hybrid Approach of Machine Learning Unsupervised and Deep Learning. ATMOSPHERE 2022. [DOI: 10.3390/atmos13081205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Air pollution is associated with respiratory diseases and the transmission of infectious diseases. In this context, the association between meteorological factors and poor air quality possibly contributes to the transmission of COVID-19. Therefore, analyzing historical data of particulate matter (PM2.5, and PM10) and meteorological factors in indoor and outdoor environments to discover patterns that allow predicting future confirmed cases of COVID-19 is a challenge within a long pandemic. In this study, a hybrid approach based on machine learning and deep learning is proposed to predict confirmed cases of COVID-19. On the one hand, a clustering algorithm based on K-means allows the discovery of behavior patterns by forming groups with high cohesion. On the other hand, multivariate linear regression is implemented through a long short-term memory (LSTM) neural network, building a reliable predictive model in the training stage. The LSTM prediction model is evaluated through error metrics, achieving the highest performance and accuracy in predicting confirmed cases of COVID-19, using data of PM2.5 and PM10 concentrations and meteorological factors of the outdoor environment. The predictive model obtains a root-mean-square error (RMSE) of 0.0897, mean absolute error (MAE) of 0.0837, and mean absolute percentage error (MAPE) of 0.4229 in the testing stage. When using a dataset of PM2.5, PM10, and meteorological parameters collected inside 20 households from 27 May to 13 October 2021, the highest performance is obtained with an RMSE of 0.0892, MAE of 0.0592, and MAPE of 0.2061 in the testing stage. Moreover, in the validation stage, the predictive model obtains a very acceptable performance with values between 0.4152 and 3.9084 for RMSE, and a MAPE of less than 4.1%, using three different datasets with indoor environment values.
Collapse
|
29
|
Heidari A, Jafari Navimipour N, Unal M, Toumaj S. Machine learning applications for COVID-19 outbreak management. Neural Comput Appl 2022; 34:15313-15348. [PMID: 35702664 PMCID: PMC9186489 DOI: 10.1007/s00521-022-07424-w] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2021] [Accepted: 05/10/2022] [Indexed: 12/29/2022]
Abstract
Recently, the COVID-19 epidemic has resulted in millions of deaths and has impacted practically every area of human life. Several machine learning (ML) approaches are employed in the medical field in many applications, including detecting and monitoring patients, notably in COVID-19 management. Different medical imaging systems, such as computed tomography (CT) and X-ray, offer ML an excellent platform for combating the pandemic. Because of this need, a significant quantity of study has been carried out; thus, in this work, we employed a systematic literature review (SLR) to cover all aspects of outcomes from related papers. Imaging methods, survival analysis, forecasting, economic and geographical issues, monitoring methods, medication development, and hybrid apps are the seven key uses of applications employed in the COVID-19 pandemic. Conventional neural networks (CNNs), long short-term memory networks (LSTM), recurrent neural networks (RNNs), generative adversarial networks (GANs), autoencoders, random forest, and other ML techniques are frequently used in such scenarios. Next, cutting-edge applications related to ML techniques for pandemic medical issues are discussed. Various problems and challenges linked with ML applications for this pandemic were reviewed. It is expected that additional research will be conducted in the upcoming to limit the spread and catastrophe management. According to the data, most papers are evaluated mainly on characteristics such as flexibility and accuracy, while other factors such as safety are overlooked. Also, Keras was the most often used library in the research studied, accounting for 24.4 percent of the time. Furthermore, medical imaging systems are employed for diagnostic reasons in 20.4 percent of applications.
Collapse
Affiliation(s)
- Arash Heidari
- Department of Computer Engineering, Tabriz Branch, Islamic Azad University, Tabriz, Iran
- Department of Computer Engineering, Shabestar Branch, Islamic Azad University, Shabestar, Iran
| | | | - Mehmet Unal
- Department of Computer Engineering, Nisantasi University, Istanbul, Turkey
| | - Shiva Toumaj
- Urmia University of Medical Sciences, Urmia, Iran
| |
Collapse
|
30
|
Venugopalan M, Gupta D. An enhanced guided LDA model augmented with BERT based semantic strength for aspect term extraction in sentiment analysis. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108668] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
|
31
|
Sunitha D, Patra RK, Babu NV, Suresh A, Gupta SC. Twitter sentiment analysis using ensemble based deep learning model towards COVID-19 in India and European countries. Pattern Recognit Lett 2022; 158:164-170. [PMID: 35464347 PMCID: PMC9014659 DOI: 10.1016/j.patrec.2022.04.027] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 04/06/2022] [Accepted: 04/16/2022] [Indexed: 11/22/2022]
Abstract
As of November 2021, more than 24.80 crore people are diagnosed with the coronavirus in that around 50.20 lakhs people lost their lives, because of this infectious disease. By understanding the people's sentiment's expressed in their social media (Facebook, Twitter, Instagram etc.) helps their governments in controlling, monitoring, and eradicating the coronavirus. Compared to other social media's, the twitter data are indispensable in the extraction of useful awareness information related to any crisis. In this article, a sentiment analysis model is proposed to analyze the real time tweets, which are related to coronavirus. Initially, around 3100 Indian and European people's tweets are collected between the time period of 23.03.2020 to 01.11.2021. Next, the data pre-processing and exploratory investigation are accomplished for better understanding of the collected data. Further, the feature extraction is performed using Term Frequency-Inverse Document Frequency (TF-IDF), GloVe, pre-trained Word2Vec, and fast text embedding's. The obtained feature vectors are fed to the ensemble classifier (Gated Recurrent Unit (GRU) and Capsule Neural Network (CapsNet)) for classifying the user's sentiment's as anger, sad, joy, and fear. The obtained experimental outcomes showed that the proposed model achieved 97.28% and 95.20% of prediction accuracy in classifying the both Indian and European people's sentiments.
Collapse
Affiliation(s)
- D Sunitha
- Department of Computer Science & Engineering, Kamala Institute of Technology & Science, Singapur, Telangana 505468, India
| | | | - N V Babu
- Department of Electrical and Electronics Engineering, SJB Institute of Technology, Bangalore, India
| | - A Suresh
- Department of Computer Science and Engineering, Veltech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India
| | - Suresh Chand Gupta
- Department of Computer Science & Engineering, Panipat Institute of Engineering and Technology, Panipat, Haryana, India
| |
Collapse
|
32
|
COVID-19 Tweets Classification Based on a Hybrid Word Embedding Method. BIG DATA AND COGNITIVE COMPUTING 2022. [DOI: 10.3390/bdcc6020058] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
In March 2020, the World Health Organisation declared that COVID-19 was a new pandemic. This deadly virus spread and affected many countries in the world. During the outbreak, social media platforms such as Twitter contributed valuable and massive amounts of data to better assess health-related decision making. Therefore, we propose that users’ sentiments could be analysed with the application of effective supervised machine learning approaches to predict disease prevalence and provide early warnings. The collected tweets were prepared for preprocessing and categorised into: negative, positive, and neutral. In the second phase, different features were extracted from the posts by applying several widely used techniques, such as TF-IDF, Word2Vec, Glove, and FastText to capture features’ datasets. The novelty of this study is based on hybrid features extraction, where we combined syntactic features (TF-IDF) with semantic features (FastText and Glove) to represent posts accurately, which helps in improving the classification process. Experimental results show that FastText combined with TF-IDF performed better with SVM than the other models. SVM outperformed the other models by 88.72%, as well as for XGBoost, with an 85.29% accuracy score. This study shows that the hybrid methods proved their capability of extracting features from the tweets and increasing the performance of classification.
Collapse
|
33
|
Alkhaldi NA, Asiri Y, Mashraqi AM, Halawani HT, Abdel-Khalek S, Mansour RF. Leveraging Tweets for Artificial Intelligence Driven Sentiment Analysis on the COVID-19 Pandemic. Healthcare (Basel) 2022; 10:910. [PMID: 35628045 PMCID: PMC9141128 DOI: 10.3390/healthcare10050910] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 05/09/2022] [Accepted: 05/10/2022] [Indexed: 01/25/2023] Open
Abstract
The COVID-19 pandemic has been a disastrous event that has elevated several psychological issues such as depression given abrupt social changes and lack of employment. At the same time, social scientists and psychologists have gained significant interest in understanding the way people express emotions and sentiments at the time of pandemics. During the rise in COVID-19 cases with stricter lockdowns, people expressed their sentiments on social media. This offers a deep understanding of human psychology during catastrophic events. By exploiting user-generated content on social media such as Twitter, people's thoughts and sentiments can be examined, which aids in introducing health intervention policies and awareness campaigns. The recent developments of natural language processing (NLP) and deep learning (DL) models have exposed noteworthy performance in sentiment analysis. With this in mind, this paper presents a new sunflower optimization with deep-learning-driven sentiment analysis and classification (SFODLD-SAC) on COVID-19 tweets. The presented SFODLD-SAC model focuses on the identification of people's sentiments during the COVID-19 pandemic. To accomplish this, the SFODLD-SAC model initially preprocesses the tweets in distinct ways such as stemming, removal of stopwords, usernames, link punctuations, and numerals. In addition, the TF-IDF model is applied for the useful extraction of features from the preprocessed data. Moreover, the cascaded recurrent neural network (CRNN) model is employed to analyze and classify sentiments. Finally, the SFO algorithm is utilized to optimally adjust the hyperparameters involved in the CRNN model. The design of the SFODLD-SAC technique with the inclusion of an SFO algorithm-based hyperparameter optimizer for analyzing people's sentiments on COVID-19 shows the novelty of this study. The simulation analysis of the SFODLD-SAC model is performed using a benchmark dataset from the Kaggle repository. Extensive, comparative results report the promising performance of the SFODLD-SAC model over recent state-of-the-art models with maximum accuracy of 99.65%.
Collapse
Affiliation(s)
- Nora A. Alkhaldi
- Department of Computer Science, College of Computer Sciences and Information Technology, King Faisal University, Al-Ahsa 31982, Saudi Arabia;
| | - Yousef Asiri
- Department of Computer Science, College of Computer Science and Information Systems, Najran Univesity, Najran 61441, Saudi Arabia; (Y.A.); (A.M.M.)
| | - Aisha M. Mashraqi
- Department of Computer Science, College of Computer Science and Information Systems, Najran Univesity, Najran 61441, Saudi Arabia; (Y.A.); (A.M.M.)
| | - Hanan T. Halawani
- Department of Computer Science, College of Computer Science and Information Systems, Najran Univesity, Najran 61441, Saudi Arabia; (Y.A.); (A.M.M.)
| | - Sayed Abdel-Khalek
- Department of Mathematics, College of Science, Taif University, Taif 21944, Saudi Arabia;
| | - Romany F. Mansour
- Department of Mathematics, Faculty of Science, New Valley University, El-Kharga 72511, Egypt;
| |
Collapse
|
34
|
Shahi TB, Sitaula C, Paudel N. A Hybrid Feature Extraction Method for Nepali COVID-19-Related Tweets Classification. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:5681574. [PMID: 35281187 PMCID: PMC8906125 DOI: 10.1155/2022/5681574] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 02/10/2022] [Indexed: 12/20/2022]
Abstract
COVID-19 is one of the deadliest viruses, which has killed millions of people around the world to this date. The reason for peoples' death is not only linked to its infection but also to peoples' mental states and sentiments triggered by the fear of the virus. People's sentiments, which are predominantly available in the form of posts/tweets on social media, can be interpreted using two kinds of information: syntactical and semantic. Herein, we propose to analyze peoples' sentiment using both kinds of information (syntactical and semantic) on the COVID-19-related twitter dataset available in the Nepali language. For this, we, first, use two widely used text representation methods: TF-IDF and FastText and then combine them to achieve the hybrid features to capture the highly discriminating features. Second, we implement nine widely used machine learning classifiers (Logistic Regression, Support Vector Machine, Naive Bayes, K-Nearest Neighbor, Decision Trees, Random Forest, Extreme Tree classifier, AdaBoost, and Multilayer Perceptron), based on the three feature representation methods: TF-IDF, FastText, and Hybrid. To evaluate our methods, we use a publicly available Nepali-COVID-19 tweets dataset, NepCov19Tweets, which consists of Nepali tweets categorized into three classes (Positive, Negative, and Neutral). The evaluation results on the NepCOV19Tweets show that the hybrid feature extraction method not only outperforms the other two individual feature extraction methods while using nine different machine learning algorithms but also provides excellent performance when compared with the state-of-the-art methods.
Collapse
Affiliation(s)
- T. B. Shahi
- Central Department of Computer Science and Information Technology, Tribhuvan University, 44600 Kathmandu, Nepal
- School of Engineering and Technology, Central Queensland University, Rockhampton 4701, QLD, Australia
| | - C. Sitaula
- Central Department of Computer Science and Information Technology, Tribhuvan University, 44600 Kathmandu, Nepal
- Department of Electrical and Computer Systems Engineering, Monash University, Clayton 3800, VIC, Australia
| | - N. Paudel
- Central Department of Computer Science and Information Technology, Tribhuvan University, 44600 Kathmandu, Nepal
| |
Collapse
|
35
|
Li X, Zhang J, Du Y, Zhu J, Fan Y, Chen X. A Novel Deep Learning-based Sentiment Analysis Method Enhanced with Emojis in Microblog Social Networks. ENTERP INF SYST-UK 2022. [DOI: 10.1080/17517575.2022.2037160] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Xianyong Li
- School of Computer and Software Engineering, Xihua University, Chengdu China
| | - Jiabo Zhang
- School of Computer and Software Engineering, Xihua University, Chengdu China
- Chengdu Tianfu International Airport, Chengdu China
| | - Yajun Du
- School of Computer and Software Engineering, Xihua University, Chengdu China
| | - Jian Zhu
- Department of Mathematics and Physics, Xinjiang Institute of Engineering, Urumqi, China
| | - Yongquan Fan
- School of Computer and Software Engineering, Xihua University, Chengdu China
| | - Xiaoliang Chen
- School of Computer and Software Engineering, Xihua University, Chengdu China
| |
Collapse
|
36
|
Integration of Semantic Patterns and Fuzzy Concepts to Reduce the Boundary Region in Three-way Decision-making. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.02.036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
37
|
Wu H, Zhang Z, Shi S, Wu Q, Song H. Phrase dependency relational graph attention network for Aspect-based Sentiment Analysis. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107736] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
38
|
Data-Driven Analytics Leveraging Artificial Intelligence in the Era of COVID-19: An Insightful Review of Recent Developments. Symmetry (Basel) 2021. [DOI: 10.3390/sym14010016] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
This paper presents the role of artificial intelligence (AI) and other latest technologies that were employed to fight the recent pandemic (i.e., novel coronavirus disease-2019 (COVID-19)). These technologies assisted the early detection/diagnosis, trends analysis, intervention planning, healthcare burden forecasting, comorbidity analysis, and mitigation and control, to name a few. The key-enablers of these technologies was data that was obtained from heterogeneous sources (i.e., social networks (SN), internet of (medical) things (IoT/IoMT), cellular networks, transport usage, epidemiological investigations, and other digital/sensing platforms). To this end, we provide an insightful overview of the role of data-driven analytics leveraging AI in the era of COVID-19. Specifically, we discuss major services that AI can provide in the context of COVID-19 pandemic based on six grounds, (i) AI role in seven different epidemic containment strategies (a.k.a non-pharmaceutical interventions (NPIs)), (ii) AI role in data life cycle phases employed to control pandemic via digital solutions, (iii) AI role in performing analytics on heterogeneous types of data stemming from the COVID-19 pandemic, (iv) AI role in the healthcare sector in the context of COVID-19 pandemic, (v) general-purpose applications of AI in COVID-19 era, and (vi) AI role in drug design and repurposing (e.g., iteratively aligning protein spikes and applying three/four-fold symmetry to yield a low-resolution candidate template) against COVID-19. Further, we discuss the challenges involved in applying AI to the available data and privacy issues that can arise from personal data transitioning into cyberspace. We also provide a concise overview of other latest technologies that were increasingly applied to limit the spread of the ongoing pandemic. Finally, we discuss the avenues of future research in the respective area. This insightful review aims to highlight existing AI-based technological developments and future research dynamics in this area.
Collapse
|
39
|
Gudigar A, Raghavendra U, Nayak S, Ooi CP, Chan WY, Gangavarapu MR, Dharmik C, Samanth J, Kadri NA, Hasikin K, Barua PD, Chakraborty S, Ciaccio EJ, Acharya UR. Role of Artificial Intelligence in COVID-19 Detection. SENSORS (BASEL, SWITZERLAND) 2021; 21:8045. [PMID: 34884045 PMCID: PMC8659534 DOI: 10.3390/s21238045] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 11/26/2021] [Accepted: 11/26/2021] [Indexed: 12/15/2022]
Abstract
The global pandemic of coronavirus disease (COVID-19) has caused millions of deaths and affected the livelihood of many more people. Early and rapid detection of COVID-19 is a challenging task for the medical community, but it is also crucial in stopping the spread of the SARS-CoV-2 virus. Prior substantiation of artificial intelligence (AI) in various fields of science has encouraged researchers to further address this problem. Various medical imaging modalities including X-ray, computed tomography (CT) and ultrasound (US) using AI techniques have greatly helped to curb the COVID-19 outbreak by assisting with early diagnosis. We carried out a systematic review on state-of-the-art AI techniques applied with X-ray, CT, and US images to detect COVID-19. In this paper, we discuss approaches used by various authors and the significance of these research efforts, the potential challenges, and future trends related to the implementation of an AI system for disease detection during the COVID-19 pandemic.
Collapse
Affiliation(s)
- Anjan Gudigar
- Department of Instrumentation and Control Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India; (A.G.); (S.N.); (M.R.G.); (C.D.)
| | - U Raghavendra
- Department of Instrumentation and Control Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India; (A.G.); (S.N.); (M.R.G.); (C.D.)
| | - Sneha Nayak
- Department of Instrumentation and Control Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India; (A.G.); (S.N.); (M.R.G.); (C.D.)
| | - Chui Ping Ooi
- School of Science and Technology, Singapore University of Social Sciences, Singapore 599494, Singapore;
| | - Wai Yee Chan
- Department of Biomedical Imaging, Faculty of Medicine, University of Malaya, Kuala Lumpur 50603, Malaysia;
| | - Mokshagna Rohit Gangavarapu
- Department of Instrumentation and Control Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India; (A.G.); (S.N.); (M.R.G.); (C.D.)
| | - Chinmay Dharmik
- Department of Instrumentation and Control Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal 576104, India; (A.G.); (S.N.); (M.R.G.); (C.D.)
| | - Jyothi Samanth
- Department of Cardiovascular Technology, Manipal College of Health Professions, Manipal Academy of Higher Education, Manipal 576104, India;
| | - Nahrizul Adib Kadri
- Department of Biomedical Engineering, Faculty of Engineering, University of Malaya, Kuala Lumpur 50603, Malaysia; (N.A.K.); (K.H.)
| | - Khairunnisa Hasikin
- Department of Biomedical Engineering, Faculty of Engineering, University of Malaya, Kuala Lumpur 50603, Malaysia; (N.A.K.); (K.H.)
| | - Prabal Datta Barua
- Cogninet Brain Team, Cogninet Australia, Sydney, NSW 2010, Australia;
- School of Business (Information Systems), Faculty of Business, Education, Law & Arts, University of Southern Queensland, Toowoomba, QLD 4350, Australia
- Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, Australia;
| | - Subrata Chakraborty
- Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW 2007, Australia;
- Faculty of Science, Agriculture, Business and Law, University of New England, Armidale, NSW 2351, Australia
| | - Edward J. Ciaccio
- Department of Medicine, Columbia University Medical Center, New York, NY 10032, USA;
| | - U. Rajendra Acharya
- School of Engineering, Ngee Ann Polytechnic, Singapore 599489, Singapore;
- Department of Biomedical Informatics and Medical Engineering, Asia University, Taichung 41354, Taiwan
- International Research Organization for Advanced Science and Technology (IROAST), Kumamoto University, Kumamoto 860-8555, Japan
| |
Collapse
|
40
|
Sitaula C, Basnet A, Mainali A, Shahi TB. Deep Learning-Based Methods for Sentiment Analysis on Nepali COVID-19-Related Tweets. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2021; 2021:2158184. [PMID: 34737773 PMCID: PMC8561567 DOI: 10.1155/2021/2158184] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 10/09/2021] [Accepted: 10/18/2021] [Indexed: 11/27/2022]
Abstract
COVID-19 has claimed several human lives to this date. People are dying not only because of physical infection of the virus but also because of mental illness, which is linked to people's sentiments and psychologies. People's written texts/posts scattered on the web could help understand their psychology and the state they are in during this pandemic. In this paper, we analyze people's sentiment based on the classification of tweets collected from the social media platform, Twitter, in Nepal. For this, we, first, propose to use three different feature extraction methods-fastText-based (ft), domain-specific (ds), and domain-agnostic (da)-for the representation of tweets. Among these three methods, two methods ("ds" and "da") are the novel methods used in this study. Second, we propose three different convolution neural networks (CNNs) to implement the proposed features. Last, we ensemble such three CNNs models using ensemble CNN, which works in an end-to-end manner, to achieve the end results. For the evaluation of the proposed feature extraction methods and CNN models, we prepare a Nepali Twitter sentiment dataset, called NepCOV19Tweets, with 3 classes (positive, neutral, and negative). The experimental results on such dataset show that our proposed feature extraction methods possess the discriminating characteristics for the sentiment classification. Moreover, the proposed CNN models impart robust and stable performance on the proposed features. Also, our dataset can be used as a benchmark to study the COVID-19-related sentiment analysis in the Nepali language.
Collapse
Affiliation(s)
- C. Sitaula
- Department of Electrical and Computer Systems Engineering, Monash University, VIC, Clayton, 3800, Australia
- Central Department of Computer Science and Information Technology, Tribhuvan University, Kathmandu 44600, Nepal
| | | | - A. Mainali
- Aryan School of Engineering, Kathmandu, Nepal
| | - T. B. Shahi
- Central Department of Computer Science and Information Technology, Tribhuvan University, Kathmandu 44600, Nepal
- School of Engineering and Technology, Central Queensland University, Rockhampton, QLD 4701, Australia
| |
Collapse
|
41
|
YILDIRIM S, YILDIRIM G, ALATAS B. A new plant intelligence-based method for sentiment analysis: Chaotic sunflower optimization. COMPUTER SCIENCE 2021. [DOI: 10.53070/bbd.991715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|