1
|
Blanco G, Yáñez Martínez R, Lourenço A. Leveraging deep learning to detect stance in Spanish tweets on COVID-19 vaccination. JAMIA Open 2025; 8:ooaf007. [PMID: 40008184 PMCID: PMC11854073 DOI: 10.1093/jamiaopen/ooaf007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Revised: 01/11/2025] [Accepted: 01/27/2025] [Indexed: 02/27/2025] Open
Abstract
Objectives The automatic detection of stance on social media is an important task for public health applications, especially in the context of health crises. Unfortunately, existing models are typically trained on English corpora. Considering the benefits of extending research to other widely spoken languages, the goal of this study is to develop stance detection models for social media posts in Spanish. Materials and Methods A corpus of 6170 tweets about COVID-19 vaccination, posted between March 1, 2020 and January 4, 2022, was manually annotated by native speakers. Traditional predictive models were compared with deep learning models to ascertain a baseline performance for the detection of stance in Spanish tweets. The evaluation focused on the ability of multilingual and language-specific embeddings to contextualize the topic of those short texts adequately. Results The BERT-Multi+BiLSTM combination yielded the best results (macroaveraged F1 and Matthews correlation coefficient scores of 0.86 and 0.79, respectively; interpolated area under the receiver operating curve [AUC] of 0.95 for tweets against vaccination and 0.85 in favor of vaccination and a score of 0.97 for tweets containing no stance information), closely followed by the BETO+BiLSTM and RoBERTa BNE-LSTM Spanish models and the term frequency-inverse document frequency+SVM model (average AUC decrease of 0.01). The main differentiating factor among these models was the ability to predict tweets against vaccination. Discussion The BERT Multi+BILSTM model outperformed the other models in terms of per class prediction capacity. The main assumption is that language-specific embeddings do not outperform multilingual embeddings or TF-IDF features because of the context of the topic. The inherent context of BERT or RoBERTa embeddings is general. So, these embeddings are not familiar with the slang commonly used on Twitter and, more specifically, during the pandemic. Conclusion The best performing model detects tweet stance with performance high enough to ensure its usefulness for public health applications, namely awareness campaigns, misinformation detection and other early intervention and prevention actions seeking to improve an individual's well-being based on autoreported experiences and opinions. The dataset and code of the study are available on GitHub.
Collapse
Affiliation(s)
- Guillermo Blanco
- Universidade de Vigo, Department of Computer Science, ESEI-Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n 32004 Ourense, Spain
- CINBIO, The Biomedical Research Centre, University of Vigo, Campus Univesitario Lagoas-Marcosende, Vigo 36310, Spain
- SING, Next Generation Computer Systems Group, Galicia Sur Health Research Institute, SERGAS-UVIGO, Vigo 36310, Spain
| | - Rubén Yáñez Martínez
- Universidade de Vigo, Department of Computer Science, ESEI-Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n 32004 Ourense, Spain
| | - Anália Lourenço
- Universidade de Vigo, Department of Computer Science, ESEI-Escuela Superior de Ingeniería Informática, Edificio Politécnico, Campus Universitario As Lagoas s/n 32004 Ourense, Spain
- CINBIO, The Biomedical Research Centre, University of Vigo, Campus Univesitario Lagoas-Marcosende, Vigo 36310, Spain
- SING, Next Generation Computer Systems Group, Galicia Sur Health Research Institute, SERGAS-UVIGO, Vigo 36310, Spain
- CEB—Centre of Biological Engineering, University of Minho, Campus de Gualtar, Braga 4710-057, Portugal
| |
Collapse
|
2
|
Purwitasari D, Putra CBP, Raharjo AB. A stance dataset with aspect-based sentiment information from Indonesian COVID-19 vaccination-related tweets. Data Brief 2023; 47:108951. [PMID: 36776157 PMCID: PMC9897868 DOI: 10.1016/j.dib.2023.108951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Accepted: 01/30/2023] [Indexed: 02/05/2023] Open
Abstract
As a platform of social media with high activity, Twitter has seen the discussion of many hot topics related to the COVID-19 pandemic. One such is the COVID-19 vaccination program, which has skeptics in several religious, ethnic, and socioeconomic groups, and Indonesia has one of the largest populations of various ethnicities and religions of countries worldwide. Diverse opinions based on skepticism about the effectiveness of vaccines can increase the number of people who refuse or delay vaccine acceptance. Therefore, it is important to analyze and monitor stances and public opinions on social media, especially on vaccine topics, as part of the long-term solution to the COVID-19 pandemic. This study presents the Indonesian COVID-19 vaccine-related tweets data set that contains stance and aspect-based sentiment information. The data were collected monthly from January to October 2021 using specific keywords. There are nine thousand tweets manually annotated by three independent analysts. We annotated each tweet with three labels of stance and seven predetermined aspects related to Indonesian COVID-19 vaccine-related tweets: services, implementation, apps, costs, participants, vaccine products, and general. The dataset is useful for many research purposes, including stance detection, aspect-based sentiment analysis, topic detection, and public opinion analysis on Twitter, especially on the policies regarding the prevention of pandemics.
Collapse
|
3
|
Alturayeif N, Luqman H, Ahmed M. A systematic review of machine learning techniques for stance detection and its applications. Neural Comput Appl 2023; 35:5113-5144. [PMID: 36743664 PMCID: PMC9884072 DOI: 10.1007/s00521-023-08285-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 01/06/2023] [Indexed: 01/30/2023]
Abstract
Stance detection is an evolving opinion mining research area motivated by the vast increase in the variety and volume of user-generated content. In this regard, considerable research has been recently carried out in the area of stance detection. In this study, we review the different techniques proposed in the literature for stance detection as well as other applications such as rumor veracity detection. Particularly, we conducted a systematic literature review of empirical research on the machine learning (ML) models for stance detection that were published from January 2015 to October 2022. We analyzed 96 primary studies, which spanned eight categories of ML techniques. In this paper, we categorize the analyzed studies according to a taxonomy of six dimensions: approaches, target dependency, applications, modeling, language, and resources. We further classify and analyze the corresponding techniques from each dimension's perspective and highlight their strengths and weaknesses. The analysis reveals that deep learning models that adopt a mechanism of self-attention have been used more frequently than the other approaches. It is worth noting that emerging ML techniques such as few-shot learning and multitask learning have been used extensively for stance detection. A major conclusion of our analysis is that despite that ML models have shown to be promising in this field, the application of these models in the real world is still limited. Our analysis lists challenges and gaps to be addressed in future research. Furthermore, the taxonomy presented can assist researchers in developing and positioning new techniques for stance detection-related applications.
Collapse
Affiliation(s)
- Nora Alturayeif
- Information and Computer Science Department, King Fahd University of Petroleum and Minerals, Dhahran, 31261 Saudi Arabia
- Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, 31441 Saudi Arabia
| | - Hamzah Luqman
- Information and Computer Science Department, King Fahd University of Petroleum and Minerals, Dhahran, 31261 Saudi Arabia
- SDAIA-KFUPM Joint Research Center for Artificial Intelligence, KFUPM, Dhahran, Saudi Arabia
| | - Moataz Ahmed
- Information and Computer Science Department, King Fahd University of Petroleum and Minerals, Dhahran, 31261 Saudi Arabia
- Interdisciplinary Research Center of Intelligent Secure Systems (IRC-ISS), KFUPM, Dhahran, Saudi Arabia
| |
Collapse
|
4
|
Allaway E, McKeown K. Zero-shot stance detection: Paradigms and challenges. Front Artif Intell 2023; 5:1070429. [PMID: 36714207 PMCID: PMC9880531 DOI: 10.3389/frai.2022.1070429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 12/26/2022] [Indexed: 01/14/2023] Open
Abstract
A major challenge in stance detection is the large (potentially infinite) and diverse set of stance topics. Collecting data for such a set is unrealistic due to both the expense of annotation and the continuous creation of new real-world topics (e.g., a new politician runs for office). Furthermore, stancetaking occurs in a wide range of languages and genres (e.g., Twitter, news articles). While zero-shot stance detection in English, where evaluation is on topics not seen during training, has received increasing attention, we argue that this attention should be expanded to multilingual and multi-genre settings. We discuss two paradigms for English zero-shot stance detection evaluation, as well as recent work in this area. We then discuss recent work on multilingual and multi-genre stance detection, which has focused primarily on non-zero-shot settings. We argue that this work should be expanded to multilingual and multi-genre zero-shot stance detection and propose best practices to systematize and stimulate future work in this direction. While domain adaptation techniques are well-suited for work in these settings, we argue that increased care should be taken to improve model explainability and to conduct robust evaluations, considering not only empirical generalization ability but also the understanding of complex language and inferences.
Collapse
|
5
|
Gasparetto A, Zangari A, Marcuzzo M, Albarelli A. A survey on text classification: Practical perspectives on the Italian language. PLoS One 2022; 17:e0270904. [PMID: 35793328 PMCID: PMC9258888 DOI: 10.1371/journal.pone.0270904] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/18/2022] [Indexed: 11/18/2022] Open
Abstract
Text Classification methods have been improving at an unparalleled speed in the last decade thanks to the success brought about by deep learning. Historically, state-of-the-art approaches have been developed for and benchmarked against English datasets, while other languages have had to catch up and deal with inevitable linguistic challenges. This paper offers a survey with practical and linguistic connotations, showcasing the complications and challenges tied to the application of modern Text Classification algorithms to languages other than English. We engage this subject from the perspective of the Italian language, and we discuss in detail issues related to the scarcity of task-specific datasets, as well as the issues posed by the computational expensiveness of modern approaches. We substantiate this by providing an extensively researched list of available datasets in Italian, comparing it with a similarly sought list for French, which we use for comparison. In order to simulate a real-world practical scenario, we apply a number of representative methods to custom-tailored multilabel classification datasets in Italian, French, and English. We conclude by discussing results, future challenges, and research directions from a linguistically inclusive perspective.
Collapse
Affiliation(s)
- Andrea Gasparetto
- Department of Management, Ca’ Foscari University, Venice, Italy
- * E-mail:
| | | | - Matteo Marcuzzo
- Department of Management, Ca’ Foscari University, Venice, Italy
| | - Andrea Albarelli
- Department of Environmental Sciences, Informatics and Statistics, Ca’ Foscari University, Venice, Italy
| |
Collapse
|
6
|
Bograd S, Chen B, Kavuluru R. Tracking sentiments toward fat acceptance over a decade on Twitter. Health Informatics J 2022; 28:14604582211065702. [PMID: 34986689 DOI: 10.1177/14604582211065702] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The fat acceptance (FA) movement aims to counteract weight stigma and discrimination against individuals who are overweight/obese. We developed a supervised neural network model to classify sentiment toward the FA movement in tweets and identify links between FA sentiment and various Twitter user characteristics. We collected any tweet containing either "fat acceptance" or "#fatacceptance" from 2010-2019 and obtained 48,974 unique tweets. We independently labeled 2000 of them and implemented/trained an Average stochastic gradient descent Weight-Dropped Long Short-Term Memory (AWD-LSTM) neural network that incorporates transfer learning from language modeling to automatically identify each tweet's stance toward the FA movement. Our model achieved nearly 80% average precision and recall in classifying "supporting" and "opposing" tweets. Applying this model to the complete dataset, we observed that the majority of tweets at the beginning of the last decade supported FA, but sentiment trended downward until 2016, when support was at its lowest. Overall, public sentiment is negative across Twitter. Users who tweet more about FA or use FA-related hashtags are more supportive than general users. Our findings reveal both challenges to and strengths of the modern FA movement, with implications for those who wish to reduce societal weight stigma.
Collapse
Affiliation(s)
- Sadie Bograd
- 326741Paul Laurence Dunbar High School, Lexington, KY USA
| | - Benjamin Chen
- 326741Paul Laurence Dunbar High School, Lexington, KY, USA
| | - Ramakanth Kavuluru
- Department of Internal Medicine, 4530University of Kentucky, Lexington, KY, USA
| |
Collapse
|
7
|
|
8
|
Chiril P, Pamungkas EW, Benamara F, Moriceau V, Patti V. Emotionally Informed Hate Speech Detection: A Multi-target Perspective. Cognit Comput 2021; 14:322-352. [PMID: 34221180 PMCID: PMC8236572 DOI: 10.1007/s12559-021-09862-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 01/12/2021] [Indexed: 11/11/2022]
Abstract
Hate Speech and harassment are widespread in online communication, due to users' freedom and anonymity and the lack of regulation provided by social media platforms. Hate speech is topically focused (misogyny, sexism, racism, xenophobia, homophobia, etc.), and each specific manifestation of hate speech targets different vulnerable groups based on characteristics such as gender (misogyny, sexism), ethnicity, race, religion (xenophobia, racism, Islamophobia), sexual orientation (homophobia), and so on. Most automatic hate speech detection approaches cast the problem into a binary classification task without addressing either the topical focus or the target-oriented nature of hate speech. In this paper, we propose to tackle, for the first time, hate speech detection from a multi-target perspective. We leverage manually annotated datasets, to investigate the problem of transferring knowledge from different datasets with different topical focuses and targets. Our contribution is threefold: (1) we explore the ability of hate speech detection models to capture common properties from topic-generic datasets and transfer this knowledge to recognize specific manifestations of hate speech; (2) we experiment with the development of models to detect both topics (racism, xenophobia, sexism, misogyny) and hate speech targets, going beyond standard binary classification, to investigate how to detect hate speech at a finer level of granularity and how to transfer knowledge across different topics and targets; and (3) we study the impact of affective knowledge encoded in sentic computing resources (SenticNet, EmoSenticNet) and in semantically structured hate lexicons (HurtLex) in determining specific manifestations of hate speech. We experimented with different neural models including multitask approaches. Our study shows that: (1) training a model on a combination of several (training sets from several) topic-specific datasets is more effective than training a model on a topic-generic dataset; (2) the multi-task approach outperforms a single-task model when detecting both the hatefulness of a tweet and its topical focus in the context of a multi-label classification approach; and (3) the models incorporating EmoSenticNet emotions, the first level emotions of SenticNet, a blend of SenticNet and EmoSenticNet emotions or affective features based on Hurtlex, obtained the best results. Our results demonstrate that multi-target hate speech detection from existing datasets is feasible, which is a first step towards hate speech detection for a specific topic/target when dedicated annotated data are missing. Moreover, we prove that domain-independent affective knowledge, injected into our models, helps finer-grained hate speech detection.
Collapse
Affiliation(s)
- Patricia Chiril
- IRIT, Université de Toulouse, Université Toulouse III - UPS, Toulouse, France
| | | | - Farah Benamara
- IRIT, Université de Toulouse, Université Toulouse III - UPS, Toulouse, France
| | - Véronique Moriceau
- IRIT, Université de Toulouse, Université Toulouse III - UPS, Toulouse, France
| | - Viviana Patti
- Dipartimento di Informatica, University of Turin, Turin, Italy
| |
Collapse
|
9
|
Ayyub K, Iqbal S, Nisar MW, Ahmad SG, Munir EU. Stance detection using diverse feature sets based on machine learning techniques. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-202269] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Sentiment analysis is the field that analyzes sentiments, and opinions of people about entities such as products, businesses, and events. As opinions influence the people’s behaviors, it has numerous applications in real life such as marketing, politics, social media etc. Stance detection is the sub-field of sentiment analysis. The stance classification aims to automatically identify from the source text, whether the source is in favor, neutral, or opposed to the target. This research study proposed a framework to explore the performance of the conventional (NB, DT, SVM), ensemble learning (RF, AdaBoost) and deep learning-based (DBN, CNN-LSTM, and RNN) machine learning techniques. The proposed method is feature centric and extracted the (sentiment, content, tweet specific and part-of-speech) features from both datasets of SemEval2016 and SemEval2017. The proposed study has also explored the role of deep features such as GloVe and Word2Vec for stance classification which has not received attention yet for stance detection. Some base line features such as Bag of words, N-gram, TF-IDF are also extracted from both datasets to compare the proposed features along with deep features. The proposed features are ranked using feature ranking methods such as (information gain, gain ration and relief-f). Further, the results are evaluated using standard performance evaluation measures for stance classification with existing studies. The calculated results show that the proposed feature sets including sentiment, (part-of-speech, content, and tweet specific) are helpful for stance classification when applied with SVM and GloVe a deep feature has given the best results when applied with deep learning method RNN.
Collapse
Affiliation(s)
- Kashif Ayyub
- Department of Computer Science, COMSATS University Islamabad, Wah Campus, Wah Cantt, Pakistan
| | - Saqib Iqbal
- College of Engineering, Al Ain University, Al Ain, UAE
| | - Muhammad Wasif Nisar
- Department of Computer Science, COMSATS University Islamabad, Wah Campus, Wah Cantt, Pakistan
| | - Saima Gulzar Ahmad
- Department of Computer Science, COMSATS University Islamabad, Wah Campus, Wah Cantt, Pakistan
| | - Ehsan Ullah Munir
- Department of Computer Science, COMSATS University Islamabad, Wah Campus, Wah Cantt, Pakistan
| |
Collapse
|
10
|
Misleading information in Spanish: a survey. SOCIAL NETWORK ANALYSIS AND MINING 2021. [DOI: 10.1007/s13278-021-00746-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
11
|
Impact of Unreliable Content on Social Media Users during COVID-19 and Stance Detection System. ELECTRONICS 2020. [DOI: 10.3390/electronics10010005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The abundant dissemination of misinformation regarding coronavirus disease 2019 (COVID-19) presents another unprecedented issue to the world, along with the health crisis. Online social network (OSN) platforms intensify this problem by allowing their users to easily distort and fabricate the information and disseminate it farther and rapidly. In this paper, we study the impact of misinformation associated with a religious inflection on the psychology and behavior of the OSN users. The article presents a detailed study to understand the reaction of social media users when exposed to unverified content related to the Islamic community during the COVID-19 lockdown period in India. The analysis was carried out on Twitter users where the data were collected using three scraping packages, Tweepy, Selenium, and Beautiful Soup, to cover more users affected by this misinformation. A labeled dataset is prepared where each tweet is assigned one of the four reaction polarities, namely, E (endorse), D (deny), Q (question), and N (neutral). Analysis of collected data was carried out in five phases where we investigate the engagement of E, D, Q, and N users, tone of the tweets, and the consequence upon repeated exposure of such information. The evidence demonstrates that the circulation of such content during the pandemic and lockdown phase had made people more vulnerable in perceiving the unreliable tweets as fact. It was also observed that people absorbed the negativity of the online content, which induced a feeling of hatred, anger, distress, and fear among them. People with similar mindset form online groups and express their negative attitude to other groups based on their opinions, indicating the strong signals of social unrest and public tensions in society. The paper also presents a deep learning-based stance detection model as one of the automated mechanisms for tracking the news on Twitter as being potentially false. Stance classifier aims to predict the attitude of a tweet towards a news headline and thereby assists in determining the veracity of news by monitoring the distribution of different reactions of the users towards it. The proposed model, employing deep learning (convolutional neural network(CNN)) and sentence embedding (bidirectional encoder representations from transformers(BERT)) techniques, outperforms the existing systems. The performance is evaluated on the benchmark SemEval stance dataset. Furthermore, a newly annotated dataset is prepared and released with this study to help the research of this domain.
Collapse
|