1
|
Cheng Z, Jiang Z, Yin Y, Wang C, Ge S, Gu Q. A Consistent Dual-MRC Framework for Emotion-Cause Pair Extraction. ACM T INFORM SYST 2022. [DOI: 10.1145/3558548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Abstract
Emotion-cause pair extraction (ECPE) is a recently proposed task that aims to extract the potential clause pairs of emotions and its corresponding causes in a document. In this paper, we propose a new paradigm for the ECPE task. We cast the task as a two-turn machine reading comprehension (MRC) task, i.e., the extraction of emotions and causes is transformed to the task of identifying answer clauses from the input document specific to a query. This two-turn MRC formalization brings several key advantages: firstly, the QA manner provides an explicit pairing way to identify causes specific to the target emotion; secondly, it provides a natural way of jointly modeling the emotion extraction, the cause extraction, and the pairing of emotion and cause; and thirdly, it allows us to exploit the well developed MRC models. Based on the two-turn MRC formalization, we propose a dual-MRC framework to extract emotion-cause pairs in a dual-direction way, which enables a more comprehensive coverage of all pairing cases. Furthermore, we propose a consistent training strategy for the second-turn query, so that the model is able to filter the errors produced by the first turn at inference. Experiments on two benchmark datasets demonstrate that our method outperforms previous methods and achieves state-of-the-art performance. All the code and data of this work can be obtained at https://github.com/zifengcheng/CD-MRC.
Collapse
Affiliation(s)
| | | | | | | | | | - Qing Gu
- State Key Laboratory for Novel Software Technology, Nanjing University, China
| |
Collapse
|
2
|
Yang T, Hu L, Shi C, Ji H, Li X, Nie L. HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification. ACM T INFORM SYST 2021. [DOI: 10.1145/3450352] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Short text classification has been widely explored in news tagging to provide more efficient search strategies and more effective search results for information retrieval. However, most existing studies, concentrating on long text classification, deliver unsatisfactory performance on short texts due to the sparsity issue and the insufficiency of labeled data. In this article, we propose a novel heterogeneous graph neural network-based method for semi-supervised short text classification, leveraging full advantage of limited labeled data and large unlabeled data through information propagation along the graph. Specifically, we first present a flexible heterogeneous information network (HIN) framework for modeling short texts, which can integrate any type of additional information and meanwhile capture their relations to address the semantic sparsity. Then, we propose Heterogeneous Graph Attention networks (HGAT) to embed the HIN for short text classification based on a dual-level attention mechanism, including node-level and type-level attentions. To efficiently classify new coming texts that do not previously exist in the HIN, we extend our model HGAT for inductive learning, avoiding re-training the model on the evolving HIN. Extensive experiments on single-/multi-label classification demonstrates that our proposed model HGAT significantly outperforms state-of-the-art methods across the benchmark datasets under both transductive and inductive learning.
Collapse
Affiliation(s)
- Tianchi Yang
- Beijing University of Posts and Telecommunications, Beijing, China
| | - Linmei Hu
- Beijing University of Posts and Telecommunications, Beijing, China
| | - Chuan Shi
- Beijing University of Posts and Telecommunications, Beijing, China
| | - Houye Ji
- Beijing University of Posts and Telecommunications, Beijing, China
| | - Xiaoli Li
- Institute for Infocomm Research, Singapore
| | - Liqiang Nie
- Shan Dong University, Shandong Province, China
| |
Collapse
|
3
|
Zhang R, Mensah S, Kong F, Hu Z, Mao Y, Liu X. Pairwise Link Prediction Model for Out of Vocabulary Knowledge Base Entities. ACM T INFORM SYST 2020. [DOI: 10.1145/3406116] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Real-world knowledge bases such as DBPedia, Yago, and Freebase contain sparse linkage connectivity, which poses a severe challenge to link prediction between entities. To cope with such data scarcity issues, recent models have focused on learning interactions between entity pairs by means of relations that exist between them. However promising, some relations are associated with very few tail entities or head entities, resulting in poor estimation of the relation interaction between entities. In this article, we break the sole dependency of modeling relation interactions between entity pairs by associating a triple with pairwise embeddings, i.e., distributed vector representations for pairs of word-based entities and relation of a triple. We capture the interactions that exist between pairwise embeddings by means of a Pairwise Factorization Model that employs a factorization machine with relation attention. This approach allows parameters for related interactions to be estimated efficiently, ensuring that the pairwise embeddings are discriminative, providing strong supervisory signals for the decoding task of link prediction. The Pairwise Factorization Model we propose exploits a neural bag-of-words model as the encoder, which effectively encodes word-based entities into distributed vector representations for the decoder. The proposed model is simple and enjoys efficiency and capability, showing superior link prediction performance over state-of-the-art complex models on benchmark datasets DBPedia50K and FB15K-237.
Collapse
Affiliation(s)
- Richong Zhang
- Beijing Advanced Innovation Center for Big Data and Brain Computing, School of Computer Science and Engineering, Beihang University, Beijing, China
| | - Samuel Mensah
- Beijing Advanced Innovation Center for Big Data and Brain Computing, School of Computer Science and Engineering, Beihang University, Beijing, China
| | | | - Zhiyuan Hu
- Beijing Advanced Innovation Center for Big Data and Brain Computing, School of Computer Science and Engineering, Beihang University, Beijing, China
| | - Yongyi Mao
- School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, Canada
| | - Xudong Liu
- Beijing Advanced Innovation Center for Big Data and Brain Computing, School of Computer Science and Engineering, Beihang University, China
| |
Collapse
|
4
|
Singh LG, Singh SR. Empirical study of sentiment analysis tools and techniques on societal topics. J Intell Inf Syst 2020. [DOI: 10.1007/s10844-020-00616-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
5
|
Abstract
Opinion mining in outdoor images posted by users during different activities can provide valuable information to better understand urban areas. In this regard, we propose a framework to classify the sentiment of outdoor images shared by users on social networks. We compare the performance of state-of-the-art ConvNet architectures and one specifically designed for sentiment analysis. We also evaluate how the merging of deep features and semantic information derived from the scene attributes can improve classification and cross-dataset generalization performance. The evaluation explores a novel dataset—namely, OutdoorSent—and other publicly available datasets. We observe that the incorporation of knowledge about semantic attributes improves the accuracy of all ConvNet architectures studied. Besides, we found that exploring only images related to the context of the study—outdoor, in our case—is recommended, i.e., indoor images were not significantly helpful. Furthermore, we demonstrated the applicability of our results in the United States city of Chicago, Illinois, showing that they can help to improve the knowledge of subjective characteristics of different areas of the city. For instance, particular areas of the city tend to concentrate more images of a specific class of sentiment, which are also correlated with median income, opening up opportunities in different fields.
Collapse
Affiliation(s)
| | | | - Rodrigo Minetto
- Universidade Tecnológica Federal do Paraná - UTFPR, Curitiba, Brazil
| | | |
Collapse
|