Abstract
Researchers commonly perform sentiment analysis on large collections of short texts like tweets, Reddit posts or newspaper headlines that are all focused on a specific topic, theme or event. Usually, general-purpose sentiment analysis methods are used. These perform well on average but miss the variation in meaning that happens across different contexts, for example, the word “active” has a very different intention and valence in the phrase “active lifestyle” versus “active volcano”. This work presents a new approach, CIDER (Context Informed Dictionary and sEmantic Reasoner), which performs context-sensitive linguistic analysis, where the valence of sentiment-laden terms is inferred from the whole corpus before being used to score the individual texts. In this paper, we detail the CIDER algorithm and demonstrate that it outperforms state-of-the-art generalist unsupervised sentiment analysis techniques on a large collection of tweets about the weather. CIDER is also applicable to alternative (non-sentiment) linguistic scales. A case study on gender in the UK is presented, with the identification of highly gendered and sentiment-laden days. We have made our implementation of CIDER available as a Python package: https://pypi.org/project/ciderpolarity/.
Funder
Natural Environment Research Council
Engineering and Physical Sciences Research Council
Publisher
Public Library of Science (PLoS)
Reference65 articles.
1. Sentiment Analysis in Social Media and Its Application: Systematic Literature Review;Z Drus;Procedia Computer Science,2019
2. Domain-Specific Sentiment Analysis for Tweets during Hurricanes (DSSA-H): A Domain-Adversarial Neural-Network-Based Approach;F Yao;Computers, Environment and Urban Systems,2020
3. Lucy L, Tadimeti D, Bamman D. Discovering differences in the representation of people using contextualized semantic axes. arXiv preprint arXiv:221012170. 2022;.
4. Zhao C, Liu P, Yu D. From Polarity to Intensity: Mining Morality from Semantic Space. In: Proceedings of the 29th International Conference on Computational Linguistics; 2022. p. 1250–1262.
5. Bolukbasi T, Chang KW, Zou JY, Saligrama V, Kalai AT. Man is to computer programmer as woman is to homemaker? debiasing word embeddings. Advances in neural information processing systems. 2016;29.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献