Author:
Gutiérrez Espinoza Luis,Keith Norambuena Brian
Abstract
In this work, we evaluate the impact of changing the semantic text representation on the performance of the AR-SVS (extended association rules in semantic vector spaces) algorithm on the sentiment polarity classification task on a paper reviews dataset. To do this, we use natural language processing techniques in conjunction with machine learning classifiers. In particular, we report the classification performance using the F1 and accuracy metrics. The semantic representations that we used in our evaluation were chosen based on a systematic literature review, leading to an evaluation of AR-SVS with FastText, GloVe, and LDA2vec representations, with word2vec providing the baseline performance. The results of the experiments indicate that the choice of semantic text representation does not have major effects on the performance of AR-SVS for polarity classification. Furthermore, the results resemble those obtained in the original AR-SVS article, both in quantitative and qualitative terms. Thus, while direct improvements in classification performance were not found, we discuss other aspects and advantages of using different semantic representations.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Theoretical Computer Science
Reference55 articles.
1. Heurísticas para data augmentation en nlp: Aplicación a revisiones de artículos científicos;Acosta;RISTI-Revista Ibérica de Sistemas e Tecnologias de Informaç ao,2019
2. R. Agarwal, R. Srikant et al., Fast algorithms for mining association rules, in: Proc. of the 20th VLDB Conference, 1994, pp. 487–499.
3. R. Agrawal, T. Imieliński and A. Swami, Mining association rules between sets of items in large databases, in: Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, SIGMOD ’93, New York, NY, USA, Association for Computing Machinery, 1993, pp. 207–216.
4. A. Alateeq, M. Roantree and C. Gurrin, Voxento: A prototype voice-controlled interactive search engine for lifelogs, in: Proceedings of the Third Annual Workshop on Lifelog Search Challenge, 2020, pp. 77–81.
5. T. Alegre Sepúlveda and B. Keith Norambuena, Twitter sentiment analysis for the estimation of voting intention in the 2017 chilean elections, Intelligent Data Analysis 24(5) (2020), 1141–1160.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Data Classification Algorithm Based on Association Rules from the Perspective of Data Mining;2022 International Conference on Knowledge Engineering and Communication Systems (ICKES);2022-12-28