An experimental study on the performance of collaborative filtering based on user reviews for large-scale datasets-Reference-Cited by-同舟云学术

An experimental study on the performance of collaborative filtering based on user reviews for large-scale datasets

Published:2023-08-25 Issue: Volume:9 Page:e1525
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

AL-Ghuribi Sumaia¹²,Mohd Noah Shahrul Azman¹,Mohammed Mawal³

Affiliation:

1. Center for Artificial Intelligence Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia

2. Department of Computer Science, Faculty of Applied Sciences, Taiz University, Taiz, Yemen

3. Department of Software Engineering, Prince Sattam Bin Abdulaziz University, Alkharj, Saudi Arabia

Abstract

Collaborative filtering (CF) approaches generate user recommendations based on user similarities. These similarities are calculated based on the overall (explicit) user ratings. However, in some domains, such ratings may be sparse or unavailable. User reviews can play a significant role in such cases, as implicit ratings can be derived from the reviews using sentiment analysis, a natural language processing technique. However, most current studies calculate the implicit ratings by simply aggregating the scores of all sentiment words appearing in reviews and, thus, ignoring the elements of sentiment degrees and aspects of user reviews. This study addresses this issue by calculating the implicit rating differently, leveraging the rich information in user reviews by using both sentiment words and aspect–sentiment word pairs to enhance the CF performance. It proposes four methods to calculate the implicit ratings on large-scale datasets: the first considers the degree of sentiment words, while the second exploits the aspects by extracting aspect-sentiment word pairs to calculate the implicit ratings. The remaining two methods combine explicit ratings with the implicit ratings generated by the first two methods. The generated ratings are then incorporated into different CF rating prediction algorithms to evaluate their effectiveness in enhancing the CF performance. Evaluative experiments of the proposed methods are conducted on two large-scale datasets: Amazon and Yelp. Results of the experiments show that the proposed ratings improved the accuracy of CF rating prediction algorithms and outperformed the explicit ratings in terms of three predictive accuracy metrics.

Funder

The Universiti Kebangsaan Malaysia

Publisher

PeerJ

Subject

General Computer Science

Link

https://peerj.com/articles/cs-1525.pdf

Reference62 articles.

1. Boolean logic algebra driven similarity measure for text based applications;Abdalla;PeerJ Computer Science,2021

2. Informed recommender: basing recommendations on consumer product reviews;Aciar;IEEE Intelligent Systems,2007

3. Multi-criteria recommender systems;Adomavicius,2011

4. Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions;Adomavicius;IEEE Transactions on Knowledge and Data Engineering,2005

5. Aspect based sentiment oriented summarization of hotel reviews;Akhtar;Procedia Computer Science,2017

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Natural language processing for analyzing online customer reviews: a survey, taxonomy, and open research challenges;PeerJ Computer Science;2024-07-19

2. Max Explainability Score–A quantitative metric for explainability evaluation in knowledge graph-based recommendations;Computers and Electrical Engineering;2024-05