Numerical Similarity Measures Versus Jaccard for Collaborative Filtering
Source of Publication
Lecture Notes on Data Engineering and Communications Technologies
Collaborative filtering (CF) is an important method for recommendation systems, which are employed in many facets of our lives and are particularly prevalent in online-based commercial systems. The K-nearest neighbors (KNN) technique is a well-liked CF algorithm that uses similarity measurements to identify a user's closest neighbors in order to quantify the degree of dependency between the respective user and item pair. As a result, the CF approach is not only dependent on the choice of the similarity measure but also sensitive to it. However, some traditional “numerical” similarity measures, like cosine and Pearson, concentrate on the size of ratings, whereas Jaccard, one of the most frequently employed similarity measures for CF tasks, concerns the existence of ratings. Jaccard, in particular, is not a dominant measure, but it has long been demonstrated to be a key element in enhancing any measure. Therefore, this research focuses on presenting novel similarity measures by combining Jaccard with a multitude of numerical measures in our ongoing search for the most effective similarity measures for CF. Both existence and magnitude would benefit the combined measurements. Experimental results demonstrated that the combined measures are superior, surpassing all single measures across the considered assessment metrics.
Springer Nature Switzerland
Collaborating Filtering, Similarity Measure, Jaccard, K-Nearest Neighbor, Recommendation Systems
Abdalla, Hassan I.; Amer, Yasmeen A.; Nguyen, Loc; Amer, Ali A.; and Al-Maqaleh, Basheer Mohamad, "Numerical Similarity Measures Versus Jaccard for Collaborative Filtering" (2023). All Works. 6048.
Indexed in Scopus