Abstract:Collaborative filtering (CF) models offer users personalized recommendations by measuring the relevance between the active user and each individual candidate item. Following this idea, user-based collaborative filtering (UCF) usually selects the local popular items from the like-minded neighbor users. However, these traditional relevance-based models only consider the individuals (i.e., each neighbor user and candidate item) separately during neighbor set selection and recommendation set generation, thus usually incurring highly similar recommendations that lack diversity. While many researchers have recognized the importance of diversified recommendations, the proposed solutions either needed additional semantic information of items or decreased accuracy in this process. In this article, we describe how to generate both accurate and diversified recommendations from a new perspective. Along this line, we first introduce a simple measure of coverage that quantifies the usefulness of the whole set, that is, the neighbor userset and the recommended itemset as a complete entity. Then we propose a recommendation framework named REC that considers both traditional relevance-based scores and the new coverage measure based on UCF. Under REC, we further prove that the goals of maximizing relevance and coverage measures simultaneously in both the neighbor set selection step and the recommendation set generation step are NP-hard. Luckily, we can solve them effectively and efficiently by exploiting the inherent submodular property. Furthermore, we generalize the coverage notion and the REC framework from both a data perspective and an algorithm perspective. Finally, extensive experimental results on three real-world datasets show that the REC-based recommendation models can naturally generate more diversified recommendations without decreasing accuracy compared to some state-of-the-art models.

Finding an λ-representative subset from massive data

A heuristic approach for λ-representative information retrieval from large-scale data

Finding representative set from massive data

How “small” Reflects “large”?—representative Information Measurement and Extraction

A Combined Measure for Representative Information Retrieval in Enterprise Information Systems.

A Combined Measure for Representativeness on Information Retrieval in Web Search

Representative Selection Based on Sparse Modeling.

Extracting representative information to enhance flexible data queries.

Finding Representative and Diverse Vertices within Graphs

Discovering the Representative Subset with Low Redundancy for Hyperspectral Feature Selection

Extending Representative Information Extraction Based on Fuzzy Classification

A Unified Framework for Representation-Based Subspace Clustering of Out-of-Sample and Large-Scale Data.

Selecting a Representative Set of Diverse Quality Reviews Automatically.

Extracting a Diverse Information Subset by Considering Information Coverage and Redundancy Simultaneously

Continuously identifying representatives out of massive streams

Approximate Membership Localization within a Web-Based Join Framework

Will the Real Linda Please Stand up...to Large Language Models? Examining the Representativeness Heuristic in LLMs

Relevance Meets Coverage

Approximate membership localization (AML) for web-based join.

Continuously Extracting High-Quality Representative Set from Massive Data Streams.

Scalable Graph Representation Learning Via Locality-Sensitive Hashing