Towards Strong Regret Minimization Sets: Balancing Freshness and Diversity in Data Selection

Hongjie Guo,Jianzhong Li,Hong Gao
DOI: https://doi.org/10.1016/j.tcs.2024.114986
IF: 1.002
2024-01-01
Theoretical Computer Science
Abstract:Multi-criteria decision-making typically requires selecting a concise, representative set from large databases. Regret minimization set (RMS) queries have emerged as a solution to circumvent the necessity of a utility function in top-k queries and to address the expansive result sets produced by skyline queries. However, traditional RMS formulations only ensures one result under any utility function and do not account for the diversity and freshness of results. This study introduces the concept of strong regret minimization set (SRMS), ensuring the utility value accuracy of selected k data points under any utility function while incorporating result diversity and freshness. We explore two new computational challenges: the Minimum Size problem, focusing on reducing the result set size with bounded utility error, and the Max-sum Diversity and Freshness problem, aiming to optimize the diversity and freshness of the selected set. Both problems are prove to be NP-hard, and we develop approximation algorithms for them. Experimental results on both real-world and synthetic data show high efficiency and scalability of proposed algorithms.
What problem does this paper attempt to address?