Abstract:Deep learning-based recommender systems (DRSs) are increasingly and widely deployed in the industry, which brings significant convenience to people’s daily life in different ways. However, recommender systems are also shown to suffer from multiple issues, e.g., the echo chamber and the Matthew effect , of which the notation of “fairness” plays a core role. For instance, the system may be regarded as unfair to 1) a specific user, if the user gets worse recommendations than other users, or 2) an item (to recommend), if the item is much less likely to be exposed to the users than other items. While many fairness notations and corresponding fairness testing approaches have been developed for traditional deep classification models, they are essentially hardly applicable to DRSs. One major challenge is that there still lacks a systematic understanding and mapping between the existing fairness notations and the diverse testing requirements for deep recommender systems, not to mention further testing or debugging activities. To address the gap, we propose FairRec, a unified framework that supports fairness testing of DRSs from multiple customized perspectives, e.g., model utility, item diversity, item popularity, etc. We also propose a novel, efficient search-based testing approach to tackle the new challenge, i.e., double-ended discrete particle swarm optimization (DPSO) algorithm, to effectively search for hidden fairness issues in the form of certain disadvantaged groups from a vast number of candidate groups. Given the testing report, by adopting a simple re-ranking mitigation strategy on these identified disadvantaged groups, we show that the fairness of DRSs can be significantly improved. We conducted extensive experiments on multiple industry-level DRSs adopted by leading companies. The results confirm that FairRec is effective and efficient in identifying the deeply hidden fairness issues, e.g., achieving ∼95% testing accuracy with ∼half to 1/8 time.

Search results diversification for effective fair ranking in academic search

FairRec: Fairness Testing for Deep Recommender Systems

Directly Optimize Diversity Evaluation Measures: A New Approach to Search Result Diversification.

Efficient Diversification of Web Search Results

Overview of the TREC 2020 Fair Ranking Track

User Fairness, Item Fairness, and Diversity for Rankings in Two-Sided Markets

Result Diversification in Search and Recommendation: A Survey

Recency Ranking by Diversification of Result Set

Intersectional fair ranking via subgroup divergence

Fairness in Ranking under Disparate Uncertainty

Fair ranking: a critical review, challenges, and future directions

Fairness-Aware Ranking in Search & Recommendation Systems with Application to LinkedIn Talent Search

Beyond Greedy Search: Pruned Exhaustive Search for Diversified Result Ranking.

Representation Online Matters: Practical End-to-End Diversification in Search and Recommender Systems

Fairness in Ranking: A Survey

Adapting Markov Decision Process for Search Result Diversification

Revisiting The Evaluation Of Diversified Search Evaluation Metrics With User Preferences

Maximizing Marginal Fairness for Dynamic Learning to Rank

MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification

FARA: Future-aware Ranking Algorithm for Fairness Optimization

Search Results Diversification Based on Swap Minimal Marginal Contribution.