Abstract:We propose a preference-learning algorithm for uncovering Decision Makers’ (DMs’) contingent evaluation strategies in the context of multiple criteria sorting. We assume the preference information in the form of holistic assignment examples derived from the analysis of alternatives’ performance vectors and textual descriptions. We characterize the decision policies using a mixture of threshold-based, value-driven preference models and associated latent topics. The latter serve as the stimuli underlying the contingency in decision behavior. Such a probabilistic model is constructed by using a flexible and nonparametric Bayesian framework. The proposed method adopts a hierarchical Dirichlet process as the prior so that a group of DMs can share a countably infinite number of contingent models and topics. For all DMs, it automatically identifies the components representing their evaluation strategies adequately. The posterior is summarized by using the Hamiltonian Monte Carlo sampling method. We demonstrate the method’s practical usefulness in a real-world recruitment problem considered by a Chinese IT company. We also compare the approach with counterparts that use a single preference model, implement the parametric framework, or consider each DM’s preferences individually. The results indicate that our approach performs favorably in both interpreting DMs’ contingent decision behavior and recommending decisions on new alternatives. Furthermore, the approach’s performance and robustness are investigated through a computational experiment involving real-world data sets. History: Accepted by Ram Ramesh, Area Editor for Data Science and Machine Learning. Funding: J. Liu received financial support from the National Natural Science Foundation of China [Grants 72071155 and 71701160]. M. Kadziński received financial support from the Polish National Science Center under the SONATA BIS project [Grant DEC-2019/34/E/HS4/00045]. X. Liao received financial support from the National Natural Science Foundation of China [Grant 71872144]. Supplemental Material: The software that supports the findings of this study is available within the paper and its Supplemental Information ( https://pubsonline.informs.org/doi/suppl/10.1287/ijoc.2023.1292 ) as well as from the IJOC GitHub software repository ( https://github.com/INFORMSJoC/2021.0328 ) at ( http://dx.doi.org/10.5281/zenodo.7608750 ).

Learning to Select and Rank from Choice-Based Feedback: A Simple Nested Approach

Sequential Modeling of Hierarchical User Intention and Preference for Next-item Recommendation

Choosing to Rank

Dynamic Assortment Selection under the Nested Logit Models.

Dynamic Assortment with Online Learning under Threshold Multinomial Logit Model

Stop Relying on No-Choice and Do not Repeat the Moves: Optimal, Efficient and Practical Algorithms for Assortment Optimization

Ranking Inferences Based on the Top Choice of Multiway Comparisons

Invidious Comparisons: Ranking and Selection as Compound Decisions

Tiered Assortment: Optimization and Online Learning

Adaptively Learning to Select-Rank in Online Platforms

Revealed Preference at Scale: Learning Personalized Preferences from Assortment Choices

Bounding Consideration Probabilities in Consider-Then-Choose Ranking Models

Assortment Optimization with Position Effects under the Nested Logit Model

Asymptotically Optimal Sequential Design for Rank Aggregation

A rank-based selection with cardinal payoffs and a cost of choice

Ranking and Selection with Two-Stage Decision

Modeling Contingent Decision Behavior: A Bayesian Nonparametric Preference-Learning Approach

Best-item Learning in Random Utility Models with Subset Choices

A Discrete Choice Model for Subset Selection

Non-Myopic Knowledge Gradient Policy for Ranking and Selection.

A New Framework of Designing Sequential Ranking-and-selection Procedures.