Abstract:We propose a preference-learning algorithm for uncovering Decision Makers’ (DMs’) contingent evaluation strategies in the context of multiple criteria sorting. We assume the preference information in the form of holistic assignment examples derived from the analysis of alternatives’ performance vectors and textual descriptions. We characterize the decision policies using a mixture of threshold-based, value-driven preference models and associated latent topics. The latter serve as the stimuli underlying the contingency in decision behavior. Such a probabilistic model is constructed by using a flexible and nonparametric Bayesian framework. The proposed method adopts a hierarchical Dirichlet process as the prior so that a group of DMs can share a countably infinite number of contingent models and topics. For all DMs, it automatically identifies the components representing their evaluation strategies adequately. The posterior is summarized by using the Hamiltonian Monte Carlo sampling method. We demonstrate the method’s practical usefulness in a real-world recruitment problem considered by a Chinese IT company. We also compare the approach with counterparts that use a single preference model, implement the parametric framework, or consider each DM’s preferences individually. The results indicate that our approach performs favorably in both interpreting DMs’ contingent decision behavior and recommending decisions on new alternatives. Furthermore, the approach’s performance and robustness are investigated through a computational experiment involving real-world data sets. History: Accepted by Ram Ramesh, Area Editor for Data Science and Machine Learning. Funding: J. Liu received financial support from the National Natural Science Foundation of China [Grants 72071155 and 71701160]. M. Kadziński received financial support from the Polish National Science Center under the SONATA BIS project [Grant DEC-2019/34/E/HS4/00045]. X. Liao received financial support from the National Natural Science Foundation of China [Grant 71872144]. Supplemental Material: The software that supports the findings of this study is available within the paper and its Supplemental Information ( https://pubsonline.informs.org/doi/suppl/10.1287/ijoc.2023.1292 ) as well as from the IJOC GitHub software repository ( https://github.com/INFORMSJoC/2021.0328 ) at ( http://dx.doi.org/10.5281/zenodo.7608750 ).

A Stochastic Policy Search Model for Matching Behavior

A dynamical policy search model for matching law.

Algorithm of matching law based on optimal policy search model

Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling

Matching provides efficient decisions

Operant matching as a Nash equilibrium of an intertemporal game

A Computational Model of Match Decision-Making Problem Using Spiking SHESN with Reward-Modulated Reinforcement Learning.

Behavioral psychology's matching law describes the allocation of covert attention: A choice rule for the mind

Matching-Based Policy Learning

Online Matching with Stochastic Rewards: Advanced Analyses Using Configuration Linear Programs

A two-sided matching decision-making approach based on prospect theory under the probabilistic linguistic environment

A Model about Love in Human Dynamics

An Approximate Dynamic Programming Approach to Dynamic Stochastic Matching

Nonstandard Choice In Matching Markets

Behavior Decision of Mobile Robot With a Neurophysiologically Motivated Reinforcement Learning Model

Matching Algorithms: Fundamentals, Applications and Challenges

Online Stochastic Matching: A Polytope Perspective

Learning in Multi-Stage Decentralized Matching Markets

Modeling Contingent Decision Behavior: A Bayesian Nonparametric Preference-Learning Approach

Learning Neural Strategy-Proof Matching Mechanism from Examples

Learning a Diffusion Model Policy from Rewards via Q-Score Matching