Extreme Multilabel Classification for Specialist Doctor Recommendation with Implicit Feedback and Limited Patient Metadata

Filipa Valdeira,Stevo Racković,Valeria Danalachi,Qiwei Han,Cláudia Soares
2023-08-22
Abstract:Recommendation Systems (RS) are often used to address the issue of medical doctor referrals. However, these systems require access to patient feedback and medical records, which may not always be available in real-world scenarios. Our research focuses on medical referrals and aims to predict recommendations in different specialties of physicians for both new patients and those with a consultation history. We use Extreme Multilabel Classification (XML), commonly employed in text-based classification tasks, to encode available features and explore different scenarios. While its potential for recommendation tasks has often been suggested, this has not been thoroughly explored in the literature. Motivated by the doctor referral case, we show how to recast a traditional recommender setting into a multilabel classification problem that current XML methods can solve. Further, we propose a unified model leveraging patient history across different specialties. Compared to state-of-the-art RS using the same features, our approach consistently improves standard recommendation metrics up to approximately $10\%$ for patients with a previous consultation history. For new patients, XML proves better at exploiting available features, outperforming the benchmark in favorable scenarios, with particular emphasis on recall metrics. Thus, our approach brings us one step closer to creating more effective and personalized doctor referral systems. Additionally, it highlights XML as a promising alternative to current hybrid or content-based RS, while identifying key aspects to take into account when using XML for recommendation tasks.
Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of how to effectively recommend doctors in medical recommendation systems, particularly in the context of specialist doctor recommendations, using limited patient metadata (such as basic demographic information). Specifically, the researchers focus on: 1. **Cold Start Problem**: For new patients, traditional recommendation systems often struggle to provide accurate recommendations due to the lack of historical interaction data. 2. **Limited Patient Metadata**: In real medical scenarios, detailed patient medical records and personal feedback may not be available, necessitating recommendations based on limited information. 3. **Cross-Specialty Recommendation**: The recommendation system needs to handle recommendations for doctors across different specialties, not just a single specialty. To address these issues, the researchers propose a method based on Extreme Multilabel Classification (XML), redefining the traditional recommendation problem as a multilabel classification problem. This approach not only effectively handles the cold start problem but also provides more accurate recommendations with limited patient metadata. ### Main Contributions 1. **Proposed Solution**: Utilizing implicit data and limited patient metadata, a new recommendation method is proposed, suitable for the privacy protection needs in healthcare scenarios. 2. **Problem Redefinition**: The recommendation problem is redefined as an extreme multilabel classification problem, addressing two key challenges: encoding existing features and converting consultation history into appropriate labels. 3. **Performance Comparison**: Compared to existing state-of-the-art recommendation systems, this method shows better performance in predicting for both existing users and new users without historical interactions, especially in recall metrics. ### Method Overview 1. **Dataset and Problem Setup**: The dataset comes from patient-doctor consultation records in a private medical network in Europe, including basic information about patients and doctors, educational background, and specialty information. The goal is to predict the preferred doctor for each patient using this information. 2. **Recommendation Method**: Each patient-doctor interaction is treated as positive feedback, constructing a rating matrix and converting it into a label matrix. Through extreme multilabel classification methods, the most relevant doctors for each patient are predicted. 3. **Feature Extraction**: Four different feature groups are extracted from the interaction data, including baseline features, educational history, specialty information, and hospital location information. 4. **Experimental Results**: The effectiveness of the method is validated through various evaluation metrics (such as PS-nDCG@3 and Recall@10) and compared with benchmark methods like SVD, BiVAE, and LightFM. ### Conclusion This study successfully addresses the cold start problem and the issue of limited patient metadata in medical recommendation systems by redefining the traditional recommendation problem as an extreme multilabel classification problem. Experimental results show that this method performs excellently in recommendations for both existing and new users, significantly outperforming existing methods in recall metrics. This provides new insights for developing more effective and personalized doctor recommendation systems.