The explainable structure of deep neural network for recommendation systems

Mohammad Daryaie Zanjani,Mehdi Hosseinzadeh Aghdam
DOI: https://doi.org/10.1016/j.future.2024.05.036
IF: 7.307
2024-05-26
Future Generation Computer Systems
Abstract:Recommender systems (RS) play a pivotal role in establishing user trust by suggesting relevant items that meet their needs and enhance reliability. The recent trend involves integrating deep learning models into RS, enabling the creation of intricate, precise, and application-specific models. However, to ensure the dependability of these deep RS models, it is crucial to implement explainable systems that illuminate the internal mechanisms of the models and enhance user satisfaction. Additionally, in specific domains such as medicine and law, users require a comprehension of the model's operational processes. This is where explainable deep neural networks (xDNN) come into play, aiding in the analysis and refinement of complex models, ultimately resulting in increased user satisfaction. Furthermore, there are several challenges in current explainable models, such as non-general models, the need for pretraining models, constraints on input variations, and compromising accuracy. To tackle the challenge of comprehending which neurons in specific layers contribute to particular outputs, this paper introduces a novel approach known as xDNN based on shadow models (xDNN-SM) to maintain interpretability without sacrificing accuracy. xDNN-SM incorporates inputs, hidden layers, and outputs to reveal the relationship between inputs and their corresponding outputs through the utilization of shadow models and shadow inputs. A shadow model is a basic component of the network, and its inputs are determined by its location within the network and its connections. Shadow inputs are computed based on real inputs to enhance the explainability capabilities of the proposed model. By leveraging shadow models and inputs, the primary model becomes explainable, allowing for traceability along various paths. The xDNN-SM approach categorizes the latent feature vectors of each shadow model separately, determining the cluster to which test inputs belong. Each shadow model possesses distinct input features, making the role of each model quantifiable. Consequently, the connections between inputs and outputs become traceable. Furthermore, the proposed model identifies similar samples for each test instance through both local and global interpretation approaches. The effectiveness of xDNN-SM is demonstrated on benchmark datasets, specifically KDDcup 2012 and Yelp, highlighting the explanatory capabilities of the proposed model.
computer science, theory & methods
What problem does this paper attempt to address?