Unveiling Black-boxes: Explainable Deep Learning Models for Patent Classification

Md Shajalal,Sebastian Denef,Md. Rezaul Karim,Alexander Boden,Gunnar Stevens
DOI: https://doi.org/10.1007/978-3-031-44067-0_24
2023-10-31
Abstract:Recent technological advancements have led to a large number of patents in a diverse range of domains, making it challenging for human experts to analyze and manage. State-of-the-art methods for multi-label patent classification rely on deep neural networks (DNNs), which are complex and often considered black-boxes due to their opaque decision-making processes. In this paper, we propose a novel deep explainable patent classification framework by introducing layer-wise relevance propagation (LRP) to provide human-understandable explanations for predictions. We train several DNN models, including Bi-LSTM, CNN, and CNN-BiLSTM, and propagate the predictions backward from the output layer up to the input layer of the model to identify the relevance of words for individual predictions. Considering the relevance score, we then generate explanations by visualizing relevant words for the predicted patent class. Experimental results on two datasets comprising two-million patent texts demonstrate high performance in terms of various evaluation measures. The explanations generated for each prediction highlight important relevant words that align with the predicted class, making the prediction more understandable. Explainable systems have the potential to facilitate the adoption of complex AI-enabled methods for patent classification in real-world applications.
Artificial Intelligence
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the black-box problem in patent classification tasks and proposes an interpretable patent classification framework based on Layer-wise Relevance Propagation (LRP). #### Specific Goals: 1. **Improve the transparency of patent classification**: By introducing LRP technology, provide human-understandable explanations for prediction results, making the model's decision process more transparent. 2. **Enhance model interpretability**: By highlighting keywords related to the predicted category, help users understand why the model made a specific classification. 3. **Improve classification performance**: Various deep neural network models (such as Bi-LSTM, CNN, and CNN-BiLSTM) were trained on two benchmark datasets, and experiments verified the effectiveness and consistency of these models. #### Main Contributions: 1. Proposed an LRP-based patent classification explanation method that can generate explanations for each predicted category. 2. The developed DNN models performed excellently on multiple evaluation metrics and showed better performance compared to existing works. ### Summary This paper addresses the black-box problem in patent classification by introducing LRP technology to improve the transparency and interpretability of the model, thereby enhancing user trust in the model. Additionally, experimental results show that this method outperforms existing patent classification methods on two benchmark datasets.