Weakly Supervised Reasoning by Neuro-Symbolic Approaches

Xianggen Liu,Zhengdong Lu,Lili Mou
DOI: https://doi.org/10.3233/FAIA230162
2023-09-19
Abstract:Deep learning has largely improved the performance of various natural language processing (NLP) tasks. However, most deep learning models are black-box machinery, and lack explicit interpretation. In this chapter, we will introduce our recent progress on neuro-symbolic approaches to NLP, which combines different schools of AI, namely, symbolism and connectionism. Generally, we will design a neural system with symbolic latent structures for an NLP task, and apply reinforcement learning or its relaxation to perform weakly supervised reasoning in the downstream task. Our framework has been successfully applied to various tasks, including table query reasoning, syntactic structure reasoning, information extraction reasoning, and rule reasoning. For each application, we will introduce the background, our approach, and experimental results.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
### The problems the paper attempts to solve This paper attempts to solve the weakly - supervised reasoning problem in natural language processing (NLP) tasks. Specifically, the author focuses on how to combine two artificial intelligence paradigms, symbolism and connectionism, to design a neural - symbolic framework to achieve weakly - supervised reasoning for complex tasks. The main objectives of the paper include: 1. **Explanatory and interpretability**: Existing deep - learning models are usually black - box models and lack explicit explanations. By introducing symbolic structures, the author hopes that the model can provide intermediate thinking steps when performing tasks, thereby improving the interpretability of the model. 2. **Weakly - supervised learning**: In many practical applications, it is very difficult to obtain detailed labeled data. Therefore, the author proposes a weakly - supervised method, that is, the training signal only exists in the final conclusion, and the intermediate thinking steps are not directly supervised and need to be reasoned by the model itself. 3. **Multi - task applications**: This framework is applied to multiple NLP tasks, including table query reasoning, syntactic structure reasoning, information extraction reasoning and rule reasoning. Each application has its specific background, method and experimental results. ### Specific problem example: Table query reasoning In the table query reasoning task, the input is a table and a natural language query, and the output is the result obtained by SQL - like execution. For example, given a table and the query "What is the duration of the competition with the largest host country area?" To answer this question, first, the row with the largest value in the "area" column needs to be found, and then the value in the "duration" column of that row is selected. ### Limitations of existing methods 1. **Traditional methods**: Traditional semantic parsing methods require a large amount of manual feature engineering and are difficult to extend to complex queries. 2. **End - to - end neural networks**: Although end - to - end neural networks can learn complex features, they lack interpretability and are not as efficient in execution as symbolic methods. 3. **Reinforcement learning**: When using reinforcement learning for training, the choice of the initial policy is very sensitive and it is easy to fall into local optimal solutions. ### Solutions in the paper The author proposes a neural - symbolic framework that combines the advantages of the distributed executor and the symbolic executor: 1. **Distributed executor**: Use a neural network to handle table query tasks, and all semantic units (including words in the query, entries in the table and execution results) are represented as distributed real - valued vectors. 2. **Symbolic executor**: Define a set of basic operators to perform specific query steps. The symbolic executor uses a machine - learning model to predict the operation sequence and its parameters. 3. **Pre - training and reinforcement learning**: Use the intermediate execution results of the distributed executor to pre - train the symbolic executor to obtain a meaningful initial policy. Then, use the REINFORCE algorithm to further optimize the policy. ### Experimental results The experimental results show that the coupled distributed and symbolic executors are significantly superior to single distributed or symbolic executors on multiple tasks, showing higher accuracy, learning efficiency and execution efficiency. ### Summary This paper solves the weakly - supervised reasoning problem in NLP tasks by combining symbolic and connectionist methods, and improves the interpretability and execution efficiency of the model.