Meta-Interpretive LEarning with Reuse

Rong Wang,Jun Sun,Cong Tian,Zhenhua Duan
DOI: https://doi.org/10.3390/math12060916
IF: 2.4
2024-03-21
Mathematics
Abstract:Inductive Logic Programming (ILP) is a research field at the intersection between machine learning and logic programming, focusing on developing a formal framework for inductively learning relational descriptions in the form of logic programs from examples and background knowledge. As an emerging method of ILP, Meta-Interpretive Learning (MIL) leverages the specialization of a set of higher-order metarules to learn logic programs. In MIL, the input includes a set of examples, background knowledge, and a set of metarules, while the output is a logic program. MIL executes a depth-first traversal search, where its program search space expands polynomially with the number of predicates in the provided background knowledge and exponentially with the number of clauses in the program, sometimes even leading to search collapse. To address this challenge, this study introduces a strategy that employs the concept of reuse, specifically through the integration of auxiliary predicates, to reduce the number of clauses in programs and improve the learning efficiency. This approach focuses on the proactive identification and reuse of common program patterns. To operationalize this strategy, we introduce MILER, a novel method integrating a predicate generator, program learner, and program evaluator. MILER leverages frequent subgraph mining techniques to detect common patterns from a limited dataset of training samples, subsequently embedding these patterns as auxiliary predicates into the background knowledge. In our experiments involving two Visual Question Answering (VQA) tasks and one program synthesis task, we assessed MILER's approach to utilizing reusable program patterns as auxiliary predicates. The results indicate that, by incorporating these patterns, MILER identifies reusable program patterns, reduces program clauses, and directly decreases the likelihood of timeouts compared to traditional MIL. This leads to improved learning success rates by optimizing computational efforts.
mathematics
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper primarily addresses the efficiency issues faced by Meta-Interpretive Learning (MIL) in large-scale search spaces. Specifically, MIL is a method for inductive learning of logic programs from examples and background knowledge, but it encounters an exponential growth in the search space when dealing with a large number of predicates, making the learning process inefficient or even failing. To tackle this challenge, the authors propose a new method called MILER (Meta-Interpretive Learning with Reuse). The core idea of MILER is to extract reusable program patterns from a limited number of training samples and add these patterns as auxiliary predicates to the background knowledge. This approach reduces the number of clauses in the generated programs, thereby narrowing the search space and improving learning efficiency and success rate. ### Main Contributions 1. **Utilization of Frequent Subgraph Mining Techniques**: Extracting reusable program patterns from a limited number of training samples. 2. **Expansion of Background Knowledge**: Enriching existing background knowledge by adding auxiliary predicates to enhance its scalability. 3. **Reduction of Timeout Instances**: Generating programs with fewer clauses to reduce timeout occurrences during the learning process, thereby improving learning efficiency. ### Experimental Validation The authors conducted experimental validation on two Visual Question Answering (VQA) tasks and one program synthesis task. The results show that, compared to traditional MIL methods, MILER can effectively identify reusable program patterns, significantly reduce the number of clauses in programs, and lower the probability of timeouts, thereby increasing the success rate of learning. Moreover, even with very few training samples (only a few dozen samples), MILER can still maintain robust learning efficiency.