On the hardness of learning queries from tree structured data

Xianmin Liu,Jianzhong Li
DOI: https://doi.org/10.1007/s10878-013-9609-9
2013-01-01
Journal of Combinatorial Optimization
Abstract:The problem of learning queries from tree structured data is studied by this paper. A tree structured data is modeled as a node-labeled tree T , and applying a query q on T will return a set q(T) which is a subset of nodes in T . For a tree-node pair (T,t) where t is a node in T , q is called to accept the pair if t∈q(T) , and reject the pair if t∉q(T) . For some query class ℒ , given tree-node pair sets E_p and E_n , the tree query learning problem is to find a query q∈ℒ such that (1) q rejects all pairs in E_n , and (2) the size of pairs in E_p accepted by q is maximized. On four different query classes 𝒬 ^ / , 𝒬 ^ /,* , 𝒬 ^ /,// and 𝒬 ^ /,[] , this paper studies the hardness of the corresponding tree query learning problems. For 𝒬 ^ / , a PTime algorithm is given. For 𝒬 ^ /,* and 𝒬 ^ /,// , the NP- complete results are shown. For 𝒬 ^ /,[] , the problem is shown to be NP- hard by considering two constrained fragments of 𝒬 ^ /,[] . Also, for 𝒬 ^ /,* , 𝒬 ^ /,[] and 𝒬 ^ /,// , it is shown that there are no n^1-ϵ -approximation algorithms for any ϵ >0 .
What problem does this paper attempt to address?