Supplementary Material for Learning Optimal Tree Models under Beam Search

Jingwei Zhuo,Ziru Xu,Wei Dai,Han Zhu,Han Li,Jian Xu,Kun Gai
2020-01-01
Abstract:A. Detailed Introduction of PLTs and TDMs A.1. Probabilistic Label Trees (PLTs) PLTs formulate tree modelsM(T , g) as hierarchical probability estimators for the marginal distribution p(yj |x) (Jain et al., 2016; Wydmuch et al., 2018). In PLTs, the pseudo target zn is defined as zn = I( ∑ n′∈L(n) yπ(n′) ≥ 1), which implies that zn = 1 if and only if there exists n′ ∈ C(n) such that zn′ = 1. In other words, zn = 1 implies zρ(n) = 1. As a result, for any n ∈ N , corresponding p(zn|x) can be decomposed as p(zn = 1|x) = ∏
What problem does this paper attempt to address?