Automatic vocabulary primitive prediction method and device

Sun Maosong,Xie Ruobing,Yuan Xingchi,Liu Zhiyuan
2017-01-01
Abstract:Embodiments of the invention disclose an automatic vocabulary primitive prediction method and device. The method comprises the following steps of: calculating a vector distance between each primitive-unknown vocabulary and each primitive-known vocabulary according to a word vector of each preset vocabulary; selecting at least one target primitive-known vocabulary as an alternative primitive set of each primitive-unknown vocabulary according to each vector distance and a distance threshold value; calculating a score of each primitive of each primitive-unknown vocabulary according to a primitive vector of each target primitive-known vocabulary in the alternative primitive set; and obtaining a first primitive vector of each primitive-unknown vocabulary according to a score threshold and the score of each primitive. The alternative primitive set of each primitive-unknown vocabulary is determined through the vector distance, the score of each primitive in the alternative primitive set is further calculated, and then the first primitive vector of each primitive-unknown vocabulary is obtained, so that correct primitive prediction can be automatically carried out on the primitive-unknown vocabularies, the pressure of manual labelling is lightened, and the possible variations caused to the result by labelling carried out by different persons are decreased.
What problem does this paper attempt to address?