User portrait extraction method based on text information

Yang Zhilin,Chen Yujun,Du Yulun,Zhang Yutao,Chen Xinmei,Xu Chao
2020-01-01
Abstract:The invention relates to the technical field of computer information processing, in particular to a text information-based user portrait extraction method, which mainly comprises the following steps of: 1, collecting text information; 2, judging whether the text information can be used for training and analysis of a computer for preliminary screening or not, and obtaining text sentences ; step 3,labeling the text sentences obtained in the step 2; 4, preprocessing the text sentences obtained in the step 3, extracting related data, and removing irrelevant words; and 5, constructing a text feature vector, and describing text information features through chi-square test and tfidf means. Compared with a traditional user portrait discovery system only based on rules, the human use efficiency can be effectively improved; the human cost of text information extraction is greatly reduced on the premise that the accuracy is guaranteed; and it can be guaranteed that the user portrait is obtainedefficiently on line. The purpose of efficiently and accurately extracting the user portrait in the text is achieved.
What problem does this paper attempt to address?