A Survey on Deep Text Matching
Liang PANG,Yan-Yan LAN,Jun XU,Jia-Feng GUO,Sheng-Xian WAN,Xue-Qi CHENG
DOI: https://doi.org/10.11897/SP.J.1016.2017.00985
2017-01-01
Chinese Journal of Computers
Abstract:Many problems in natural language processing,such as information retrieval,question answering,machine translation,dialog system,paraphrase identification and so on,can be treated as a problem of text matching.The past researches on text matching focused on defining artificial features and learning relation between two text features,thus the performance of the text matching model heavily relies on the features designing.Recently,affected by the idea of automatically feature extraction in deep learning,many text matching models based on deep learning,namely Deep Text Matching model,have been proposed.Comparing to the traditional methods,Deep Text Matching models can automatically learn relations among words from big data and make use of the information from phrase patterns and text hierarchical structures.Considering the different structures of Deep Text Matching models,we divide them into three categories:Single semantic document representation based deep matching model,Multiple semantic document representation based deep matching model and Matching pattern based deep matching model.We can see the progressive relationship among three kinds of models in modelling the interaction of texts,while which have their own merits and defects based on a specific task.Experiments were carried out on the typical datasets of paraphrase identification,question answering and information retrieval.We compare and explain the different performance of three kinds of deep text matching models.Finally,we give the key challenges and the future outlooks of the deep text matching models.