A Joint Multi-task Learning Model for Web Table-to-Knowledge Graph Matching

Jie Wu,Mengshu Hou
DOI: https://doi.org/10.1007/978-981-97-5492-2_31
2024-01-01
Abstract:The effective utilization and analysis of tabular data rely on the identification and comprehension of the semantics of table elements. The semantic types of columns and inter-column relations, which often lack proper annotations in many tables, are particularly crucial for tasks such as tabular data cleaning, integration, and retrieval. Most existing annotation models tend to treat each subtask as an independent task, potentially overlooking certain constraints and cues, leading to relational confusion and inference errors. In this study, we propose TabJML, a joint multi-task learning framework capable of accomplishing multiple table-to-knowledge graph matching tasks within a single model, including designed column named entity type recognition, target column type annotation and columns property annotation. This model makes full use of interdependencies between subtasks, facilitating communication of features and shared representations among tasks, thereby optimizing the performance of subtasks and enhancing generalization capabilities. It does not rely on additional contextual information or knowledge graphs, utilizing only the internal table content, yet achieves good performance even under short input scenarios. Experimental results demonstrate the superiority and potential of the proposed model.
What problem does this paper attempt to address?