A multimodal dual-fusion entity extraction model for large and complex devices

Weiming Tong,Xu Chu,Wenqi Jiang,Zhongwei Li
DOI: https://doi.org/10.1016/j.comcom.2023.07.026
IF: 5.047
2023-07-30
Computer Communications
Abstract:In the context of large and complex devices with a multi-source heterogeneous data environment, the extraction of network device configuration related entity information from diverse modalities of the Internet of Things data is a crucial and fundamental step towards establishing a domain knowledge graph for global Zero Touch Provisioning of network devices. In this paper, we present a novel multimodal dual-fusion entity extraction model that serves as a foundation for an intelligent and efficient network device configuration process. Firstly, the multimodal data is encoded, followed by the ViLBERT pre-training model to obtain more feature information of entities for multimodal front-end fusion. Next, the attention weights of each modal feature are learned through a multilayer neural network classifier and probabilistic graphical model, facilitating multimodal back-end fusion and reducing information redundancy. Finally, entity recognition is accomplished by employing a cohesive memory module that extracts the essential parameters for device configuration. The simulation results demonstrate that the proposed model performs exceptionally well on the MSCOCO2017 public dataset and the SFZK-Dev data network device dataset, with F 1 values of the comprehensive evaluation index of model quality at 94.65% and 96.94%, respectively, indicating high stability. Additionally, the link prediction metrics achieved accuracy levels of 58.21% and 68.04%.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?