MulmQA: Multimodal Question Answering for Database Alarm

Dequan Gao,Jiwei Li,Xuewei Ding,Bao Feng,Zhijie Ren,Linfeng Zhang
DOI: https://doi.org/10.1109/ictc61510.2024.10602092
2024-01-01
Abstract:In response to the dramatic increase in data volume and diversification of data types, traditional question-answering systems face significant challenges in understanding and processing complex data sources. This challenge becomes particularly acute when addressing urgent database alarms, where providing a rapid and accurate solution is crucial. To address this, we introduce MulmQA-a novel integrated model designed to effectively combine multimodal knowledge graphs with database alarm question-answering systems. MulmQA incorporates a variety of modal data, including text and images, into a unified knowledge graph. We employ cutting-edge techniques to process image and text data and have developed a specialized fusion algorithm to enhance contextual understanding and the accuracy of answers. Our framework provides a 3-4% improvement in BLUE and PPL performance metrics compared to SOAT approaches.
What problem does this paper attempt to address?