Survey of Research on Multimodal Semantic Communication

QIN Zhijin,ZHAO Tantan,LI Fan,TAO Xiaoming
DOI: https://doi.org/10.11959/j.issn.1000−436x.2023105
2023-01-01
Abstract:With the cross-integration of artificial intelligence and communications, technologies for processing multimodal data such as text, image, audio, and video are booming, the shared dimension of modal semantics is deeply excavated, and the characteristics of multimodal semantic information such as high abstraction, intelligence and simplicity are being fully utilized, which brings new ideas and means to semantic communications.First, the fundamental theories and classifications of semantic communication were introduced, and the research status of single-modal semantic communication was reviewed for text, image, audio, and video respectively.Then, the research status of multimodal semantic communication was reviewed, and multimodal data fusion technology and secure semantic communication were introduced.Finally, the challenges faced by multimodal semantic communication were summarized.
What problem does this paper attempt to address?