Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts

Hongyang Du,Guangyuan Liu,Dusit Niyato,Jiayi Zhang,Jiawen Kang,Zehui Xiong,Bo Ai,Dong In Kim
2023-09-06
Abstract:Semantic communication (SemCom) holds promise for reducing network resource consumption while achieving the communications goal. However, the computational overheads in jointly training semantic encoders and decoders-and the subsequent deployment in network devices-are overlooked. Recent advances in Generative artificial intelligence (GAI) offer a potential solution. The robust learning abilities of GAI models indicate that semantic decoders can reconstruct source messages using a limited amount of semantic information, e.g., prompts, without joint training with the semantic encoder. A notable challenge, however, is the instability introduced by GAI's diverse generation ability. This instability, evident in outputs like text-generated images, limits the direct application of GAI in scenarios demanding accurate message recovery, such as face image transmission. To solve the above problems, this paper proposes a GAI-aided SemCom system with multi-model prompts for accurate content decoding. Moreover, in response to security concerns, we introduce the application of covert communications aided by a friendly jammer. The system jointly optimizes the diffusion step, jamming, and transmitting power with the aid of the generative diffusion models, enabling successful and secure transmission of the source messages.
Image and Video Processing,Machine Learning,Networking and Internet Architecture
What problem does this paper attempt to address?
This paper aims to solve two main problems in semantic communication (SemCom): computational overhead and data security. Specifically: 1. **Computational Overhead**: Traditional semantic communication systems require joint training of encoders and decoders, which is not only computationally complex but also energy - consuming. Moreover, the trained models are usually for specific tasks, and multiple encoder - decoder pairs need to be trained and distributed separately for different Internet of Things (IoT) devices. The lack of such joint training makes semantic communication systems difficult in tasks that require accurate message reconstruction, such as face image transmission. 2. **Data Security**: When transmitting multi - modal cues in an open wireless environment, data security is an important issue. Traditional physical - layer security methods rely on encryption techniques, while covert communication enhances data security by hiding the transmission behavior, making it difficult to be detected by potential eavesdroppers or external attackers. To solve the above problems, this paper proposes a semantic communication system based on generative artificial intelligence (GAI), which uses multi - modal cues to achieve accurate content decoding and introduces covert communication techniques to protect the transmission of multi - modal cues. The specific contributions are as follows: - **A New GAI - Assisted Semantic Communication Framework**: This framework does not require joint training, reducing computational complexity and energy consumption. - **Multi - Modal Cue Mechanism**: Combining text cues and visual cues improves the accuracy of message reconstruction and solves the problem of GAI model instability. - **Covert Communication Technology**: By optimizing diffusion steps, interference, and transmission power, it ensures accurate image regeneration under energy constraints and guarantees the covertness of communication. Through these innovations, this paper provides a semantic communication solution with significant improvements in both computational efficiency and data security.