Abstract:Semantic communication (SemCom) holds promise for reducing network resource consumption while achieving the communications goal. However, the computational overheads in jointly training semantic encoders and decoders-and the subsequent deployment in network devices-are overlooked. Recent advances in Generative artificial intelligence (GAI) offer a potential solution. The robust learning abilities of GAI models indicate that semantic decoders can reconstruct source messages using a limited amount of semantic information, e.g., prompts, without joint training with the semantic encoder. A notable challenge, however, is the instability introduced by GAI's diverse generation ability. This instability, evident in outputs like text-generated images, limits the direct application of GAI in scenarios demanding accurate message recovery, such as face image transmission. To solve the above problems, this paper proposes a GAI-aided SemCom system with multi-model prompts for accurate content decoding. Moreover, in response to security concerns, we introduce the application of covert communications aided by a friendly jammer. The system jointly optimizes the diffusion step, jamming, and transmitting power with the aid of the generative diffusion models, enabling successful and secure transmission of the source messages.

What problem does this paper attempt to address?

This paper aims to solve two main problems in semantic communication (SemCom): computational overhead and data security. Specifically: 1. **Computational Overhead**: Traditional semantic communication systems require joint training of encoders and decoders, which is not only computationally complex but also energy - consuming. Moreover, the trained models are usually for specific tasks, and multiple encoder - decoder pairs need to be trained and distributed separately for different Internet of Things (IoT) devices. The lack of such joint training makes semantic communication systems difficult in tasks that require accurate message reconstruction, such as face image transmission. 2. **Data Security**: When transmitting multi - modal cues in an open wireless environment, data security is an important issue. Traditional physical - layer security methods rely on encryption techniques, while covert communication enhances data security by hiding the transmission behavior, making it difficult to be detected by potential eavesdroppers or external attackers. To solve the above problems, this paper proposes a semantic communication system based on generative artificial intelligence (GAI), which uses multi - modal cues to achieve accurate content decoding and introduces covert communication techniques to protect the transmission of multi - modal cues. The specific contributions are as follows: - **A New GAI - Assisted Semantic Communication Framework**: This framework does not require joint training, reducing computational complexity and energy consumption. - **Multi - Modal Cue Mechanism**: Combining text cues and visual cues improves the accuracy of message reconstruction and solves the problem of GAI model instability. - **Covert Communication Technology**: By optimizing diffusion steps, interference, and transmission power, it ensures accurate image regeneration under energy constraints and guarantees the covertness of communication. Through these innovations, this paper provides a semantic communication solution with significant improvements in both computational efficiency and data security.

Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts

Generative Semantic Communication: Architectures, Technologies, and Applications

Generative AI for Semantic Communication: Architecture, Challenges, and Outlook

Generative AI-driven Semantic Communication Networks: Architecture, Technologies and Applications

Harnessing the Power of AI-Generated Content for Semantic Communication

Semantic Change Driven Generative Semantic Communication Framework

Enhancing Reasoning Ability in Semantic Communication through Generative AI-Assisted Knowledge Construction

Agent-driven Generative Semantic Communication with Cross-Modality and Prediction

Generative Semantic Communication via Textual Prompts: Latency Performance Tradeoffs

Semantic Successive Refinement: A Generative AI-aided Semantic Communication Framework

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

Image Generation with Multimodule Semantic Feature-Aided Selection for Semantic Communications

Generative AI-driven Semantic Communication Framework for NextG Wireless Network

Temporal Prompt Engineering for Generative Semantic Communication

Cross-Modal Generative Semantic Communications for Mobile AIGC: Joint Semantic Encoding and Prompt Engineering

Receiver-Centric Generative Semantic Communications

Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models

Rethinking Generative Semantic Communication for Multi-User Systems with Multi-Modal LLM

Large Generative Model Assisted 3D Semantic Communication

A Wireless AI-Generated Content (AIGC) Provisioning Framework Empowered by Semantic Communication