Abstract:The Transformer model has gained significant recognition for its remarkable computational capabilities and versatility, positioning itself as a fundamental component in numerous practical applications. However, the robustness of the Transformer model, specifically its stability and reliability under various types of adversarial attacks, is of utmost importance for its practical applicability. Furthermore, it offers valuable insights for the design of more efficient and secure models. In contrast with conventional investigations into adversarial robustness, our study focuses on the analysis of Positional Embeddings (PEs), a crucial component that sets the Transformer model apart from previous model architectures. Theoretical analysis of PEs has been limited due to previous predominantly empirical design, which includes features such as sinusoidal or linear patterns, learned or fixed characteristics, and absolute or relative measurements. Our investigation delves deep into potential vulnerabilities within PEs. Initially, we develop a set of input infection techniques that can be universally applied to exploit vulnerabilities present in the Transformer architecture and its variants. In addition, we propose a novel adversarial attack that manipulates the model by providing it with incorrect positional information, enabling an evasion attack. Significantly, in contrast to previous attacks that were limited to a single task, our conducted experiments involving time-series analysis, natural language processing, and computer vision indicate that the susceptibility of PEs could be universal and transferable. This finding serves as a significant warning for future Transformer-based model design, urging researchers to consider potential security risks inherent in the model’s structure.

Why Are Positional Encodings Nonessential for Deep Autoregressive Transformers? Revisiting a Petroglyph

Learning positional encodings in transformers depends on initialization

The Impact of Positional Encoding on Length Generalization in Transformers

Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers

Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings

Algebraic Positional Encodings

Bridging Graph Position Encodings for Transformers with Weighted Graph-Walking Automata

Round and Round We Go! What makes Rotary Positional Encodings useful?

PoPE: Legendre Orthogonal Polynomials Based Position Encoding for Large Language Models

Positional Encoding Helps Recurrent Neural Networks Handle a Large Vocabulary

Rethinking Positional Encoding in Language Pre-training

σ-GPTs: A New Approach to Autoregressive Models

Comparing Graph Transformers via Positional Encodings

Non-autoregressive Transformer by Position Learning

What Do Position Embeddings Learn? An Empirical Study of Pre-Trained Language Model Positional Encoding

Your Transformer May Not be as Powerful as You Expect

Explore Better Relative Position Embeddings from Encoding Perspective for Transformer Models.

PNAT: Non-autoregressive Transformer by Position Learning

PE-Attack: on the Universal Positional Embedding Vulnerability in Transformer-based Models

Theoretical Analysis of Hierarchical Language Recognition and Generation by Transformers without Positional Encoding

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding