Abstract:Privacy protection in computer communication is gaining attention because plaintext transmission without encryption can be eavesdropped on and intercepted. Accordingly, the use of encrypted communication protocols is on the rise, along with the number of cyberattacks exploiting them. Decryption is essential for preventing attacks, but it risks privacy infringement and incurs additional costs. Network fingerprinting techniques are among the best alternatives, but existing techniques are based on information from the TCP/IP stack. They are expected to be less effective because cloud-based and software-defined networks have ambiguous boundaries, and network configurations not dependent on existing IP address schemes increase. Herein, we investigate and analyze the Transport Layer Security (TLS) fingerprinting technique, a technology that can analyze and classify encrypted traffic without decryption while addressing the problems of existing network fingerprinting techniques. Background knowledge and analysis information for each TLS fingerprinting technique is presented herein. We discuss the pros and cons of two groups of techniques, fingerprint collection and artificial intelligence (AI)-based. Regarding fingerprint collection techniques, separate discussions on handshake messages ClientHello/ServerHello, statistics of handshake state transitions, and client responses are provided. For AI-based techniques, discussions on statistical, time series, and graph techniques according to feature engineering are presented. In addition, we discuss hybrid and miscellaneous techniques that combine fingerprint collection with AI techniques. Based on these discussions, we identify the need for a step-by-step analysis and control study of cryptographic traffic to effectively use each technique and present a blueprint.

Positional-Unigram Byte Models for Generalized TLS Fingerprinting

TLS fingerprint for encrypted malicious traffic detection with attributed graph kernel

Active TLS Stack Fingerprinting: Characterizing TLS Server Deployments at Scale

A novel TLS-based Fingerprinting approach that combines feature expansion and similarity mapping

A survey of methods for encrypted network traffic fingerprinting

Machine learning interpretability meets TLS fingerprinting

Adaptive Webpage Fingerprinting from TLS Traces

UTF:Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification

Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique

Large-scale Wireless Fingerprints Prediction for Cellular Network Positioning.

Large Language Models as Carriers of Hidden Messages

RF Fingerprints Prediction for Cellular Network Positioning: A Subspace Identification Approach

Clid: Identifying TLS Clients With Unsupervised Learning on Domain Names

TLSsem: A TLS Security-Enhanced Mechanism Against MITM Attacks in Public WiFis.

Deciphering Malware's use of TLS (without Decryption)

Remote Timing Attacks on Efficient Language Model Inference

LLMmap: Fingerprinting For Large Language Models

Formatted Stateful Greybox Fuzzing of TLS Server

Differential Fuzz Testing of TLS Implementations Based on Multi-Armed Bandit Variant

A Fingerprint Enhancement and Second-Order Markov Chain Based Malicious Encrypted Traffic Identification Scheme

Resistance of Orthogonal Gaussian Fingerprints to Collusion Attacks