Abstract:The financial market server in exchanges aims to maintain the order books and provide real time market data feeds to traders. Low-latency processing is in a great demand in financial trading. Although software solutions provide the flexibility to express algorithms in high-level programming models and to recompile quickly, it is becoming increasingly uncompetitive due to the long and unpredictable response time. Nowadays, Field Programmable Gate Arrays (FPGAs) have been proved to be an established technology for achieving a low and constant latency for processing streaming packets in a hardware accelerated way. However, maintaining order books on FPGAs involves organizing packets into GBs of structural data information as well as complicated routines (sort, insertion, deletion, etc.), which is extremely challenging to FPGA designs in both design methodology and memory volume. Thus existing FPGA designs often leave the post-processing part to the CPUs. However, it largely cancels the latency gain of the network packet processing part. This paper proposes a CPU-FPGA hybrid list design to accelerate financial market servers that achieve microsecond-level latencies. This paper mainly includes four contributions. First, we design a CPU-FPGA hybrid list with two levels, a small cache list on the FPGA and a large master list at the CPU host. Both lists are sorted with different sorting schemes, where the bitonic sort is applied to the cache list while a balanced tree is used to maintain the master list. Second, in order to effectively update the hybrid sorted list, we derive a complete set of low-latency routines, including insertion, deletion, selection, sorting, etc., providing a low latency at the scale of a few cycles. Third, we propose a non-blocking on-demand synchronization strategy for the cache list and the master list to communicate with each other. Lastly, we integrate the hybrid list as well as other components, such as packets splitting, parsing, processing, etc. to form an industry-level financial market server. Our design is applied in the environment of the China Financial Futures Exchange (CFFEX), demonstrating its functionality and stability by running 600+ hours with hundreds of millions packets per day. Compared with the existing CPU-based solution in CFFEX, our system is able to support identical functionalities while significantly reducing the latency from 100+ microseconds to 2 microseconds, gaining a speedup of 50x.

Fpga Based Low-Latency Market Data Feed Handler

A Domain-Specific Accelerator for Ultralow Latency Market Data Distribution System

A Market Data Feeds Processing Accelerator Based on FPGA

Accelerating Financial Market Server Through Hybrid List Design (abstract Only)

High speed video signal acquisition and processing system based on FPGA technique

A Hardware Structure for FAST Protocol Decoding Adapting to 40gbps Bandwidth

FPGA-based Design of Real-time Video Processing Platform

Design and Implementation of High Speed Fixed-Point Fast Fourier Transform Processor

Design and Implementation of Hyper-Speed FFT Processor

The Hardware Measurement System for High-Speed Network Flow

FABLE-DTS: Hardware-Software Co-Design of a Fast and Stable Data Transmission System for FPGAs

NetStorageFPGA—A prototyping platform for building high-performance transmission and storage systems using Field Programmable Gate Array (FPGA) hardware

FASTeller: A Hardware Partial Aggregator for Accurate Flow Counting in Cloud Networks

AIOC: an All-in-One-Card Hardware Design for Financial Market Trading System

Fast Protocol Decoding in Parallel with FPGA Hardware

An Fpga-Based Interface For Recording High-Speed Data Stream

A Fast Approach For Generating Efficient Parsers On Fpgas

Astronomical Data Preprocessing Implementation Based On Fpga And Data Transformation Strategy For The Fast Telescope As A Giant Cps

An Improved Fpga Implementation Of Sparse Fast Fourier Transform

The Design of a High Speed and Reliable Data Center Based on FPGA

High Speed Real-Time Signal Processing System