Abstract:More complicated computational tasks are posed to the network equipments, such as Deep packet inspection (DPI) for network security check and network coding to achieve efficient multicast, etc. These complicated applications need processors to process the whole packet payload, potentially causing low throughput and long latency due to the large access delay to external memories. The behind hint lies that we can get the packet-processor/thread pair binding information in advance from the front-end dispatching component before the packet will be actually processed by cores. This interesting observation enables us design a new architecture of memory access for packet processors instead of the traditional model. In this paper we explore to apply push model to packet processors. The push model makes the data being pushed into the local memory/on-chip L1 cache in an on-demand and fine granularity manner ahead of being asked by running instructions, making a core always feels getting its data from the local memory/L1 cache instead of fetching them from the external memory in pull model. In order to verify the effectiveness, we design and implement the push model with the Intel IXP2850, and then conduct experiments to show the performance of push model in the IXP2850 simulator compared with the pull model. Simulation results indicate that applying push model to packet processors could improve the system throughput and reduce the packet processing latency and reducing required number of hardware threads.

Architecture-Aware Session Lookup Design for Inline Deep Inspection on Network Processors

Fastpath-based VPN Gateway over Network Processor

Embedded network processor based parallel intrusion detection

Towards High-Performance Network Intrusion Prevention System on Multi-core Network Services Processor

Implementation of Network Processor-Based Content Filtering

A novel cross-layer framework for early-stage power delivery and architecture co-exploration.

Fast Path Session Creation on Network Processors

SANS: a scalable architecture for network intrusion prevention with stateful frontend.

Towards High-Performance Flow-Level Packet Processing on Multi-Core Network Processors

On the Extreme Parallelism Inside Next-Generation Network Processors

A Fast Multi-pattern Matching Algorithm for Deep Packet Inspection on a Network Processor

Adding Security to Network Via Network Processors

YACA: Yet Another Cluster-Based Architecture for Network Intrusion Prevention

Evaluating regular expression matching engines on network and general purpose processors

Adaptive Packet Classification Algorithm Based on Ixp2800 Network Processor

Experience on Applying Push Model to Packet Processors in High Performance Routers.

A Parallel Nids Pattern Matching Engine and Its Implementation on Network Processor

An Efficient Scheduling Mechanism With Flow-Based Packet Reordering In A High-Speed Network Processor

Network Processor-Based High Performance String Matching

Towards Efficient Security Policy Lookup on Many-Core Network Processing Platforms

A high performance ARP lookup system for gigabit ethernet