scispace - formally typeset
Proceedings ArticleDOI

Memory Aware Packet Matching Architecture for High-Speed Networks

Reads0
Chats0
TLDR
This paper proposes a novel parallel hardware architecture for hash-based exact match classification of multiple packets per clock cycle with reduced memory replication requirements and shows that the proposed approach can use memory very efficiently and scales exceptionally well with increased record capacities.
Abstract
Packet classification is a crucial operation for many different networking tasks ranging from switching or routing to monitoring and security devices like firewall or IDS. Generally, accelerated architectures implementing packet classification must be used to satisfy ever-growing demands of current high-speed networks. Furthermore, to keep up with the rising network throughputs, the accelerated architectures for FPGAs must be able to classify more than one packet in each clock cycle. This can be mainly achieved by utilization of multiple processing pipelines in parallel, what brings replication of FPGA logic and more importantly scarce on-chip memory resources. Therefore in this paper, we propose a novel parallel hardware architecture for hash-based exact match classification of multiple packets per clock cycle with reduced memory replication requirements. The basic idea is to leverage the fact that modern FPGAs offer hundreds of BlockRAM tiles that can be accessed (addressed) independently to maintain high throughput of matching even without fully replicated memory architecture. Our results show that the proposed approach can use memory very efficiently and scales exceptionally well with increased record capacities. For example, the designed architecture is able to achieve throughput of more than 2 Tbps (over 3 000 Mpps) with an effective capacity of more than 40 000 IPv4 flow records for the cost of only 366 BlockRAM tiles and around 57 000 LUTs.

read more

Citations
More filters
Proceedings ArticleDOI

Multi Buses: Theory and Practical Considerations of Data Bus Width Scaling in FPGAs

TL;DR: The key idea is to enable the processing of multiple transactions per clock cycle with very high and predictable effective throughput even in the worst-case, and to design FPGA cores for key operations in networking with sufficient throughputs for wire-speed packet processing.
References
More filters
Journal ArticleDOI

P4: programming protocol-independent packet processors

TL;DR: This paper proposes P4 as a strawman proposal for how OpenFlow should evolve in the future, and describes how to use P4 to configure a switch to add a new hierarchical label.
Proceedings ArticleDOI

High-speed policy-based packet forwarding using efficient multi-dimensional range matching

TL;DR: New packet classification schemes are presented that, with a worst-case and traffic-independent performance metric, can classify packets, by checking amongst a few thousand filtering rules, at rates of a million packets per second using range matches on more than 4 packet header fields.
Proceedings ArticleDOI

Fast and scalable layer four switching

TL;DR: Two new algorithms for solving the least cost matching filter problem at high speeds are described, based on a grid-of-tries construction and works optimally for processing filters consisting of two prefix fields using linear space.
Proceedings ArticleDOI

Packet classification using multidimensional cutting

TL;DR: This paper introduces a classification algorithm called phHyperCuts, which can provide an order of magnitude improvement over existing classification algorithms and can be fully pipelined to provide one classification result every packet arrival time, and also allows fast updates.
Proceedings ArticleDOI

A cloud-scale acceleration architecture

TL;DR: A new cloud architecture that uses reconfigurable logic to accelerate both network plane functions and applications, and is much more scalable than prior work which used secondary rack-scale networks for inter-FPGA communication.
Related Papers (5)