What causes the underlying problems of SDN?

The authors find that the underlying causes are linked to software inefficiencies, as well as pathological interactions between switch hardware properties (shared resources and how forwarding rules are organized) and the control operation workload (the order of operations, and concurrent switch activities).

What is the reason why the authors studied 3 commercial switches?

In [8], the authors also studied 3 commercial switches (HP Procurve, Fulcrum, Quanta) and found that delay distributions were distinct, mainly due to variable control delays.

What is the way to schedule a set of rule updates?

Dionysus [11] optimally schedules a set of rule updates while maintaining desirable consistency properties (e.g., no loops and no blackholes).

How do the authors determine whether the latencies will ever reach what hardware can support?

given that software will continue to bridge control and data planes in SDN switches, the authors remain skeptical whether latencies will ever reach what hardware can natively support.

Which open flow protocol supports rules defined over any header fields?

For Intel, BCM-1.0, and IBM, the authors install/modify/delete rules in the single table supported by OpenFlow 1.0; for BCM1.3, the authors use the highest numbered table, which supports rules defined over any L2, L3, or L4 header fields.

(Open Access) Measuring control plane latency in SDN-enabled switches (2015) | Keqiang He

Q: What are the contributions in "Measuring control plane latency in sdn-enabled switches" ?

To this end, the authors conduct a comprehensive measurement study using four types of production SDN switches.

Q: What is the primary factor contributing to inbound latency?

Their conversations with the switch vendor suggest that the limited bus bandwidth between the ASIC and switch CPU is the primary factor contributing to inbound latency.

Q: What is the reason for the delay in BCM-1.3?

Conversations with Broadcom indicated that TCAM modification should ideally be fast and independent of table size, so the underlying cause appears to be less optimized switch software in BCM-1.0.

Q: How does the research examine latencies in production switches?

(2) The authors find that outbound latency, i.e., the latency involved in the switch installing/modifying/deleting forwarding rules provided by control applications, is also high—3ms and 30ms per rule for insertion and modification, respectively, in Broadcom.

Q: How many rules are inserted in the TCAM?

the authors observe that the burst of B rules is divided into several groups, and each group is reordered and inserted in the TCAM in order of increasing priority.

Q: What is the reason for the latency of rule deletions?

The authors conclude that the per-rule modification latency on BCM-1.0 and IBM is impacted purely by table occupancy, not by rule priority structure.

Q: What is the effect of the step-function effect?

The authors see two effects: (1) the latencies alternate between two modes at any given time, and (2) there is a step-function effect after every 300 or so rules.

Measuring Control Plane Latency in SDN-enabled Switches

Keqiang He

†

, Junaid Khalid

†

, Aaron Gember-Jacobson

†

, Sourav Das

†

, Chaithan Prakash

†

Aditya Akella

†

, Li Erran Li*, Marina Thottan*

†

University of Wisconsin-Madison, *Bell Labs

ABSTRACT

Timely interaction between an SDN controller and switches is cru-

cial to many SDN applications—e.g., fast rerouting during link fail-

ure and ﬁne-grained trafﬁc engineering in data centers. However, it

is not well understood how the control plane in SDN switches im-

pacts these applications. To this end, we conduct a comprehensive

measurement study using four types of production SDN switches.

Our measurements show that control actions, such as rule instal-

lation, have surprisingly high latency, due to both software imple-

mentation inefﬁciencies and fundamental traits of switch hardware.

Categories and Subject Descriptors

C.2.0 [Computer-Communication Network]: General; C.4 [Perfo-

rmance of Systems]: Metrics—performance measures

Keywords

Software-deﬁned Networking (SDN); Latency; Measurement

1. INTRODUCTION

Software deﬁned networking (SDN) advocates for the separation

of control and data planes in network devices, and provides a logi-

cally centralized platform to program data plane state [3, 14]. This

has opened the door to rich network control applications that can

adapt to changes in network topology or trafﬁc patterns more ﬂexi-

bly and more quickly than legacy control planes [2,6,7,9,10,13,16].

However, to optimally satisfy network objectives, many important

control applications require the ability to reprogram data plane state

at very ﬁne time-scales. For instance, ﬁne-grained data center traf-

ﬁc engineering requires routes to be set up within a few hundred

milliseconds to leverage short-term trafﬁc predictability [2]. Simi-

larly, setting up routes in cellular networks (when a device becomes

active, or during a handoff) must complete within ∼30-40ms to en-

sure users can interact with Web services in a timely fashion [10].

Timeliness is determined by: (1) the speed of control programs,

(2) the latency to/from the logically central controller, and (3) the

responsiveness of network switches in interacting with the controller—

speciﬁcally, in generating the necessary input messages for con-

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are not

made or distributed for proﬁt or commercial advantage and that copies bear

this notice and the full citation on the ﬁrst page. Copyrights for components

of this work owned by others than ACM must be honored. Abstracting with

credit is permitted. To copy otherwise, or republish, to post on servers or to

redistribute to lists, requires prior speciﬁc permission and/or a fee. Request

permissions from Permissions@acm.org.

SOSR 2015, June 17–18, 2015, Santa Clara, CA, USA.

http://dx.doi.org/10.1145/2774993.2775069.

trol programs, and in modifying forwarding state as dictated by

the programs. Robust control software design and advances in dis-

tributed controllers [12] have helped overcome the ﬁrst two issues.

However, with the focus in current/upcoming generations of SDN

switches being on the ﬂexibility beneﬁts of SDN w.r.t. legacy tech-

nology, the third issue has not gained much attention. Thus, it is

unknown whether SDN can provide sufﬁciently responsive control

to support the aforementioned applications.

To this end, we present a thorough systematic exploration of la-

tencies in four types of production SDN switches from three dif-

ferent vendors—Broadcom, Intel, and IBM—using a variety of

workloads. We investigate the relationship between switch design

and observed latencies using greybox probes and feedback from

vendors. Key highlights from our measurements are as follows:

(1) We ﬁnd that inbound latency, i.e., the latency involved in the

switch generating events (e.g., when a ﬂow is seen for the ﬁrst

time) can be high—8 ms per packet on average on Intel. The

delay is particularly high whenever the switch is simultaneously

processing forwarding rules received from the controller. (2) We

ﬁnd that outbound latency, i.e., the latency involved in the switch

installing/modifying/deleting forwarding rules provided by control

applications, is also high—3ms and 30ms per rule for insertion and

modiﬁcation, respectively, in Broadcom. The latency crucially de-

pends on the priority patterns of both the rules being inserted and

those already in a switch’s table. (3) We ﬁnd signiﬁcant differ-

ences in latency trends across switches with different chipsets and

ﬁrmware, pointing to different internal optimizations.

These observations highlight two important gaps in current switch

designs. First, some of our ﬁndings show that poor switch soft-

ware design contributes signiﬁcantly to observed latencies (afﬁrm-

ing [5,8, 17]). We believe near term work will address these issues;

our measurements with an early release of Broadcom’s OpenFlow

1.3 ﬁrmware exemplify this. More crucially, our measurements re-

veal latencies that appear to be fundamentally rooted in hardware

design: e.g., rules must be organized in switch hardware tables in

priority order, and simultaneous switch control actions must con-

tend for limited bus bandwidth between a switch’s CPU and ASIC.

Unless the hardware signiﬁcantly changes—and our ﬁrst-of-a-kind

in-depth measurement study may engender such changes—we be-

lieve these latencies will manifest even in next generation switches.

2. BACKGROUND

Instead of running a complex control plane on each switch, SDN

delegates network control to external applications running on a log-

ically central controller. Applications determine the routes trafﬁc

should take, and they instruct the controller to update switches with

the appropriate forwarding state. These decisions may be based

on data packets that are received by switches and sent to the con-

PHY PHY

Forwarding

Engine

Hardware

Tables

CPU

Memory

PCIe

ASIC

SDK

Agent

lookup

packet_in/inbound

switch

1G/10G ports

DMA

controller

flow_mod/outbound

Switch

fabric

CPU

board

Figure 1: Schematic of an OpenFlow switch. We also show the

factors contributing to inbound and outbound latency

troller. Such packet events and state update operations are enabled

by OpenFlow [14]—a standard API implemented by switches to fa-

cilitate communication with the controller. Although SDN moves

control plane logic from switches to a central controller, switches

must still perform several steps to generate packet events and up-

date forwarding state. We describe these steps below.

Packet Arrival. When a packet arrives, the switch ASIC ﬁrst per-

forms a lookup in the switch’s hardware forwarding tables. If a

match is found, the packet is forwarded at line rate. Otherwise the

following steps occur (Figure 1): (I1) The ASIC sends the packet to

the switch’s CPU via the PCIe bus. (I2) An OS interrupt is raised, at

which point the ASIC SDK gets the packet and dispatches it to the

switch-side OpenFlow agent. (I3) The agent wakes up, processes

the packet, and sends to the controller a packet_in message con-

taining metadata and the ﬁrst 128B of the packet. All three steps,

I1–I3, can impact the latency in generating a packet_in message.

We categorize this as inbound latency, since the controller receives

the message as input.

Forwarding Table Updates. The controller sends ﬂow_mod mes-

sages to update a switch’s forwarding tables. A switch takes the

following steps to handle a ﬂow_mod (Figure 1): (O1) The Open-

Flow agent running on the CPU parses the message. (O2) The agent

schedules the addition (or removal) of the forwarding rule in hard-

ware tables, typically TCAM. (O3) Depending on the nature of the

rule, the chip SDK may require existing rules in the tables to be

rearranged, e.g., to accommodate high priority rules. (O4) The rule

is inserted (or removed) in the hardware table. All four steps, O1–

O4, impact the total latency in executing a ﬂow_mod action. We

categorize this as outbound latency, since the controller outputs a

ﬂow_mod message.

3. LATENCY MEASUREMENTS

In this section, we systematically measure in/outbound latencies

to understand what factors contribute to high latencies. We gen-

erate a variety of workloads to isolate speciﬁc factors, and we use

production switches from three vendors, running switch software

with support for OpenFlow 1.0 [14] or, if available, OpenFlow 1.3,

to highlight the generality of our observations and to understand

how software evolution impacts latencies.

Henceforth, we refer

to the four hardware and software combinations (Table 1) as Intel,

BCM-1.0, BCM-1.3, and IBM. To ensure we are experimenting in

the optimal regimes for each switch, we take into account factors

such as ﬂow table capacity and support for packet_in.

3.1 Measurement Methodology

Figure 2 shows our measurement setup. The host has one 1Gbps

and two 10Gbps interfaces connected to the switch under test. The

eth0 interface is connected to the control port of the switch, and

When using OpenFlow 1.3 ﬁrmware, we only leverage features

also available in OpenFlow 1.0 for an apples-to-apples comparison.

Model CPU RAM

Ver.

Flow

Table Size

Ifaces

Intel

FM6000

2Ghz 2GB 1.0 4096

40x10G

+ 4x40G

Broadcom

956846K

1Ghz 1GB

1.0 896 14x10G

+ 4x40G1.3 1792 (ACL tbl)

IBM

G8264

? ? 1.0 750

48x10G

+ 4x40G

Table 1: Switch speciﬁcations

eth0

eth1

eth2

Control Channel

Flows IN

Flows OUT

OpenFlow

Switch

Figure 2: Measurement experiment setup

an SDN controller (POX for Intel, BCM-1.0, and IBM; RYU for

BCM-1.3) running on the host listens on this interface. The RTT

between switch and controller is negligible (≈0.1ms). We use the

controller to send a burst of OpenFlow ﬂow_mod commands to the

switch. For Intel, BCM-1.0, and IBM, we install/modify/delete

rules in the single table supported by OpenFlow 1.0; for BCM-

1.3, we use the highest numbered table, which supports rules de-

ﬁned over any L2, L3, or L4 header ﬁelds. The host’s eth1 and

eth2 interfaces are connected to data ports on the switch. We run

pktgen [15] in kernel space to generate trafﬁc on eth1 at a rate of

600-1000Mbps using minimum Ethernet frame size.

Prior work notes that accurate execution times for OpenFlow

commands on commercial switches can only be observed in the

data plane [17]. Thus, we craft our experiments to ensure the la-

tency impact of various factors can be measured directly from the

data plane (at eth2 in Figure 2), with the exception of packet_in

generation latency. We run libpcap on our measurement host to ac-

curately timestamp the packet and rule processing events of each

ﬂow. We ﬁrst log the timestamps in memory, and when the exper-

imental run is complete, the results are dumped to disk and pro-

cessed. We use the timestamp of the ﬁrst packet associated with

a particular ﬂow as the ﬁnish time of the corresponding ﬂow_mod

command; more details are provided later in this section.

3.2 Dissecting Inbound Latency

To measure inbound latency, we empty the table at the switch,

and we generate trafﬁc such that packet_in events are generated at

a certain rate (i.e., we create packets for new ﬂows at a ﬁxed rate).

To isolate the impact of packet_in processing from other message

processing, we perform two kinds of experiments: (1) the packet_in

will trigger corresponding ﬂow_mod (insert simple OpenFlow rules

differing just in destination IP) and packet_out messages; (2) the

packet_in message is dropped silently by the controller.

We record the timestamp (t

) when each packet is transmitted on

the measurement host’s eth1 interface (Figure 2). We also record

the timestamp (t

) when the host receives the corresponding packet_in

100

150

200

0 200 400 600 800 1000

inbound delay(ms)

flow #

(a) with ﬂow_mod/pkt_out

100

0 200 400 600 800 1000

inbound delay(ms)

flow #

(b) w/o ﬂow_mod/pkt_out

Figure 3: Inbound delay on Intel, ﬂow arrival rate = 200/s

with ﬂow mod/pkt out

ﬂow rate 100/s 200/s

cpu usage 15.7% 26.5%

w/o ﬂow mod/pkt out

ﬂow rate 100/s 200/s

cpu usage 9.8% 14.4%

Table 2: CPU usage on Intel

message on eth0. The difference (t

− t

) is the inbound latency.

Representative results for an Intel switch are shown in Figure 3;

IBM has similar performance (5ms latency per packet_in on aver-

age).

In the ﬁrst experiment (Figure 3a), we see the inbound la-

tency is quite variable with a mean of 8.33ms, a median of 0.73ms,

and a standard deviation of 31.34ms. In the second experiment

(Figure 3b), the inbound delay is lower (mean of 1.72ms, median

of 0.67ms) and less variable (standard deviation of 6.09ms). We

also observe that inbound latency depends on the packet_in rate:

e.g. in the ﬁrst experiment the mean is 3.32 ms for 100 ﬂows/s (not

shown) vs. 8.33ms for 200 ﬂows/s (Figure 3a).

The only difference between the two experiments is that in the

former case the switch CPU must process ﬂow_mod and packet_out

messages, and send forwarding entries and outbound packets across

the PCIe bus to the ASIC, in addition to generating packet_in mes-

sages. As such, we observe that the CPU usage is higher when the

switch is handling concurrent OpenFlow operations and generat-

ing more packet_in messages (Table 2). However, since the Intel

switch features a powerful CPU (Table 1), plenty of CPU capacity

remains. Our conversations with the switch vendor suggest that the

limited bus bandwidth between the ASIC and switch CPU is the

primary factor contributing to inbound latency.

3.3 Dissecting Outbound Delay

We now study the outbound latencies for three different ﬂow_mod

operations: insertion, modiﬁcation, and deletion. For each opera-

tion, we examine the latency impact of key factors, including table

occupancy and rule priority.

Before measuring outbound latency, we install a single default

low priority rule which instructs the switch to drop all trafﬁc. We

then install a set of non-overlapping OpenFlow rules that output

trafﬁc on the port connected to the eth2 interface of our measure-

ment host. For some experiments, we systematically vary the rule

priorities.

3.3.1 Insertion Latency

We ﬁrst examine how different rule workloads impact insertion

latency. We insert a burst of B rules: r

, · · · , r

. Let T (r

) be

the time we observe the ﬁrst packet matching r

emerging from

the output port speciﬁed in the rule. We deﬁne per-rule insertion

latency as T (r

) − T (r

i−1

Rule Complexity. To understand the impact of rule complexity

(i.e., the number of header ﬁelds speciﬁed in a rule), we install

bursts of rules that specify either 2, 8, or 12 ﬁelds. In particular,

we specify destination IP and EtherType (others wilcarded) in the

2-ﬁeld case; input port, EtherType, source and destination IPs, ToS,

protocol, and source and destination ports in the 8-ﬁeld case; and

all supported header ﬁelds in the 12-ﬁeld (exact match) case. We

use a burst size of 100 and all rules have the same priority.

We ﬁnd that rule complexity does not impact insertion latency.

The mean per-rule insertion delay for 2-ﬁeld, 8-ﬁeld, and exact

match cases is 3.31ms, 3.44ms, and 3.26ms, respectively, for BCM-

1.0. Similarly, the mean per-rule insertion delay for Intel, IBM, and

BCM-1.3 is ≈ 1 ms irrespective of the number of ﬁelds. All exper-

iments that follow use rules with 2 ﬁelds.

Our technique differs from [8], where the delay was captured from

the switch to the controller, which includes controller overhead.

BCM-1.0 and BCM-1.3 do not support packet_in messages.

Table occupancy. To understand the impact of table occupancy,

we insert a burst of B rules into a switch that already has S rules

installed. All B + S rules have the same priority. We ﬁx B and

vary S, ensuring B+S rules can be accommodated in each switch’s

hardware table.

We ﬁnd that ﬂow table occupancy does not impact insertion de-

lay if all rules have the same priority. Taking B = 400 as an exam-

ple, the mean per-rule insertion delay is 3.14ms, 1.09ms, 1.12ms,

and 1.11ms (standard deviation 2.14ms, 1.24ms, 1.53ms, and 0.18ms)

for BCM-1.0, BCM-1.3, IBM and Intel, respectively, regardless of

the value of S.

Rule priority. To understand the effect of rule priority on the in-

sertion operations, we conducted three different experiments each

covering different patterns of priorities. In each, we insert a burst

of B rules into an empty table (S = 0); we vary B. In the same

priority experiment, all rules have the same priority. In the increas-

ing and decreasing priority experiments, each rule has a different

priority and the rules are inserted in increasing/decreasing priority

order, respectively.

Representative results for same priority rules are shown in Fig-

ure 4a and 4b for B = 100 and B = 200, respectively; the switch

is BCM-1.0. For both burst sizes, the per-rule insertion delay is

similar, with medians of 3.12ms and 3.02ms, and standard devia-

tions of 1.70ms and 2.60ms for B = 100 and B = 200, respec-

tively. The same priority insertion delays on BCM-1.3, IBM, and

Intel are slightly lower, but still similar: mean per-rule insertion de-

lay is 1.09ms, 1.1ms, and 1.17ms, respectively, for B = 100. We

conclude that same priority rule insertion delay does not vary with

burst size.

In contrast, the per-rule insertion delay of increasing priority

rules increases linearly with the number of rules inserted for BCM-

1.0, BCM-1.3, and IBM. Figure 4c and 4d shows this effect for

B = 100 and B = 200, respectively, for BCM-1.0. Compared

with the same priority experiment, the average per-rule delay is

much larger: 9.47ms (17.66ms) vs. 3.12ms (3.02ms), for B = 100

(200). The results are similar for BCM-1.3 and IBM: the average

per-rule insertion delay is 7.75ms (16.81ms) and 10.14ms (18.63)

for B = 100 (200), respectively. We also observe the slope of the

latency increase is constant—for a given switch—regardless of B.

The increasing latency in BCM-1.0, BCM-1.3, and IBM stems

from the TCAM storing high priority rules at low (preferred) mem-

ory addresses. Each rule inserted in the increasing priority experi-

ments displaces all prior rules!

Surprisingly, latency does not increase when increasing prior-

ity rules are inserted in Intel. As shown in Figure 5a, the median

per-rule insertion delay for Intel is 1.18ms (standard deviation of

1.08ms), even with B = 800! Results for other values of B are

similar. This shows that the Intel TCAM architecture is fundamen-

tally different from Broadcom and IBM. Rules are ordered in Intel’s

TCAM such that higher priority rules do not displace existing low

priority rules.

However, displacement does still occur in Intel. Figure 5b shows

per-rule insertion latencies for for decreasing priority rules for B =

800. We see two effects: (1) the latencies alternate between two

modes at any given time, and (2) there is a step-function effect after

every 300 or so rules.

A likely explanation for the former is bus buffering. Since rule

insertion is part of the switch’s control path, it is not really opti-

mized for latency. The latter effect can be explained as follows: Ex-

amining the Intel switch architecture, we ﬁnd that it has 24 slices,

. . . A

, and each slice holds 300 ﬂow entries. There exists a

consumption order (low-priority ﬁrst) across all slices. Slice A

stores the i

lowest priority rule group. If rules are inserted in de-

0 20 40 60 80 100

insertion delay(ms)

rule #

(a) burst size 100, same priority

0 50 100 150 200

insertion delay(ms)

rule #

(b) burst size 200, same priority

0 20 40 60 80 100

insertion delay(ms)

rule #

0 50 100 150 200

insertion delay(ms)

rule #

(d) burst size 200, incr. priority

Figure 4: BCM-1.0 priority per-rule insert latency

0 100 200 300 400 500 600 700 800

insertion delay(ms)

rule #

(a) burst size 800, incr. priority

0 100 200 300 400 500 600 700 800

insertion delay(ms)

rule #

(b) burst size 800, decr. priority

Figure 5: Intel priority per-rule insert

creasing priority, A

is consumed ﬁrst until it becomes full. When

the next low priority rule is inserted, this causes one rule to be dis-

placed from A

to A

. This happens for each of the next 300 rules,

after which cascaded displacements happen: A

→ A

, and

so on. We conﬁrmed this with Intel.

We observe different trends when inserting decreasing priority

rules in BCM-1.0, BCM-1.3, and Intel. With BCM-1.0, we ﬁnd the

average per-rule insertion delay increases with burst size: 8.19ms

for B = 100 vs. 15.5ms for B = 200. Furthermore, we observe

that the burst of B rules is divided into several groups, and each

group is reordered and inserted in the TCAM in order of increasing

priority. This indicates that BCM-1.0 ﬁrmware reorders the rules

and prefers increasing priority insertion. In contrast, BCM-1.3’s

per-rule insertion delay for decreasing priority rules is similar to

same priority rule insertion: ≈ 1ms. Hence, the BCM-1.3 ﬁrmware

has been better optimized to handle decreasing priority rule inser-

tions. The same applies to Intel: per-rule insertion delay for de-

creasing priority rules is similar to same priority rule insertion: ≈

1.1ms.

Priority and table occupancy combined effects. We now study

the combined impact of rule priority and table occupancy. We con-

duct two experiments: For the ﬁrst experiment, the table starts with

S high priority rules, and we insert B low priority rules. For the

second experiment, the priorities are inverted. For both experi-

ments, we measure the total time to install all rules in the burst,

T (r

) − T (r

For BCM-1.0, BCM-1.3, and IBM, we expect that as long as

the same number of rules are displaced, the completion time for

different values of S should be the same. Indeed, from Figure 6a

(for BCM-1.0), we see that even with 400 high priority rules in the

table, the insertion delay for the ﬁrst experiment is no different from

the setting with only 100 high priority rules in the table. In contrast,

in Figure 6b, newly inserted high priority rules will displace low

priority rules in the table, so when S = 400 the completion time

is about 3x higher than S = 100. For IBM (not shown), inserting

300 high priority rules into a table with 400 low priority rules takes

more than 20 seconds.

For Intel, the results are similar to same priority rule insertion.

This indicates that Intel uses different TCAM organization schemes

200

400

600

800

1000

1200

1400

1600

1800

2000

2200

0 100 200 300 400 500 600 700

avg completion time (ms)

burst size

table 100

table 400

(a) insert low priority rules into

a table with high priority rules

2000

4000

6000

8000

10000

12000

14000

16000

18000

20000

0 100 200 300 400 500 600 700

avg completion time (ms)

burst size

table 100

table 400

(b) insert high priority rules into

a table with low priority rules

Figure 6: Overall completion time on BCM-1.0. Initial table oc-

cupancy is S high (low) priority rules; insert a burst of low (high)

priority rules. Averaged over 5 runs.

than the Broadcom and IBM switches.

Summary and root causes. We observe that: (1) rule complex-

ity does not affect insertion delay; (2) same priority insertions in

BCM-1.0, BCM-1.3, Intel and IBM are fast and not affected by

ﬂow table occupancy; and (3) priority insertion patterns can affect

insertion delay very differently. For Intel, increasing priority in-

sertion is similar to same priority insertion, but decreasing priority

incurs much higher delay. For BCM-1.3 and IBM the behavior is

inverted: decreasing priority insertion is similar to same priority

insertion and increasing priority insertion incurs higher delay. For

BCM-1.0, insertions with different priority patterns are all much

higher than insertions with same priority.

Key root causes for observed latencies are: (1) how rules are

organized in the TCAM, and (2) the number of slices. Both of

these are intrinsically tied to switch hardware. Even in the best

case (Intel), per-rule insertion latency of 1ms is higher than what

native TCAM hardware can support (100M updates/s [1]). Thus,

in addition to the above two causes, there appears to be an intrinsic

switch software overhead contributing to all latencies.

3.3.2 Modiﬁcation Latency

We now study modiﬁcation operations. As before, we use bursts

of rules and a similar deﬁnition of latency.

Table occupancy. To study the impact of table occupancy, we pre-

insert S rules into a switch, all with the same priority. We then

modify one rule at a time by changing the rule’s output port, send-

ing modiﬁcation requests back to back.

Per-rule modiﬁcation delay for BCM-1.0 when S = 100 and

S = 200 are shown in Figure 7a and 7b, respectively. We see that

the per-rule delay is more than 30 ms for S = 100. When we dou-

ble the number of rules, S = 200, latency doubles as well. It grows

linearly with S (not shown). Note that this latency is much higher

than the corresponding insertion latency (3.12ms per rule) (§3.3.1).

IBM’s per-rule modiﬁcation latency is also affected signiﬁcantly

by the table occupancy— the per-rule modiﬁcation latencies for

S = 100 and S = 200 are 18.77ms and 37.13ms, respectively.

100

120

140

0 20 40 60 80 100

modification delay (ms)

rule #

(a) 100 rules in table

100

120

140

0 50 100 150 200

modification delay(ms)

rule #

(b) 200 rules in table

Figure 7: BCM-1.0 per-rule mod. latency, same priority

100

0 20 40 60 80 100

modification delay(ms)

rule #

(a) burst size 100, incr. priority

100

0 20 40 60 80 100

modification delay(ms)

rule #

(b) burst size 100, decr. priority

Figure 8: BCM-1.0 priority per-rule modiﬁcation latency

In contrast, Intel and BCM-1.3 have lower modiﬁcation delay,

and it does not vary with table occupancy. For Intel (BCM-1.3)

the per-rule modiﬁcation delay for both S = 100 and S = 200 is

around 1 ms (2ms) for all modiﬁed rules, similar to (2X more than)

same priority insertion delay.

Rule Priority. We conduct two experiments on each switch to

study the impact of rule priority. In each experiment, we insert

B rules into an empty table (S = 0). In the increasing priority

experiments, the rules in the table each have a unique priority, and

we send back-to-back modiﬁcation requests for rules in increasing

priority order. We do the opposite in the decreasing priority exper-

iment. We vary B.

Figure 8a and 8b show the results for the increasing and decreas-

ing priority experiments, respectively, for B = 100 on BCM-1.0.

In both cases, we see: (1) the per-rule modiﬁcation delay is similar

across the rules, with a median of 25.10ms and a standard devia-

tion of 6.74ms, and (2) the latencies are identical across the experi-

ments. We similarly observe that priority does not affect modiﬁca-

tion delay in BCM-1.3, Intel and IBM (not shown).

Summary and root causes. We conclude that the per-rule modi-

ﬁcation latency on BCM-1.0 and IBM is impacted purely by table

occupancy, not by rule priority structure. For BCM-1.3 and Intel,

the per-rule modiﬁcation delay is independent of rule priority, table

occupancy, and burst size; BCM-1.3’s per-rule modiﬁcation delay

is 2X higher than insertion.

Conversations with Broadcom indicated that TCAM modiﬁca-

tion should ideally be fast and independent of table size, so the

underlying cause appears to be less optimized switch software in

BCM-1.0. Indeed, our measurements with BCM-1.3 show that this

issue has (at least partly) been ﬁxed.

3.3.3 Deletion Latency

We now study the latency of rule deletions. We again use bursts

of operations. T (r

) denotes the time we stop observing packets

matching rule r

from the intended port of the rule action. We

deﬁne deletion latency as T (r

) − T (r

i−1

Table Occupancy. We pre-insert S rules into a switch, all with the

same priority. We then delete one rule at a time, sending deletion

requests back-to-back. The results for BCM-1.0 at S = 100 and

0 20 40 60 80 100

deletion delay(ms)

rule #

(a) 100 rules in table

100

0 50 100 150 200

deletion delay(ms)

rule #

(b) 200 rules in table

Figure 9: BCM-1.0 per-rule del. latency, same priority

0 20 40 60 80 100

deletion delay(ms)

rule #

(a) 100 rules in table

0 50 100 150 200

deletion delay(ms)

rule #

(b) 200 rules in table

Figure 10: Intel per-rule del. latency, same priority

0 20 40 60 80 100

deletion delay(ms)

rule #

(a) increasing priority

0 20 40 60 80 100

deletion delay(ms)

rule #

(b) decreasing priority

Figure 11: BCM-1.0 priority per-rule del. latency, B=100

0 20 40 60 80 100

deletion delay(ms)

rule #

(a) increasing priority

0 20 40 60 80 100

deletion delay(ms)

rule #

(b) decreasing priority

Figure 12: Intel priority per-rule del. latency, B=100

S = 200 are shown in Figure 9a and 9b, respectively. We see that

per rule deletion delay decreases as the table occupancy drops. We

see a similar trend for Intel (Figure 10a and 10b) BCM-1.3 and

IBM (not shown).

Rule Priorities. We start with B existing rules in the switch, and

delete one rule at a time in increasing and decreasing priority order.

For all switches (BCM-1.0 and Intel shown in Figure 11 and 12,

respectively) deletion is not affected by the priorities of rules in the

table or the order of deletion.

Root cause. Since deletion delay decreases with rule number in

all cases, we conclude that deletion is incurring TCAM reordering.

We also observe that processing rule timeouts at the switch does

not noticeably impact ﬂow_mod operations. Given these two ob-

servations, we recommend allowing rules to time out rather than

explicitly deleting them, if possible.

3.4 Implications

Measuring control plane latency in SDN-enabled switches

Figures

Citations

Software-defined networking (SDN): a survey

Detour: Dynamic Task Offloading in Software-Defined Fog for IoT Applications

Incremental Flow Scheduling and Routing in Time-Sensitive Software-Defined Networks

Survey of Consistent Software-Defined Network Updates

STAR: Preventing flow-table overflow in software-defined networks

References

OpenFlow: enabling innovation in campus networks

B4: experience with a globally-deployed software defined wan

Onix: a distributed control platform for large-scale production networks

DevoFlow: scaling flow management for high-performance networks

Achieving high utilization with software-driven WAN

Related Papers (5)

OpenFlow: enabling innovation in campus networks

P4: programming protocol-independent packet processors

B4: experience with a globally-deployed software defined wan

Hedera: dynamic flow scheduling for data center networks

Software-Defined Networking: A Comprehensive Survey

Frequently Asked Questions (14)

Q1. What are the contributions in "Measuring control plane latency in sdn-enabled switches" ?

Q2. What is the primary factor contributing to inbound latency?

Q3. What causes the underlying problems of SDN?

Q4. What is the reason for the delay in BCM-1.3?

Q5. How does the research examine latencies in production switches?

Q6. What is the reason why the authors studied 3 commercial switches?

Q7. How many rules are inserted in the TCAM?

Q8. What is the reason for the latency of rule deletions?

Q9. What is the effect of the step-function effect?

Q10. What is the way to schedule a set of rule updates?

Q11. How do the authors determine whether the latencies will ever reach what hardware can support?

Q12. How is the insertion latency in the case?

Q13. Which open flow protocol supports rules defined over any header fields?

Q14. How many rules are inserted into a table with low priority?