Proceedings Article•DOI•

Improved Fast Rerouting Using Postprocessing

Klaus-Tycho Foerster¹, Andrzej Kamisinski², Yvonne-Anne Pignolet, Stefan Schmid¹, Gilles Tredan³ - Show less +1 more•Institutions (3)

University of Vienna¹, AGH University of Science and Technology², Centre national de la recherche scientifique³

01 Oct 2019-pp 173-182

TL;DR: This paper presents an algorithmic framework for improving a given FRR network decomposition, using postprocessing, based on iterative arc swapping strategies and supports a number of use cases, from strengthening the resilience to improving the quality of the resulting routes.

read less

Abstract: To provide fast traffic recovery upon failures, most modern networks support static Fast Rerouting (FRR) mechanisms for mission critical services. However, configuring FRR mechanisms to tolerate multiple failures poses challenging algorithmic problems. While state-of-the-art solutions leveraging arc-disjoint arborescence-based network decompositions ensure that failover routes always reach their destinations eventually, even under multiple concurrent failures, these routes may be long and introduce unnecessary loads; moreover, they are tailored to worst-case failure scenarios. This paper presents an algorithmic framework for improving a given FRR network decomposition, using postprocessing. In particular, our framework is based on iterative arc swapping strategies and supports a number of use cases, from strengthening the resilience (e.g., in the presence of shared risk link groups) to improving the quality of the resulting routes (e.g., reducing route lengths and induced loads). Our simulations show that postprocessing is indeed beneficial in various scenarios, and can therefore enhance today's approaches.

...read moreread less

Summary (4 min read)

Jump to: [1 INTRODUCTION] – [2 IMPOSSIBILITY OF BEATING ARBORESCENCES] – [3 THE POSTPROCESSING FRAMEWORK] – [4 USE CASES AND EVALUATION] – [4.1 Impact of the Original Network Decomposition] – [4.2 Optimization Use Cases] – [4.3 Runtime Analysis] – [4.4 Optimizing Network Decomposition Heuristics] – [4.5 Experiments on Real World Graphs] – [5 AN EXAMPLE ILP MODEL FOR THE CIRCULAR ROUTING SCHEME] – [6 RELATED WORK] and [7 CONCLUSION]

1 INTRODUCTION

Communication networks have become a critical infrastructure of their digital society: enterprises which outsource their IT infrastructure to the cloud, as well as many applications related to health monitoring, power grid management, or disaster response [1], depend on the uninterrupted availability of such networks.
When encountering a failure, a packet is rerouted onto the next arborescence according to some pre-defined order.
This paper presents an algorithmic framework for postprocessing state-of-the-art FRR mechanisms based on network decompositions, to improve resilience, performance, and flexibility, of fast rerouting.
The authors show that they do not limit ourselves by focusing on arc-disjoint arborescence network decompositions by proving that arborescence-based decompositions are as good as any deterministic local failover method.

2 IMPOSSIBILITY OF BEATING ARBORESCENCES

The authors first motivate their focus on failover algorithms based on arborescence network decompositions, showing that this approach does not only provide a high resilience but also competitive route qualities (in terms of lengths).
The additive stretch of the routing scheme is then the maximum stretch along all failover routes, i.e., from all v to t.
The authors start with some definitions for arborescence-based re-routing.
For those nodes, there is no shorter path to the destination after such a failure, and hence from a competitive point of view, their failover route is optimal.
To this end, the authors will show that there are k-connected k-regular graphs where every deterministic local algorithm has to take large detours, even though short routes are available.

3 THE POSTPROCESSING FRAMEWORK

This section presents their algorithmic framework to postprocess arborescence-based network decompositions for improved resilience and performance.
The authors consider two classes of objectives in this paper and present two examples each.
From the above correctness conditions (1,4,5) are always satisfied, while (2,3) are irrelevant.
Based on this arc-swap operation, the idea of their algorithmic framework is then to swap arcs only if it improves a certain objective function, see Algorithm 2.
When exactly two arcs e, e′ are swapped in a given valid arborescence decomposition, both must be outgoing from the same node v, else at least one arborescence will be disconnected, i.e., the decomposition is invalid.

4 USE CASES AND EVALUATION

The authors framework for postprocessing a decomposition can be configured with different objective functions, depending on the specific needs.
In the following, the authors discuss and evaluate different use cases, namely two traffic scenario optimization use cases (for stretch/load) and two pure network decomposition optimizations (SRLG and independent paths).
For the experimental evaluation the authors generate 100 instances of undirected (bi-directional) 5-regular random graphs with 100 nodes with the NetworkX library3 implementation of Steger and Wormwald’s algorithm [16].
The authors then compare the unoptimized and optimized arborescences by failing a fraction of the network links picked at random, and simulate a circular arborescence routing process on the resulting infrastructure.
In the latter case, they continue on the next available arborescence, i.e., if a packet has used arborescence Ti up to the failed link, it will then follow arborescence Ti+1 provided that the corresponding outgoing link is available, or try arborescences Ti+2, . . . otherwise.

4.1 Impact of the Original Network Decomposition

The authors first study the impact of the network arborescence decomposition algorithm (that is, the input of the optimization process) on the optimization efficiency, before analyzing the optimization scenario in more detail.
Both of them are described next in more detail.
Hence, as each arc of the O(E) arcs might get tested O(k) times, the construction finishes in O(|E|2k2).
The greedy decomposition is analogous to the random decomposition and is used for the experimental evaluation in [17].
First, one can observe that the Random arborescence decomposition (top) performs worse than the Greedy arborescences decomposition before optimization : for instance facing x = 20 random link failures, the median stretch is 11 for Random and only 5 for Greedy, and 10% of the samples have a stretch above 22 for Random, and only above 9 for Greedy.

4.2 Optimization Use Cases

A first fundamental objective is to ensure that failover routes are short.
Given a subset of nodes that are deemed crucial and need to send packets to some destination node (the root of the arborescence) as well as a set of links highly susceptible to failures, the packets should reach the destination even if all these links or a subset of them failed with short detours.
The two next metrics exhibit a mirrored trend compared to the figure of stretch: optimizing load efficiently reduces the load in both median and 10% worst cases.
When the number of SRLG links increases, the algorithms manages to put proportionally less such links in the last arborescences.
Thus paths are already independent with a high probability (949/1000 on average), and that this quantity varies considerably across networks (high dispersion of values).

4.3 Runtime Analysis

The authors now turn their attention to the runtime of their optimization framework.
The single-threaded code is executed on a 24-core Intel Xeon E5-2620 platform with 32Gb memory.
Figure 9 presents the distribution of those results.
It shows that optimizing stretch or load on a 80-nodes topology takes on average around 750 seconds.
Quite surprisingly, connectivity only has a slight impact on runtime.

4.4 Optimizing Network Decomposition Heuristics

So far in this section, the authors evaluated their postprocessing framework on network decomposition algorithms that always yield a valid output.
Recent work [14] also proposed a heuristic called Bonsai that attempts to generate arborescences of small depth, with no guarantees if a valid output may be produced.
This is in contrast to the random and greedy schemes, which build arborescences sequentially.
Even though the Bonsai round-robin scheme outperforms the greedy and random schemes regarding stretch quality in evaluations in [14], it has the downside that it might not produce a valid decomposition.

4.5 Experiments on Real World Graphs

To complement their experiments on synthetic graphs, the authors also ran them on well-connected cores of network topologies, taken from the Topology Zoo data set [18].
The authors trim the Topology Zoo graphs s.t. only the well-connected cores remain, as follows.
Next, the authors replace nodes that have a degree 3 with three edges between the three affected neighbors.
The results of the experiments are very similar to the results on synthetic graphs.
In all cases, the optimizations are computed quickly and yield improvements in the same percentage range as the authors have observed on synthetic graphs.

5 AN EXAMPLE ILP MODEL FOR THE CIRCULAR ROUTING SCHEME

The existence of a valid circular routing scheme based on k arc-disjoint spanning arborescences in a given network graph containing a known set of failed links can also be analyzed with the aid of Integer Linear Programming (ILP) tools.
To illustrate one of the possible approaches, the authors formulate an example mathematical model of the corresponding ILP optimization problem for path lengths and stretch below.
The remaining terms in Formula (2) guarantee that the corresponding binary variables are set to 0, unless the positive value is required to satisfy the constraints.
Then, the authors eliminate the forbidden combinations of used arborescences, which is enforced by the following groups of constraints (16: Non-consecutive trees A)(19: Prohibited rerouting B).

7 CONCLUSION

This paper was motivated by the computational challenges involved in computing network decompositions which do not only provide basic connectivity but also account for the quality of routes after failures.
The authors proposed and evaluated a simple solution which improves an arbitrary network decomposition, using fast postprocessing, in terms of basic traffic engineering metrics such as route length and load.
Furthermore, the authors showed that their framework can also be used to improve resiliency for shared risk link groups: an important extension in practice.
Lastly, in order to guarantee reproducibility and facilitate other researchers to build upon their algorithms, their code is publicly available at https://gitlab.cs.univie.ac.at/ctpapers/fast-failover.

Did you find this useful? Give us your feedback

Figures (11)

Fig. 4. Example for the swapping in Case (i).

Fig. 5. Example for the swapping in Case (ii).

Fig. 11. Load (left) and stretch (right) performances of Bonsai arborescences [14], before and after improvement, when improvement targets load (top) or stretch (bottom), facing random failures. Each point represents the median metric value over 100 indep. trials. Ribbon delimits the 10 (resp. 90) quantile values.

Fig. 10. Effect on Bonsai [14]: (left) Number of SRLG links in the last two arborescences before/after SRLG optimization, for varying SRLG sizes. Dashed line represents the ideal case where all SRLG links end up in the last two arborescences. (right) Distribution of the number of independent pairs before and after optimization, established over 3600 independent random graphs.

Fig. 6. Efficiency of stretch optimization when optimizing Random (top) or Greedy (bottom) arborescences, facing random failures. Each point represents the median metric value over 100 independent trials. The shaded area delimits the 10 (resp. 90) quantile values.

Fig. 9. Impact of topology size on stretch and load optimization wall clock runtimes. Each point represents the mean time in second established over 100 independent random regular graphs. Vertical bars (shifted for readability) represent the the 10 (resp. 90) quantile values.

Fig. 1. Example network from [14] with two different t-rooted arc-disjoint spanning arborescence decompositions, T1 left and T2 right. In both of them one arborescence is drawn with dotted red arrows, while the second arborescence is depicted with dashed blue arrows. Note that the mean path length of the arborescences of T1 is 2.5, while it is less than 2 in T2.

Fig. 7. Resilience, stretch and load worst case performances of greedy structures, before and after improvement, when improvement targets stretch (left) and load (right), facing random failures. Each point represents the median metric value over 100 indep. trials. Ribbon delimits the 10 (resp. 90) quantile values.

Fig. 2. Example of a (4, `)-clique-torus (see Definition 1), with 4 t-rooted arc-disjoint arborescences, in blue (dotted), red (dash-dotted), green (dashed), and olive (loosely dashed). Three links (striked out) incident to 22 have failed in this scenario, forcing a circular scheme to use the olive (loosely dashed) arborescence at 22, which takes a tour of length at least `− 1, even though a short 5-hop alternative is available.

Fig. 8. (left) Number of SRLG links in the last two arborescences before and after SRLG optimization, for varying SRLG sizes. Dashed line represents the ideal case where all SRLG links end up in the last two arborescences. (right) Distribution of the number of independent pairs before and after optimization on 4800 graphs.

Fig. 3. Introductory example with three nodes where growing arborescences sequentially can end in a deadlock. On the left side, the dotted blue arborescence uses both arcs to t, leaving no possibility for the remaining dashed red arborescence to route to the destination. However, after swapping (v2, t) with (v2, v1), the dashed red arborescence may use the link (v2, t) on the right side (and subsequentially, the link (v1, v2) to complete the construction).

Content maybe subject to copyright Report

HAL Id: hal-03048830

https://hal.laas.fr/hal-03048830

Submitted on 11 Dec 2020

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-

entic research documents, whether they are pub-

lished or not. The documents may come from

teaching and research institutions in France or

abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est

destinée au dépôt et à la diusion de documents

scientiques de niveau recherche, publiés ou non,

émanant des établissements d’enseignement et de

recherche français ou étrangers, des laboratoires

publics ou privés.

Improved Fast Rerouting Using Postprocessing

Klaus-Tycho Foerster, Andrzej Kamisinski, Yvonne-Anne Pignolet, Stefan

Schmid, Gilles Trédan

To cite this version:

Klaus-Tycho Foerster, Andrzej Kamisinski, Yvonne-Anne Pignolet, Stefan Schmid, Gilles Tré-

dan. Improved Fast Rerouting Using Postprocessing. IEEE Transactions on Dependable and Se-

cure Computing, Institute of Electrical and Electronics Engineers, 2022, 19 (1), pp.537 - 550.

�10.1109/TDSC.2020.2998019�. �hal-03048830�

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, VOL. 1, NO. 1, MAY 2020 1

Improved Fast Rerouting Using Postprocessing

Klaus-Tycho Foerster , Andrzej Kamisi

nski , Yvonne-Anne Pignolet ,

Stefan Schmid , and Gilles Tredan

Abstract—To provide fast trafﬁc recovery upon failures, most modern networks support static Fast Rerouting (FRR) mechanisms for

mission critical services. However, conﬁguring FRR mechanisms to tolerate multiple failures poses challenging algorithmic problems.

While state-of-the-art solutions leveraging arc-disjoint arborescence-based network decompositions ensure that failover routes always

reach their destinations eventually, even under multiple concurrent failures, these routes may be long and introduce unnecessary loads;

moreover, they are tailored to worst-case failure scenarios.

This paper presents an algorithmic framework for improving a given FRR network decomposition, using postprocessing. In particular, our

framework is based on iterative arc swapping strategies and supports a number of use cases, from strengthening the resilience (e.g., in

the presence of shared risk link groups) to improving the quality of the resulting routes (e.g., reducing route lengths and induced loads).

Our simulations show that postprocessing is indeed beneﬁcial in various scenarios, and can therefore enhance today’s approaches.

Index Terms—Resilience, fault-tolerance, computer networks, failover.

1 INTRODUCTION

Communication networks have become a critical infrastruc-

ture of our digital society: enterprises which outsource their

IT infrastructure to the cloud, as well as many applications

related to health monitoring, power grid management, or dis-

aster response [1], depend on the uninterrupted availability

of such networks. To meet their dependability requirements,

most modern networks provide static Fast Rerouting (FRR)

mechanisms [2], [3], [4], [5]. Since FRR mechanisms pre-

conﬁgure conditional failover behaviors, they enable a very

fast trafﬁc recovery upon failures, which only involves the

data plane but not the (typically much slower [6]) control

plane.

However, while allowing to pre-conﬁgure conditional

failover behavior is the key beneﬁt of FRR, enabling the

fast response to failures, it is also the key challenge when it

comes to designing algorithms for such mechanisms: as the

conditional failover behavior needs to be conﬁgured before

the failures are known, the algorithmic problem of how to

optimally conﬁgure the failover rules at the different routers,

for all possible failures, seems inherently combinatorial. The

problem is particularly challenging in scenarios where packet

headers cannot be used to carry meta-information about en-

countered failures: such header rewriting is often undesired

and introduces overhead (related to header rewriting itself,

but also in terms of additional rules required at the routers

to process such information).

While FRR technology has been used for many years al-

ready in modern communication networks, a major algorith-

mic result on how to conﬁgure FRR mechanisms is relatively

•

K.-T. Foerster and S. Schmid are with the Faculty of Computer Science at

the University of Vienna, Austria.

•

Andrzej Kamisi´nski is with the AGH University of Science and Technology,

Poland.

• Yvonne-Anne Pignolet is with DFINITY, Switzerland.

• Gilles Tredan is with LAAS-CNRS, France.

Manuscript submitted December 15, 2019, revised May 11, 2020.

recent: Chiesa et al. [7], [8] showed that by decomposing the

network into

arc-disjoint spanning arborescences [9], highly

resilient FRR conﬁgurations can be deﬁned. Edmonds [10]

proved that

-connected graphs always allow for

such

arborescences, and they can be computed rapidly [9].

However, Chiesa et al.’s conjecture that for any

connected graph, there exists a failover routing resilient

to any

k − 1

failures, remains an open problem. What is

more, while this network decomposition approach ensures

connectivity, the failover routes may be far from optimal

regarding latency (i.e., route length) and congestion.

The goal of this paper is to improve the network decom-

position approach, in terms of resilience, performance, and

ﬂexibility. In particular, we are motivated by the observa-

tion that in practice, additional information about failure

scenarios and failover objectives may be available, e.g., about

shared risk link groups [11], [12], [13] or about critical ﬂows

for which it is important to be routed along short paths, even

after failures. Existing optimizations of arborescence-based

failover schemes are oblivious to such aspects.

Model.

In a nutshell, we consider the problem of pre-

deﬁning (static) conditional failover rules at network’s nodes

(i.e., switches or routers), which deﬁne to which link to

forward an incoming packet. These forwarding rules can

only depend on the destination

, the in-port at which a

packet arrives at the current node, as well as the status of the

links directly incident to the node. At the same time, they

should not depend on non-local failures or the packet source.

In particular, we do not allow for packet tagging (i.e., header

rewriting) or carrying failure information in the header.

More speciﬁcally, we consider FRR mechanisms leverag-

ing arc-disjoint arborescence network decompositions [7], [8]:

for each destination, a set of arborescences are deﬁned which

are rooted at the destination and span the entire network

without two arborescences sharing an arc. As long as no

failure is encountered, a packet travels along an arbitrary

arborescence towards the root, being the destination. When

encountering a failure, a packet is rerouted onto the next

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, VOL. 1, NO. 1, MAY 2020 2

arborescence according to some pre-deﬁned order. The logic

of the latter is deﬁned by the arborescence routing strategy.

Contribution.

This paper presents an algorithmic framework

for postprocessing state-of-the-art FRR mechanisms based on

network decompositions, to improve resilience, performance,

and ﬂexibility, of fast rerouting. The framework relies on

an iterative swapping of arcs, hence changing the network

decompositions towards a certain objective. More speciﬁcally,

such swapping operations can be used to account for speciﬁc

failure scenarios (e.g., given by shared risk link groups), to

improve trafﬁc engineering properties of failover paths (such

as load and stretch), or to ﬂexibly adjust the failover routes

to the speciﬁc requirements or priorities of ﬂows (and their

applications).

We show that we do not limit ourselves by focusing

on arc-disjoint arborescence network decompositions by

proving that arborescence-based decompositions are as good

as any deterministic local failover method. Furthermore, we

demonstrate the potential of our arc-swapping framework

in four different use cases: two related to routing (i.e.,

improving stretch and load), and two related to properties

of the decomposition (namely, depth and independence of

paths). We report on extensive simulations using synthetic

network topologies, which illustrate the beneﬁts of our

approach. Moreover, we also provide a novel Integer Linear

Program (ILP) formulation, to directly create optimized

arborescences, instead of postprocessing them.

Organization.

The remainder of this paper is organized

as follows. Section 2 provides intuition on why focusing

on arborescences-based network decompositions is not a

limitation. Our postprocessing framework is described in

Section 3. We discuss and evaluate case studies in Section 4

and present our ILP in Section 5. After reviewing related

work in Section 6 we conclude in Section 7.

2 IMPOSSIBILITY OF BEATING ARBORESCENCES

We ﬁrst motivate our focus on failover algorithms based

on arborescence network decompositions, showing that this

approach does not only provide a high resilience but also

competitive route qualities (in terms of lengths).

In general, while the static fast rerouting algorithms

considered in this paper have the advantage that they do not

require header rewriting nor control plane reconvergence,

the resulting failover routes may have a high additive stretch.

More formally, the (additive) stretch of a failover route from

is deﬁned as the difference between the number of

hops taken

along the failover route from

and the

hops

along the shortest route from

. The additive

stretch of the routing scheme is then the maximum stretch

along all failover routes, i.e., from all v to t.

We will see that this is a feature inherent to all local fast

failover algorithms, though as the later evaluation sections

show, it is more of a rarely occurring worst-case scenario.

We start with some deﬁnitions for arborescence-based

re-routing. Let

(u, v)

denote a directed arc from node

A directed subgraph

is an

-rooted spanning arborescence

if (i)

r ∈ V (G)

, (ii)

V (T ) = V (G)

, (iii)

is the only

node without outgoing arcs and (iv), for each

v ∈ V \ {r}

there exists a single directed path from

. When it is

clear from the context, we use the term “arborescence” to

x u

w v

x u

w v

Fig. 1. Example network from [14] with two different

-rooted arc-disjoint

spanning arborescence decompositions,

left and

right. In both of

them one arborescence is drawn with dotted red arrows, while the second

arborescence is depicted with dashed blue arrows. Note that the mean

path length of the arborescences of

is 2.5, while it is less than 2 in

refer to a

-rooted spanning arborescence, where

is the

destination node. A set of arborescences

T = {T

, . . . T

}

arc-disjoint if no pair of arborescences in

shares common

arcs, i.e., if

(u, v) ∈ E(T

)

then

(u, v) /∈ E(T

)

for all

i 6= j

. A set of

-rooted arc-disjoint spanning arborescences

is a valid arborescence-based decomposition. See Fig. 1

for two examples of such arborescence decompositions. In

arborescence-based routing, packets follow an arborescence

towards its root. In case of encountering failures on its path

to the root, the packet switches to another arborescence. Let

the blue dashed arborescence be failover route for

if its

direct link to

fails. In this case the additive stretch is 3 for

the decomposition

, while it is 1 for

. This illustrates that

the choice of the decomposition has an impact on the quality

of service in case of failures.

In the following, we show that the arborescence-based

routing scheme depicted in Fig. 2 may lead to a detour

of length

Ω(n)

, even though a constant-length detour is

available. In our example, the arborescences (depicted with

different colours and line patters in Fig. 2) to be used have

been constructed such that a certain set of failures leads to a

long detour for packets emitted by node 22, even though 22

is very close to the destination

. The only link out of node

22 belongs to an arborescence that takes this long detour if

no other links fail as the packet will stay on this arborescence

until it reaches t.

In general though, no failover algorithm can obtain a

better stretch than

Ω(n)

for three failures: an adversary could

fail the links

(22, t), (21, 11), (23, 13)

, in which case even

algorithms with global information would take a detour of

length Ω(n).

However, what happens when we strengthen the deﬁni-

tion of additive stretch to a competitive [15] point of view?

Recall that we so far deﬁned the stretch in comparison

to the shortest path in the network without failures. In a

competitive setting, we compare the stretch under some

failure set, to the shortest path in the network with failures.

To give some intuition, in the simple network setting of a

cycle, a local failover strategy is to switch between clockwise

and counterclockwise routing. This strategy induces an

additive stretch according to the size of the cycle, for

the nodes neighboring the destination, when their direct

connection to the destination fails. However, for those nodes,

there is no shorter path to the destination after such a failure,

and hence from a competitive point of view, their failover

route is optimal. We next consider short failover routes.

In the failure example of Fig. 2, an algorithm with global

information could simply take a tour of length

from node

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, VOL. 1, NO. 1, MAY 2020 3

. . .. . .

Fig. 2. Example of a

(4, `)

-clique-torus (see Deﬁnition 1), with

4 t

-rooted

arc-disjoint arborescences, in blue (dotted), red (dash-dotted), green

(dashed), and olive (loosely dashed). Three links (striked out) incident

have failed in this scenario, forcing a circular scheme to use the

olive (loosely dashed) arborescence at

, which takes a tour of length

at least ` − 1, even though a short 5-hop alternative is available.

, as already pointed out above. Is it possible to ﬁnd better

deterministic local failover algorithms that can outperform

arborescence-based routing in this example?

In this context, a deterministic algorithm makes all

decisions on which out-port is used by a packet entirely

depend on available information only, no randomness is

used, e.g., when switching to another arborescence. An

algorithm is local if its failover decisions do not take the state

of other routers into account, but only the locally available

information (inport, dst). In particular a router does not know

where in the network other failures have happened.

Succinctly stated, the answer is no—all deterministic local

fast failover routing schemes perform badly in such cases,

i.e., they do not outperform arborescence-based routing. To

this end, we will show that there are

-connected

-regular

graphs where every deterministic local algorithm has to take

large detours, even though short routes are available.

The intuition behind this statement lies in the fact that

even with the freedom of taking other decisions, there are

cases that lead to long detours and/or high load when

only local knowhow can be used (i.e., the router does not

know where else failures have happened). Thus the power

of algorithms that make deterministic decisions without

knowing anything about the state of other ﬂows and routers

coincides with the power of arborescence based algorithms.

For our proof we deﬁne the following graph class: start

with a cycle of

nodes and replace each link with

k − 1

parallel links. Observe that these graphs are

-connected and

-regular, but have parallel links between neighboring nodes.

In order to obtain a simple graph without parallel links, we

expand each node into a clique of

k − 1

nodes, preserving

connectivity and regularity. For example for

k − 1 = 3

, this

results in a 3 × ` torus graph, as in Fig. 2.

Deﬁnition 1.

Let

k, ` ∈ N

with

k ≥ 3

` ≥ 3

. A

(k, `)

-clique-

torus is a graph with

(k−1)`

nodes and

(`(k−1))+(`

(k−1)(k−2)

)

links, constructed as follows: create

cliques

1 ≤ j ≤ `

, of

k − 1

nodes, i.e., so far every node has degree

k − 2

. Denote the

k − 1

nodes of each clique

j,1

, v

j,2

, . . . , v

j,k−1

. Next and

last, for each

1 ≤ j ≤ `

and each

1 ≤ i ≤ k − 1

, connect

j,i

with v

(j mod `)+1,i

We show that every deterministic local fast failover

algorithm sometimes has to take detours with a length in

the order of the diameter of the graph, even though a route

with a constant number of hops is available. We note that

our results here refer to deterministic algorithms.

Theorem 1.

For all

k ≥ 3

` ≥ 6

: for deterministic local fast

failover algorithms

ALG

resilient to

k − 1

failures on

-connected

-regular graphs, matching on in-port (from which link the packet

arrives) and destination, the competitive additive stretch of

ALG

vs. a globally optimal algorithm is ≥ ` − 6.

We utilize the following Lemma for the theorem proof:

Lemma 1.

Let

G = (V, E)

be a connected graph with a link

failure set

F ⊂ E

. Let

be a

, V

)

-node separator of

s.t.

1) F ⊆ E(V

)

all links in

are not of the type

, v

), v

∈

, v

∈ V

all nodes in

, which are adjacent to nodes in

(denoted as

)

) have at most degree

|F | + 1

. Let

t ∈ V

s.t. there is no path to

E(V

) \ F

from any node in

)

but in

E \ F

. Let

)

t,F

be the shortest path from any node in

)

only using edges from

E \ F

and, over all nodes

v ∈ V

, v

∈ V

, with

= E(v

)

, let

)

t,F

be the maximum

length, of all shortest paths from nodes

v ∈ N

)

only

using edges from

E \ E(v

)

. Then, the competitive additive stretch

of any

|F |

-resilient local fast failover algorithm

, matching on

in-port and destination, on

is at least, for all eligible

t, V

, V

, F

max

t,V

)

t,F

− R

)

t,F

− 1 . (1)

Proof: We start by considering ﬁxed

t, V

, V

, F

fulﬁll-

ing the lemma requirements. Observe that

must provision

routes from all nodes in

)

when the links

fail,

where the shortest of such routes has the length

)

t,F

Let

be the ﬁrst link used by

)

t,F

, i.e.,

is from some

∈ V

to some

∈ V

. After traversing

, the route is

deterministically predeﬁned, never encountering incident

links from

. We can furthermore enforce that

will be

traversed by

, by setting the failure set

as all links

incident to

except

, with

| ≤ |F |

. Now, being at

node

, both failure sets

F, F

are indistinguishable to

i.e., the remaining route of

)

t,F

will be used. On the

other hand, the globally optimal route from

has at

most the the length

)

t,F

, with one additional hop from

. As such, we proved a competitive additive stretch of

)

t,F

) − (R

)

t,F

+ 1)

for ﬁxed eligible

t, V

, V

, F

, from

which the lemma statement follows directly.

We can now prove Theorem 1 in a succinct fashion, using

Lemma 1 as follows for

(k, `)

-clique-torus graphs: a local

algorithm cannot distinguish the situation where

all links

between two cliques failed, forcing a long detour, and

being enforced to take a hop on the long detour, by a dense

cluster of failures which leaves a short detour intact.

Proof of Theorem 1: We pick

from clique

and set

as the

k − 1

links between clique

1, 2

, with

being clique

and

being

V \ V

, where the picked

t, V

, V

, F

fulﬁll the

requirements of Theorem 1. The theorem statement follows

from

)

t,F

≥ ` − 1

and that

)

t,F

+ 1 ≤ 5

(

hop for

1 in C

, 2 to reach C

, at most 1 extra to reach t ∈ C

Combining the fact that no deterministic local algorithm

can have a better competitive additive stretch than

Ω(`)

with

the fact that a

(k, `)

-clique-torus graph has

(k − 1)`

nodes,

i.e., ` ∈ Ω(n/(k − 1)), yields the following:

Corollary 1.

For all

k ≥ 3

, deterministic local fast failover

algorithms resilient to

k − 1

failures, matching destination and

in-port, have competitive additive stretch of Ω(n/(k − 1)).

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, VOL. 1, NO. 1, MAY 2020 4

Algorithm 1: Basic Arc-Swap Operation

Input: valid arborescence-based decomposition

Output: modiﬁed valid arborescence-based

decomposition

1 given a node v and two outgoing arcs e, e

2 if arborescence conditions hold for e, e

then

3 swap arborescences

3 THE POSTPROCESSING FRAMEWORK

This section presents our algorithmic framework to post-

process arborescence-based network decompositions for im-

proved resilience and performance. In the following, we ﬁrst

present the general framework, before we discuss concrete

use cases. In particular, we do not make any assumptions on

the given arborescence-based network decompositions nor

the (re)routing strategy used, which can be arbitrary (speciﬁc

examples will be considered in our simulations).

The framework can be used to optimize a large set of

objectives. We consider two classes of objectives in this paper

and present two examples each. In the ﬁrst class, we aim to

improve trafﬁc-engineering metrics of failover routes (like

load or stretch) or account for ﬂow or application priorities,

given certain assumptions about a trafﬁc scenario and failure

model, without sacriﬁcing maximum resilience. As a shorthand,

we will refer to this ﬁrst class as the

trafﬁc scenario

. In the

second class, the concrete routing mechanism is ignored and

properties of the decomposition, e.g., depth and independence,

are improved, which can lead to shorter paths and higher

resilience respectively.

We will refer to this second class as the

decomposition

In the remainder of this section, we introduce our framework

without concrete instantiations of objective functions, which

we will cover in the next section.

At the heart of our postprocessing framework lies an

arc-swapping algorithm which can come in different ﬂavors,

depending on the use case. All the different variants of the

arc-swapping algorithm have in common that they always

preserve connectivity: if a source-destination pair features

a certain property that inﬂuences the objective, then this

property can only be improved in each arc-swap operation.

In particular, these swaps must maintain the arborescence

character of the decompositions, i.e., they cannot introduce

cycles.

The general principle is quite simple (see Algorithm 1):

we only swap the arborescences of two outgoing arcs of the

same node

and ensure that no cycles are generated. For

simplicity we refer to the set of arcs that do not belong to

any arborescence as the

-arborescence, even though they do

not form an arborescence. This allows us to treat all arcs

in a uniform manner and simpliﬁes the description of our

algorithm.

More formally, we revisit the approach use to gener-

ate arborescences as in e.g. [9], where arcs are added to

arborescences incrementally until no further arcs can be

added. When this situation is reached, arcs belonging to

different arborescences and possibly the 0-arborescences are

swapped to allow the process to continue. The (incomplete)

arborescence set is denoted by

, . . . , T

}

. When growing

an arborescence

, the following minimal conditions must

Before swapping After swapping

Fig. 3. Introductory example with three nodes where growing arbores-

cences sequentially can end in a deadlock. On the left side, the dotted

blue arborescence uses both arcs to

, leaving no possibility for the

remaining dashed red arborescence to route to the destination. However,

after swapping

, t)

with

, v

)

, the dashed red arborescence may

use the link

, t)

on the right side (and subsequentially, the link

, v

)

to complete the construction).

hold when swapping

e = (u, v) ∈ 0 − arborescence

with

= (u, v

) ∈ E(T

)

to ensure that the resulting arbores-

cences are valid [14]:

(1) u has a neighbor v

∈ V (T

)

(2) e = (u, v)

does not belong to any arborescence yet, i.e.,

e /∈ ∪

ρ=1..k

E(T

)

(3) u /∈ V (T

)

(4) ∃j, s.t. e

= (u, v

) ∈ E(T

)

(5) v ∈ V (T

)

(6) v

is not on the path to from v to the root in T

Let us consider an example. Let us assume that arbores-

cences have different colors. In Fig. 3, when we swap the

dotted blue arc

, t)

to the unused arc

, v

)

, the dashed

red arborescence may now take over

, t)

, removing the

current deadlock situation. In general, when we cannot add

an arc to

in the normal round-robin fashion as explained

in [14] and discussed further in Section 4.4, we can check for

candidate arc pairs

e = (u, v), e

= (u, v

)

leaving node

we could perform a swapping operation. Analogously to the

above conditions, we can formulate the criteria for swapping

two arcs belonging to arborescences T

and T

In contrast to the swapping checks necessary when

constructing arborescences, we do not have to test whether

each node is incident to an arborescence in this case: this is

guaranteed already by the existing decomposition (condition

(1,4,5)). Thus, in contrast to the swapping conditions during

the arborescence decomposition, there are two cases to

consider.

Case

(i)

e = (u, v) ∈ E(T

), e

= (u, v

) ∈ E(T

)

. From

the above correctness conditions (1,4,5) are always satisﬁed,

while (2,3) are irrelevant. In addition to (6), it must hold that

is not on the path to from

to the root in

. If these

conditions are satisﬁed, then

can be added to

and

can

be added to

. An example is provided in Figure 4, which

improves the depth of both arborescences.

Case

(ii)

e = (u, v) ∈ E(T

)

and

= (u, v

)

does not

belong to any (real) arborescence,

/∈ ∪

ρ=1..k

E(T

)

. In this

case, (1) is always satisﬁed and (2-6) are irrelevant. Instead,

to be able to remove

from

and replace it with

it must

hold that

does not belong to the path from

to the root in

. An example is provided in Figure 5, which gives a better

depth for the dashed red arborescence.

If the conditions are met, then the arborescence set after

the swap is still valid. The time complexity of picking all

[9] and similar approaches use additional criteria which are immaterial

to this discussion

HTML Viewer

Frequently Asked Questions (14)

Q1. What have the authors contributed in "Improved fast rerouting using postprocessing" ?

This paper presents an algorithmic framework for improving a given FRR network decomposition, using postprocessing. Their simulations show that postprocessing is indeed beneficial in various scenarios, and can therefore enhance today ’ s approaches.

Q2. What have the authors stated for future works in "Improved fast rerouting using postprocessing" ?

The authors understand their work as the first step and believe that it opens several interesting avenues for future research. In particular, it will be interesting to study alternative postprocessing algorithms, and derive formal performance guarantees for them. It would also be interesting to study further use cases for their framework, beyond the ones given in this paper, e. g., for SRLGs combined with load and stretch.

Q3. What is the effect of optimizing for low load?

For low load some flows must take detours, so in general optimizing for low load leads to higher stretch, as the authors will see in their next experiments.

Q4. What is the way to analyze arborescences?

The existence of a valid circular routing scheme based on k arc-disjoint spanning arborescences in a given network graph containing a known set of failed links can also be analyzed with the aid of Integer Linear Programming (ILP) tools.

Q5. What is the first group of constraints?

The first group of constraints (1: Arc in one tree) guarantees that each arc in the network graph belongs to at most one of k arc-disjoint spanning arborescences covering the graph.

Q6. What is the motivation behind this paper?

This paper was motivated by the computational challenges involved in computing network decompositions which do not only provide basic connectivity but also account for the quality of routes after failures.

Q7. What are the main drawbacks of static fast rerouting algorithms in the data plane?

The authors in this paper are interested in static fast rerouting algorithms in the data plane, which rely on precomputed failover rules and do not require packet header rewriting.

Q8. How many failures are there in Greedy arborescences?

Even under a high number of failures (e.g. 40), the median of routing failures is 0 in both optimized and unoptimized arborescences, only the 10% worst unoptimized arborescences seem to raise to a low 5% failure rate.

Q9. What is the effect of stretch optimization on the routing failure rate?

One can first observe (top) that this optimization has an impact on the routing failure rate: before optimizing, some packets do not reach their destination, but after swapping, the failure rate is 0.

Q10. What is the objective of swapping edges?

Figure 8 (right) presents the results of swapping edges with the objective of increasing the number of independent paths from all nodes in all arborescence pairs.

Q11. How many arborescences can be swapped before an improvement of the objective function is required?

the authors note that their algorithmic framework can also be generalized to swap multiple (i.e., more than two) arcs before an improvement of the objective function is required, even from multiple nodes at once.

Q12. What is the shortest path between the source and the root node?

to be able to minimize the maximum path stretch among all user demands d in the network graph containing failed links (arcs belonging to the set F ), the authors first introduce additional virtual unit flows to find the shortest paths between the source nodes sd and the root node r, and then, the authors determine the maximum path stretch based on the difference in length between the actually used paths (circular routing) and the reference paths (the shortest paths avoiding the failed links).

Q13. What is the problem with rewriting packet headers?

The problem is particularly challenging in scenarios where packet headers cannot be used to carry meta-information about encountered failures: such header rewriting is often undesired and introduces overhead (related to header rewriting itself, but also in terms of additional rules required at the routers to process such information).

Q14. Why are there no additional information about failure scenarios and failover objectives?

In particular, the authors are motivated by the observation that in practice, additional information about failure scenarios and failover objectives may be available, e.g., about shared risk link groups [11], [12], [13] or about critical flows for which it is important to be routed along short paths, even after failures.

Improved Fast Rerouting Using Postprocessing

Summary (4 min read)

1 INTRODUCTION

2 IMPOSSIBILITY OF BEATING ARBORESCENCES

3 THE POSTPROCESSING FRAMEWORK

4 USE CASES AND EVALUATION

4.1 Impact of the Original Network Decomposition

4.2 Optimization Use Cases

4.3 Runtime Analysis

4.4 Optimizing Network Decomposition Heuristics

4.5 Experiments on Real World Graphs

5 AN EXAMPLE ILP MODEL FOR THE CIRCULAR ROUTING SCHEME

7 CONCLUSION

Figures (11)

Citations

Cites background from "Improved Fast Rerouting Using Postp..."

References

Related Papers (5)

Frequently Asked Questions (14)

Q1. What have the authors contributed in "Improved fast rerouting using postprocessing" ?

Q2. What have the authors stated for future works in "Improved fast rerouting using postprocessing" ?

Q3. What is the effect of optimizing for low load?

Q4. What is the way to analyze arborescences?

Q5. What is the first group of constraints?

Q6. What is the motivation behind this paper?

Q7. What are the main drawbacks of static fast rerouting algorithms in the data plane?

Q8. How many failures are there in Greedy arborescences?

Q9. What is the effect of stretch optimization on the routing failure rate?

Q10. What is the objective of swapping edges?

Q11. How many arborescences can be swapped before an improvement of the objective function is required?

Q12. What is the shortest path between the source and the root node?

Q13. What is the problem with rewriting packet headers?

Q14. Why are there no additional information about failure scenarios and failover objectives?