scispace - formally typeset
Search or ask a question
Journal ArticleDOI

A Survey of Fast-Recovery Mechanisms in Packet-Switched Networks

TL;DR: This survey presents a systematic, tutorial-like overview of packet-based fast-recovery mechanisms in the data plane, focusing on concepts but structured around different networking technologies, from traditional link-layer and IP-based mechanisms, over BGP and MPLS to emerging software-defined networks and programmable data planes.
Abstract: In order to meet their stringent dependability requirements, most modern packet-switched communication networks support fast-recovery mechanisms in the data plane. While reactions to failures in the data plane can be significantly faster compared to control plane mechanisms, implementing fast recovery in the data plane is challenging, and has recently received much attention in the literature. This survey presents a systematic, tutorial-like overview of packet-based fast-recovery mechanisms in the data plane, focusing on concepts but structured around different networking technologies, from traditional link-layer and IP-based mechanisms, over BGP and MPLS to emerging software-defined networks and programmable data planes. We examine the evolution of fast-recovery standards and mechanisms over time, and identify and discuss the fundamental principles and algorithms underlying different mechanisms. We then present a taxonomy of the state of the art, summarize the main lessons learned, and propose a few concrete future directions.

Content maybe subject to copyright    Report

Citations
More filters
06 Mar 2006
TL;DR: In this paper, the authors specify additional information that can be inserted in IS-IS LSPs to convey link capabilities that may be useful in certain application cases, such as local protection, provided by a U-turn alternate, in the event of a node failure and/or node reconverging onto a new topology.
Abstract: This document specifies additional information that can inserted in IS-IS LSPs to convey link capabilities that may be useful in certain applications. In particular, an IS may convey that zero or more of its links are explicit marked and/or implicit U-turn recipient capable, which may be described as capable of identifying traffic as U-turn traffic and redirecting the traffic to a suitable alternate. The immediate applicability for these two link capabilities is in support of local protection, provided by a U-turn alternate, in the event of a link and/or node failure while the IS-IS area is reconverging onto a new topology.

70 citations

Journal ArticleDOI
TL;DR: The state-of-the-art literature on post-disaster wireless communication networks is reviewed, and insights for the future establishment of such networks are provided and several promising research directions are presented.
Abstract: The number of disasters has increased over the past decade where these calamities significantly affect the functionality of communication networks. In the context of 6G, airborne and spaceborne networks offer hope in disaster recovery to serve the underserved and to be resilient in calamities. Therefore, our paper reviews the state-of-the-art literature on post-disaster wireless communication networks and provides insights for the future establishment of such networks. In particular, we first give an overview of the works investigating the general procedures and strategies for facing any large-scale disaster. Then, we present technological solutions for post-disaster communications, such as the recovery of the terrestrial infrastructure, installing aerial networks, and using spaceborne networks. Afterwards, we shed light on the technological aspects of post-disaster networks, primarily the physical and networking issues. We present the literature on channel modeling, coverage and capacity, radio resource management, localization, and energy efficiency in the physical layer part, and discuss the integrated space-air-ground architectures, routing, delay-tolerant/software-defined networks, and edge computing in the networking layer part. This paper also includes interesting simulation results which can provide practical guidelines about the deployment of ad hoc network architectures in emergency scenarios. Finally, we present several promising research directions, namely backhauling, cache-enabled and intelligent reflective surface-enabled networks, placement optimization of aerial base stations (ABSs), and the mobility-related aspects that come into play when deploying aerial networks, such as planning their trajectories and the consequent handovers (HOs).

6 citations

Journal ArticleDOI
TL;DR: In this article , the authors review the state-of-the-art literature on post-disaster wireless communication networks and provide insights for the future establishment of such networks and present technological solutions for postdisaster communications, such as the recovery of the terrestrial infrastructure, installing aerial networks, and using spaceborne networks.
Abstract: The number of disasters has increased over the past decade where these calamities significantly affect the functionality of communication networks. In the context of 6G, airborne and spaceborne networks offer hope in disaster recovery to serve the underserved and to be resilient in calamities. Therefore, our paper reviews the state-of-the-art literature on post-disaster wireless communication networks and provides insights for the future establishment of such networks. In particular, we first give an overview of the works investigating the general procedures and strategies for facing any large-scale disaster. Then, we present technological solutions for post-disaster communications, such as the recovery of the terrestrial infrastructure, installing aerial networks, and using spaceborne networks. Afterwards, we shed light on the technological aspects of post-disaster networks, primarily the physical and networking issues. We present the literature on channel modeling, coverage and capacity, radio resource management, localization, and energy efficiency in the physical layer part, and discuss the integrated space-air-ground architectures, routing, delay-tolerant/software-defined networks, and edge computing in the networking layer part. This paper also includes interesting simulation results which can provide practical guidelines about the deployment of ad hoc network architectures in emergency scenarios. Finally, we present several promising research directions, namely backhauling, cache-enabled and intelligent reflective surface-enabled networks, placement optimization of aerial base stations (ABSs), and the mobility-related aspects that come into play when deploying aerial networks, such as planning their trajectories and the consequent handovers (HOs).

5 citations

Journal ArticleDOI
TL;DR: In this article , a hybrid fast path recovery algorithm (HFPR-A) is proposed based on the principle of simplicity, where in some cases, the shortest path or approximate shortest path between PMU and PDC can be recovered by adding only one edge to the original forwarding tree; while in other cases the shortest paths can be recover with
Abstract: In the wide area measurement system (WAMS), the end-to-end transmission delay between the phasor measurement unit (PMU) and phasor data concentrator (PDC) is strictly constrained for real-time monitoring and protection applications. When a communication link failure happens, fast path recovery is required to reduce the impact of measurement losses. In this work, the promising software-defined network (SDN) technique is leveraged to compute the re-routing path in a global view upon a single link failure. More specifically, a hybrid fast path recovery algorithm (HFPR-A) is proposed based on the principle of simplicity: in some cases, the shortest path or approximate shortest path between PMU and PDC can be recovered by adding only one edge to the original forwarding tree; while in the other cases, the shortest paths can be recovered with lower computational complexity than the traditional Dijkstra’s algorithm. The proposed HFPR-A is implemented on the Ryu + Mininet testbed, and the simulation results on different IEEE benchmark test power systems show that the proposed HFPR-A could find shorter re-routing paths than the existing methods with a low-enough response time.

4 citations

Journal ArticleDOI
TL;DR: In this article , the authors reviewed 188 papers from 2010 to 2021, selecting 70 articles that are highly relevant to our work and collected a large amount of evidence that will assist the industry and academic researchers in networking to address current research gaps in failure recovery solutions for the SDN data plane.

3 citations

References
More filters
Journal ArticleDOI
31 Mar 2008
TL;DR: This whitepaper proposes OpenFlow: a way for researchers to run experimental protocols in the networks they use every day, based on an Ethernet switch, with an internal flow-table, and a standardized interface to add and remove flow entries.
Abstract: This whitepaper proposes OpenFlow: a way for researchers to run experimental protocols in the networks they use every day. OpenFlow is based on an Ethernet switch, with an internal flow-table, and a standardized interface to add and remove flow entries. Our goal is to encourage networking vendors to add OpenFlow to their switch products for deployment in college campus backbones and wiring closets. We believe that OpenFlow is a pragmatic compromise: on one hand, it allows researchers to run experiments on heterogeneous switches in a uniform way at line-rate and with high port-density; while on the other hand, vendors do not need to expose the internal workings of their switches. In addition to allowing researchers to evaluate their ideas in real-world traffic settings, OpenFlow could serve as a useful campus component in proposed large-scale testbeds like GENI. Two buildings at Stanford University will soon run OpenFlow networks, using commercial Ethernet switches and routers. We will work to encourage deployment at other schools; and We encourage you to consider deploying OpenFlow in your university network too

9,138 citations


"A Survey of Fast-Recovery Mechanism..." refers methods in this paper

  • ...For example, many data centers use Equal Cost MultiPath (ECMP) [22] (a data plane algorithm that provides automatic failover to another shortest path), WAN networks leverage IP Fast Reroute [23]–[25] or MPLS Fast Reroute [26] to deal with failures on the data plane, SDNs provide FRR functionality in terms of OpenFlow fast-failover groups [27], and BGP relies on BGP-PIC [28] for quickly rerouting flows, to just name a few....

    [...]

  • ...FatTire’s compiler targets the OpenFlow fast-failover mechanism and facilitates simple reasoning about network programs even in the presence of failures....

    [...]

  • ...We then discuss the two main generations of programmable specifications, i.e., OpenFlow and the most recent P4....

    [...]

  • ...27 illustrates the OpenFlow model and its FRR mechanism: a controller (e.g., Floodlight) can populate the forwarding tables of the different switches from a logically centralized perspective....

    [...]

  • ...Programmable data planes can also be used to implement fast-recovery concepts proposed in the context of SDN, but requiring additional features currently not available on OpenFlow switch hardware....

    [...]

Journal ArticleDOI
TL;DR: The aim is to explicate a set of general concepts, of relevance across a wide range of situations and, therefore, helping communication and cooperation among a number of scientific and technical communities, including ones that are concentrating on particular types of system, of system failures, or of causes of systems failures.
Abstract: This paper gives the main definitions relating to dependability, a generic concept including a special case of such attributes as reliability, availability, safety, integrity, maintainability, etc. Security brings in concerns for confidentiality, in addition to availability and integrity. Basic definitions are given first. They are then commented upon, and supplemented by additional definitions, which address the threats to dependability and security (faults, errors, failures), their attributes, and the means for their achievement (fault prevention, fault tolerance, fault removal, fault forecasting). The aim is to explicate a set of general concepts, of relevance across a wide range of situations and, therefore, helping communication and cooperation among a number of scientific and technical communities, including ones that are concentrating on particular types of system, of system failures, or of causes of system failures.

4,695 citations


"A Survey of Fast-Recovery Mechanism..." refers background in this paper

  • ..., by availability, and integrity) as well as individual features including authorisability, auditability, confidentiality, and nonrepudiability [65]....

    [...]

01 Jan 2007
TL;DR: In this paper, the main definitions relating to dependability, a generic concept including a special case of such attributes as reliability, availability, safety, integrity, maintainability, etc.
Abstract: This paper gives the main definitions relating to dependability, a generic concept including a special case of such attributes as reliability, availability, safety, integrity, maintainability, etc. Security brings in concerns for confidentiality, in addition to availability and integrity. Basic definitions are given first. They are then commented upon, and supplemented by additional definitions, which address the threats to dependability and security (faults, errors, failures), their attributes, and the means for their achievement (fault prevention, fault tolerance, fault removal, fault forecasting). The aim is to explicate a set of general concepts, of relevance across a wide range of situations and, therefore, helping communication and cooperation among a number of scientific and technical communities, including ones that are concentrating on particular types of system, of system failures, or of causes of system failures.

4,335 citations

01 Jan 2001
TL;DR: This document specifies the architecture for Multiprotocol Label Switching (MPLS).
Abstract: This document specifies the architecture for Multiprotocol Label Switching (MPLS). [STANDARDS-TRACK]

3,354 citations


"A Survey of Fast-Recovery Mechanism..." refers background in this paper

  • ...The architecture of Multiprotocol Label Switching (MPLS), introduced in [116], relies on Label Switching Routers (LSRs) capable of forwarding packets along Label Switched Paths (LSPs) based on additional labels carried in a packet header....

    [...]

  • ...• Stack-Label Matching: Can messages be forwarded based on the label currently occupying the top position in a stack embedded in the message header? Stacklabel matching enables flexible forwarding along preestablished paths in the network, without performing additional routing table lookups based on the values of the primary fields describing the source and the destination of the message [26], [116], [117]....

    [...]

01 Jul 1994
TL;DR: This document, together with its companion document, "Application of the Border Gateway Protocol in the Internet", define an inter- autonomous system routing protocol for the Internet.
Abstract: This document, together with its companion document, "Application of the Border Gateway Protocol in the Internet", define an inter- autonomous system routing protocol for the Internet.

2,832 citations


"A Survey of Fast-Recovery Mechanism..." refers background in this paper

  • ...This in turn makes it challenging to assure high availability of network services, since routing protocols typically used in IP networks (such as Border Gateway Protocol — BGP [46] or Open Shortest Path First – OSPF [10]) are characterized by a slow process of a post-failure routing convergence which can even take tens of seconds [42], [47]....

    [...]