scispace - formally typeset
Search or ask a question

Showing papers on "Multipath routing published in 1992"


Proceedings ArticleDOI
01 Apr 1992
TL;DR: Simulations of partially adaptive and nonadaptive routing algorithms for 2D meshes and hypercubes show that which algorithm has the lowest latencies and highest sustainable throughput depends on the pattern of message traffic.
Abstract: We present a model for designing wormhole routing algorithms that are deadlock free, livelock free, minimal or nonminimal, and maximally adaptive. A unique feature of this model is that it is not based on adding physical or virtual channels to network topologies (though it can be applied to networks with extra channels). Instead, the model is based on analyzing the directions in which packets can turn in a network and the cycles that the turns can form. Prohibiting just enough turns to break all of the cycles produces routing algorithms that are deadlock free, livelock free, minimal or nonminimal, and maximally adaptive for the network. In this paper, we focus on the two most common network topologies for wormhole routing, n-dimensional mesh, just a quarter of the turns must be prohibited to prevent deadlock. The remaining three quarters of the turns permit partial adaptiveness in routing. Partially adaptive routing algorithms are described for 2D meshes, n-dimensional meshes, k-ary n-cubes, and hypercubes. Simulations of partially adaptive and nonadaptive routing algorithms for 2D meshes and hypercubes show that which algorithm has the lowest latencies and highest sustainable throughput depends on the pattern of message traffic. For nonuniform traffic, partially adaptive routing algorithms perform better than non-adaptive ones.

881 citations



Proceedings ArticleDOI
01 Apr 1992
TL;DR: This paper provides the simplest known support for deadlock-free adaptive routing in k-ary n-cubes of more than two dimensions (with k > 2) and restricts adaptivity reduces the hardware complexity, improving router speed or allowing additional performance-enhancing network features.
Abstract: Network throughput can be increased by allowing multipath, adaptive routing. Adaptive routing allows more freedom in the paths taken by messages, spreading load over physical channels more evenly. The flexibility of adaptive routing introduces new possibilities of deadlock. Previous deadlock avoidance schemes in k-ary n-cubes require an exponential number of virtual channels, independent of network size and dimension. Planar adaptive routing algorithms reduce the complexity of deadlock prevention by reducing the number of choices at each routing step. In the fault-free case, planar-adaptive networks are guaranteed to be deadlock-free. In the presence of network faults, the planar-adaptive router can be extended with misrouting to produce a working network which remains provably deadlock free and is provably livelock free. In addition, planar adaptive networks can simultaneously support both in-order and adaptive, out-of-order packet delivery.Planar-adaptive routing is of practical significance. It provides the simplest known support for deadlock-free adaptive routing in k-ary n-cubes of more than two dimensions (with k > 2). Restricting adaptivity reduces the hardware complexity, improving router speed or allowing additional performance-enhancing network features. The structure of planar-adaptive routers is amenable to efficient implementation.

464 citations


Journal ArticleDOI
TL;DR: Results show that the maximum throughput achievable with hot-potato routing can be as low as 25% of that for store-and-forward routing, and that the relative degradation increases as the number of nodes grows larger.
Abstract: The achievable aggregate capacity for a variant of the basic multihop approach in which minimum distance store-and-forward routing is replaced by a hot-potato routing algorithm is determined. With hot-potato routing, all packets simultaneously arriving at a given node and not intended for reception at that node are immediately placed onto the outbound links leaving that node; if two or more packets contend for the same outgoing link to achieve a minimum distance routing, then all but one will be misrouted to links which produce longer paths to the eventual destination. Attention is confined to the development of an analytical methodology for finding the probability distribution of the number of hops with hot potato routing for symmetric networks under uniform traffic load. Results show that the maximum throughput achievable with hot-potato routing can be as low as 25% of that for store-and-forward routing, and that the relative degradation increases as the number of nodes grows larger. This implies that the link speed up needed to produce a significant overall capacity advantage with hot potato should be at least a factor of 10. >

166 citations


Book ChapterDOI
26 Nov 1992

157 citations


Journal ArticleDOI
01 Apr 1992
TL;DR: In a dynamic network environment under heavy traffic load, shortest-path routing algorithms, particularly those that attempt to adapt to traffic changes, frequently exhibit oscillatory behaviors and cause performance degradation.
Abstract: In a dynamic network environment under heavy traffic load, shortest-path routing algorithms, particularly those that attempt to adapt to traffic changes, frequently exhibit oscillatory behaviors and cause performance degradation. In this paper we first examine the problems from the perspective of control theory and decision making, and then analyze the behaviors of the shortest-path routing algorithms in details.

126 citations


Proceedings ArticleDOI
01 Jun 1992
TL;DR: Two new algorithms for deadlock- and livelock-free wormhole routing in the torus network are presented and four worm-hole routing techniques for the two-dimensional torus are experimentally evaluated using a dynamic message injection model and different tr&c patterns and message lengths.
Abstract: Luis Gravano, Gustavo D. PifarrC, Pablo E. Berman, and Jorge L. C. Sanz, Fellow, ZEEE Abstract-This paper consists of two parts. In the first part, two new algorithms for deadlock- and livelock-free wormhole routing in the torus network are presented. The first algorithm, called *-Channels, is for the n-dimensional torus network. This technique is fully-adaptive minimal, that is, all paths with a minimal number of hops from source to destination are available for routing, and needs only five virtual channels per bidirectional link, the lowest channel requirement known in the literature for fully-adaptive minimal worm-hole routing. In addition, this result also yields the lowest buffer require- ment known in the literature for packet-switched fully-adaptive minimal routing. The second algorithm, called 4-Classes, is for the bidimensional torus network. This technique is fully-adaptive minimal and requires only eight virtual channels per bidirectional link. Also, it allows for a highly parallel implementation of its associated routing node. In the second part of this paper, four worm-hole routing techniques for the two-dimensional torus are experimentally evaluate'd using a dynamic message injection model and different tr&c patterns and message lengths.

119 citations


Proceedings ArticleDOI
01 May 1992
TL;DR: The authors propose a burst level bandwidth reservation scheme which is appropriate for an asynchronous transfer mode (ATM) network that supports the transfer of high-speed burst traffic and it is shown that the proposed scheme works well if the round-trip propagation delay is less than about 10% of the burst duration.
Abstract: The authors propose a burst level bandwidth reservation scheme which is appropriate for an asynchronous transfer mode (ATM) network that supports the transfer of high-speed burst traffic. The proposed scheme utilizes multiple links and multiple paths in a network to transfer individual bursts, thereby increasing network efficiency. As the link and path selection is implemented on a burst-by-burst basis, instead of the cell-by-cell basis used in the conventional multilink approach, the proposed scheme does not require a cell-by-cell distributor and out-of-order cell delivery is avoided, which was the disadvantage of the conventional cell-by-cell multilink approach. Performance evaluation showed that the delay and blocking probability characteristics are greatly improved, as the number of links between each pair of adjacent nodes and the number of source-destination paths increase. For example, around 40% link utilizations are achieved with 16 multiple links, while the conventional single-link schemes yield only to 1% link utilization to guarantee the same quality requirements. It is shown that the proposed scheme works well if the round-trip propagation delay is less than about 10% of the burst duration. >

113 citations


Proceedings ArticleDOI
01 Oct 1992
TL;DR: A dynamic multi-path routing scheme that has been considered for connection oriented homogeneous high speed networks to bridge the gap between routing and congestion control as the network becomes congested is described.
Abstract: In this paper we describe briefly a dynamic multi-path routing scheme that has been considered for connection oriented homogeneous high speed networks. The fundamental objective of the scheme is to bridge the gap between routing and congestion control as the network becomes congested. Because propagation delay far out shadows queueing and transmission delay in high speed networks, the proposed routing scheme works as a shortest path (minimum hop) first algorithm under light traffic conditions. However as the shortest path becomes congested, the source node uses multiple paths when and if available in order to distribute the load and reduce packet loss. The scheme is a cross between Alternate Path routing and Trunk Reservation.We compare the performance of the proposed scheme with the Shortest Path Only algorithm, the Alternate Path routing algorithm, the Random Routing algorithm, and the Trunk Reservation scheme. The throughput and packet loss performance are compared via simulations. These have been carried out concentrating on a 5 node network with varying traffic patterns, the intention being to gain insight into the strengths and weaknesses of the various schemes.

105 citations


Patent
09 Apr 1992
TL;DR: In this paper, a new routing process for a single-instruction-multiple-data (SIMD) multi-level hierarchical network or nodes, (N111-N113, N121-N123, N131-N133, N141-N143, N211, N212, N221, N222, N311 and N312), which are arranged in clusters and interconnected by dual, unidirectional channels, are used to send data packets including routing address information during a succession of routing cycles from transmitting ones to receiving ones of a larger number of parallel
Abstract: A new routing process for a single-instruction-multiple-data (SIMD) multi-level hierarchical network or nodes, (N111-N113, N121-N123, N131-N133, N141-N143, N211, N212, N221, N222, N311 and N312), which are arranged in clusters and interconnected by dual, unidirectional channels, are used to send data packets including routing address information during a succession of routing cycles from transmitting ones to receiving ones of a larger number of parallel processors (e.g., 4096 processors arranged in a hierarchy of 8 cabinets, each of which contains a cluster of 8 circuits boards, with each circuit board containing a cluster of 64 processors). Each of the nodes (N111-N113, N121-N123, N131-N133, N211, N212, N221, N222, N311 and N312) includes a storage buffer having a capacity equal to one more than the number of channels at that level. This new routine process guarantees prevention of deadlock between levels and buffer overflow, and offers high-speed, low-cost interprocessor communication for SIMD computer.

100 citations



Journal ArticleDOI
TL;DR: In this paper, the authors investigate dynamic routing in queueing networks and show that there is a heavy traffic limiting regime in which a network model based on Brownian motion can be used to approximate and solve an optimal control problem for a queueing network with multiple customer types.
Abstract: In this paper we investigate dynamic routing in queueing networks. We show that there is a heavy traffic limiting regime in which a network model based on Brownian motion can be used to approximate and solve an optimal control problem for a queueing network with multiple customer types. Under the solution of this approximating problem the network behaves as if the service-stations of the original system are combined to form a single pooled resource. This resource pooling is a result of dynamic routing, it can be achieved by a form of shortest expected delay routing, and we find that dynamic routing can offer substantial improvements in comparison with less responsive routing strategies.

Proceedings ArticleDOI
09 Jun 1992
TL;DR: It is shown that wormhole routing in mesh-connected networks can be deadlock free and adaptive without the addition of channels to the basic topology.
Abstract: It is shown that wormhole routing in mesh-connected networks can be deadlock free and adaptive without the addition of channels to the basic topology. Several partially adaptive routing algorithms for 2-D and 3-D meshes are described and simulated for a variety of conditions. Simulations of policies for selecting input channels show that transmitting extra information in the header flits can reduce communication latencies at high network throughputs. Simulations of policies for selecting output channels show that avoiding turns reduces latencies at high throughputs. Unrestricted nonminimal routing is found to reduce latencies slightly at low throughputs but increase latencies significantly at high throughputs. For nonuniform traffic patterns, a partially adaptive routing algorithm performs better than a nonadaptive one. >

Patent
Dave Dunning1
01 Jun 1992
TL;DR: In this paper, a self-timed message routing chip is coupled to each processor node, thereby forming a two dimensional mesh of message routing chips, and data can broadcast through the mesh routing chips to a row, a column, or a matrix of nodes.
Abstract: A method and apparatus providing for data broadcasting in a two dimensional mesh of processor nodes is disclosed. In accordance with the present invention, a self-timed message routing chip is coupled to each processor node, thereby forming a two dimensional mesh of message routing chips. Broadcasting originates from a corner node, and data can broadcast through the mesh routing chips to a row, a column, or a matrix of nodes. The mesh routing chips, together, form a self-timed pipeline with each individual message routing chip having broadcasting hardware which provides for the forking of a message within that particular message routing chip. The self-timed forking of a message within individual message routing chips directly supports data broadcasting within the two dimensional mesh.

Journal ArticleDOI
TL;DR: Simulation results show that the family of state-dependent routing schemes, of which separable routing is a member, is very attractive in terms of blocking rate, built-in network management features, and behavior in the presence of traffic forecast error.
Abstract: Separable routing is the first of a number of routing schemes for circuit switched telephone traffic invented at Bellcore. These routing schemes are state dependent, in the sense that, for each call attempt, a routing decision is made on the basis of the state of the network (defined in terms of the numbers of busy and idle trunks in the various trunk groups at the moment of the call attempt). In this paper, we describe separable routing and its mathematical background. Simulation results we have presented elsewhere show that the family of state-dependent routing schemes, of which separable routing is a member, is very attractive in terms of blocking rate, built-in network management features, and behavior in the presence of traffic forecast error.

Journal ArticleDOI
TL;DR: This paper presents an optimal algorithm for packet routing on a mesh-connected computer and shows how a single step of inter-processor communication can be thought of as a packet routing task.

Proceedings ArticleDOI
01 Jun 1992
TL;DR: It is shown, using a recently developed network simulator, MaRS, that a newly proposed distance-vector algorithm (ExBF) performs as well as SPF, suggesting that distance- vector algorithms are appropriate for very large wide-area networks.
Abstract: There are two approaches to adaptive routing protocols for wide-area store-and-forward networks: distance-vector and link-state. Distance-vector algorithms use O(N x e) storage at each node, whereas link-state algorithms use O(N2), where N is the number of nodes in the network and e is the average degree of a node. The ARPANET started with a distance-vector algorithm (Distributed Bellman-Ford), but because of long-lived loops, changed to a link-state algorithm (SPF). We show, using a recently developed network simulator, MaRS, that a newly proposed distance-vector algorithm (ExBF) performs as well as SPF. This suggests that distance-vector algorithms are appropriate for very large wide-area networks.

Proceedings ArticleDOI
06 Dec 1992
TL;DR: An approach to adaptive routing in multirate networks using a Markov decision theoretic framework which maintains low computational complexity while still providing quite accurate routing information is proposed.
Abstract: An approach to adaptive routing in multirate networks using a Markov decision theoretic framework which maintains low computational complexity while still providing quite accurate routing information is proposed. In this approach, each link is modeled as a birth-death process to reduce the state space size and a policy iteration applied to achieve better network performance. The results show that routing algorithms based on this approach yield better performance than least-load path routing (LLP) without incurring any significant increase in computational complexity. >

Journal ArticleDOI
TL;DR: A solution approach is developed using a shortest path based formulation with additional routing constraints imposed to refine the routing in response to minimum frequency constraints and a local improvement heuristic is presented which manipulates the routing constraints.
Abstract: Routing shipments efficiently on less-than-truckload trucking networks represents an important subproblem of the general network design problem that arises when designing a service network. The objective of the LTL shipment routing problem is to minimize the total transportation and handling costs subject to two key constraints: (i) service between two terminals must always satisfy a given minimum frequency (measured in trailers per week) and (ii) the paths from all origins into a destination should form a tree. This second constraint reflects a practical limitation on the types of instructions that can be implemented in the field. A solution approach is developed using a shortest path based formulation with additional routing constraints imposed to refine the routing in response to minimum frequency constraints. A local improvement heuristic is presented which manipulates the routing constraints. A separate set of primal-dual algorithms are also developed which provide both upper and lower bounds. Numeri...

Patent
22 Jun 1992
TL;DR: In this article, a distributed and adaptive method and system for correcting routing errors due to packet deflections in a dual-shuffle exchange network (DSN) is provided, which can be used either as the basis of a switch architecture of a ultra high-speed local-area network or metropolitan area network.
Abstract: A distributed and adaptive method and system are provided for correcting routing errors due to packet deflections in a dual-shuffle exchange network (DSN). The DSN includes a shuffle-exchange network (SN) and an unshuffle-exchange network (USN). A packet that cannot be routed correctly will be temporarily "deflected" to a wrong route. This deflection, or routing error, is then registered and encoded in a routing tag of the header of the packet. Using this information, an error-correcting routing algorithm is then used to correct the error at a different part of the network. The method and system can be used either as the basis of a switch architecture of a ultra high-speed local-area network or metropolitan area network. Also, the method and system can be used in circuit switching. The DSN can achieve the Shannon's lower bound N log N on switch complexity with arbitrarily small packet-loss probability.

Proceedings ArticleDOI
10 May 1992
TL;DR: The authors propose a provably good performance-driven global routing algorithm based on a new bounded-radius minimum routing free formulation, which simultaneously minimizes both routing cost and the longest interconnection path, so that both are bounded by small constant factors away from optimal.
Abstract: The authors propose a provably good performance-driven global routing algorithm for both cell-based and building block design. The approach is based on a new bounded-radius minimum routing free formulation, which simultaneously minimizes both routing cost and the longest interconnection path, so that both are bounded by small constant factors away from optimal. The authors' method generalizes to Steiner global routing in arbitrary weighted graphs, and to the case where varying wirelength bounds are prescribed for different source-sink paths. Extensive simulations confirmed that the approach gave very good performance, and exhibited a smooth tradeoff between the competing requirements of minimum delay and minimum total wirelength. >

Proceedings ArticleDOI
01 May 1992
TL;DR: The authors propose a new scheme for optical signal routing within linear lightwave network (LLN) subnets, based on a new optical agent signal routing scheme, called rooted routing, and shows that when the link and excess losses are small, the optimal power budget is close to thatWhen the original, shortest path routing scheme is used.
Abstract: The authors propose a new scheme for optical signal routing within linear lightwave network (LLN) subnets. They study the synchronization problem that exists in these subnets which prevents efficient implementation of time division multiple access schemes for sharing a common broadcast medium. A solution for this problem is proposed, based on a new optical agent signal routing scheme, called rooted routing. The impact of the rooted routing on power losses is analyzed, and an approach for optimizing the power losses in LLNs with rooted routing is presented. It is shown that when the link and excess losses are small, the optimal power budget is close to that when the original, shortest path routing scheme is used. It is also shown that the power budget in LLNs with rooted routing can be significantly improved using a single optical amplifier. >

Proceedings ArticleDOI
28 May 1992
TL;DR: The authors propose a multicast wormhole routing algorithm for 2D-torus multicomputers that is based on a path-like routing scheme and offers a deadlock-free routing.
Abstract: More recently, multicast communication has been included in multicomputer systems. Multicast communication refers to the delivery of the same message from a source node to an arbitrary number of destination nodes. The performance of multicomputers is highly dependent on the message routing scheme and the communication mechanism. Wormhole routing is the most promising switching technique used in the new generation multicomputers. The authors propose a multicast wormhole routing algorithm for 2D-torus multicomputers. The multicast routing algorithm is based on a path-like routing scheme and offers a deadlock-free routing. Comparing with the 2D-mesh multicomputers, the authors simulation show that the multipath routing algorithm for 2D-torus multicomputers can reduce the communication traffic and the maximum distance in multicast communication. >

Proceedings ArticleDOI
01 Mar 1992
TL;DR: This work introduces and evaluates the multipath E-cube algorithm (MECA) for adaptive and fault-tolerant wormhole routing in k-ary n-cubes and highlights the advantages of MECA over other circuit-switched routing techniques.
Abstract: Introduces and evaluates the multipath E-cube algorithm (MECA) for adaptive and fault-tolerant wormhole routing in k-ary n-cubes. This algorithm provides multiple shortest paths between any source and destination node of a multicomputer using only a constant number of virtual channels for every physical channel, and with little added overhead at the router. Consequently, it is able to improve network throughput and reduce latency while maintaining the deadlock-free property that is critical for circuit-switched routing mechanisms. MECA also allows multiple broadcast trees to be used concurrently without incurring deadlock. Simulation results highlight the advantages of MECA over other circuit-switched routing techniques. >

Proceedings ArticleDOI
01 Jul 1992
TL;DR: A new three-layer over-the-cell channel routing algorithm, WILMA3 (wire length minimization algorithm 3), for high-performance circuits that not only minimizes the channel height by OTC routing but also minimized the net lengths.
Abstract: The authors present a new three-layer over-the-cell channel routing algorithm, WILMA3 (wire length minimization algorithm 3), for high-performance circuits. This router not only minimizes the channel height by OTC routing but also minimizes the net lengths. They have also developed a two-layer router, WILMA2. Both WILMA2 and WILMA3 achieve reduction in channel height while maintaining the net length greater than its length in the two-layer channel routing. Experimental results with WILMA2 indicated that WILMA2 can produce routings that are comparable to existing two-layer over-the-cell routers. Both WILMA2 and WILMA3 achieved these results, while maintaining the net length bound. >

Proceedings ArticleDOI
01 Jun 1992
TL;DR: This paper presents new on-line and off-line routing algorithms, all of which are optimal to within a small additive term, for permutation routing in two and three-dimensional mesh-connected processor arrays.
Abstract: In this paper, we consider the problem of permutation routing in twoand three-dimensional mesh-connected processor arrays. We present new on-line and off-line routing algorithms, all of which are optimal to within a small additive term. In particular, our results include the following: 1. An off-line algorithm for routing a permutation in an n x n mesh in time 2n – 1 using buffers of size 4. 2. An off-line algorithm for routing a permutation in an n x n x n mesh in time 372– 1 using buffers of size 10. 3. An on-line randomized algorithm for routing a permutation in an n x n x n mesh in time 3n + O(log n) time using constant sized buffers, ●Supported in part by NSF Grant NSF-CCR-87-04513. This work was done while visiting NEC Research Institute. E-mail: kakl@harvard.harvard .edu t This work was done while visiting N EC Research Institute. E-mail: krizanc@cs. rochester.edu t E-mail: satish@research .nj.nec.com Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the ACM copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Association for Computing Machinety. To copy otherwise, or to republish, requires a fee and/or speeitic permission. SF’AA ’9261921CA @1992 ACM O-89791-484-8192K)O0610023. . . . . .. $1.50 23 4. An on-line randomized algorithm for routing a permutation in an n x n torus in time n + O(log n ) time using constant sized buffers. Also, while optimal algorithms for routing on two-dimensional meshes were known, we give new algorithms that are simpler and/or require smaller buffers.

Proceedings ArticleDOI
01 May 1992
TL;DR: A new routing algorithm called the labeled distance-vector routing algorithm (LDR), that is loop-free at every instant, eliminates the counting-to-infinity problem of the distributed Bellman-Ford (DBF) algorithm, operates with arbitrary link and node delays, and provides shortest paths a finite time after the occurrence of an arbitrary sequence of topological changes.
Abstract: The author presents, verifies, and analyzes a new routing algorithm called the labeled distance-vector routing algorithm (LDR), that is loop-free at every instant, eliminates the counting-to-infinity problem of the distributed Bellman-Ford (DBF) algorithm, operates with arbitrary link and node delays, and provides shortest paths a finite time after the occurrence of an arbitrary sequence of topological changes. In contrast to previous successful approaches to loop-free routing, LDR maintains DBF's row-independence property and does not require internodal coordination spanning multiple loops. The new algorithm is shown to be loop-free and to converge in a finite time after an arbitrary sequence of topological changes. Its performance is compared with the performance of other distributed routing algorithms. >

Journal ArticleDOI
TL;DR: This approach successfully solves the net ordering and congestion prediction problems which other approaches suffer and achieves up to 11% area reduction compared to the previous global routing package used in the DATools system and obtains up to 17% reduction in the total channel densityCompared to the Timberwolf 4.2 package.

Proceedings ArticleDOI
11 Oct 1992
TL;DR: The interactions between the CAD tools that are used to configure the routing resources of a field-programmable gate array (FPGA) and the design of the routing architecture itself are examined and it is demonstrated that the fewest routing switches are required when each logical pin appears on only one side of the logic cell rather than two or more.
Abstract: The interactions between the CAD tools that are used to configure the routing resources of a field-programmable gate array (FPGA) and the design of the routing architecture itself are examined. Such an understanding is used to determine where to reduce the number of routing switches in the FPGA while maintaining routability. Experiments are used to study a switch block that was previously thought to have unacceptably low flexibility. It is shown that the performance of this switch block can be improved by adapting the global router to require less flexibility in the architecture, and by careful placement of physical pins on the logic blocks. It is demonstrated that the fewest routing switches are required when each logical pin appears on only one side of the logic cell rather than two or more. >

ReportDOI
01 Jun 1992
TL;DR: Simulation results are presented that show dimension-order routing achieves both higher throughput and lower latency than adaptive routing, refuting claims that adaptive routing performs better than dimension- order routing.
Abstract: This report refutes claims that adaptive routing performs better than dimension-order routing. Simulation results are presented that show dimension-order routing achieves both higher throughput and lower latency than adaptive routing. Specious claims for the advantages of adaptive routing are critiqued.