What are the future works in "Data center networking with multipath tcp" ?

The authors will explore such topologies in their future work. The authors intend to explore these in future work, together with ways of automatically tuning MPTCP to bring the best performance. As with any system, there may be some costs too: under heavy load, per-packet latency may increase due to timeouts ; a little more memory is needed for receive buffers ; and some complexity will be added to OS implementations, particularly to deal with issues such as such as flow-to-core affinities.

What is the simplest solution to the traffic concentration problem?

The simplest solution is to use randomized load balancing, where each flow is assigned a random path from the set of possible paths.

What is the definition of a flow scheduling algorithm?

Host-limited Flows Hedera’s flow scheduling algorithm is based on the assumption that long-lived flows contribute most of the bytes and therefore it only needs to schedule those flows.

What are the main advantages of MPTCP?

First the authors wish to understand the potential benefits of MPTCP with respect to the three major topologies in the literature: FatTree, VL2 and BCube.

What is the intuition for scheduling big flows?

The intuition is that if the authors only schedule the big flows the authors can fully utilize all the bandwidth, and yet have a small scheduling cost, as dictated by the small number of flows.

What is the problem with dense interconnects?

while such dense interconnects can in principle support the full cross-sectional bandwidth of every host communicating flat out simultaneously, the denseness of interconnection poses a difficult challenge for routing.

What is the way to analyze traffic in the three topologies?

In all three topologies, with the right path selection this traffic pattern should just be able to load the network to full capacity but no more.

(Open Access) Data center networking with multipath TCP (2010) | Costin Raiciu

Q: What are the contributions in "Data center networking with multipath tcp" ?

The authors propose a natural evolution of data center transport from TCP to multipath TCP. The authors show that multipath TCP can effectively and seamlessly use available bandwidth, providing improved throughput and better fairness in these new topologies when compared to single path TCP and randomized flow-level load balancing. The authors also show that multipath TCP outperforms laggy centralized flow scheduling without needing centralized control or additional infrastructure.

Q: What is the way to schedule a flow?

during a scheduling period, a flow’s average throughput is greater than 10% of the interface speed, it is explicitly scheduled.

Q: What is the main advantage of using a hierarchical topology?

Traditionally data centers have been built using hierarchical topologies: racks of hosts connect to a top-of-rack switch; these switches connect to aggregation switches; in turn these are connected to a core switch.

Data Center Networking with Multipath TCP

Costin Raiciu†, Christopher Pluntke†, Sebastien Barre‡, Adam Greenhalgh†,

Damon Wischik†, Mark Handley†

†University College London, ‡Universite Catholique de Louvain

ABSTRACT

Recently new data center topologies have been proposed that

offer higher aggregate bandwidth and location independence

by creating multiple paths in the core of the network. To ef-

fectively use this bandwidth requires ensuring different ﬂows

take different paths, which poses a challenge.

Plainly put, there is a mismatch between single-path trans-

port and the multitude of available network paths. We pro-

pose a natural evolution of data center transport from TCP

to multipath TCP. We show that multipath TCP can effec-

tively and seamlessly use available bandwidth, providing im-

proved throughput and better fairness in these new topolo-

gies when compared to single path TCP and randomized

ﬂow-level load balancing. We also show that multipath TCP

outperforms laggy centralized ﬂow scheduling without need-

ing centralized control or additional infrastructure.

Categories and Subject Descriptors

C.2.2[Computer-Comms Nets]: Network Protocols

General Terms: Multipath TCP, Data Center Networks

1. INTRODUCTION

Recent growth in cloud applications from companies such

as Google, Microsoft, and Amazon has resulted in the con-

struction of data centers of unprecedented size. These appli-

cations are written to be distributed across machines num-

bering in the tens of thousands, but in so doing, they stress

the networking fabric within the data center: distributed ﬁle

systems such as GFS transfer huge quantities of data be-

tween end-systems (a point-to-point trafﬁc pattern) while data

processing applications such as MapReduce, BigTable or

Dryad shufﬂe a signiﬁcant amount of data between many

machines. To allow maximum ﬂexibility when rolling out

new applications, it is important that any machine can play

any role without creating hot-spots in the network fabric.

Data center networking has become focus of attention re-

cently; in part this is because data centers are now important

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for proﬁt or commercial advantage and that copies

bear this notice and the full citation on the ﬁrst page. To copy otherwise, to

republish, to post on servers or to redistribute to lists, requires prior speciﬁc

permission and/or a fee.

Hotnets ’10, October 20–21, 2010, Monterey, CA, USA.

enough to be considered as special cases in their own right,

but perhaps equally importantly, they are one of the few

cases where researchers can dictate both the physical topol-

ogy and the routing of trafﬁc simultaneously. New topolo-

gies such as FatTree[1] and VL2[5] propose much denser

interconnects than have traditionally been implemented, so

as to allow operators to deploy application functionality in

a location independent manner. However, while such dense

interconnects can in principle support the full cross-sectional

bandwidth of every host communicating ﬂat out simultane-

ously, the denseness of interconnection poses a difﬁcult chal-

lenge for routing. How can we ensure that no matter the traf-

ﬁc pattern, the load is distributed between the many possible

parallel paths as evenly as possible?

The current wisdom seems to be to use randomised load

balancing (RLB) to randomly choose a path for each ﬂow

from among the possible parallel paths. However, RLB can-

not achieve full bisectional bandwidth because some ﬂows

will randomly choose the same path while other links ran-

domly fail to be selected. Thus, RLB tends to be supple-

mented by centralized ﬂow-scheduling for large ﬂows.

In this paper we propose an alternative and simpler ap-

proach; the end systems in the data center should simply

use multipath TCP (MPTCP), as currently under consider-

ation in the IETF[4], to utilize multiple parallel paths for

each TCP connection. The great advantage of this approach

is that the linked congestion controller in each MPTCP end

system can act on very short timescales to move its own traf-

ﬁc from paths it observes to be more congested, onto paths

it observes to be less congested. Theory suggests that such

behavior can be stable, and can also serve to load balance

the entire network.

We evaluate how effective MPTCP is in comparison to

alternative scheduling mechanisms across a range of differ-

ent proposed data center topologies. We use a combination

of large scale simulation and smaller scale data center ex-

perimentation for evaluation. Our conclusion is that for all

the workloads and topologies we considered, MPTCP either

matches or in many cases exceeds the performance a central-

ized scheduler can achieve, and is more robust.

Further, we show that single-path TCP cannot fully utilize

capacity for certain topologies and trafﬁc matrices, while

multipath can. There is a close connection between topol-

ogy, path selection, and transport in data centers; this hints

at possible beneﬁts from designing topologies for MPTCP.

2. DATA CENTER NETWORKING

From a high-level perspective, there are four main com-

ponents to a data center networking architecture:

• Physical topology

• Routing over the topology

• Selection between the paths supplied by routing

• Congestion control of trafﬁc on the selected paths

These are not independent; the performance of one will

depend on the choices made by those preceding it in the list,

and in some cases by those after it in the list. We will discuss

each in turn, but it is worth noting now that MPTCP spans

both path selection and congestion control, which is why it

is able to offer beneﬁts that cannot otherwise be obtained.

2.1 Topology

Traditionally data centers have been built using hierar-

chical topologies: racks of hosts connect to a top-of-rack

switch; these switches connect to aggregation switches; in

turn these are connected to a core switch. Such topologies

make sense if most of the trafﬁc ﬂows into or out of the data

center. However, if most of the trafﬁc is intra-datacenter, as

is increasingly the trend, then there is a very uneven distri-

bution of bandwidth. Unless trafﬁc is localized to racks, the

higher levels of the topology become a serious bottleneck.

Recent proposals address these limitations. VL2 and Fat-

Tree are Clos[3] topologies that use multiple core switches

to provide full bandwidth between any pair of hosts in the

network. They differ in that FatTree uses larger quantities of

lower speed (1Gb/s) links between switches, whereas VL2

uses fewer faster (10Gb/s) links. In contrast, in BCube[6],

the hierarchy is abandoned in favor a hypercube-like topol-

ogy, using hosts themselves to relay trafﬁc.

All three proposals solve the trafﬁc concentration prob-

lem at the physical level — there is enough capacity for ev-

ery host to be able to transmit ﬂat-out to another randomly

chosen host. However the denseness of interconnection they

provide poses its own problems when it comes to determin-

ing how trafﬁc should be routed.

2.2 Routing

Dense interconnection topologies provide many possible

parallel paths between each pair of hosts. We cannot ex-

pect the host itself to know which of these paths is the least

loaded, so the routing system itself must spread trafﬁc across

these paths. The simplest solution is to use randomized load

balancing, where each ﬂow is assigned a random path from

the set of possible paths.

In practice there are multiple ways to implement random-

ized load balancing in today’s switches. For example, if each

switch uses a link-state routing protocol to provide ECMP

forwarding then, based on a hash of the ﬁve-tuple in each

packet, ﬂows will be split roughly equally across equal length

paths. VL2 provides just such a mechanism over a virtual

layer 2 infrastructure.

However, in topologies such as BCube, paths vary in length,

and simple ECMP cannot access many of these paths be-

cause it only hashes between the shortest paths. A simple

alternative is to use multiple static VLANs to provide mul-

tiple paths that expose all the underlying network paths[8].

Either the host or the ﬁrst hop switch can then hash the ﬁve-

tuple to determine which path is used.

In our simulations, we do not model dynamic routing; in-

stead we assume that all the paths between a pair of end-

points are available for selection, whatever mechanism actu-

ally does the selection. For our experiments in Section 4, we

use the VLAN-based routing solution.

2.3 Path Selection

Solutions such as ECMP or multiple VLANs provide the

basis for randomised load balancing as the default path se-

lection mechanism. However, as others have shown, ran-

domised load balancing cannot achieve the full cross-sectional

bandwidth in most topologies, nor is it especially fair. The

problem, quite simply, is that often a random selection causes

hot-spots to develop, where an unlucky combination of ran-

dom path selection causes a few links to be underloaded and

links elsewhere to have little or no load.

To address these issues, the use of a centralized ﬂow sched-

uler has been proposed. Large ﬂows are assigned to lightly

loaded paths and existing ﬂows may be reassigned to maxi-

mize overall throughput[2]. The scheduler does a good job

if ﬂows are network-limited, with exponentially distributed

sizes and Poisson arrivals, as shown in Hedera [2]. The in-

tuition is that if we only schedule the big ﬂows we can fully

utilize all the bandwidth, and yet have a small scheduling

cost, as dictated by the small number of ﬂows.

However, data center trafﬁc analysis shows that ﬂow dis-

tributions are not Pareto distributed [5]. In such cases, the

scheduler has to run frequently (100ms or faster) to keep up

with the ﬂow arrivals. Yet, the scheduler is fundamentally

limited in its reaction time as it has to retrieve statistics, com-

pute placements and instantiate them, all in this scheduling

period. We show through simulation that a scheduler run-

ning every 500ms has similar performance to randomised

load balancing when these assumptions do not hold.

2.4 Congestion Control

Most applications use singlepath TCP, and inherit TCP’s

congestion control mechanism which does a fair job of match-

ing offered load to available capacity on whichever path was

selected. Recent research has shown there are beneﬁts from

tuning TCP for data center use, such as by reducing the min-

imum retransmit timeout[10], but the problem TCP solves

remains unchanged.

In proposing the use of MPTCP, we change the partition-

ing of the problem. MPTCP can establish multiple subﬂows

across different paths between the same pair of endpoints for

a single TCP connection. The key point is that by linking

the congestion control dynamics on these multiple subﬂows,

100

RLB 2 3 4 5 6 7 8

Throughput (% of optimal)

Multipath TCP

(a) FatTree (8192 hosts)

100

RLB 2 3 4 5 6 7 8

Multipath TCP

(b) VL2 (11520)

100

RLB 2 3 4 5

Multipath TCP

Figure 1: Throughput for long running connections using a permutation traf-

ﬁc matrix, for RLB and varying numbers of MPTCP subﬂows

100 1000 10000

Subflows

Number of Servers

Fat Tree

VL2

BCube

Figure 2: Subﬂows needed to

reach 90% network utilization

MPTCP can explicitly move trafﬁc away from the more con-

gested paths and place it on the less congested paths.

The algorithm currently under discussion in the IETF is

called “linked increases” because the slope of additive in-

crease part of the TCP sawtooth is determined by that ﬂow’s

fraction of the total window of trafﬁc in ﬂight. The faster

a ﬂow goes, the larger its fraction of the total, and so the

faster it can increase. This algorithm makes MPTCP incre-

mentally deployable, as it is designed to be fair to competing

singlepath TCP trafﬁc, unlike simply running multiple reg-

ular TCP ﬂows between the same endpoints. In addition it

moves more trafﬁc off congested paths than multiple regular

TCP ﬂows would.

Our hypothesis is that given sufﬁciently many randomly

chosen paths, MPTCP will ﬁnd at least one good unloaded

path, and move most of its trafﬁc that way. In so doing it will

relieve congestion on links that got more than their fair share

of RLB-balanced ﬂows. This in turn will allow those com-

peting ﬂows to achieve their full potential, maximizing the

cross-sectional bandwidth of the network and also improv-

ing fairness. Fairness is not an abstract concept for many

distributed applications; for example, when a search appli-

cation is distributed across many machines, the overall com-

pletion time is determined by the slowest machine. Hence

worst-case performance matters signiﬁcantly.

3. ANALYSIS

To validate our hypothesis, we must examine how MPTCP

performs in a range of topologies and with a varying num-

ber of subﬂows. We must also show how well it performs

against alternative systems. To perform such an analysis is

itself challenging - we really want to know how well such de-

ployments will perform at large scale with real-world trans-

port protocol implementations and with reasonable trafﬁc

patterns. Lacking a huge data center to play with, we have

to address these issues independently, using different tools;

• Flow-level simulation to examine idealized large scale

behavior.

• Packet-level simulation to examine more detailed medium-

scale behavior.

• Real-world implementation to examine practical limi-

tations at small-scale.

3.1 Large scale analysis

First we wish to understand the potential beneﬁts of MPTCP

with respect to the three major topologies in the literature:

FatTree, VL2 and BCube. The baseline for comparison is

randomized load balancing with RLB using singlepath TCP.

MPTCP adds additional randomly chosen paths, but then the

linked congestion control moves the trafﬁc within each con-

nection to the least congested subﬂows.

We use an iterative ﬂow-level simulator to analyze topolo-

gies of up to 10,000 servers

. In each iteration the simulator

computes the loss rates for each link based on the offered

load, and adjusts the load accordingly. When the offered

load and loss rate stabilize, the simulator ﬁnishes. This sim-

ulator does not model ﬂow startup behavior and other packet

level effects, but is scalable to very large topologies.

Fig. 1 shows the total throughput of all ﬂows when we

use a random permutation matrix where each host sends ﬂat

out (as determined by the TCP response function) to a sin-

gle other host. In all three topologies, with the right path

selection this trafﬁc pattern should just be able to load the

network to full capacity but no more.

What we observe is that RLB is unable to ﬁll any of these

networks. It performs best in the VL2 topology, where it

achieves 77% throughput, but performs much worse in Fat-

Tree and BCube. The intuition is simple: to achieve 100%

capacity with RLB, no two ﬂows should ever traverse the

same link. Obviously RLB cannot do this, but how badly

it suffers depends on how overloaded links become. With

FatTree, when two TCP ﬂows that could potentially send at

1Gb/s end up on the same 1Gb/s link, each backs off by 50%,

leaving other links underutilized. With VL2, when eleven

1Gb/s TCP ﬂows end up on the same 10Gb/s link, the effect

is much less drastic, hence the reduced performance penalty.

The beneﬁts of MPTCP are clear; as additional subﬂows

are added, the overall throughput increases. How many sub-

ﬂows are needed depends on the topology; the intuition is

as before - we need enough subﬂows to overcome the traf-

ﬁc concentration effects of the random path allocation. One

might think that the power of two choices[7] might apply

here, providing good load balancing with very few subﬂows.

However it does not because the paths are not disjoint. Each

how many is determined by the need for a regular topology

100

RLB 2 3 4 5 6

% of optimal

Multipath TCP

Min Rate Fairness

(a) FatTree

100

RLB 2 3 4 5 6 7 8

% of optimal

Multipath TCP

Min Rate Fairness

(b) VL2

100

RLB 2 3 4 5

% of interface rate

Multipath TCP

Min Rate Fairness

Figure 3: Minimum ﬂow throughput and Jain fairness index for ﬂows in Fig. 1

100

RLB 1s 500ms 100ms 10ms MTCP

Throughput (% of max)

First Fit Scheduler

Figure 4: First-ﬁt scheduling

compared to RLB and MPTCP

subﬂow can encounter a congested bottleneck on a single

link along its path, causing the other links along the path to

be underutilized. Although such bottleneck links are load-

balanced, with FatTree in particular, other links cannot be

fully utilized, and it takes more than two subﬂows to spread

load across sufﬁcient paths to fully utilize the network.

This raises the question of how the number of subﬂows

needed scales with the size of the network. We chose an ar-

bitrary utilization target of 90% of the cross sectional band-

width. For different network sizes we then progressively in-

creased the number of subﬂows used. Fig. 2 shows the min-

imum number of subﬂows that can achieve 90% utilization

for each size of network. The result is encouraging: be-

yond a certain size, the number of subﬂows needed does not

increase signiﬁcantly with network size. For VL2, two sub-

ﬂows are needed. For FatTree, eight are needed. This might

seem like quite a high number, but for an 8192-node FatTree

network there are 256 distinct paths between each host pair,

so only a small fraction of the paths are needed to achieve

full utilization. From the host point of view, eight subﬂows

is not a great overhead.

We also care that the capacity is allocated fairly between

connections, especially for applications where the ﬁnal re-

sult can only be returned when the last node running a part

of a calculation returns its results. Fig. 3 shows the through-

put of the lowest speed ﬂow (as a percentage of what should

be achievable) and Jain’s fairness index for the three topolo-

gies. Multipath always improves fairness, even for the VL2

topology which performed relatively well if we only exam-

ine throughput.

We have also run experiments in our packet-level simu-

lator with a wide range of load levels. At very light load,

there are few collisions, so MPTCP gives little beneﬁt over

RLB on FatTree or VL2 topologies. However on BCube,

MPTCP excels because a single ﬂow can use all the the host

interfaces simultaneously.

At the other extreme, under overload conditions even RLB

manages to ﬁll the network, but MPTCP still gives better

fairness. Fig. 5 shows the throughput of each individual ﬂow

in just such an overload scenario.

The results above use a permutation trafﬁc matrix, which

is useful as a benchmark because it enables a network de-

signed for full bisection bandwidth to be loaded to 100%

utilization with the right trafﬁc distribution scheme. In prac-

tice less regular trafﬁc and both lighter and heavier loads are

of interest. Fig. 6 shows results when the source and desti-

nation is chosen randomly for varying numbers of ﬂows.

FatTree show substantial improvements over single-path

RLB, even for very light or very heavy loads. This shows

the performance beneﬁts of MPTCP are robust across a wide

range of conditions.

The improvements for BCube are even greater at lower

trafﬁc loads. This is because BCube hosts have multiple in-

terfaces, and MPTCP can use them all for a single ﬂow - at

light loads the bottlenecks are the hosts themselves.

The results for VL2 were a surprise, given that Fig. 1

shows improvements for this topology with the permutation

matrix. MPTCP gives improvements over RLB of less than

1% for all loads we studied. On closer examination, it turns

out that the host interface is almost always the bottleneck for

VL2. Many ﬂows collide on either the sending or receiving

host, and MPTCP has no path diversity here. The 10Gb/s

links are then not the bottleneck for the remaining ﬂows un-

der these load levels.

3.2 Scheduling and Dynamic Flow Arrivals

With single-path TCP is it clear that RLB does not per-

form sufﬁciently well unless the topology has been specif-

ically tailored for it, as with VL2. Even with VL2, ﬂuid

simulations show that MPTCP can increase fairness and per-

formance signiﬁcantly.

RLB however is not the only singlepath path selection

algorithm; Hedera proposes using a centralized scheduler

to supplement RLB, with the goal of explicitly allocating

large ﬂows to paths. Speciﬁcally, Hedera ﬂows start off

using RLB, but are measured by the centralized scheduler.

If, during a scheduling period, a ﬂow’s average throughput

is greater than 10% of the interface speed, it is explicitly

scheduled. How well does MPTCP compare with central-

ized scheduling?

This evaluation is more difﬁcult; the performance of a

scheduler can depend on lag in ﬂow measurement, path con-

ﬁguration, and TCP’s response to path reconﬁguration. Sim-

ilarly the performance of MPTCP can depend on how quickly

100

120

0 1000 2000

Throughput (Mb/s)

Rank of Flow

RLB

Multipath TCP

Figure 5: Flow Rates for an

overloaded Fat Tree (128)

100

150

200

250

300

25 50 75 100 125

Multipath Throughput [% of RLB]

Active flows [% of total number of hosts]

FatTree (8192 hosts)

BCube (1024 hosts)

VL2 (11520 hosts)

Figure 6: Random conns:

Improvement vs. load

1 2 4 8

Loss Rate (%)

Number of subflows

Linked Max

Linked Ave

Independent Max

Independent Ave

(a) Network loss rates

100

1 2 4 8

Timeouts

Number of subflows

Linked

Independent

(b) Retransmit Timeouts

Figure 7: MPTCP vs. multiple independent TCP ﬂows

new subﬂows can slowstart. None of these effects can be

captured in a ﬂuid ﬂow model, so we have to resort to full

packet-level simulation.

For our experiments we modiﬁed htsim[9], which was built

from ground up to support high speeds and large numbers of

ﬂows. It models TCP very similarly to ns2, but performance

is much better and simulation time scales approximately lin-

early with total bandwidth simulated.

For space reasons, we only examine the FatTree topology

with 128 severs and a total maximum bandwidth of 128Gb/s.

We use a permutation trafﬁc matrix with closed loop ﬂow

arrivals (one ﬂow ﬁnishes, another different one starts), and

ﬂow sizes distributed according to the VL2 dataset. We mea-

sure throughputs over 20 seconds of simulated time for RLB,

MPTCP (8 subﬂows), and a centralized scheduler using the

First Fit heuristic, as in Hedera [2].

The average bisectional bandwidth achieved is shown in

Fig. 4. Again, MPTCP signiﬁcantly outperforms RLB. Cen-

tralized scheduler performance depends on how frequently

it is run. In the Hedera paper it is run every 5 seconds. Our

results show it needs to run every 100ms to approach the per-

formance of MPTCP; if it runs as frequently as every 500ms

there is little beneﬁt because in the high bandwidth data cen-

ter environment even large ﬂows only take around a second

to complete.

Host-limited Flows

Hedera’s ﬂow scheduling algorithm is based on the assump-

tion that long-lived ﬂows contribute most of the bytes and

therefore it only needs to schedule those ﬂows. Other ﬂows

are treated as background noise. It also assumes that ﬂows

which it schedules onto unused links are capable of increas-

ing their transmit rate to ﬁll that link.

Both assumptions can be violated by ﬂows which are end-

host limited and so cannot increase their transmission rate.

For example, network bandwidth can easily exceed disk per-

formance for certain workloads. Host-limited ﬂows can be

long lived and transfer a great deal of data, but never exceed

the scheduling threshold. These ﬂows are essentially invis-

ible to the scheduler and can collide with scheduled ﬂows.

We chose First Fit because it runs much faster than the Simulated

Annealing heuristic; execution speed is really important to get ben-

eﬁts with centralized scheduling.

Perhaps worse, a host-limited ﬂow might just exceed the

threshold for scheduling and be assigned to an empty path

which it cannot utilize, wasting capacity.

We ran simulations using a permutation matrix where each

host sends two ﬂows; one is host-limited and the other is

not. When the host-limited ﬂows have throughput just below

the 10% scheduling threshold, Hedera’s throughput drops

20%. When the same ﬂows are just above the threshold for

scheduling it costs Hedera 17%.

Scheduling App Limited Flows

Threshold Over-Threshold Under-Threshold

5% -21% -22%

10% -17% -21%

20% -22% -23%

50% -51% -45%

The table shows the 10% threshold is a sweet spot; chang-

ing it either caused too few ﬂows to be scheduled, or causes

even more problems when a scheduled ﬂow cannot expand

to ﬁll capacity.

In contrast, MPTCP makes no such assumptions. It re-

sponds correctly to competing host-limited ﬂows, consis-

tently obtaining high throughput.

MPTCP vs. Multiple TCP Connections

Using multiple subﬂows clearly has signiﬁcant beneﬁts. How-

ever, MPTCP is not the only possible solution. Could we not

simply use multiple TCP connections in parallel, and stripe

at the application level?

From a network performance point of view, this is equiv-

alent to asking what the effect is of the congestion control

linkage within MPTCP. If, instead of using MPTCP’s “linked

increases” algorithm, we use regular TCP congestion control

independently for each subﬂow, this will have the same ef-

fect on the network.

To test this, we use again the permutation trafﬁc matrix

and create 20 long running ﬂows from each host. We mea-

sure network loss rates for MPTCP with Linked Increases

and compare against running independent TCP congestion

control on each subﬂow.

The results in Fig. 7(a) show that MPTCP does not in-

crease network load, as measured by either mean or max loss

rate. In contrast, independent congestion control for each

Data center networking with multipath TCP

Figures

Citations

DevoFlow: scaling flow management for high-performance networks

Design, implementation and evaluation of congestion control for multipath TCP

DeTail: reducing the flow completion time tail in datacenter networks

On the impact of packet spraying in data center networks

Decentralized task-aware scheduling for data center networks

References

A scalable, commodity data center network architecture

VL2: a scalable and flexible data center network

A study of non-blocking switching networks

BCube: a high performance, server-centric network architecture for modular data centers

Hedera: dynamic flow scheduling for data center networks

Related Papers (5)

TCP Extensions for Multipath Operation with Multiple Addresses

Data center TCP (DCTCP)

Hedera: dynamic flow scheduling for data center networks

A scalable, commodity data center network architecture

Network traffic characteristics of data centers in the wild

Frequently Asked Questions (16)

Q1. What are the future works in "Data center networking with multipath tcp" ?

Q2. What are the contributions in "Data center networking with multipath tcp" ?

Q3. What is the way to schedule a flow?

Q4. What is the main advantage of using a hierarchical topology?

Q5. What is the default method for randomised load balancing?

Q6. What is the simplest solution to the traffic concentration problem?

Q7. What is the advantage of this approach?

Q8. What is the reason for the growth in data centers?

Q9. What is the definition of a flow scheduling algorithm?

Q10. What are the main advantages of MPTCP?

Q11. What is the intuition for scheduling big flows?

Q12. How many subflows are needed for a fattree network?

Q13. How many subflows are needed to fully utilize the network?

Q14. What are the differences between FatTree and VL2?

Q15. What is the problem with dense interconnects?

Q16. What is the way to analyze traffic in the three topologies?

Trending Questions (1)