What contributions have the authors mentioned in the paper "Updates in highly unreliable, replicated peer-to-peer systems" ?

This paper studies the problem of updates in decentralised and self-organising P2P systems in which peers have low online probabilities and only local knowledge. The update strategy the authors propose for this environment is based on a hybrid push/pull rumor spreading algorithm and provides a fully decentralised, efficient and robust communication scheme which offers probabilistic guarantees rather than ensuring strict consistency. The authors describe a generic analytical model to investigate the utility of their hybrid update propagation scheme from the perspective of communication overhead.

What have the authors stated for future works in "Updates in highly unreliable, replicated peer-to-peer systems" ?

In this paper, the authors have used heuristics to find proper parameters, but they plan to explore the possibility of both feed-back and feed-forward to evolve a proper mechanism of parameter tuning using local knowledge. To verify the correctness of the analysis if some of the simplifying assumptions are relaxed, the authors plan to use simulations, which will also help us investigate whether there is bimodal2 behavior [ 4, 13 ] even in the assumed environment of very low peer presence. The authors plan to use their PGrid peer-to-peer system as a testbed for the implementation and practical tests of the algorithm. In such a scenario a relatively reliable network backbone would exist and thus would make possible further performance improvements.

What is the novel approach of their work?

The novel approach of their work is the use of push/pull in the context of replicas being offline long and frequently, and in the significant reduction of message overhead in the push phase.

What is the important metric that can be used to tune parameters?

The number of duplicate messages received by a replica also provides an essential, locally available metric that may utilise to tune parameters ,'p$ S ! and "on .

What is the main advantage of the algorithm?

Another major advantage of the algorithm is that it is totally decentralised and uses no global knowledge but exploits local knowledge instead.

What is the common economic paradigm for data replication in Mariposa?

Other economic paradigms to maintain distributed data replicas include [15, 16, 21] where a primary copy model is used to provide one-copy serializability.

How can the authors tune the push phase?

Tuning the push phase may not only be done through feedback mechanisms (to determine when to stop pushing), but also by a speculative (feed-forward) mechanism.

What is the way to test the consistency of the analysis?

To verify the correctness of the analysis if some of the simplifying assumptions are relaxed, the authors plan to use simulations, which will also help us investigate whether there is bimodal2 behavior [4, 13] even in the assumed environment of very low peer presence.

What is the definition of the second category of a rumor spreading algorithm?

The first category is defined by whether nodes use feedback from other nodes (for example, whether they already know the rumor or not) and thus decide on their future course, or not (generally called “blind” then).

What is the way to reduce the number of duplicate messages sent to a replica?

This strategy will only be effective for short time intervals, since over a period of time,is expected to be online according to a random distribution for all the replicas.

What is the strategy for reducing the probability of forwarding updates?

Fig. 4 indicates that the best strategy is to reduce the probability of forwarding updates with the increase in number of push rounds, which eliminates many unnecessary messages.

(Open Access) Updates in highly unreliable, replicated peer-to-peer systems (2003) | Anwitaman Datta

Q: What is the performance criterion for the pull phase of updates?

Their performance criterion for this analysis is primarily the number of messages that are generated as part of a single update, compared to the extent to which the update propagates among the online population.

Q: What is the assumption of a very small bmd?

The assumption of a very small bmd is justified because a single push round will take a very small time (network delay for a single message), and unless there is any kind of catastrophic failure, a very small number of peers will suddenly decide to go offline.

Updates in Highly Unreliable, Replicated Peer-to-Peer Systems



Anwitaman Datta, Manfred Hauswirth, Karl Aberer

Swiss National Institute of Technology (EPFL), Lausanne, Switzerland



anwitaman.datta, manfred.hauswirth, karl.aberer



@epﬂ.ch

Abstract

This paper studies the problem of updates in decen-

tralised and self-organising P2P systems in which peers

have low online probabilities and only local knowledge.

The update strategy we propose for this environment is

based on a hybrid push/pull rumor spreading algorithm

and provides a fully decentralised, efﬁcient and robust com-

munication scheme which offers probabilistic guarantees

rather than ensuring strict consistency. We describe a

generic analytical model to investigate the utility of our

hybrid update propagation scheme from the perspective of

communication overhead.

1. Introduction

In most peer-to-peer (P2P) systems data is assumed

to be rather static and updates occur very infrequently.

For application domains beyond mere ﬁle sharing, for ex-

ample trust management [2] or peer commerce, such as-

sumptions do not hold and updates in fact may occur fre-

quently. Other typical applications where new data items

are added, deleted, or updated frequently by multiple users

are bulletin-board systems, shared calendars or address

books, e-commerce catalogues, and project management in-

formation.

To improve fault-tolerance and response time data is

heavily replicated in most P2P systems and the system must

take into account that peers are autonomous and may be of-

ﬂine frequently and that no global knowledge on the system

exists. This is especially relevantfor upcoming mobile envi-

ronments. Thus we are operating in a decentralised setting

without global control. This is the setting we assume for

updates in our P-Grid P2P system [1, 3] described in this

paper. To meet the challenges imposed by a high replica-

tion factor, lack of global knowledge, and peers being on-

line only with a very low probability, we exploit epidemic

algorithms under the assumption that probabilistic guaran-

tees instead of strict consistency is sufﬁcient and such an

approach can indeed be used in a decentralised and self-

organising environment.

Our proposed update algorithm is based on rumor

spreading. We modify existing message ﬂooding algo-

rithms to achieve lower communication overheads but pro-



The work presented in this paper was supported (in part) by the Na-

tional Competence Center in Research on Mobile Information and Com-

munication Systems (NCCR-MICS), a center supported by the Swiss Na-

tional Science Foundation under grant number 5005-67322.

vide similar probabilistic guarantees and low latency. Since

we assume that peers are mostly ofﬂine, we propose a hy-

brid push/pull algorithm so that ofﬂine peers can inquire

for updates that they had missed when they become on-

line again. In the push phase the algorithm uses a new

mechanism, apart from traditional feedback and probabilis-

tic methods to propagate a rumor, to avoid many duplicate

messages by propagating a partial list of peers to which a

particular message has already been sent. It also employs

this list in conjunction with the number of duplicate mes-

sages received at a particular node as a local metric to es-

timate the extent to which a message has spread globally,

and thereby tunes the probabilistic parameters of the generic

algorithm locally which is a novel contribution. As will

be discussed later, the current taxonomy of epidemic algo-

rithms does not exploit the advantages of speculation (feed-

forward) for signiﬁcant reduction of communication over-

head.

Further, the algorithm deals with logical connectivity

(knowledge), and is disentangled from the underlying net-

work/physical connectivity. Consecutively, the propagation

of messages in the physical network and thus the imple-

mentation of applications is an orthogonal issue and may

employ any point-to-point/multicast/ad-hoc communication

mechanism (for example, [4, 13]). Though this work was

initially motivated by P-Grid, the algorithm is generic and

can be applied in other systems too. Our modiﬁcations

to existing message ﬂooding algorithms and other results

may as well be applied to other search/update algorithms or

broadcast/multicast schemes which employ ﬂooding.

Another signiﬁcant contribution of this paper is an ana-

lytical model of the gossiping algorithm unlike most of the

literature which relies on simulation results. Since our al-

gorithm is generic as argued above the analytical model is

valid for many of the other variants of ﬂooding algorithms

and so are the results of our analysis.

2. Motivation and problem statement

Various global storage systems have been proposed, for

example, Freenet [7], OceanStore [24], Pastry [26], and Far-

site [5]. Their main goal is to provide distributed storage

that scales to very large numbers of users and data sets.

Additionally, they may exhibit certain specialisations that

stem from their intended application domains. For exam-

ple, Freenet wants to support free speech and anonymity

on the Internet, whereas OceanStore focuses on distributed

archival storage, which requires special system support.

From the viewpoint of data management these systems

should address two critical areas:

1. Efﬁcient, scalable data access which is provided more

or less by all approaches, and

2. Updates to the data stored, especially with respect to

replication and low online probabilities.

Many of the access schemes are based on some mech-

anism that associates peers logically with a partition of the

search space by means of a distributed, scalable index struc-

ture (P-Grid, OceanStore) and use replication to improve

responsiveness and fault-tolerance. Some of the systems

support updates. For example, OceanStore, uses classical

schemes for updating replicas and assumes high availability

of servers, whereas in the systems we envision the peers are

fairly unreliable. Our assumptions are:

– Peers have low online probabilities and quorums can-

not be assumed.

– Eventual consistency is sufﬁcient.

– Since we do not target database systems update con-

ﬂicts are rare and their resolution is not necessary in

general.

– Probabilistic success guarantees for search are sufﬁ-

cient.

– Consecutive updates are distributed sparsely.

– The required communication overhead is the critical

measure for the performance of the approach.

– The typical number of replicas is substantially higher

than assumed normally for distributed databases but

substantially lower than the total network size.

– The connectivity among replicas is high and the con-

nectivity graph is random.

The two last points require further discussion: We may

expect a few hundred to thousand replicas to be maintained.

An intuitive explanation for such numbers is that if we need

a 99.9% success guarantee for a search and only 10% of

the replicas are online on average, then a serial search will

need about 65 attempts (since

  

). This does

not account for load balancing among the available repli-

cas which may account for another factor of 5 to 10 times

of replication, particularly for “hot” items. Statistics from

some of the music ﬁle sharing systems show that on average

200 to 250 replicas of same ﬁles are available, not counting

the replicas that are not shared [30]. This requires devis-

ing a scheme which scales at least beyond the hundreds to

thousands.

We assume that the replicas within a logical partition of

the data space are connected among each other and each

replica knows a minimal fraction of the complete set of

replicas. If not enough replicas are known they can be efﬁ-

ciently obtained by randomized search. Additionally repli-

cas get known through the update mechanism discussed in

this paper. Even when peers potentially know a large frac-

tion of the complete replica population the use of rumor

spreading bears a number of advantages as compared to im-

mediately contacting the complete neighborhoods for up-

dates: better load balancing, reduced delay due to paral-

lel propagation, improved robustness against changes in the

peer network, and only partial knowledge of the neighbor-

hood is required.

3. System model

In the analysis of the update algorithm we focus on

the amount of communication required to achieve quasi-

consistency and provide probabilistic guarantees for suc-

cessful and appropriate results for queries. As observed in

[22] we assume a very low rate of conﬂicts. Indeed, many

applications, for example, music ﬁle sharing or news dis-

semination, have such a proﬁle where, if data is altered, it

may be treated as distinct and coexists as different versions.

Similarly, deletions may use conventional tombstones and

death certiﬁcates. These issues are relatively orthogonal

to the communication mechanism used to convey the up-

dates among the replicas. Further, in a decentralised sys-

tem, such as P-Grid the “data” may indeed be knowledge

regarding the system’s topology, for example the routing ta-

bles used in P-Grid [1]. Most of these systems operate with

a relatively high degree of imperfect knowledge, which is

why probabilistic guarantee of information dissemination

in such application scenarios is sufﬁcient.

Our system assumes an infrastructure-less peer-to-peer

system, i.e., all peers are equal and no specialised infras-

tructure, e.g., hierarchy, exists. No peer has a global view

of the system but base their behaviour on local knowledge,

i.e., its routing tables, replica list, etc. The peers can go of-

ﬂine at any time according to a random process that models

the behaviour when peers are online. Physical connectiv-

ity and topology are ignored. We can assume that if two

peers are online a communication channel may be estab-

lished between them. This is not a serious relaxation of

assumption, since if two peers may not communicate with

each other, they will simply perceive each other to be of-

ﬂine. It is primarily the erratic behaviour of online avail-

ability and the complete lack of global knowledge, as well

as the absence of any centralisation, and thus the need of

self-organisation, which prompts us to call this environment

unreliable. The limited resources, particularly bandwidth

(and power in wireless/mobile environments), and the vary-

ing degree for tolerance of latency makes the environment

even more challenging.

Our update propagation scheme has a push phase and a

pull phase which are consecutive but may overlap in time. A

new update is pushed by the initiator to a subset of responsi-

ble peers it knows, which in turn propagate it to responsible

peers they know similar to a constrained ﬂooding scheme.

By “responsible” we denote peers that are affected by the

update because they hold the original version of the data

item. In our analysis of the push phase in the next section

we assume a synchronous model which is a standard model

for analysing epidemic algorithms [18].

Peers that have been disconnected (ofﬂine, disruption

of communication) and get connected again, peers that do

not receive updates for a long time (locally determined), or

peers that receive a pull request, but are not sure to have the

latest update, enter the pull phase to synchronise and recon-

cile. The pull scheme is similar to anti-entropy [9], in the

sense that the pulling party tries to synchronise itself with

the pulled party. Since the pulled party itself may be out of

sync, it is preferable to contact multiple peers and choose

the most up to date peer(s) among them.

Push phase of the update algorithm: When a peer



re-

ceives a update request

  !

, where



is the up-

dated data item,



its version



is a counter which counts

the number of push rounds that have already been executed

for the update, from a peer

, it also receivesa partial (ﬂood-

ing) list



, to which the same update has been sent (not

necessarily received by all peers in





). Then



chooses a

random set

$#

of its replicating peers and forwards the re-

quest

%

'&

(# )*+!

with a probability

,.-/0 !

to the

set



#1



,.-/0 !

can be any function, and is a self tuning

parameter, determined locally by



. Another beneﬁt of

2

is that



possibly discovers replicas unknown to her.

/* At replica ‘p’ (push phase) */

IF received(Push(U, V, R_f, t)) THEN

/* process the update if not precessed it yet */

IF ProcessedUpdate(U, V, R_f, t) == FALSE THEN

Select a random subset Rp of replicas

with |Rp| = R * f_r;

With probability PF(t):

Push(U, V, Rf union Rp, t+1) to Rp \ Rf ;

/* PF(t): deterministic or self

tuning function */

ProcessedUpdate(U, V, R_f, t) = TRUE

Since any replica pushes the update at most once, the ter-

mination decision is trivial, and the number of push rounds

gives the latency of propagating the update to all online

replicas with high probability (arbitrarily close to 1).

Pull phase of the update algorithm: When a peer gets

connected again because it was ofﬂine or suffered from

a communication disruption, received no update for some

time, or receives a pull request but is not sure whether it is

in sync, then it enters the pull phase and inquires for missed

updates.

/* At replica ‘p’ (pull phase) */

IF online_again OR no_updates_since(t) or

(received_pull and not_confident) THEN

/* not_confident is true:

no update received within time T */

Contact online replicas;

Inquire for missed updates

based on version vectors;

4. Analysis

4.1. Setup and notation for the analysis

The goal of our update algorithm is not to achieve com-

plete consistency but rather to know what is the probability

of a correct answer given certain model parameters. We

assume that every peer knows a subset of all replicas that

replicate the same data. We consider the replica network to

be a small P2P network itself but with no internal structur-

ing. It handles updates/requests for a partition of the data

space.

In the analysis we start from a completely consistent

state, analyse a single update request, and evaluate the num-

ber of messages and time (rounds of message exchange) re-

quired to reach a consistent state again. Since most of the

This actually is a vector of version identiﬁers of the form

798:;=<?>A@BDC=E+FGH8:;=<?>A@BDC=EJIJG?K?KK?GH8:;=<?>A@BDC=E=LM

. Version identiﬁers are

universally unique identiﬁers computed locally by applying a a crypto-

graphicly secure hash function to the concatenated values of the current

date and time, the current IP address and a large random number.

replicas are ofﬂine most of the time, our notion of consis-

tent state is more related to the online population

2NO P

at a

given time

rather than the whole set of replicas

. Though

our analysis is generic, we evaluate the algorithm for real-

istic scenarios: availability of the peers to be a random pro-

cess with expected value of being online between 10% to

30%. The replication factor is assumed to be between 100

to 1000 and though scalability is not a major issue for such

small numbers larger replication factors too have been in-

vestigated. Table 1 shows the notation used in our analysis.

When an update



is initiated for a set

of replicas

with cardinality



, in general the online population in push

round



will be



O P

S !%TU

O P

SJVWX!

Y

)/Z [V\

OP

0]V^+!_

`

where

Ta(Vcb=d

and

Tebf

, where



O P

g!hTi2N

O P

if the

update starts at time

b=d

is the probability of an online peer

going ofﬂine in one push round and

is the probability of

an ofﬂine peer coming online in a push round. These values

are typically small and may vary in different push rounds.

For the sake of simpliﬁcation, we will initially ignore the

effect of replicas coming online, and will further assume a

constant

, hence we have



OP

0 !jTk

O P

SVlX!

$Y

. Ne-

glecting the effect of positive

is justiﬁed because peers

coming online need to execute pull any way, and thus do not

contribute to the push phase. Thus, even if some peers come

online during a push phase, and receives update through

push, it will not make any major difference to the whole

system’s behavior or the analysis of the same. The assump-

tion of a very small

bmd

is justiﬁed because a single push

round will take a very small time (network delay for a sin-

gle message), and unless there is any kind of catastrophic

failure, a very small number of peers will suddenly decide

to go ofﬂine. Further, we choose a discrete time model for

the rumor spreading algorithm, just like most other rumor

algorithms. This in itself does not mean that we need syn-

chronous rounds. It is indeed possible that because of vari-

ation in network latency, messages of different push rounds

live in the network at the same instant of time. Thus, instead

of treating



strictly as time, it needs to be interpreted as the

round number, and the replicas which get infected by that

round are effectively replicas that eventually gets updates

from this round. Thus



does not in itself necessarily deﬁne

an ordered chronology of receiving updates among all peers

in the system.

Typically the parameters, such as

"on



O P

g!

, may

vary over time. But for the purpose of analysis we may as-

sume that they remain constant throughout a single update

push phase. In Section 6 we will give some indication of

how the parameters can adapt over time to the varying net-

work topologies.

The choice of two parameters

,'p

(probability of for-

warding an update) and

"qn

(fraction of total replicas to

which peers initially decide to forward an update) rather

than deﬁning only one parameter which couples both of

them together is because we wanted to study the effects of

both these factors in limited ﬂooding algorithms. For exam-

ple, a protocol like Gnutella [8] uses ﬂooding with a ﬁxed

fanout, but uses no notion of

. Actually its use of TTL

effectively means that

is 1 for TTL rounds, and 0 after

that. Some other systems, for example one using gossip for



Cardinality of the set of replicas



Number of the push round for a particular update



The update message or its size (notation depends on the context)

rts

0 !

Size of messages in round



0 !

Normalised size of the partial list of replicas which have the update in round



. This is equal to the

number of entries in the list divided by



OP

0 !

Number of replicas online in round



Probability that a peer stays online in the next push round

Fraction of replicas to which peers initially decide to forward the update message

uwvXx$yov

{zS|~}=S !

Number of new replicas receiving update in round





JS !

Number of messages in round



, including messages to ofﬂine replicas

"mw

n

S !

Increment in fraction of online replicas which are aware of the update after round



mw

n+0 !

Total fraction of online replicas which are aware of the update at the beginning of round



S !

Probability that a peer pushes an update in round



if it received it in round

V



Size of data required to describe one replica (e.g., 10 bytes).

Table 1. Notation used in the analysis

ad-hoc routing [13], on the other hand uses probability of

forwarding rumors as a design parameter. In order for our

analysis to be general enough, such that all these variations

of limited ﬂooding can be reduced to special cases of our

model, we included the notion of both fanout and probabil-

ity of forwarding.

4.2. Analysis of the push phase

Round 0

The replica initiating the update propagation sends



"on

fraction of replicas. Thus we obtain a total number of mes-

sages,



Jg!T



"on

(including messages to ofﬂine

replicas). The number of new replicas which receive the

update is

uwvXx$yov

z0|~}=S!Tl

OP

S! "

. The number of on-

line replicas without update is



OP

S!=?V."onJ!

. The message

length in this round is

rts

S!TU)







"on

Round 1

Assuming message ﬂooding, where every replica which re-

ceived an update message decides with probability

?+!

to forward it to





"on

replicas, we have:



J?X!Tl

OP

S!

?+!?."

 (Vc"

The expression may be explained as follows.



OP

S!"on

of the online population received the update in the previous

round, a fraction

of these replicas continue to stay online

in the present round, a

 X!

fraction of these replicas de-

cide to forward the message. Each of the



OP

S!"on

,'p$?+!

peers decide to push the update, forwarding it to

[g"onV/"

replicas, since it knows that the update has already been sent

of the

fraction of randomly chosen replicas. Actu-

ally, in case a replica receives update information from more

than one replica, it can use the list of ‘updated replicas’ in

each of those messages, and hence the number of messages

can be further trimmed, at an additional computational cost.

uwvXx$yov

{z0|~}=?+!T 

O P

g!

 $V"onJ!



Z(V $Vc"qnJ!?wHD

 o¡



H_

The expression may be explained as follows: Of the



OP

S!

 V¢"onJ!

uninformed online peers, a fraction

?.VU"

wg

?o¡





peers continue to stay uninformed

when each of the



OP

S! "on

,£p$ X!

informed peers forward

(push) to

"on

fraction of random peers. The others receivethe

update after this round. For the message length we have:

rts

?X!¤T ¥)¦







g"on%)"on?(V"onJ! !

T ¥)¦







?(V¥?$V"onJ!

Round

¨§©

The results may be generalised as follows:

uwvXx$yov

z0|~}=0 !¤T 

OP

0VX!= (Vc"

mw

n]SV+! !

Yª

?(V¥?$V"

wg«¬



w®?¯D®

~°

«¬



o¡

«0

Thus we obtain the fraction

"mw

n

0 !

and

"qmw

n

0 !

as:

"mw

n

0 !±T ?(V"qmw

n

0 !!



?(V¥?(V"



H

«¬





®?¯D®

~°

«¬



o

«0

Then,

mw

n]S !±T "

mw

nX0V+!w)"

mw

n]S£VX!

T (V (Vc"

=w

nXS'VX!!



?(V"onJ!?wHD«¬



® ¯®

~°

«¬



o¡

«¬



Note that this is a recursive relationship and

=w

n

rapidly

grows to 1. The expression for

mw

n

may exceed the value

of 1, but that will have no physical relevance, and thus the

function needs to be determined using a ceiling function and

"mw

n

too needs to be reevaluated accordingly in the ﬁ-

nal push round. Also note that

,£pj0 !

can be any arbitrary

function of



, which individual nodes can deﬁne in an ad-

hoc manner, and we will see that this will be a self-tuning

parameter for our push phase algorithm.

It is subtle to determine the number of messages and

length of these messages. If the partial list of replicas, to

which the update has already been transmitted along with

the update information U, is ignored, we have



=0 !T

O P

S'VX!"mw

n

0V+!

0 ! 2"

since each of



OP

0V²X! "mw

n

SV*+!

0 !

replicas

(these replicas received the update in the previous round,

and continued to stay online, and decided to forward the

same) forward the update to

."on

replicas. If the partial list

of replicas is accounted for, then the number of messages

decrease to



J0 !¤T 

OP

0VX! "mw

n

S'VX!

0 !



2"qnq?(V"on]!?«

and the length of each message in round t is given as

rts

S !Ti¥)







 $V (Vc"

«0³

We now prove the two equations above by induction. Let

the normalised length of the partial list of replicas in a mes-

sage be denoted by

S !

. The normalised length of the par-

tial list is the fraction of the total replicas that the partial list

contains. Then

rts

0 !Te¥)¦







0 !

Induction hypothesis:

S !T´V¥?$V"

«0³

Now

0)kX!µT¶"on.)

S !$Ve"on

S !

, since the

."on

replicas chosen randomly are independent of the replicas in

the partial list. Now, if our hypothesis is true then,

0w)lX!¤T "qn%)U(V¥?(V"on]!?«0³

V"onq (V¥ (Vc"qnJ!~«0³

T (V¥?(V¦"on]!

«0³

)"onq $V"onJ!

«0³

T (V¥?(V¦"on]!~«0³

Thus the hypothesis is consistent. Since the hypothesis is

true for

¨Te]

, using induction, we conclude that

S !¨T

¨V?(Vc"onJ!

«0³

where

rts

S !T¥)







0 !

. Thus,



J0 !¤T 

OP

0V+! "=w

n

0VX!

S !?



"qn?(V

S£VX!!

T 

OP

0V+! "

=w

n]0VX!

,£p$S !?."onq $V"onJ!~«

As may be observed,

0 !

increases with round number



and a legitimate question to ask is its effect on the re-

source (Memory/CPU/Bandwidth/Power) available at each

of the replicas. A way to deal with increasing

S !

may be

to chose a normalised threshold length

s%·

m¸0 !

such that

0 !.T





sh·

=¸0 !=

S !¹X!

where

0 !¹/T

SV+!)

"on¨V

S£VX!"on

. This can be achieved by discarding either

random entries or the head or tail of the partial list. In this

case,



JSD)c+!Tl

O P

S ! "

=w

nXS !

,'p$0)cX! ."on 'V

s%·

m¸S ! !

"mw

n

and

"qmw

n

stay unchanged, since the extra

messages generated by reducing the

S !

are all duplicate

messages. Thus the nodes which push the update in the

next round pay the penalty of forwarding extra messages.

Note that the case where

s%·

m¸S !

is zero for all replicas

corresponds to the case where no list is propagated, and will

enhance the number of duplicate messages, without any im-

provement in coverage of unreached replicas.

4.3. Analysis of the pull phase

If a replica



comes online at a random time (after the

push phase is over), then it will (very likely) ﬁnd the update

information from any of its online replicas. The underly-

ing assumption for such an optimism is that any replica that

came online in the meantime must have pulled the update

information by the time the concerned peer



came online.

This justiﬁes the eagerness of the Update Pull algorithm.

What is more interesting is what happens if



comes on-

line while a push of an update is underway. If

mw

n

frac-

tion of the replicas



OP

are already aware of the update, the

probability of a replica



getting the update in



attempts is

VUZ(V¥S

O P

mw

nºq.!_



which implies that a constant number of pull attempts

should give the update information with high probability.

Since updates are propagating by push as well, the above

term gives a worst case estimate. Indeed if

"mw

n

(refer

to push phase analysis) fraction of online replicas received

updates in the previous push round (

$V»

), then (if they

continue pushing) the probability of getting a push is

(V $Vc"qnq?(V

S ! ! !



H

«¬



®?¯D®

~°

«¬



o

«0

4.4. Query (request)

Servicing requests under (possibly relatively frequent)

updates is similar to the Pull phase of updates. For simple

servicing of requests, we may indeed use the same analy-

sis as in the Pull section. Since requests are more sensitive

(updates can be lazy, and strong consistency is not our goal,

however we intend to return correct and most recent result

for any query) we may deﬁne some majority logic, or use a

version scheme for identifying latest updates, or a hybrid of

the two.

5. Analytical results

Based on the analytical model developed in the previous

section we investigated for various environmental parame-

ters the performance of the push phase of the propagation

of a single update. For the evaluation of the recursive ana-

lytical functions a C-program has been developed.

Our performance criterion for this analysis is primarily

the number of messages that are generated as part of a single

update, compared to the extent to which the update propa-

gates among the online population. As a simplifying (and

for ﬁxed networks, realistic) assumption we ignore message

size, as single messages can accommodate the messages of

maximal size that can occur in our setting.

In the following result plots (e.g., Fig. 1) we will show

on the y-axis the number of messages generated per mem-

ber of the initial online population. As assumed in the pre-

vious section peers coming online are not participating in

Updates in highly unreliable, replicated peer-to-peer systems

Figures

Citations

P-Grid: a self-organizing structured P2P system

Method and system for downloading updates

GridVine: building internet-scale semantic overlay networks

Review: A survey on content-centric technologies for the current Internet: CDN and P2P solutions

Power. Law.

References

Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems

Epidemic algorithms for replicated database maintenance

Freenet: a distributed anonymous information storage and retrieval system

The dangers of replication and a solution

Managing update conflicts in Bayou, a weakly connected replicated storage system

Related Papers (5)

Chord: A scalable peer-to-peer lookup service for internet applications

A scalable content-addressable network

Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems

OceanStore: an architecture for global-scale persistent storage

Freenet: a distributed anonymous information storage and retrieval system

Frequently Asked Questions (19)

Q1. What contributions have the authors mentioned in the paper "Updates in highly unreliable, replicated peer-to-peer systems" ?

Q2. What have the authors stated for future works in "Updates in highly unreliable, replicated peer-to-peer systems" ?

Q3. What is the novel approach of their work?

Q4. What is the important metric that can be used to tune parameters?

Q5. What is the main advantage of the algorithm?

Q6. What are some of the typical applications where data is added, deleted, or updated frequently?

Q7. What is the purpose of the push phase?

Q8. What is the performance criterion for the pull phase of updates?

Q9. What is the assumption of a very small bmd?

Q10. What is the common economic paradigm for data replication in Mariposa?

Q11. How can the authors tune the push phase?

Q12. What is the way to test the consistency of the analysis?

Q13. Why is it possible that different push rounds live in the same network?

Q14. What is the main contribution of this paper?

Q15. What is the probability of information dissemination in such applications?

Q16. how many replicas are needed to search a database?

Q17. What is the definition of the second category of a rumor spreading algorithm?

Q18. What is the way to reduce the number of duplicate messages sent to a replica?

Q19. What is the strategy for reducing the probability of forwarding updates?