Proceedings Article•DOI•

On the Trade-Off between Relationship Anonymity and Communication Overhead in Anonymity Networks

Ognjen Vuković, Gyorgy Dan, Gunnar Karlsson

05 Jun 2011-pp 1-6

TL;DR: The results show that, contrary to expectations, increased overhead does not always improve anonymity and the proposed anonymity network, Minstrels, achieves close to optimal anonymity under certain conditions.

read less

Abstract: Motivated by applications in industrial communication networks, in this paper we consider the trade-off between relationship anonymity and communication overhead in anonymity networks. We consider two anonymity networks; Crowds that provides unbounded communication delay and Minstrels, proposed in this paper, that provides bounded communication delay. While Crowds hides the sender's identity only, Minstrels aims to hide the receiver's identity as well. However, to achieve bounded message delay it has to expose the sender's identity to a greater extent than Crowds. We derive exact and approximate analytical expressions for the relationship anonymity for these systems. While Minstrels achieves close to optimal anonymity under certain conditions, our results show that, contrary to expectations, increased overhead does not always improve anonymity.

...read moreread less

Summary (1 min read)

Jump to: [Introduction] – [III. MINSTRELS SYSTEM DESCRIPTION] – [B. Relationship Anonymity Against Inside Attackers] and [V. NUMERICAL RESULTS]

Introduction

Many communication systems, for example modern industrial networks [1], [2], require high availability between a fixed set of nodes on a pairwise basis.
Due to the often long life-cycles of industrial systems software corruption is a threat, and the complexity of the code-base makes corruption hard to detect.
Corrupted nodes that are part of the mix network can perform inside attacks to determine the senderreceiver pair for messages that are relayed through them.
Anonymity networks can provide some level of relationship anonymity against inside attackers (e.g., [5], [6]) by hiding the sender or the receiver from the relay nodes.

III. MINSTRELS SYSTEM DESCRIPTION

Minstrels, described below, uses nodes as message relays in the same way as Crowds [6] with the difference that the number of nodes visited by a message is bounded.
The message, or part of it, is encrypted with the receiver’s public key.
These initialized nodes are considered as visited so that the message can not be relayed to them.
Fig. 1 shows another case when the list is initialized with the sender and node C, and the message is forwarded to node B. Node B adds itself to the list and decides to which of the remaining nodes (D,E) to forward the message.

B. Relationship Anonymity Against Inside Attackers

The authors consider attackers without any a priori knowledge of the system traffic matrix.
For a given attacker on the path, P(I|H1+) is the probability that the attacker’s predecessor is the sender.
Let us now turn to the calculation of the probabilities that the attacker correctly identifies the sender-receiver pair (s,r) used in (7).
The attacker can receive a message with only one node in the list of visited nodes (||L ||= 1), in which case the node in the list is the predecessor.

V. NUMERICAL RESULTS

In the following the authors use the analytical models described above to get insight into the overhead-anonymity trade-off.
Hence, for C = 3 the probability that the attacker can assign to the sender decreases faster than the probability P(H1+) of having an attacker on the path increases.
Figs. 2, 3, 4, and 5 also show the lower bounds for the probabilities Prel(s,r) for Crowds and for Minstrels.
In general, the best possible relationship anonymity might not be provided by the highest allowable overhead.

Did you find this useful? Give us your feedback

Figures (8)

TABLE IV P(Ωr,Ωs, ||L ||> 1,MC > 0,H1+|S(s),R(r))

TABLE II P(Ωr,Ωs, ||L ||= 1,MC = 0,H1+|S(s),R(r))

Fig. 2. Relationship anonymity vs. overhead for N = 10, C = 1 2 3 4 5 6 7 8 9 10 0

TABLE V P(Ωr,Ωs, ||L ||= 0,MC = 0,H1+|S(a),R(b))

TABLE VI P(Ωr,Ωs, ||L ||= 1,MC = 0,H1+|S(a),R(b))

Fig. 1. A simple example of Minstrels with five nodes.

TABLE I P(Ωr,Ωs, ||L ||= 0,MC = 0,H1+|S(s),R(r))

TABLE III P(Ωr,Ωs, ||L ||> 1,MC = 0,H1+|S(s),R(r))

Content maybe subject to copyright Report

including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or

lists, or reuse of any copyrighted component of this work in other works. The deﬁnitive version of this paper is published in Proc. of IEEE ICC, Jun 2011.

On the Trade-off between Relationship Anonymity

and Communication Overhead in Anonymity

Networks

Ognjen Vukovi

School of Electrical Engineering

KTH, Royal Institute of Technology,

Stockholm, Sweden

Email: vukovic@ee.kth.se

orgy D

School of Electrical Engineering

KTH, Royal Institute of Technology,

Stockholm, Sweden

Email: gyuri@ee.kth.se

Gunnar Karlsson

School of Electrical Engineering

KTH, Royal Institute of Technology,

Stockholm, Sweden

Email: gk@kth.se

Abstract—Motivated by protection and privacy in industrial

communication networks, in this paper we consider the trade-

off between relationship anonymity and communication over-

head. We consider two anonymity networks: Crowds, which

has unbounded communication delay and Minstrels, proposed in

this paper, which provides bounded communication delay. While

Crowds hides the sender’s identity only, Minstrels aims at hiding

the receiver’s identity as well. However, to achieve bounded

communication delay it has to expose the sender’s identity to

a greater extent than Crowds. We derive exact and approximate

analytical expressions for the relationship anonymity for these

systems. While Minstrels achieves close to optimal anonymity

under certain conditions, our results show that, contrary to expec-

tations, increased overhead does not always improve anonymity.

I. INTRODUCTION

Many communication systems, for example modern indus-

trial networks [1], [2], require high availability between a

ﬁxed set of nodes on a pairwise basis. The nodes can be the

subsidiaries of an enterprise connected by a virtual private

network over the public Internet, or they can be sensors,

actuators and operation centres in a wide area industrial control

system, e.g., in a supervisory control and data acquisition

(SCADA) network. Cryptography may provide authentication,

conﬁdentiality and data integrity for the communication, but

source and destination addresses could still be visible to an

outside attacker who is able to observe one or more network

links. The outside attacker may identify trafﬁc patterns: who

is communicating with whom, when and how often. Using

this information the attacker can infer the importance of the

messages, and may perform targeted attacks on the communi-

cation between any two nodes. These targeted attacks might

be hard to detect and can lead to incorrect system operation.

Mix networks [3] are a way to mitigate outside attacks

by providing relationship anonymity, i.e., by making it un-

traceable who communicates with whom [4]. Nodes in a mix

network relay and delay messages such that an outside attacker

cannot trace the route of the individual messages through the

mix. While relaying renders outside attacks more difﬁcult, it

introduces the possibility of inside attacks. Due to the often

long life-cycles of industrial systems software corruption is a

threat, and the complexity of the code-base makes corruption

hard to detect. Corrupted nodes that are part of the mix

network can perform inside attacks to determine the sender-

receiver pair for messages that are relayed through them.

Anonymity networks can provide some level of relationship

anonymity against inside attackers (e.g., [5], [6]) by hiding the

sender or the receiver from the relay nodes. Good sender (or

receiver) anonymity in itself does not necessarily lead to good

relationship anonymity [8], hence we focus on relationship

anonymity in this paper.

The relationship anonymity provided by mix networks and

anonymity networks comes at the price of delay and commu-

nication overhead. Excessive delays can negatively impact the

system performance, while overhead leads to high resource

requirements, so that in practice both have to be kept low.

Our goal in this paper is to investigate the trade-off between

the communication overhead introduced and the level of

relationship anonymity provided by anonymity networks.

Intuition says that increased overhead should result in

increased anonymity. In this paper we show that this is not

necessarily the case. We use two anonymity networks for

our study. First, Crowds, proposed in [6], which hides the

sender by introducing unbounded message delivery delay (it

still exposes the receiver’s identity). Crowds was shown to

provide optimal sender anonymity for given overhead [7], i.e.,

path length. Second, Minstrels, described in this paper, which

provides both sender and receiver anonymity, i.e., relationship

anonymity. Minstrels has bounded message delivery delay. We

do not consider long term intersection attacks, such as [8],

[9], [10], which exploit cases when the sender’s anonymity is

not beyond suspicion, i.e., the sender is distinguishable from

other nodes. These attacks consider that the receiver is outside

the anonymity network, and they exploit the distribution of

message destinations to decrease the relationship anonymity.

In our system the receiver is part of the anonymity network,

and message destinations can have an arbitrary distribution;

but an attacker does not have a-priori knowledge of the trafﬁc

matrix.

The rest of the paper is organized as follows. Section II

describes our system model and the anonymity metrics. Sec-

tion III provides a description of the Minstrels anonymity

network. In Section IV we develop analytical models of the

relationship anonymity provided by Crowds and Minstrels, and

we show numerical results based on the models in Section V.

Section VI concludes the paper.

II. SYSTEM MODEL AND METRICS

We consider an anonymity network with N nodes. The

nodes act as sources, destinations and as relay nodes for each

others’ messages. The underlying communication network is

a complete graph. The inside attacker is in control of C nodes,

and can observe the messages traversing those nodes and

the protocol speciﬁc information contained in the messages.

Its goal is to identify the source and the destination of the

messages that it observes.

We quantify the relationship anonymity by the probability

rel

(s,r) that the attacker assigns to a sender-receiver pair

(s,r) for a message. In general, the relationship anonymity

depends on two factors. First, on the probability of having

an attacker on the path. Second, on the probability that the

attacker assigns to the sender (that it sent the message) and to

the receiver (that it is the destination) when it gets the message.

These probabilities are a function of the anonymity protocol,

the number of nodes N and the number C of inside attackers

rel

(s,r) =

∞

∑

i=1

S(s),

R(r)|H

,S(s),R(r))P(H

|S(s),R(r)),

(1)

where S(s) and R(r) denote the events that the sender is

node s and the receiver is node r, respectively;

S(s) and

R(r) denote the events that the attacker correctly identiﬁes

node s as the sender and node r as the receiver, respectively;

P(H

|S(s),R(r)) is the probability that the position of the ﬁrst

attacker on the path is i given that (s,r) is the sender-receiver

pair, and P(

S(s),

R(r)|H

,S(s),R(r)) is the probability that the

attacker identiﬁes (s,r) as the sender-receiver pair given its

position on the path.

Finally, we deﬁne the overhead of the anonymity network

as the average path length (number of relay hops) E[K] of the

messages.

III. MINSTRELS SYSTEM DESCRIPTION

Minstrels, described below, uses nodes as message relays

in the same way as Crowds [6] with the difference that the

number of nodes visited by a message is bounded.

Consider the system described in Section II. When a node s

wants to send a message to a node r it picks a node uniformly

at random among the other N − 1 nodes (excluding s) and

forwards the message. The next node forwards the message

to one of the other N − 2 nodes (excluding itself and the

sender node s) chosen uniformly at random. Every subsequent

forwarder picks one of the non-visited nodes to forward the

message. When node r receives the message, it will send the

message further in order to improve the receiver anonymity.

The path ends when all N nodes have been visited.

Fig. 1. A simple example of Minstrels with ﬁve nodes.

The message, or part of it, is encrypted with the receiver’s

public key. When a node receives the message, it checks if it

is the receiver by trying to decrypt the encrypted part of the

message. If the decrypted part of the message represents valid

data, the node is the receiver. Note that a node does not know

who is the receiver, it can only check whether it is the receiver

itself (unlike in Crowds).

To bound the path length, the messages record a list of

the visited nodes in the header. The list can be implemented,

for example, using a Bloom ﬁlter, to keep its size small.

When a relaying node receives a message, it will relay the

message only to non-visited nodes. To control the maximum

path length (i.e., delay) the sender can initialize the list of

visited nodes with a number M ∈ {0,...,N −1} of the nodes in

the system. These initialized nodes are considered as visited

so that the message can not be relayed to them. Hence, a

message traverses all nodes except for the initialized nodes in

the list. The sender picks the number of initialized nodes at

random: it initializes the list with M nodes with probability

P(M), where

∑

N−1

M=0

P(M) = 1. For M = 0 the list is empty,

for M = 1 the list is initialized only with the sender and for

M > 1 the list is initialized with the sender and M − 1 other

nodes. The sender must not initialize the list with the receiver.

The distribution of P(M) is a system parameter, and we use it

to explore the anonymity-overhead trade-off. Fig. 1 shows two

simple examples with ﬁve nodes, node A as sender and node D

as receiver. Fig. 1 (left) shows a case when the list is initialized

with the sender node A and the message is forwarded to node

C. Node C checks if it is the receiver, puts itself in the list

and chooses the next hop uniformly at random among nodes

(B,D,E). The next hop, node D, follows the same procedure

with only two forwarding options (B,E). Fig. 1 (right) shows

another case when the list is initialized with the sender and

node C, and the message is forwarded to node B. Node B

adds itself to the list and decides to which of the remaining

nodes (D,E) to forward the message. Node C is considered as

already visited.

IV. OVERHEAD AND ANONYMITY

In the following we derive expressions for the communi-

cation overhead and the anonymity provided against inside

attackers for Crowds and for Minstrels.

A. Communication Overhead

We start with calculating the communication overhead of

Crowds and Minstrels. The mean number of hops for Crowds

is the expected value of a geometric distribution with success

probability 1 − p

, i.e.,

E[K] =

1 − p

+ 2 (2)

where p

is the probability that a node will relay a message.

For Minstrels for a given number M of initialized nodes in the

list the path length is equal to K = N −M. The mean number

of hops depends on the distribution P(M) and can be expressed

E[K] =

N−1

∑

M=0

P(M)(N − M). (3)

B. Relationship Anonymity Against Inside Attackers

We consider attackers without any a priori knowledge of the

system trafﬁc matrix. All nodes are equally likely to be senders

or receivers. The attacker can only decrease the relationship

anonymity by knowing the protocol and by observing trafﬁc

that goes over the nodes it controls. In order to calculate

the relationship anonymity in the following we express the

probabilities in (1) for Crowds and for Minstrels.

1) Crowds: For Crowds the ﬁrst attacker is on position i if

the message is ﬁrst relayed i − 1 times through trusted nodes

but the last hop is an attacker. We denote this event by H

The probability P(H

|S(s),R(r)) can be expressed as

P(H

|S(s),R(r)) = P(H

) = p

i−1



N −C − 1

N − 1



i−1

N − 1

. (4)

Let I denote the event that the ﬁrst attacker on the path is

immediately preceded on the path by the sender. Note that

⇒ I but the opposite is not true since the sender may appear

multiple times on the path. For a given attacker on the path,

P(I|H

) is the probability that the attacker’s predecessor is

the sender. P(

I|H

) is the probability that another node (i.e.,

not the predecessor) is the sender. The probability that the

attacker assigns to the actual sender of the message can be

expressed as

S(s)|H

,S(s),R(r)) = P(I|H

)P(I|H

) + P(

I|H

)P(

I|H

(5)

where P(I|H

) is the probability that for a given position i of an

attacker on the path the sender appears as the predecessor (on

position i − 1). For i = 1 we have P(I|H

) = 1 while for i > 1

we have P(I|H

) = P(I|H

) =

N−C−1

. Intuitively, P(

I|H

) is

the probability that for a given position i of an attacker on the

path, some other node, a relay, appears as the predecessor. For

i = 1 we have P(

I|H

) = 0, while for i > 1 we have P(

I|H

) =

N−C−2

N−C−1

The expression for P(I|H

) is given in [6] for the case

when there are n possible relays (including the sender). Since

in our case there are n = N −1 possible relays the expression

for P(I|H

) becomes P(I|H

) =

N−1−p

(N−C−2)

N−1

. P(

I|H

)

can be expressed as P(

I|H

) =

1−P(I|H

)

N−C−2

The receiver is exposed in Crowds, hence

S(s),

R(r)|H

,S(s),R(r)) = P(

S(s)|H

,S(s),R(r)).

2) Minstrels: For Minstrels we rewrite (1) as

rel

(s,r) = P(

S(s),

R(r)|H

,S(s),R(r))P(H

|S(s),R(r)),

(6)

where P(H

|S(s),R(r)) is the probability of having an

attacker on the path for sender-receiver pair (s,r), and

S(s),

R(r)|H

,S(s),R(r)) is the probability that the attacker

identiﬁes (s, r) as the sender-receiver pair. We consider coor-

dinated attackers that keep track of the received messages, so

that every attacker knows whether a particular message was

already received by an attacker. Hence, when the ﬁrst attacker

on the path gets the message, it knows the number m

attackers that the list of visited nodes was initialized with by

the sender. m

is a realization of the random variable M

whose distribution depends on the value of M.

In Minstrels the probability that the attacker assigns to a

sender-receiver pair does not only depend on the node that

the message is received from, i.e., the predecessor p, but also

on the contents of the list of visited nodes (L ) that the message

carries. Consequently, the attacker distinguishes between three

disjoint sets of nodes: the predecessor node ({p}), nodes in

the list of visited nodes except the predecessor (L \{p}), and

nodes not in the list of visited nodes (L ∪ {p}). These sets

form a partition of the set of all trusted nodes in the system,

and nodes belonging to the same set are equally likely to be

the sender (and the receiver). As a shorthand for the universe

of distinguishable events we use the notation Ω

= {s = p,s ∈

L \ {p}, s ∈ L ∪ {p}}, where, for example, s = p is the event

that the predecessor is the sender. Similarly, we deﬁne Ω

{r = p,r ∈ L \{p},r ∈ L ∪ {p}} for the distinguishable events

regarding the receiver.

Given the information on L , m

, and p available to the

attacker, we can use the law of total probability to expand (6)

conditional on the list length ||L || = l, ω

∈ Ω

, ω

∈ Ω

, and

= m

rel

(s,r) =

∑

S(s),

R(r)|ω

,ω

,l,S(s),R(r)) (7)

·P(ω

,ω

,l|S(s),R(r)). (8)

The summands in (7) are the probabilities that the attacker

correctly identiﬁes the sender-receiver pair of the message that

contains the information (||L || = l, ω

∈ Ω

, ω

∈ Ω

, and

= m

), and are independent of S(s),R(r). Eq. (8) is the

probability that a message with (s, r) as sender-receiver pair is

received by an attacker and carries particular information.

Before we turn to the calculation of the probabil-

ity P(ω

,ω

,l,m

|S(s),R(r)) we introduce the notation

H(l, m

|M) for the joint event ||L|| = l, H

, and M

= m

for a given number of initialized nodes M. Clearly, l ≥ M. The

probability of this event can be expressed as

P(H(l, m

|M)) =

N−1

l = 0,M = 0

P(M

= 0|M)

N−C−1

N−1

N−l

∏

l−1

z=1

N−C−z

N−z

l ≥ 1,M = 0

P(M

= m

|M)

C−m

N−l

∏

l−1

z=M

N−C+m

−z

N−z

l ≥ 1,M > 0,

(9)

TABLE I

P(Ω

,Ω

,||L || = 0,M

= 0, H

|S(s),R(r))

Ω

,Ω

s = p, r ∈ L ∪{p} P(M = 0)P(H(0, 0|M = 0))

where P(M

|M) is the probability that the list of visited nodes

is initialized with M

attacker nodes, given that it is initialized

with M nodes by the sender. Due to the rules of preﬁlling,

∈ {max(0,M −1−(N −2−C)),min(M −1,C)}. For M = 0

and M = 1 there cannot be any initialized attackers, hence

P(M

= 0|M ∈ {0,1}) = 1 and P(M

> 0|M ∈ {0,1}) = 0.

For M > 1 we have

P(M

|M) =



M − 1



∏

M−M

k=2

(N −C − k)

∏

−1

k=0

(C − k)

∏

k=2

(N − k)

. (10)

We now turn to the calculation of the probability

P(ω

,ω

,l,m

|S(s),R(r)), i.e., the probability that the

attacker would receive a particular message sent by s to r. If

the sender is the predecessor (s = p) the receiver cannot be the

predecessor, hence P(r = p, s = p, l, m

|S(s),R(r)) = 0.

For the rest of the cases we show the probabilities in a tabular

form to improve readability.

For ||L || = 0 and ||L|| = 1 there can be no attackers in

the list of visited nodes (when received by the ﬁrst attacker),

because if the sender preﬁlls the list of visited nodes it has

to include itself in the list. Hence, for ||L || = 0 and ||L || = 1

we have M

> 0 with probability 0. Furthermore, for ||L || = 0

the sender must be the predecessor (s = p) and the receiver

cannot be in the list of visited nodes (r ∈ L ∪ {p}), every other

tuple in {(ω

,ω

) : ω

∈ Ω

,ω

∈ Ω

} has probability 0. Table I

shows the corresponding probability, i.e., the probability that

the sender initializes the message with an empty list, and

chooses the attacker as next hop. For ||L || = 1 the sender

and the receiver cannot both be in the list of visited nodes.

Furthermore, if the sender or the receiver is in the list of

visited nodes, it must be the predecessor, hence s ∈ L \{p} and

r ∈ L \{p} have probability 0. Table II shows the probabilities

for the remaining cases for ||L || = 1. As an example, the

second row in the table is the probability that the sender

initializes the list empty, forwards the message to the receiver,

which then forwards the message to the attacker.

For ||L || > 1 there may or may not be attackers in the list of

initialized nodes. Table III shows the probabilities for ||L || > 1

when there are no attackers in the list of initialized nodes

= 0). When there are attackers in the list of initialized

nodes (M

> 0), the sender has to be in the list of visited

nodes. Furthermore, if the sender is the predecessor (s = p)

then the receiver cannot be in the list of visited nodes (r ∈

L \ {p}), because this could only happen if the sender had

preﬁlled the list of visited nodes with the receiver, but then the

receiver would never receive the message. The corresponding

probabilities for ||L|| > 1 and M

> 0 are shown in Table IV.

Let us now turn to the calculation of the probabilities that

TABLE II

P(Ω

,Ω

,||L || = 1,M

= 0, H

|S(s),R(r))

Ω

,Ω

s = p, r ∈ L ∪{p} P(M = 1)P(H(1, 0|M = 1))

s ∈ L ∪ {p}, r = p P(M = 0)P(H(1,0|M = 0))

N−C−1

s ∈ L ∪ {p}, r ∈ L ∪ {p} P(M = 0)P(H(1, 0|M = 0))

N−C−2

N−C−1

TABLE III

P(Ω

,Ω

,||L || > 1,M

= 0, H

|S(s),R(r))

Ω

,Ω

s = p, r ∈ L \{p} P(M = 0)P(H(l,0|M = 0))

l−1

(N−C−1)

s = p, P(M = 0)P(H(l,0|M = 0))

(N−C−l)

(N−C−1)

r ∈ L ∪ {p} +P(M = l)P(H(l,0|M = l))

s ∈ L \ {p}, P(M = 0)P(H(l,0|M = 0))

l−2

(N−C−1)

r = p +

∑

l−1

k=1

P(M = k)P(H(l, 0|M = k))

N−C−k

s ∈ L \ {p}, P(M = 0)P(H(l,0|M = 0))

(l−2)

(N−C−1)

r ∈ L \ {p} +

∑

l−2

k=1

P(M = k)P(H(l, 0|M = k))

l−k−1

N−C−k

s ∈ L \ {p}, P(M = 0)P(H(l,0|M = 0))

(N−C−l)(l−2)

(N−C−1)

r ∈ L ∪ {p} +

∑

l−1

k=1

P(M = k)P(H(l, 0|M = k))

N−C−l

N−C−k

s ∈ L ∪ {p}, r = p P(M = 0)P(H(l,0|M = 0))

(N−C−l)

(N−C−1)

s ∈ L ∪ {p}, r ∈ L \ {p} P(M = 0)P(H(l,0|M = 0))

(l−1)(N−C−l)

(N−C−1)

s ∈ L ∪ {p}, r ∈ L ∪ {p} P(M = 0)P(H(l, 0|M = 0))

(N−C−l)(N−C−l−1)

(N−C−1)

TABLE IV

P(Ω

,Ω

,||L || > 1,M

> 0, H

|S(s),R(r))

Ω

,Ω

s = p, r ∈ L ∪{p} P(M = l)P(H(l,m

|M = l))

s ∈ L \ {p}, r = p

∑

l−1

k=m

P(M = k)P(H(l, m

|M = k))

N−C+m

−k

s ∈ L \ {p},

∑

l−2

k=m

P(M = k)P(H(l, m

|M = k))

l−k−1

N−C+m

−k

r ∈ L \ {p}

s ∈ L \ {p},

∑

l−1

k=m

P(M = k)P(H(l, m

|M = k))

N−C+m

−l

N−C+m

−k

r ∈ L ∪ {p}

the attacker correctly identiﬁes the sender-receiver pair (s, r)

used in (7). Given a message received by an attacker that

contains information (||L || = l, ω

∈ Ω

, ω

∈ Ω

, and M

) the attacker would identify (s,r) as the sender-receiver

pair with probability

R(r),

S(s)|ω

,ω

,l) =

P(ω

,ω

,l,m

|S(s),R(r))· P(R(r)|S(s))· P(S(s))

∑

(a,b)

P(ω

,ω

,l,m

|S(a),R(b)) · P(R(b)|S(a)) · P(S(a))

(11)

where the summation in the denominator is over all possible

non-attacker sender-receiver pairs (a,b). P(S(s)) is the (a pri-

ory) probability that node s sends a message, and P(R(r)|S (s))

is the probability that node s selects node r as the destination

of a message. Since the trafﬁc matrix is homogeneous and

attackers are informed about each other, all trusted nodes are

equally likely to be the sender, P(S(s)) =

N−C

, and any trusted

node (except the sender) is equally likely to be chosen as

the receiver, i.e., with probability P(R(r)|S(s)) =

N−C−1

. The

same observation holds for P(S(a)) and P(R(b)), so that these

probabilities cancel out each other in (11).

We already calculated the numerator of (11), so in or-

der to ﬁnish our calculations we only have to express

TABLE V

P(Ω

,Ω

,||L || = 0,M

= 0, H

|S(a),R(b))

Ω

,Ω

,a,b

s = p, r ∈ L ∪{p}, a = s, ∀b P(M = 0)P(H(0,0|M = 0))

TABLE VI

P(Ω

,Ω

,||L || = 1,M

= 0, H

|S(a),R(b))

Ω

,Ω

,a,b

s = p, r ∈ L ∪{p}, a = s, ∀b P(M = 1)P(H(1, 0|M = 1))

s = p, r ∈ L ∪{p}, a 6= s, ∀b P(M = 0)P(H(1, 0|M = 0))

N−C−1

s ∈ L ∪ {p}, r = p, a = r, ∀b P(M = 1)P(H(1,0|M = 1))

s ∈ L ∪ {p}, r = p, a 6= r, ∀b P(M = 0)P(H(1,0|M = 0))

N−C−1

s ∈ L ∪ {p}, r ∈ L ∪ {p}, P(M = 0)P(H(1, 0|M = 0))

N−C−2

N−C−1

a ∈ {s,r}, ∀b

s ∈ L ∪ {p}, r ∈ L ∪ {p}, P(M = 0)P(H(1, 0|M = 0))

N−C−3

N−C−1

a /∈ {s, r}, ∀b +P(M = 1)P(H(1,0|M = 1))

P(ω

,ω

,l,m

|S(a),R(b)) and only for the cases when

the numerator of (11) is non-zero, and when a 6= s or b 6= r.

The attacker can receive a message with an empty list of vis-

ited nodes (||L || = 0,M

= 0) only if the sender is the prede-

cessor, hence, P(ω

,ω

,||L || = 0,M

= 0,H

|S(a),R(b)) > 0

only for a = s. Nevertheless, the receiver of the message can

be any trusted node b 6= s (we use ∀b as a shorthand nota-

tion). The corresponding probability P(Ω

,Ω

,||L || = 0, M

0,H

|S(a),R(b)) is given in Table V.

The attacker can receive a message with only one node in

the list of visited nodes (||L || = 1), in which case the node in

the list is the predecessor. The list could have been sent by the

predecessor (a = p) or by a node not in the list (a ∈ L ∪ {p}),

but in either case there cannot be any attacker node preﬁlled in

the list (M

= 0). The receiver could be any other node (∀b).

The probability of receiving such a message P(Ω

,Ω

,||L || =

1,M

= 0,H

|S(a),R(b)) is given in Table VI.

For brevity, we omit the calculation of the probabilities for

||L || > 1, they can be obtained following a similar reasoning,

and can be found in [11].

3) A Bound For Relationship Anonymity: In order to obtain

a lower bound of the probability assigned to a sender-receiver

pair we use (1) for Crowds and (6) for Minstrels. If there is

an attacker on the path, it would assume that any of the N −C

trusted nodes is equally likely to be the sender, and any other

trusted node is equally likely to be the receiver,

S(s),

R(r)|H

) =P(

S(s),

R(r)|H

) =

(N −C)(N −C − 1)

(12)

The probability P(H

), from (6), is expressed as

P(H

) =

N−1

∑

M=0

N−M

∑

i=0

min(max(0,M−1),C)

∑

P(H

,M)P(M

|M)P(M),

(13)

where for M = M

= 0 we have P(H

) =

N−1

and

P(H

,M) =

(N −C − 1)C

(N − 1)(N − i + 1)

i−2

∏

k=1

N −C − k

N − k

(14)

for i > 1, and for M > 0 we have

P(H

,M) =

C − M

N − M − i + 1

i−1

∏

k=1

N − M −C + M

− k + 1

N − M − k + 1

(15)

We use these bounds in the following as a baseline for com-

parison for the relationship anonymity provided by Crowds

and by Minstrels.

V. NUMERICAL RESULTS

In the following we use the analytical models described

above to get insight into the overhead-anonymity trade-off.

To explore the trade-off we use p

∈ (0, 1) for Crowds,

and various uniform and binomial distributions for P(M) for

Minstrels.

Fig. 2 shows the probability P

rel

(s,r) assigned to a sender-

receiver pair as a function of the overhead (i.e., the mean

path length) for C = 1 and N = 10. A higher value of

rel

(s,r) means that the sender-receiver pair is more exposed,

i.e., has less relationship anonymity. One would expect that

high overhead provides good relationship anonymity (i.e., low

assigned probability), but surprisingly this is not the case.

Above a certain point more overhead (more relaying) has a

negative effect on anonymity for both anonymity networks.

The reason is that as the number of relays increases the

probability P(H

) of having an attacker on the path increases

faster than the certainty of the attacker about the identity of

the sender-receiver pair decreases.

Fig. 3 shows results obtained with N = 10 nodes and C =

3 attackers. Interestingly, while for Minstrels the relationship

anonymity decreases above a certain level of overhead, for

Crowds the relationship anonymity improves monotonically.

Hence, for C = 3 the probability that the attacker can assign

to the sender decreases faster than the probability P(H

) of

having an attacker on the path increases.

Fig. 4 shows results for N = 50 and C = 1. The ﬁgure has

a logarithmic scale on the vertical axis to make the small

probabilities easily distinguishable. For this scenario, in which

the system size is bigger than in Fig. 2 but the number of

attackers is smaller than in Fig. 3, it is now Crowds for which

relationship anonymity deteriorates above a certain overhead.

For Minstrels the probability P

rel

(s,r) decreases monotonically

with increasing overhead. The reason is that for N = 50 the

attacker appears later on the path than for N = 10 so the sender

does not appear as predecessor that often. Hence the attacker

assigns the same probability to the sender as to any other node

in the list. This does not apply to Crowds. The sender can be

revisited and may appear as predecessor at any position on

a path and the predecessor is always more likely to be the

sender than any other node [6].

Finally, Fig. 5 shows results for N = 50 nodes and C = 5

attackers. It is only the results shown in this ﬁgure that coin-

cide with what one would expect, that is, increased overhead

provides better relationship anonymity.

Figs. 2, 3, 4, and 5 also show the lower bounds for the

probabilities P

rel

(s,r) for Crowds and for Minstrels. The lower

HTML Viewer

Frequently Asked Questions (15)

Q1. What contributions have the authors mentioned in the paper "On the trade-off between relationship anonymity and communication overhead in anonymity networks" ?

Motivated by protection and privacy in industrial communication networks, in this paper the authors consider the tradeoff between relationship anonymity and communication overhead. The authors consider two anonymity networks: Crowds, which has unbounded communication delay and Minstrels, proposed in this paper, which provides bounded communication delay. While Crowds hides the sender ’ s identity only, Minstrels aims at hiding the receiver ’ s identity as well.

Q2. What future works have the authors mentioned in the paper "On the trade-off between relationship anonymity and communication overhead in anonymity networks" ?

It is subject of their future work to provide a more complete characterization of the overhead-anonymity trade-off for anonymity networks, including networks that provide probabilistic message delivery.

Q3. How does the sender control the maximum length of the message?

To control the maximum path length (i.e., delay) the sender can initialize the list of visited nodes with a number M ∈ {0, ...,N−1} of the nodes in the system.

Q4. What is the probability of a sender being selected as the receiver?

Since the traffic matrix is homogeneous and attackers are informed about each other, all trusted nodes are equally likely to be the sender, P(S(s)) = 1N−C , and any trusted node (except the sender) is equally likely to be chosen as the receiver, i.e., with probability P(R(r)|S(s)) = 1N−C−1 .

Q5. What is the reason for the probability of having an attacker on the path?

The reason is that as the number of relays increases the probability P(H1+) of having an attacker on the path increases faster than the certainty of the attacker about the identity of the sender-receiver pair decreases.

Q6. What is the probability of a relationship anonymity for a minstrel?

while for Minstrels the relationship anonymity decreases above a certain level of overhead, for Crowds the relationship anonymity improves monotonically.

Q7. How can an attacker reduce the anonymity of the relationship?

The attacker can only decrease the relationship anonymity by knowing the protocol and by observing traffic that goes over the nodes it controls.

Q8. Why does the attacker appear later on the path than for N = 10?

The reason is that for N = 50 the attacker appears later on the path than for N = 10 so the sender does not appear as predecessor that often.

Q9. What is the probability that the attacker assigns to a sender-receiver?

In Minstrels the probability that the attacker assigns to a sender-receiver pair does not only depend on the node that the message is received from, i.e., the predecessor p, but also on the contents of the list of visited nodes (L) that the message carries.

Q10. What is the relationship anonymity vs overhead for Crowds?

5. Relationship anonymity vs. overhead for N = 50, C = 5bounds converge to an asymptote, which corresponds to the case when there is always an attacker on the path (P(H1+) = 1), and the attacker assigns Prel(s,r) = 1(N−C)(N−C−1) to every possible sender-receiver pair.

Q11. What is the probability that a message is sent by an attacker?

Given a message received by an attacker that contains information (||L || = l, ωs ∈ Ωs, ωr ∈ Ωr, and MC = mC) the attacker would identify (s,r) as the sender-receiver pair with probabilityP(R̂(r), Ŝ(s)|ωr,ωs,mC,H1+, l) = P(ωr,ωs, l,mC,H1+|S(s),R(r)) ·P(R(r)|S(s)) ·P(S(s)) ∑(a,b) P(ωr,ωs, l,mC,H1+|S(a),R(b)) ·P(R(b)|S(a)) ·P(S(a)) (11)where the summation in the denominator is over all possible non-attacker sender-receiver pairs (a,b). P(S(s)) is the (a priory) probability that node s sends a message, and P(R(r)|S(s)) is the probability that node s selects node r as the destination of a message.

Q12. Why is the relationship anonymity provided by Crowds worse than the lower bound?

The relationship anonymity provided by Crowds is significantly worse than the lower bound, which is primarily due to the lack of receiver anonymity.

Q13. What is the probability of a node relaying a message?

The mean number of hops for Crowdsis the expected value of a geometric distribution with success probability 1− p f , i.e.,E[K] = p f1− p f +2 (2)where p f is the probability that a node will relay a message.

Q14. What is the probability that the sender is the predecessor of the receiver?

if the sender is the predecessor (s = p) then the receiver cannot be in the list of visited nodes (r ∈ L \\ {p}), because this could only happen if the sender had prefilled the list of visited nodes with the receiver, but then the receiver would never receive the message.

Q15. What is the probability that the first attacker is on position i?

Crowds: For Crowds the first attacker is on position i if the message is first relayed i−1 times through trusted nodes but the last hop is an attacker.

On the Trade-Off between Relationship Anonymity and Communication Overhead in Anonymity Networks

Summary (1 min read)

Introduction

III. MINSTRELS SYSTEM DESCRIPTION

B. Relationship Anonymity Against Inside Attackers

V. NUMERICAL RESULTS

Figures (8)

Citations

Cites background from "On the Trade-Off between Relationsh..."

Cites background from "On the Trade-Off between Relationsh..."

References

"On the Trade-Off between Relationsh..." refers background in this paper

"On the Trade-Off between Relationsh..." refers background in this paper

"On the Trade-Off between Relationsh..." refers background in this paper

Related Papers (5)

Frequently Asked Questions (15)

Q1. What contributions have the authors mentioned in the paper "On the trade-off between relationship anonymity and communication overhead in anonymity networks" ?

Q2. What future works have the authors mentioned in the paper "On the trade-off between relationship anonymity and communication overhead in anonymity networks" ?

Q3. How does the sender control the maximum length of the message?

Q4. What is the probability of a sender being selected as the receiver?

Q5. What is the reason for the probability of having an attacker on the path?

Q6. What is the probability of a relationship anonymity for a minstrel?

Q7. How can an attacker reduce the anonymity of the relationship?

Q8. Why does the attacker appear later on the path than for N = 10?

Q9. What is the probability that the attacker assigns to a sender-receiver?

Q10. What is the relationship anonymity vs overhead for Crowds?

Q11. What is the probability that a message is sent by an attacker?

Q12. Why is the relationship anonymity provided by Crowds worse than the lower bound?

Q13. What is the probability of a node relaying a message?

Q14. What is the probability that the sender is the predecessor of the receiver?

Q15. What is the probability that the first attacker is on position i?