Riding the saddle point: asymptotics of the capacity-achieving simple decoder for bias-based traitor tracing

Ibrahimi, Sarah; Škorić, Boris; Oosterwijk, Jan-Jaap

doi:10.1186/s13635-014-0012-6

Research
Open access
Published: 15 August 2014

Riding the saddle point: asymptotics of the capacity-achieving simple decoder for bias-based traitor tracing

Sarah Ibrahimi¹,
Boris Škorić¹ &
Jan-Jaap Oosterwijk¹

EURASIP Journal on Information Security volume 2014, Article number: 12 (2014) Cite this article

2185 Accesses
2 Citations
Metrics details

Abstract

We study the asymptotic-capacity-achieving score function that was recently proposed by Oosterwijk et al. for bias-based traitor tracing codes. For the bias function, we choose the Dirichlet distribution with a cutoff. Using Bernstein’s inequality and Bennett’s inequality, we upper bound the false-positive and false-negative error probabilities. From these bounds we derive sufficient conditions for the scheme parameters. We solve these conditions in the limit of large coalition size c₀ and obtain asymptotic solutions for the cutoff, the sufficient code length, and the corresponding accusation threshold. We find that the code length converges to its asymptote approximately as $c_{0}^{- 1 / 2}$ , which is faster than the $c_{0}^{- 1 / 3}$ of Tardos’ score function.

MSC

94B60

Introduction

1.1 Traitor tracing

Forensic watermarking is a means for tracing unauthorized redistribution of digital content. Before distribution, the content is modified by embedding an imperceptible watermark, which plays the role of a personalized identifier. When an unauthorized copy of the content is found, a tracing algorithm outputs a list of suspicious users, based on the watermark detected in this copy.

The most powerful attacks against watermarking are collusion attacks, in which multiple attackers (the ‘coalition’) combine their differently watermarked versions of the same content; the observed differences point to the locations of the hidden marks and allow for a targeted attack.

Collusion-resistant codes have been specifically designed as a defense against collusion attacks: when codewords from such a code are embedded into the content, the surviving parts of the watermark, after the collusion attack, still contain enough information to identify (some of the) attackers, provided that the coalition is not too large.

In the past two decades, several types of collusion-resistant codes have been developed. The most popular type in the recent literature is the class of bias-based codes. These were introduced by G. Tardos in 2003. The code construction consists of two steps: first, a sequence of biases is generated, one for each position in the content; then, the watermark symbols for each user are randomly drawn according to these biases. The original paper [1] was followed by a flurry of activity, e.g., improved analyses [2]-[7], code modifications [8]-[10], decoder modifications [11]-[14], and various generalizations [15]-[18]. The advantage of bias-based versus deterministic codes is that they can achieve the asymptotically optimal relationship $ℓ \propto c_{0}^{2}$ between the sufficient code length ℓ and the coalition size c₀ to be resisted.

1.2 Capacity-achieving simple decoder

Two kinds of tracing algorithm can be distinguished: (i) simple decoders, which assign a score to single users independent of the watermarks of other users, and (ii) joint decoders[11]-[13], which assign scores to sets of users and are typically more powerful but also require more computational resources. Efficient joint decoders typically employ a simple decoder as a bootstrapping step.

The performance of a traitor tracing code is often measured by looking at the sufficient code length ℓ as a function of the coalition size c₀ to be resisted and the imposed low error rate. Equivalently, one can look at the fingerprinting rate, which is defined as the fraction $\frac{\underset{q}{log} n}{ℓ}$ , where q is the size of the alphabet and n is the number of users. The numerator corresponds to the number of q-ary symbols needed to point out one of the n users; the denominator is the number of symbols used to convey this ‘message.’ Hence, the fingerprinting rate has a natural interpretation as the fraction of codeword symbols that actually encodes the ‘message,’ i.e., the identifying information that allows for tracing. The fingerprinting rate is a figure of merit that can be used to fairly compare codes which have different alphabet sizes. The fingerprinting capacity, which can be computed information-theoretically, is an upper bound on the fingerprinting rate that can be achieved against colluders who employ an optimal strategy against the tracing scheme. It was found by Boesten and Škorić [19] that the asymptotic^a capacity is given by

C = \frac{q - 1}{2 c_{0}^{2} ln q} .

(1)

Huang and Moulin [20] found the location of the corresponding asymptotic saddlepoint: the strongest attack is the so-called interleaving attack, and the best bias distribution is the Dirichlet distribution with concentration parameter one half. (See Section 2.) For the colluders as well as the tracer, it is bad to depart from the saddlepoint. If the colluders move away from it, the tracer can achieve a higher fingerprinting rate; if the tracer moves away, the colluders can launch a stronger attack which reduces the rate.

Oosterwijk et al. [21] devised a simple decoder that reaches asymptotic capacity. The possibility of such an achievement was foreseen in [20], where it was shown that the simple decoder capacity becomes equal to the joint decoder capacity as c₀ goes to infinity.

1.3 Contributions and outline

In this paper we analyze the performance of the capacity-achieving simple decoder of [21] in the Restricted Digit Model:

• Following the approach of [22], we use Bernstein’s inequality and Bennett’s inequality to upper bound the false-positive and false-negative error probability, respectively. From these bounds, we derive conditions on the code parameters (code length, cutoff, threshold) such that the error probabilities are sufficiently low.

• We determine the asymptotics of the sufficient code length in the direct vicinity of the saddlepoint.

• We find that the optimal choice for the cutoff τ is given by $τ \propto c_{0}^{- γ}$ , with γ slightly larger than one half. With this choice, the code length approaches its saddlepoint value with a correction term of order $c_{0}^{γ - 1} \approx c_{0}^{- 1 / 2}$ . Thus, convergence to the limit is faster than in the case of the binary Tardos score, where the correction is of order $c_{0}^{- 1 / 3}$ [5].

• Our analysis yields a recipe for placing the accusation threshold as a function of the innocent user score variance. This differs from the case of the Tardos score function [1],[16], where the threshold is fixed.

In Section 2 we briefly review bias-based traitor tracing, the asymptotic saddlepoint, and the asymptotic-capacity-achieving score function. We also list the inequalities of Bernstein and Bennett. In Section 3 we study the statistical properties of an innocent user’s score and the coalition’s collective score. In Section 4 we derive the bounds on the error rates and the sufficient conditions on the code parameters. The asymptotics of the sufficient code length are treated in Section 5.

Preliminaries

2.1 Bias-based tracing using the asymptotically optimal simple decoder

2.1.1 Notation

The number of users is denoted as n, and the code length (the number of positions in the content) as ℓ. We define [ n]={1,…,n}. The alphabet is $Q$ , with size $| Q | = q$ . The symbols in the alphabet have no natural ordering. The bias in position i is denoted as p⁽ⁱ⁾. The bias is a q-dimensional vector, with components $p_{α}^{(i)} \in [τ, 1 - (q - 1) τ]$ , $α \in Q$ . The parameter τ≪1 is called the cutoff. For each i the bias satisfies |p⁽ⁱ⁾|=1, where |⋯| denotes the 1-norm, i.e., $\sum_{α \in Q} p_{α}^{(i)} = 1$ . We will often use multi-index notation: for a scalar z, the notation p^z stands for $\prod_{α \in Q} p_{α}^{z}$ ; for a vector m, the notation p^m stands for $\prod_{α \in Q} p_{α}^{m_{α}}$ . We introduce the q-component vector 1_q=(1,1,…,1). The notation δ_{x
y} stands for the Kronecker delta.

2.1.2 Code generation

The bias vectors p⁽ⁱ⁾ are drawn independently from a (truncated) Dirichlet distribution F with concentration parameter κ>0,

\begin{array}{lcr} F (p) & = & p^{- 1 + κ} / B_{τ} (κ 1_{q}) \end{array}

(2)

\begin{array}{lcr} B_{τ} (κ 1_{q}) & = & \int_{τ}^{1 - (q - 1) τ} d^{q} p δ (1 - | p |) p^{- 1 + κ} . \end{array}

(3)

The δ in the integral is a Dirac delta function; it ensures that the condition |p|=1 is enforced. The τ is called the cutoff parameter. Note that p_α∈[ τ,1−(q−1)τ]. Therefore, τ≤1/q must hold, for otherwise the interval is empty (and we would get |p|>1).

For τ=0 the normalization constant (3) evaluates to a generalized beta function. Let z∈(0,∞)^q be a vector; then the beta function B(z) is defined as $B (z) = [\prod_{α} Γ] / Γ (\sum_{β} z_{β})$ , where Γ is the gamma function. Hence B₀(κ 1_q)=B(κ 1_q)=[ Γ(κ)]^q/Γ(q κ).

In the asymptotic saddlepoint, it holds that τ=0 and κ=1/2. For large but finite c₀, the saddlepoint lies close to the asymptotic saddlepoint, but it is not known exactly where. It is known that for finite c₀, the optimal bias distribution is a discrete distribution [8],[10],[23], with a number of discrete p_α values proportional to c₀. In spite of this, we will use the continuous probability density (2). Our motivation is that we only investigate asymptotics. The cutoff τ will depend on c₀.

The code word assigned to user j is denoted as a row vector X_j=(X_{j 1},…,X_{j
ℓ}). The set of codewords is arranged in a code matrix X. The elements of the code matrix are independently generated according to the biases p⁽¹⁾,…,p^(ℓ) as follows: $Pr [X_{ji} = α] = p_{α}^{(i)}$ .

2.1.3 Collusion attack

The coalition is a subset $C \subset [n]$ of users, with size $| C | = c$ . We explicitly make the distinction between the actual coalition size c and the parameter c₀ in the code construction, which is the maximum coalition size that can be resisted. The colluders see a submatrix $X_{C}$ of X. The symbol ‘tallies’ are defined as follows:

\begin{array}{lcr} m^{(i)} = {(m_{α}^{(i)})}_{α \in Q} & ; & m_{α}^{(i)} = | {j \in C : X_{ji} = α} | . \end{array}

(4)

In words, $m_{α}^{(i)}$ is the number of colluders that received symbol α in position i. Based on $X_{C}$ , the colluders produce an output y=(y₁,…,y_ℓ). For our analysis we adopt the Restricted Digit Model as the attack model: for any i∈[ℓ], the output y_i is only allowed to be a symbol that the colluders have observed in position i. The strategy for choosing an output is allowed to be probabilistic. We adopt a number of frequently made assumptions about the attack strategy:

1.
Symbol symmetry. The strategy is invariant under permutation of the alphabet for each position independently. This assumption is motivated by the lack of a natural ordering of the alphabet.
2.
Colluder symmetry. The strategy is invariant under permutation of the colluders. (In other words, the colluders equally share the risk.) This assumption is motivated by the fact that breaking colluder symmetry will make it easier for the tracer to find at least one colluder.
3.
Position symmetry. The same strategy is applied in each position i∈[ℓ], and it does not depend on any X _{j
k} values with k≠i. Motivation: asymptotically the optimal attack must be position-symmetric [24].

When assumptions 2 and 3 hold, the strategy can be parametrized by a set of probabilities that depend only on the ‘local’ tallies: in position i, the probability of outputting symbol y_i is a function of only m⁽ⁱ⁾. Omitting the position index, this is denoted as

θ_{y | m} = Pr [colluders output y | the tally is m] .

(5)

Furthermore, if assumption 1 holds as well, it is possible [6] to re-parametrize this as

Ψ_{b} (x) = θ_{y | m} for {m_{y} = b, and m without the y component is x} .

(6)

In other words, Ψ_b(x) is the coalition’s probability of outputting a symbol given that it has tally b and that the other tallies are x. The probability Ψ_b(x) is invariant under permutation of x.

2.1.4 Simple decoder

The tracer notices the pirated copy with watermark sequence y ‘in the wild’. Based on y and X, he tries to find at least one colluder. The asymptotic-capacity-achieving simple decoder of [21] works as follows: for each user j∈[ n], a score $S_{j} = \sum_{i \in [ℓ]} S_{j}^{(i)}$ is computed, where

S_{j}^{(i)} = h (X_{ji}, y_{i}, p^{(i)}) with h (x, y, p) = \frac{δ_{xy}}{p_{y}} - 1 .

(7)

Note that we normalized the function h differently from [21], by a factor $\sqrt{q - 1}$ , for notational brevity. The score function (7) has the special property of being ‘strongly centered’: for any p and y (we are omitting the position index), the expected score of an innocent user is zero.

{\tilde{μ}}_{inn} = \sum_{x \in Q} p_{x} h (x, y, p) = \frac{p_{y}}{p_{y}} - \sum_{x \in Q} p_{x} = 0 .

(8)

The collective score of the coalition is written as $S_{C}$ ,

S_{C} = \sum_{j \in C} S_{j} .

(9)

The tracer makes a list $ℒ$ of ‘suspicious’ users, whose score exceeds a threshold Z,

ℒ = {j \in [n] : S_{j} > Z} .

(10)

Whereas the Tardos scheme uses a fixed threshold, the score function h leads to a more complicated scheme where Z must be chosen as a function of the biases and the observed tallies and colluder outputs (see Section 3.1).

2.1.5 Measuring the performance

Two types of error can occur: a false-positive, with P_FP defined as the probability that a fixed innocent user gets added to $ℒ$ , and a false-negative, with P_FN defined as the probability that none of the colluders is found:

\begin{array}{lcr} P_{FP} = Pr [j \in ℒ] for fixed innocent j & ; & P_{FN} = Pr [C \cap ℒ = \emptyset] \end{array}

(11)

The tracer demands that P_FP≤ε₁ and P_FN≤ε₂, where ε₁ and ε₂ are constants, typically with ε₁≪ε₂.

The code length ℓ and threshold Z are often parametrized as

\begin{array}{lcr} ℓ = A c_{0}^{2} ln \frac{1}{ε_{1}} & ; & Z = B c_{0} ln \frac{1}{ε_{1}} . \end{array}

(12)

This parametrization is motivated by the fact that asymptotically, for the Tardos code, A and B can be considered as constants. The relationship between the code length parametrization (12) and the fingerprinting rate is as follows. The rate is $R = (\underset{q}{log} n) / ℓ = (ln n) / (A c_{0}^{2} ln q ln ε_{1}^{- 1})$ . Let $η = Pr [ℒ ∖ C \neq \emptyset]$ , i.e., the probability that at least one innocent user ends up in the list $ℒ$ . The η is a fixed small number (e.g., 10⁻⁶) that does not depend on n. It can be shown (Lemma 6 in [22]) for n≫1, c≪n that ε₁≈η/n. Then, $ln ε_{1}^{- 1} \approx ln n - ln η \approx ln n$ . (In the last approximation, we used that η is fixed.) Asymptotically, the rate satisfies $R \sim 1 / (A c_{0}^{2} ln q)$ .

Definition 1.

The variance of an innocent user’s score and the average and variance of the coalition score are written as

\begin{array}{lcr} {\tilde{σ}}_{inn}^{2} & = & \frac{1}{ℓ} \sum_{i} 𝔼 {(S_{j}^{(i)})}^{2} - {\tilde{μ}}_{inn}^{2} for arbitrary j \notin C \end{array}

(13)

\begin{array}{lcr} \tilde{μ} & = & \frac{1}{ℓ} \sum_{i} 𝔼 S_{C}^{(i)} \end{array}

(14)

\begin{array}{lcr} {\tilde{σ}}^{2} & = & \frac{1}{ℓ} \sum_{i} 𝔼 {(S_{C}^{(i)})}^{2} - {\tilde{μ}}^{2} . \end{array}

(15)

Here stands for the expectation over all the probabilistic degrees of freedom: the biases p⁽ⁱ⁾, the code matrix X, and the coalition output y. (The ‘tilde’ notation indicates that there is an average over positions.) Note that ${\tilde{μ}}_{inn} = 0$ , as shown in (8).

Remark If assumption 3 holds (position symmetry, Section 2.1.3) then in Definition 1 the average over the positions is not necessary; in every position $𝔼 [\dots]$ has the same value. In this paper, we introduce a rescaled version (β) of the threshold parameter B,

B = β {\tilde{σ}}_{inn} .

(16)

It will turn out that it is more natural to use the quantity β than B.

Asymptotically, the first and second moments completely determine the shape of the probability distribution of the score, for an innocent user as well as for the coalition score. (The distribution becomes Gaussian in accordance with the central limit theorem.) It was found [7] that the code length parameter (and hence the fingerprinting rate) then depends on $\tilde{μ}$ and ${\tilde{σ}}_{inn}$ as follows:

\begin{array}{lcr} A \sim \frac{2 {\tilde{σ}}_{inn}^{2}}{{\tilde{μ}}^{2}} & ; & R \sim \frac{{\tilde{μ}}^{2}}{{\tilde{σ}}_{inn}^{2}} \cdot \frac{1}{2 c_{0}^{2} ln q} . \end{array}

(17)

In the asymptotic saddlepoint, the tracer uses the bias distribution (2) with τ=0, while the coalition strategy is the interleaving attack, θ_y|m=m_y/c. In the asymptotic saddlepoint, it holds [21] that ${\tilde{μ}}^{2} / {\tilde{σ}}_{inn}^{2} = q - 1$ .

2.2 Computing expectations

Following the previous work [6],[16],[22], we define (conditional) expectations as shown below. We omit the position index and write x as shorthand for X_{j
i} for a fixed innocent user $j \notin C$ .

\begin{array}{lcr} 𝔼_{p} [r (p)] & = & \int_{τ}^{1 - (q - 1) τ} d^{q} p δ (1 - | p |) F (p) r (p) \end{array}

(18)

\begin{array}{lcr} 𝔼_{x | p} [r (x)] & = & \sum_{x \in Q} p_{x} r (x) \end{array}

(19)

\begin{array}{lcr} 𝔼_{m | p} [r (m)] & = & \sum_{m \geq 0 : | m | = c} (\binom{c}{m}) p^{m} r (m) \end{array}

(20)

\begin{array}{lcr} 𝔼_{y | m} [r (y)] & = & \sum_{y \in Q} θ_{y | m} r (y) \end{array}

(21)

\begin{array}{lcr} 𝔼_{y | p} [r (y)] & = & 𝔼_{m | p} 𝔼_{y | m} [r (y)] = \sum_{y \in Q} \{\sum_{m \geq 0 : | m | = c} (\binom{c}{m}) p^{m} θ_{y | m}\} r (y) \end{array}

(22)

\begin{array}{lcr} 𝔼_{m} [r (m)] & = & \sum_{m \geq 0 : | m | = c} (\binom{c}{m}) \frac{B_{τ} (κ 1_{q} + m)}{B_{τ} (κ 1_{q})} r (m) \end{array}

(23)

\begin{array}{lcr} 𝔼_{m_{α}} [r (m_{α})] & = & \sum_{b = 0}^{c} P_{1} (b) r (b) = \sum_{b = 0}^{c} (\binom{c}{b}) \frac{B_{τ} (κ + b, [q - 1] κ + c - b)}{B_{τ} (κ, [q - 1] κ)} r (b) \end{array}

(24)

\begin{array}{lcr} K_{b} = 𝔼_{x | b} Ψ_{b} (x) = \sum_{x \geq 0 : | x | = c - b} (\binom{c - b}{x}) \frac{B (κ 1_{q - 1} + x)}{B (κ 1_{q - 1})} Ψ_{b} (x) . \end{array}

(25)

Here P₁(b) is a marginal probability for a single fixed symbol to have tally b. The quantity K_b is the probability, given that a certain symbol has tally b, for the colluders to output that symbol; i.e., for arbitrary fixed α, we have K_b= Pr[y=α|m_α=b]. The sum rule $\sum_{b} P_{1} (b) K_{b} = 1 / q$ holds [6], since the overall probability of outputting y=α is 1/q.

2.3 Concentration inequalities

Lemma 1 (Bernstein’s inequality [25]).

Let a>0 be a constant. Let U₁,…,U_ℓ be independent zero-mean random variables, with |U_i|≤a for all i. Let Z≥0. Then,

Pr [\sum_{i = 1}^{ℓ} U_{i} > Z] \leq exp (- \frac{Z^{2} / 2}{\sum_{i = 1}^{ℓ} 𝔼 [U_{i}^{2}] + aZ / 3}) .

(26)

Lemma 2 (Bennett’s inequality [26]).

Let b>0 be a constant. Let Y₁,…,Y_ℓ be independent zero-mean random variables, with |Y_i|≤b for all i. Let $s^{2} = \frac{1}{ℓ} \sum_{i = 1}^{ℓ} 𝔼 [Y_{i}^{2}]$ . Let the function ξ be defined as

ξ (v) = \int_{0}^{v} d x ln (1 + x) = (v + 1) ln (v + 1) - v.

(27)

Let T≥0. Then,

Pr [\sum_{i = 1}^{ℓ} Y_{i} > T] \leq exp (- \frac{ℓ s^{2}}{b^{2}} ξ (\frac{b}{ℓ s^{2}} T)) .

(28)

Property 1.

The function ξ in Lemma 2 can be lower bounded as

v > 0 \Rightarrow ξ (v) > v ln \frac{v}{e} .

(29)

Proof.

For v>0, we have $ξ (v) = \int_{0}^{v} d x ln (1 + x) > \int_{0}^{v} d x ln x = v ln \frac{v}{e}$ .

Lemma 3 (weaker form of Bennett’s inequality).

Let b>0 be a constant. Let Y₁,…,Y_ℓ be independent zero-mean random variables, with |Y_i|≤b for all i. Let $s^{2} = \frac{1}{ℓ} \sum_{i = 1}^{ℓ} 𝔼 [Y_{i}^{2}]$ . Let T>0. Then

Pr [\sum_{i = 1}^{ℓ} Y_{i} > T] \leq exp (- \frac{T}{b} ln \frac{bT}{eℓ s^{2}}) .

(30)

Proof.

We substitute Property 1 in Lemma 2. This is allowed since the argument of ξ is positive.

Statistics of the innocent score and coalition score

We study the moments of the innocent score and coalition score in two cases: (i) interleaving attack and arbitrary bias distribution and (ii) the bias distribution is the Dirichlet distribution with τ=0 and arbitrary concentration parameter κ; the attack is arbitrary.

These two scenarios represent two different ways of departing from the asymptotic saddlepoint. In the first one, the bias distribution is varied. In the second one, not only the attack is varied but also a limited change of the bias distribution is allowed (κ).

The results of this section do not all contribute directly to the analysis of the sufficient code length in Section 5, but they are important in their own right since they elucidate how the score moments behave in a variety of circumstances.

3.1 General result for the moments

We investigate the first and second moments of an innocent user’s score and of the coalition score. We begin with a general result for position-symmetric colluder strategies. Then, we look more specifically at the interleaving attack.

Lemma 4.

If the coalition is employing a position-symmetric strategy, then

\begin{array}{lcr} {\tilde{σ}}_{inn}^{2} & = & - 1 + 𝔼 \frac{1}{p_{y}} \end{array}

(31)

\begin{array}{lcr} \tilde{μ} & = & - c + 𝔼 \frac{m_{y}}{p_{y}} \end{array}

(32)

\begin{array}{lcr} {\tilde{μ}}^{2} + {\tilde{σ}}^{2} & = & 𝔼 \frac{{(m_{y} - {cp}_{y})}^{2}}{p_{y}^{2}} . \end{array}

(33)

Proof.

We start from Definition 1. In all three definitions, the summation over i merely yields a factor ℓ which cancels against the factor 1/ℓ in front of the summation. Thus, for ${\tilde{σ}}_{inn}^{2}$ we can write, for arbitrary index i, and recalling that ${\tilde{μ}}_{inn} = 0$ , ${\tilde{σ}}_{inn}^{2} = 𝔼 {(S_{j}^{(i)})}^{2} = 𝔼_{p} 𝔼_{y | p} 𝔼_{x | p} {(- 1 + δ_{xy} / p_{y})}^{2}$ $= 𝔼_{p} 𝔼_{y | p} 𝔼_{x | p} (1 - 2 δ_{xy} / p_{y} + δ_{xy} / p_{y}^{2})$ $= 1 - 2 𝔼_{p} 𝔼_{y | p} 1 + 𝔼_{p} 𝔼_{y | p} 1 / p_{y}$ $= - 1 + 𝔼 1 / p_{y}$ . The results for $\tilde{μ}$ and $\tilde{σ}$ follow directly from the fact that $S_{C}^{(i)} = (m_{y} / p_{y} - c) = (m_{y} - {cp}_{y}) / p_{y}$ .

Note that Lemma 4 allows the tracer to obtain an estimate of the score moments: he can replace the by an empirical average over the codeword positions.

3.2 The case of the interleaving attack

Lemma 5.

If the coalition is using the interleaving attack, then

\begin{array}{lcr} {\tilde{μ}}_{Int} = q - 1; & {({\tilde{σ}}_{inn}^{2})}_{Int} = q - 1; & {\tilde{μ}}_{Int}^{2} + {\tilde{σ}}_{Int}^{2} = c (q - 1) - 3 q + 2 + q 𝔼_{p} \frac{1}{p_{α}} . \end{array}

(34)

where $α \in Q$ is arbitrary.

Proof.

For the interleaving attack, we have $𝔼 [\dots] = 𝔼_{p} 𝔼_{m | p} \sum_{y} (m_{y} / c) [\dots] = \sum_{y} 𝔼_{p} 𝔼_{m | p} (\frac{m_{y} - {cp}_{y}}{c} + p_{y}) [\dots]$ .We will make use of the binomial properties $𝔼_{m | p} m_{α} = {cp}_{α}$ , $𝔼_{m | p} {(m_{α} - {cp}_{α})}^{2} = {cp}_{α} (1 - p_{α})$ and $𝔼_{m | p} {(m_{α} - {cp}_{α})}^{3} = {cp}_{α} (1 - p_{α}) (1 - 2 p_{α})$ .For $\tilde{μ}$ this gives $\tilde{μ} = \sum_{y} 𝔼_{p} 𝔼_{m | p} [\frac{{(m_{y} - {cp}_{y})}^{2}}{{cp}_{y}} + m_{y} - {cp}_{y}]$ $= 𝔼_{p} \sum_{y} (1 - p_{y}) + 0 = q - 1$ .Furthermore, ${\tilde{σ}}_{inn}^{2} = - 1 + 𝔼_{p} \sum_{y} 𝔼_{m | p} [\frac{m_{y}}{{cp}_{y}}] = - 1 + 𝔼_{p} \sum_{y} 1 = q - 1$ .Finally, ${\tilde{μ}}^{2} + {\tilde{σ}}^{2} = \sum_{y} 𝔼_{p} 𝔼_{m | p} [\frac{{(m_{y} - {cp}_{y})}^{3}}{c p_{y}^{2}} + \frac{{(m_{y} - {cp}_{y})}^{2}}{p_{y}}]$ $= 𝔼_{p} \sum_{y} [\frac{(1 - p_{y}) (1 - 2 p_{y})}{p_{y}} + c (1 - p_{y})]$ $= c (q - 1) - 3 q + 2 + \sum_{y} 𝔼_{p} \frac{1}{p_{y}}$ .

Remark 1.

Part of Lemma 5 ( $\tilde{μ}$ and ${\tilde{σ}}_{inn}$ ) was already done in [21]. We show the proof again because of our modified normalization of the score function.

Remark 2.

The result for ${\tilde{μ}}_{Int}$ and ${({\tilde{σ}}_{inn}^{2})}_{Int}$ does not depend on the bias distribution F, but ${\tilde{σ}}_{Int}$ does.

Remark 3.

In the large-c limit, the variance of the coalition score tends to be large due to the c(q−1) term as well as the expression $𝔼 [1 / p_{α}]$ which blows up when τ becomes small.

3.3 Taking the Dirichlet distribution with cutoff τ=0

Lemma 6.

Let τ=0. Let the coalition use a strategy that is colluder-symmetric and position-symmetric. Then the quantities $\tilde{μ}$ and ${\tilde{σ}}_{inn}$ can be written as

\begin{array}{lcr} {\tilde{μ}}_{τ = 0} & = & - c + (qκ + c - 1) 𝔼_{m} \sum_{y \in Q} θ_{y | m} [1 + \frac{1 - κ}{κ + m_{y} - 1}] \end{array}

(35)

\begin{array}{lcr} {({\tilde{σ}}_{inn}^{2})}_{τ = 0} & = & - 1 + (qκ + c - 1) 𝔼_{m} \sum_{y \in Q} θ_{y | m} \frac{1}{κ + m_{y} - 1} \end{array}

(36)

Furthermore, if the colluder strategy is also symbol-symmetric, then

\begin{array}{lcr} {\tilde{μ}}_{τ = 0} & = & - c + (qκ + c - 1) q \sum_{b = 1}^{c} P_{1} (b) K_{b} \frac{b}{κ + b - 1}, \end{array}

(37)

\begin{array}{lcr} {({\tilde{σ}}_{inn}^{2})}_{τ = 0} & = & - 1 + (qκ + c - 1) q \sum_{b = 1}^{c} P_{1} (b) K_{b} \frac{1}{κ + b - 1} . \end{array}

(38)

Proof.

We start from the expressions $\tilde{μ} = - c + 𝔼 [m_{y} / p_{y}]$ and ${\tilde{σ}}_{inn}^{2} = - 1 + 𝔼 [1 / p_{y}]$ . For any function J(m_y), we can write $𝔼 [J (m_{y}) / p_{y}] = 𝔼_{p} \sum_{m} (\binom{c}{m}) p^{m} \sum_{y} θ_{y | m} \frac{J (m_{y})}{p_{y}}$ $= \sum_{m} (\binom{c}{m}) \sum_{y} θ_{y | m} J (m_{y}) 𝔼_{p} p^{m} / p_{y}$ . For τ=0, we have

𝔼_{p} \frac{p^{m}}{p_{y}} = \frac{B (κ 1_{q} + m - e_{y})}{B (κ 1_{q})} = \frac{qκ + c - 1}{κ + m_{y} - 1} \cdot \frac{B (κ 1_{q} + m)}{B (κ 1_{q})} = \frac{qκ + c - 1}{κ + m_{y} - 1} 𝔼_{p} p^{m} .

(39)

Setting J(m_y)=m_y for $\tilde{μ}$ and J(m_y)=1 for ${\tilde{σ}}_{inn}^{2}$ yield (35) and (36). The final step is to notice that $𝔼 [J (m_{y}) / p_{y}] = 𝔼_{m} 𝔼_{y | m} [\frac{qκ + c - 1}{κ + m_{y} - 1} J (m_{y})]$ which can be rewritten as $q \sum_{b} P_{1} (b) K_{b} \frac{qκ + c - 1}{κ + b - 1} J (b)$ if the strategy is symbol-symmetric.

Theorem 1.

Let c≫1 and κ∈(0,1). Let the coalition use a strategy that is colluder-symmetric and position-symmetric. Then, both quantities $\tilde{μ}$ and ${\tilde{σ}}_{inn}$ are maximized by the minority voting attack and minimized by the majority voting attack.

Proof.

For c≫1, we can use the τ=0 approximation for $\tilde{μ}$ and ${\tilde{σ}}_{inn}$ , i.e., Lemma 6. In (35) and (36), the θ_y|m in the y summation multiplies a decreasing function of m_y. Hence, the summand is maximized by outputting a symbol y with tally m_y as small as possible (but nonzero because of the marking assumption) and, vice versa, minimized by outputting the symbol with the largest tally.

Theorem 1 gives insight into the trade-offs that the colluders have to deal with. They want to minimize $\tilde{μ}$ and to maximize ${\tilde{σ}}_{inn}$ , since this leads to high error rates. However, the strategy that optimizes $\tilde{μ}$ for them is the worst possible strategy regarding ${\tilde{σ}}_{inn}$ and vice versa. The interleaving attack at the saddlepoint is ‘in the middle’ between minority voting and majority voting.

Lemma 7.

Let τ=0. Let the coalition use a strategy that is colluder-symmetric and position-symmetric. Then $\tilde{μ}$ and ${\tilde{σ}}_{inn}$ can be bounded as

\begin{array}{lcr} \frac{cκ (q - 1)}{c - 1 + κ} \leq & {\tilde{μ}}_{τ = 0} & \leq c (\frac{1}{κ} - 1) + q - \frac{1}{κ} \end{array}

(40)

\begin{array}{lcr} \frac{κ (q - 1)}{c - 1 + κ} \leq & {({\tilde{σ}}_{inn}^{2})}_{τ = 0} & \leq \frac{c}{κ} + q - 1 - \frac{1}{κ} . \end{array}

(41)

Proof.

For m_y∈{1,…,c}, we have $\frac{1}{κ + c - 1} \leq \frac{1}{κ + m_{y} - 1} \leq \frac{1}{κ}$ . We substitute these inequalities into (35) and (36). Finally, we use $\sum_{y} θ_{y | m} = 1$ .

Remark It is possible to obtain a tighter upper bound by treating the m_y=c term separately in (35),(36), since then θ_y|m=1. However, the improvement of the tightness is minimal.

Bounding the error probabilities

We use Bernstein’s inequality and Bennett’s inequality to upper bound the false-positive and false-negative error probability, respectively.

4.1 Bounding the false-positive probability

Theorem 2.

Let q≥2. Let the coalition use any attack strategy. Then the false-positive probability for a fixed innocent user can be bounded as

P_{FP} \leq exp [(ln ε_{1}) \frac{β^{2}}{2 A} {(1 + \frac{β}{3 A c_{0} τ {\tilde{σ}}_{inn}})}^{- 1}] .

(42)

Proof.

For any coalition strategy, even one that breaks the position symmetry, the single-position scores $S_{j}^{(i)}$ for the innocent user are mutually independent [1]. Hence, we are allowed to use Bernstein’s inequality. In Lemma 1 we set $U_{i} = S_{j}^{(i)}$ for the innocent user. This is allowed since $S_{j}^{(i)}$ has zero expectation value. We have

| U_{i} | \leq max \{\frac{1}{p_{min}} - 1, | - 1 |\} = max \{\frac{1}{τ} - 1, 1\} = \frac{1}{τ} - 1 < \frac{1}{τ} .

(43)

In the last equality, we used τ≤1/q (see Section 2.1.2). Thus, we are allowed to set a=1/τ in Lemma 1. Furthermore, we note that by definition $𝔼 [U_{i}^{2}] = {\tilde{σ}}_{inn}^{2}$ for all i. Lemma 1 then gives

Pr [S_{j} > Z] \leq exp (\frac{- Z^{2} / 2}{ℓ {\tilde{σ}}_{inn}^{2} + aZ / 3}) = exp (\frac{- Z^{2}}{2 ℓ {\tilde{σ}}_{inn}^{2}} \cdot \frac{1}{1 + aZ / (3 ℓ {\tilde{σ}}_{inn}^{2})}) .

(44)

Substituting a=1/τ, $ℓ = A c_{0}^{2} ln \frac{1}{ε_{1}}$ and $Z = β {\tilde{σ}}_{inn} c_{0} ln \frac{1}{ε_{1}}$ finish the proof.

Remark In (42), we see that the bound on P_FP is a decreasing function of the product c₀τ. Hence, it is advantageous to set τ such that c₀τ≫1.

Corollary 1.

Let q≥2 and τ≤1/2. Let the coalition use any attack strategy. Then, it holds that

A \leq \frac{1}{2} β^{2} - \frac{β}{3 c_{0} τ {\tilde{σ}}_{inn}} \Rightarrow P_{FP} \leq ε_{1} .

(45)

Proof.

The proof follows directly from Theorem 2.

4.2 Bounding the false-negative probability

Theorem 3.

Let q≥2. Let the coalition employ a position-symmetric strategy. Let $\tilde{μ} A c_{0} - {\tilde{σ}}_{inn} βc > 0$ . Let τ satisfy

τ \leq c / (c + \tilde{μ}) .

(46)

Then the false-negative probability can be bounded as

P_{FN} \leq exp [(ln ε_{1}) \frac{c_{0} τ}{c} [\tilde{μ} {Ac}_{0} - {\tilde{σ}}_{inn} βc] ln \frac{\tilde{μ} {Ac}_{0} - {\tilde{σ}}_{inn} βc}{e ({\tilde{σ}}^{2} / c) A c_{0} τ}] .

(47)

Proof.

We start from

\begin{array}{lcr} P_{FN} & = & Pr [\forall_{j \in C} S_{j} < Z] < Pr [S_{C} < cZ] = Pr [ℓ \tilde{μ} - S_{C} > ℓ \tilde{μ} - cZ] \\ = & Pr [\sum_{i = 1}^{ℓ} (\tilde{μ} - S_{C}^{(i)}) > ℓ \tilde{μ} - cZ] . \end{array}

(48)

Because of the assumption that the collusion attack is position-symmetric, the random variables $S_{C}^{(i)}$ are mutually independent. We are then allowed to use Bennett’s inequality (we take the weaker form, Lemma 3), which we do with the following parameters: $Y_{i} = \tilde{μ} - S_{C}^{(i)}$ ; $T = ℓ \tilde{μ} - cZ = (\tilde{μ} {Ac}_{0} - {\tilde{σ}}_{inn} βc) c_{0} ln \frac{1}{ε_{1}}$ ; $s^{2} = {\tilde{σ}}^{2}$ ; b=c/τ. The choice for b follows from

| Y_{i} | = | S_{C}^{(i)} - \tilde{μ} | \leq max \{c (\frac{1}{τ} - 1) - \tilde{μ}, \tilde{μ} + c\} \leq max \{\frac{c}{τ}, \tilde{μ} + c\} = \frac{c}{τ},

(49)

where the last equality is a consequence of the assumption (46). We can see that the T is positive from the assumption $\tilde{μ} A c_{0} - {\tilde{σ}}_{inn} βc > 0$ .

Notice that at c≫c₀ Theorem 3 no longer applies, because the condition $\tilde{μ} A c_{0} - {\tilde{σ}}_{inn} βc > 0$ cannot be satisfied. In practical terms, this means that for c>c₀, the FN probability is no longer under control, and the colluders may evade detection with high probability.

Theorem 4.

Let q≥2. Let the coalition employ a position-symmetric strategy. Let 2≤c≤c₀. Let $\tilde{μ} A - {\tilde{σ}}_{inn} β > 0$ . Let $τ \leq 2 / (2 + \tilde{μ})$ . Then the false-negative probability can be bounded as

P_{FN} \leq exp [(ln ε_{1}) c_{0} τ [\tilde{μ} A - {\tilde{σ}}_{inn} β] ln \frac{\tilde{μ} A - {\tilde{σ}}_{inn} β}{e ({\tilde{σ}}^{2} / c_{0}) Aτ}] .

(50)

Proof.

We start from Theorem 3. Due to the conditions c≤c₀ and $\tilde{μ} A - {\tilde{σ}}_{inn} β > 0$ , the condition $\tilde{μ} A c_{0} - {\tilde{σ}}_{inn} βc > 0$ in Theorem 3 holds. Due to c≥2 and $τ < 2 / (2 + \tilde{μ})$ , the condition (46) holds. Since all the conditions are satisfied, we are allowed to apply Theorem 3. Finally, we make use of the fact that the expression (47) is an increasing function of c for c≤c₀.

Corollary 2.

Let q≥2. Let the coalition employ a position-symmetric strategy. Let 2≤c≤c₀. Let $\tilde{μ} A - {\tilde{σ}}_{inn} β > 0$ . Let $τ \leq 2 / (2 + \tilde{μ})$ . Then it holds that

c_{0} τ [\tilde{} μA - {\tilde{σ}}_{inn} β] ln \frac{\tilde{μ} A - {\tilde{σ}}_{inn} β}{e ({\tilde{σ}}^{2} / c_{0}) Aτ} \geq \frac{ln ε_{2}}{ln ε_{1}} \Rightarrow P_{FN} \leq ε_{2} .

(51)

Proof.

Follows directly from Theorem 4.

Asymptotics of the sufficient code length

The main aim of this paper is to determine the performance of the score system (7) at large but finite c₀. The performance at ‘ c₀=∞’ is known: the saddlepoint is given by the interleaving attack, combined with the $κ = \frac{1}{2}$ Dirichlet distribution (with τ=0) as the bias distribution; in this saddlepoint, the rate of the score system is equal to capacity. What we want to know is how the fingerprinting rate approaches capacity and how to optimally choose the cutoff τ as a function of c₀.

5.1 Sufficient code length

We aim for an analysis in the (unknown!) large-but-finite- c₀ saddlepoint:

- The saddlepoint (‘SP’) of the mutual information minimax game [20] is close to the asymptotic saddlepoint. The unknown strategy θ^SP is close to interleaving. The unknown bias distribution F^SP(p) is some discrete distribution close to the Dirichlet distribution. We approximate F by the continuous Dirichlet distribution with cutoff τ because this is the only available constructive approach that we know of.

- A practical tracing system that uses the score function (7) cannot have a fixed threshold Z like the Tardos scheme, since the score statistics strongly depend on the colluder strategy. The threshold has to be chosen as a function of estimated values for ${\tilde{σ}}_{inn}$ and $\tilde{μ}$ . (See Section 3.1 for the estimation method.) When attacking this tracing system, the best choice for the colluders is to use θ^SP as their strategy, for otherwise they get caught faster. We will assume that the colluders use θ^SP, which in the analysis leads to a ‘fixed’ threshold Z that only has meaning in this context.

- Hence, we analyze the tracing system consisting of the bias distribution (2) and the score system (7), when pitted against an unknown attack close to interleaving. Our starting point will be the ‘sufficient’ conditions given by Corollaries 1 and 2. We know that ${\tilde{μ}}^{SP} = q - 1 - △ \tilde{μ}$ and ${({\tilde{σ}}_{inn}^{2})}^{SP} = q - 1 + △ {\tilde{σ}}_{inn}^{2}$ , and we have to carefully deal with the corrections $△ \tilde{μ}$ and $△ {\tilde{σ}}_{inn}^{2}$ . On the other hand, the $\tilde{σ}$ appears only in the logarithm in (51) and hence any corrections with respect to Lemma 5 can be neglected.

Corollary 1 and the condition $\tilde{μ} A - {\tilde{σ}}_{inn} β > 0$ together define an interval for the sufficient code length parameter ‘ A_suff,’

A_{suff} \in (\frac{{\tilde{σ}}_{inn}}{\tilde{μ}} β, \frac{1}{2} β^{2} - \frac{β}{3 c_{0} τ {\tilde{σ}}_{inn}}] .

(52)

This interval exists only if

β > 2 \frac{{\tilde{σ}}_{inn}}{\tilde{μ}} + \frac{2}{3 c_{0} τ {\tilde{σ}}_{inn}},

(53)

which yields

A_{suff} > \frac{2 {\tilde{σ}}_{inn}^{2}}{{\tilde{μ}}^{2}} + \frac{2}{3 c_{0} τ \tilde{μ}} .

(54)

We must try to bring β and A as close as possible to the bounds (53, 54) while still satisfying the condition in the left hand side of (51). We introduce the following shorthand notation:

\begin{array}{lcr} \frac{{\tilde{σ}}_{inn}}{\tilde{μ}} = \frac{1}{\sqrt{q - 1}} (1 + w), & ψ = \tilde{μ} A - {\tilde{σ}}_{inn} β, & \frac{{\tilde{σ}}^{2}}{c} = q - 1 + r, \end{array}

(55)

where w≪1, ψ≪1, r≪1. The w will be studied in the next section. The ψ we will solve approximately. The fact that r is small follows from Lemma 5. The expression $𝔼 [1 / p_{α}]$ in (34) is of order τ^κ−1; this leads to a contribution to ${\tilde{σ}}^{2} / c$ of order τ^κ/(c₀τ), which is negligible compared to (q−1) since c₀τ≫1 (see Section 4.1).

Theorem 5.

Let c₀τ≫1 and c₀τ²≪1. Let the attackers employ a position-symmetric strategy close to interleaving. Let 2≤c≤c₀. Then the following combination of a code length parameter A and threshold parameter β is sufficient to achieve P_FP≤ε₁ and P_FN≤ε₂.

\begin{array}{lcr} β_{suff} & = & \frac{2}{\sqrt{q - 1}} [1 + w + \frac{1}{3 c_{0} τ} + O (\frac{w}{c_{0} τ})] \end{array}

(56)

\begin{array}{lcr} A_{suff} & = & \frac{2}{q - 1} [1 + 2 w + \frac{1}{3 c_{0} τ} + \frac{ln ε_{2} / ln ε_{1}}{2 c_{0} τ ln \frac{1}{c_{0} τ^{2}}} + O (w^{2}) + O (\frac{w}{c_{0} τ})] . \end{array}

(57)

Proof.

Using the parametrization (55), the condition in (51) can be written compactly as

c_{0} τψ ln \frac{ψ}{e (q - 1 + r) Aτ} \geq \frac{ln ε_{2}}{ln ε_{1}} .

(58)

Taking the equal sign and solving for ψ gives (we denote the solution as ψ₀)

\begin{array}{lcr} ψ_{0} & = & \frac{ln ε_{2}}{ln ε_{1}} \cdot \frac{1}{c_{0} τ} \cdot \frac{1}{ln [\frac{1}{e (q - 1 + r) Aτ} \cdot \frac{ln ε_{2}}{ln ε_{1}} \cdot \frac{1}{c_{0} τ} ln \frac{ψ_{0}}{e (q - 1 + r) Aτ}]} \\ = & \frac{ln ε_{2}}{ln ε_{1}} \cdot \frac{1}{c_{0} τ} \cdot \frac{1}{ln [\frac{1}{c_{0} τ^{2}}] + ln [\frac{1}{e (q - 1) A} \frac{ln ε_{2}}{ln ε_{1}}] - O (r) + O (ln ln \frac{ψ_{0}}{τ})} \\ = & \frac{ln ε_{2}}{ln ε_{1}} \cdot \frac{1}{c_{0} τ ln \frac{1}{c_{0} τ^{2}}} [1 - O (\frac{ln ln \frac{1}{c_{0} τ^{2}}}{ln \frac{1}{c_{0} τ^{2}}})] \\ < \frac{ln ε_{2}}{ln ε_{1}} \cdot \frac{1}{c_{0} τ ln \frac{1}{c_{0} τ^{2}}} . \end{array}

(59)

We take $ψ = \frac{ln ε_{2}}{ln ε_{1}} \cdot \frac{1}{c_{0} τ ln \frac{1}{c_{0} τ^{2}}}$ (last line of (59)), since it is a compact analytical expression that satisfies (58). We can now find the sufficient A and β. We write $β_{suff} = 2 \frac{{\tilde{σ}}_{inn}}{\tilde{μ}} + \frac{2}{3 c_{0} τ {\tilde{σ}}_{inn}} + λ$ , with λ arbitrarily close to zero. Solving A from β and ψ gives

\begin{array}{lcr} A_{suff} & = & β_{suff} \frac{{\tilde{σ}}_{inn}}{\tilde{μ}} + \frac{ψ}{\tilde{μ}} \\ = & \frac{2}{q - 1} [1 + 2 w + \frac{1}{3 c_{0} τ} + \frac{ln ε_{2} / ln ε_{1}}{2 c_{0} τ ln \frac{1}{c_{0} τ^{2}}} + O (w^{2}) + O (λ) + O (\frac{w}{c_{0} τ})], \end{array}

(60)

where we have used that $△ \tilde{μ}$ and $△ {\tilde{σ}}_{inn}$ are of order w. Finally, we note that λ is much smaller than the other high-order correction terms.

Note that the condition c₀τ²≪1 is required in the above proof in order to make sure that the argument of the logarithm is well-behaved, i.e., larger than 1. Hence, when choosing τ we have to satisfy

\begin{align} Condition 1 c_{0} τ ≫ 1 . \\ Condition 2 c_{0} τ^{2} ≪ 1 . \end{align}

One way of satisfying these conditions is to set

\begin{array}{lcr} τ \propto c_{0}^{- γ} & with & γ \in (\frac{1}{2}, 1) . \end{array}

(61)

5.2 Optimization of the cutoff τ as a function of c₀

Lemma 8 (adapted from [21]).

Let △θ_y|m=θ y|m SP−m_y/c. The first-order and second-order correction terms to $\tilde{μ}$ and ${\tilde{σ}}_{inn}^{2}$ in the vicinity of the saddle point are given by

\begin{array}{lcr} {\tilde{μ}}_{(1)} & = & \sum_{m} (\binom{c}{m}) \sum_{y \in Q} △ θ_{y | m} m_{y} \frac{B (κ 1_{q} + m - e_{y})}{B (κ 1_{q})} \\ = & 𝔼_{m} \sum_{y \in Q} △ θ_{y | m} (1 - κ) \frac{c + qκ - 1}{m_{y} - (1 - κ)} \\ {[{\tilde{σ}}_{inn}^{2}]}_{(1)} & = & \sum_{m} (\binom{c}{m}) \sum_{y \in Q} △ θ_{y | m} \frac{B (κ 1_{q} + m - e_{y})}{B (κ 1_{q})} = 𝔼_{m} \sum_{y \in Q} △ θ_{y | m} \frac{c + qκ - 1}{m_{y} - (1 - κ)} \\ {\tilde{μ}}_{(2)} & = & \sum_{m} (\binom{c}{m}) \sum_{y \in Q} △ θ_{y | m} m_{y} [\frac{B (κ 1_{q} + m - e_{y})}{B (κ 1_{q})} - \frac{B_{τ} (κ 1_{q} + m - e_{y})}{B_{τ} (κ 1_{q})}] \\ {[{\tilde{σ}}_{inn}^{2}]}_{(2)} & = & \sum_{m} (\binom{c}{m}) \sum_{y \in Q} △ θ_{y | m} [\frac{B (κ 1_{q} + m - e_{y})}{B (κ 1_{q})} - \frac{B_{τ} (κ 1_{q} + m - e_{y})}{B_{τ} (κ 1_{q})}] . \end{array}

(62)

The first-order correction to ${\tilde{μ}}^{2} / {\tilde{σ}}_{inn}^{2}$ is zero because of the saddlepoint. The second-order correction to ${\tilde{μ}}^{2} / {\tilde{σ}}_{inn}^{2}$ is given by

\begin{array}{lcr} {[\frac{{\tilde{μ}}^{2}}{{\tilde{σ}}_{inn}^{2}}]}_{(2)} & = & 2 {\tilde{μ}}_{(2)} - {[{\tilde{σ}}_{inn}^{2}]}_{(2)} + \frac{1}{q - 1} {({\tilde{μ}}_{(1)} - {[{\tilde{σ}}_{inn}^{2}]}_{(1)})}^{2} \end{array}

(63)

\begin{array}{lcr} = & - \sum_{m} (\binom{c}{m}) \sum_{y \in Q} △ θ_{y | m} (2 m_{y} - 1) \frac{B_{τ} (κ 1_{q} + m - e_{y})}{B_{τ} (κ 1_{q})} + \frac{κ^{2}}{q - 1} {({[{\tilde{σ}}_{inn}^{2}]}_{(1)})}^{2} . \end{array}

(64)

Proof.

Equations 62 and 63 are a slight adaptation of the saddlepoint formulas in [21], where we have substituted the saddlepoint values $\tilde{μ} = q - 1$ and ${\tilde{σ}}_{inn}^{2} = q - 1$ . Note again that we have normalized the score function differently from [21] by a factor $\sqrt{q - 1}$ . Equation 64 follows from Equation 63 by using Equation 62.

Proposition 1.

The correction w is negligible compared to $\frac{1}{c_{0} τ}$ .

Argumentation.

The w is proportional to (63) or, differently expressed, (64). In (64) we have the ${({[{\tilde{σ}}_{inn}^{2}]}_{(1)})}^{2}$ term which is of order (△θ)². The order of magnitude of the $\sum_{m}$ contribution is more difficult to determine because the incomplete Dirichlet integral B_τ(κ 1_q+m−e_y) is difficult to bound;^b however, no matter how B_τ(κ 1_q+m−e_y) is behaved, the $\sum_{m}$ contribution is at most of order △θ. Huang and Moulin [20] conjectured that $△ θ = O (\frac{1}{c})$ , and this turned out to be consistent with their asymptotic saddlepoint analysis. If their conjecture is true, we have $w \propto \frac{1}{c_{0}} ≪ \frac{1}{c_{0} τ}$ . Even if their conjecture is not true and △θ scales as, for instance, $1 / \sqrt{c}$ , then, $w \propto 1 / \sqrt{c_{0}} ≪ \frac{1}{c_{0} τ}$ , i.e., w is still negligible. (The latter holds because τ scales as $c_{0}^{- γ}$ with $γ > \frac{1}{2}$ .)

The consequences of Proposition 1 are the following: The optimal choice for the cutoff is to set

γ_{opt} = \frac{1}{2} + ν

(65)

where ν denotes a very small positive number. The sufficient code length is then given by

A_{suff} = \frac{2}{q - 1} [1 + O (c_{0}^{- 1 / 2 + ν})] .

(66)

Note that the correction term is smaller than the $O (c_{0}^{- 1 / 3})$ that was found [5] for Tardos’s score function at q=2.

Conclusions

We have studied a q-ary bias-based collusion-resistant scheme where the score function (7) of Oosterwijk et al. [21] is used in combination with the Dirichlet distribution with a cutoff. We have used Bernstein’s inequality and Bennett’s inequality to upper bound the error rates. For large c₀, this leads to a sufficient code length as specified in Theorem 5.

Then we adopted a conjecture (based on a conjecture by Huang and Moulin) that △θ, the difference in strategy between the finite-c and infinite-c saddlepoint, is of order $O (1 / \sqrt{c})$ . This leads to an optimal cutoff choice $τ = 1 / (λ c_{0}^{1 / 2 + ν})$ , where λ>0 is a constant and ν is a very small positive constant. The sufficient code length is then

ℓ_{suff} = \frac{2}{q - 1} [1 + λ c_{0}^{- \frac{1}{2} + ν} (\frac{1}{3} + \frac{1}{4} \frac{ln ε_{2}}{ln ε_{1}} \frac{1}{ln (c_{0}^{ν} λ)}) + \dots] c_{0}^{2} ln ε_{1}^{- 1},

(67)

and the corresponding accusation threshold is

Z = 2 [1 + \frac{1}{3} λ c_{0}^{- \frac{1}{2} + ν} + \dots] c_{0} ln ε_{1}^{- 1} .

(68)

From previous work on provable bounds for bias-based codes, it is clear that the bounds obtained from concentration inequalities (Markov, Bernstein, Bennett) are not tight.

As topics for future work, we mention the following: (i) obtaining tighter bounds - the CSE method [6] or similar techniques may yield more precise information about the error rates. (ii) Studying the performance of the score function (7) further away from the asymptotic saddlepoint. This would require locating (by numerical techniques) the saddlepoint for large but finite c. (iii) Applying the analysis in this paper in the context of dynamic traitor tracing, similar to the work in [27].

Endnotes

^a Throughout this paper, the term asymptotic refers to the limit of large coalition size.

^b The correction to the normalization factor is known. In [22] it was found that $B_{τ} (κ 1_{q}) = B (κ 1_{q}) [1 - O (τ^{κ})]$ .

References

G Tardos, in Proceedings of the 35th Annual ACM Symposium on Theory of Computing (STOC). Optimal probabilistic fingerprint codes, (2003), pp. 116–125.
Google Scholar
Blayer O, Tassa T: Improved versions of Tardos’ fingerprinting scheme. Des Codes Cryptography 2008, 48(1):79-103.
Article MATH MathSciNet Google Scholar
T Furon, A Guyader, F Cérou, in Information Hiding, Lecture Notes in Computer Science, 5284. On the design and optimization of Tardos probabilistic fingerprinting codes (Springer, 2008), pp. 341–356.
Chapter Google Scholar
Furon T, Pérez-Freire L, Guyader A, Cérou F: Estimating the minimal length of Tardos code. In Information Hiding, LNCS. Springer, Heidelberg; 2009:176-190.
Chapter Google Scholar
Laarhoven T, de Weger BMM: Optimal symmetric Tardos traitor tracing schemes. Designs Codes Cryoptography 2011, 71: 83-103.
Article MathSciNet Google Scholar
Simone A: Accusation probabilities in Tardos codes: beyond the Gaussian approximation. Des Codes Cryptography 2012, 63(3):379-412.
Article MATH MathSciNet Google Scholar
Vladimirova TU, Celik MU, Talstra JC: Tardos fingerprinting is better than we thought. IEEE Trans. Inform. Theor 2008, 54(8):3663-3676.
Article MathSciNet Google Scholar
YW Huang, P Moulin, in IEEE Workshop on Information Forensics and Security (WIFS). Capacity-achieving fingerprint decoding (London, 6–9 December 2009), pp. 51–55.
Google Scholar
K Nuida, in Information Hiding, LNCS, 6387. Short collusion-secure fingerprint codes against three pirates (Springer, 2010), pp. 86–102.
Chapter Google Scholar
Nuida K, Fujitsu S, Hagiwara M, Kitagawa T, Watanabe H, Ogawa K, Imai H: An improvement of discrete Tardos fingerprinting codes. Des Codes Cryptography 2009, 52(3):339-362.
Article MATH MathSciNet Google Scholar
E Amiri, G Tardos, in Proceedings of the 20th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). High rate fingerprinting codes and the fingerprinting capacity (New York, 4–6 January 2009), pp. 336–345.
Chapter Google Scholar
A Charpentier, F Xie, C Fontaine, T Furon, in SPIE Proceedings on Media Forensics and Security, 7254. Expectation maximization decoding of Tardos probabilistic fingerprinting code (SPIE, 2009), p. 72540.
Chapter Google Scholar
P Meerwald, T Furon, in Information Hiding, LNCS, 6958. Towards joint Tardos decoding: the ‘Don Quixote’ algorithm (Springer, 2011), pp. 28–42.
Chapter Google Scholar
J-J Oosterwijk, Škorić B, J Doumen, in Information Hiding & Multimedia Security 2013. Optimal suspicion functions for Tardos traitor tracing schemes (Montpellier, 17–19 June 2013).
Google Scholar
A Charpentier, C Fontaine, T Furon, IJ Cox, in Information Hiding, LNCS, 6958. An asymmetric fingerprinting scheme based on Tardos codes (Springer, 2011), pp. 43–58.
Chapter Google Scholar
Katzenbeisser S, Celik MU: Symmetric Tardos fingerprinting codes for arbitrary alphabet sizes. Des Codes Cryptography 2008, 46(2):137-166.
Article MathSciNet Google Scholar
Katzenbeisser S, Schaathun HG, Celik MU: Tardos fingerprinting codes in the combined digit model. IEEE Trans. Inf. Forensics Secur 2011, 6(3):906-919.
Article Google Scholar
F Xie, T Furon, C Fontaine, in Proceedings of the 10th Workshop on Multimedia & Security (MM&Sec). On-off keying modulation and Tardos fingerprinting (ACM, 2008), pp. 101–106.
Google Scholar
D Boesten, Škorić B, in Information Hiding 2011, LNCS, 6958. Asymptotic fingerprinting capacity for non-binary alphabets (Springer, 2011), pp. 1–13.
Google Scholar
Y-W Huang, P Moulin, in IEEE International Symposium on Information Theory (ISIT) 2012. On fingerprinting capacity games for arbitrary alphabets and their asymptotics (Cambridge, 1–6 July 2012), pp. 2571–2575.
Chapter Google Scholar
J-J Oosterwijk, Škorić B, J Doumen, A capacity-achieving simple decoder for bias-based traitor tracing schemes (2013). http://eprint.iacr.org/2013/389 Accessed 5 August 2014.
Google Scholar
Škorić B, J-J Oosterwijk, Binary and q-ary Tardos codes, revisited. Designs, Codes, and Cryptography (2012). http://eprint.iacr.org/2012/249 Accessed 5 August 2014.
Google Scholar
T Laarhoven, BMM de Weger, in Information Hiding & Multimedia Security 2013 Discrete Distributions in the Tardos Scheme, Revisited, (2013).
Google Scholar
P Moulin, Universal fingerprinting: capacity and random-coding exponents (2008). http://arxiv.org/abs/0801.3837.
Google Scholar
SN Bernstein, Theory of Probability, (1927).
Google Scholar
Bennett G: Probability inequalities for the sum of independent random variables. J. Am. Stat. Assoc 1962, 57(297):33-45.
Article MATH Google Scholar
T Laarhoven, J Doumen, P Roelse, Škorić B, B de Weger, Dynamic Tardos traitor tracing schemes. IEEE Trans. Inf. Theory. 59(7), 4230–4242.

Download references

Acknowledgments

We thank Benne de Weger, Jeroen Doumen, and Thijs Laarhoven for useful discussions. Part of this work was supported by STW (project 10518).

Author information

Authors and Affiliations

Eindhoven University of Technology, Eindhoven, 5612 AZ, Netherlands
Sarah Ibrahimi, Boris Škorić & Jan-Jaap Oosterwijk

Authors

Sarah Ibrahimi
View author publications
You can also search for this author in PubMed Google Scholar
Boris Škorić
View author publications
You can also search for this author in PubMed Google Scholar
Jan-Jaap Oosterwijk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Boris Škorić.

Additional information

Competing interests

The authors declare that they have no competing interests.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Ibrahimi, S., Škorić, B. & Oosterwijk, JJ. Riding the saddle point: asymptotics of the capacity-achieving simple decoder for bias-based traitor tracing. EURASIP J. on Info. Security 2014, 12 (2014). https://doi.org/10.1186/s13635-014-0012-6

Download citation

Received: 27 January 2014
Accepted: 28 July 2014
Published: 15 August 2014
DOI: https://doi.org/10.1186/s13635-014-0012-6

Keywords

Traitor tracing; Fingerprinting

Riding the saddle point: asymptotics of the capacity-achieving simple decoder for bias-based traitor tracing

Abstract

MSC

Introduction

1.1 Traitor tracing

1.2 Capacity-achieving simple decoder

1.3 Contributions and outline

Preliminaries

2.1 Bias-based tracing using the asymptotically optimal simple decoder

2.1.1 Notation

2.1.2 Code generation

2.1.3 Collusion attack

2.1.4 Simple decoder

2.1.5 Measuring the performance

Definition 1.

2.2 Computing expectations

2.3 Concentration inequalities

Lemma 1 (Bernstein’s inequality [25]).

Lemma 2 (Bennett’s inequality [26]).

Property 1.

Proof.

Lemma 3 (weaker form of Bennett’s inequality).

Proof.

Statistics of the innocent score and coalition score

3.1 General result for the moments

Lemma 4.

Proof.

3.2 The case of the interleaving attack

Lemma 5.

Proof.

Remark 1.

Remark 2.

Remark 3.

3.3 Taking the Dirichlet distribution with cutoff τ=0

Lemma 6.

Proof.

Theorem 1.

Proof.

Lemma 7.

Proof.

Bounding the error probabilities

4.1 Bounding the false-positive probability

Theorem 2.

Proof.

Corollary 1.

Proof.

4.2 Bounding the false-negative probability

Theorem 3.

Proof.

Theorem 4.

Proof.

Corollary 2.

Proof.

Asymptotics of the sufficient code length

5.1 Sufficient code length

Theorem 5.

Proof.

5.2 Optimization of the cutoff τ as a function of c 0

Lemma 8 (adapted from [21]).

Proof.

Proposition 1.

Argumentation.

Conclusions

Endnotes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Keywords

5.2 Optimization of the cutoff τ as a function of c₀