What is the way to assess the effects of different DCT coefficients?

As to the effects of the cumulation of different DCT coefficients, the best results are obtained by considering the first 6 coefficients with the simplified map: when considering a higher number of coefficients the AUC values decrease, suggesting that NA-DJPG artifacts can not be reliably detected at the higher frequencies.

What is the ROC curve for the nMNF detector?

Since the ROC curve is a two dimensional plot of Pd versus Pfa as the decision threshold of the detector is varied, the authors adopt the area under the ROC curve (AUC) in order to summarize the performance of the detector with a unique scalar value.

what is the probability distribution of a DCT coefficient?

(11)If multiple DCT coefficients within the same 8 × 8 block are considered, by assuming that they are independently distributed the authors can express the likelihood ratio corresponding to the block at position (i, j) asL(i, j) = ∏ k L(xk(i, j)) (12)where xk(i, j) denotes the kth DCT coefficient within the block at position (i, j)1.

What is the likelihood map of a JPEG image?

The likelihood map obtained using such simplifications can be expressed asL(i, j) ≈ ∏ k nQ(xk(i, j)) b (13)where b = −1 (SCF) or b = 1 (DCF), and depends only on compression parameters, i.e., Q1, Q2, having removed any dependencies from the image content.

How can the authors simplify the DCT coefficients of a NA-DJPG?

if the authors can assume that the histogram of the original DCT coefficients is locally uniform, that is p0(u) is smooth, the authors can simplifyp1(x) ≈ { Q1p0(x) x = kQ10 elsewhere (8)Hence, if the authors assume that the JPEG approximation error due to the last compression is smaller than Q1, and thanks to (7), the authors have that (4) can be simplified topQ(x;Q1) ≈ nQ(x) · pNQ(x), x = 0. (9) where nQ(x) = nQ,0(x) ∗ gQ(x) andnQ,0(x) { Q1 x = kQ10 elsewhere (10)In Fig. 1 the models proposed in (4), (9), and (7) are compared with the histograms of unquantized DCT coefficients of a NA-DJPG compressed and a singly compressed image: in both cases there is a good agreement between the proposed models and the real distributions.

What is the likelihood function for the DCT coefficient x?

the authors should determine the shift (r, c) between the1With a slight abuse of notation, the authors use the same symbol L(x) even if for different k the authors have different likelihood functions.

What is the likelihood distribution of a DCT coefficient x?

Given p(x|H1) and p(x|H0), a DCT coefficient x can be classified as belonging to one of the two models according to the value of the likelihood ratioL(x) = p(x|H1) p(x|H0) .

How do the authors compute the AUC for the different types of DCT coefficients?

In all cases, likelihood maps are obtained by cumulating different numbers of DCT coefficients for each block, starting from the DC coefficient and scanning the coefficients in zig-zag order.

What is the method for estimating the distribution of the unquantized DCT coefficients?

2) Estimation of p0(u): Following the observations in [11], the authors propose to approximate the distribution of the unquantized DCT coefficients using the histogram of the DCT coefficients of the decompressed image computed after the DCT grid is suitably shifted with respect to the upper left corner.

(Open Access) Analysis of non-aligned double JPEG artifacts for the localization of image forgeries (2011) | Tiziano Bianchi

Q: What contributions have the authors mentioned in the paper "Analysis of non-aligned double jpeg artifacts for the localization of image forgeries" ?

In this paper, the authors present a forensic algorithm to discriminate between original and forged regions in JPEG images, under the hypothesis that the tampered image presents a nonaligned double JPEG compression ( NA-JPEG ).

Q: How can the authors solve the minimization problem in a discrete parameter?

Since Q1 is a discrete parameter with a limited set of possible values, the minimization in (15) can be solved iteratively by trying every possible Q1 and using the corresponding αopt.

Q: What is the common scenario for a forger to use?

The authors can assume that the forger disrupts the JPEG compression statistics in the tampered area: examples could be a cut and paste from either a non compressed image or a resized image, or the insertion of computer generated content.

Analysis of Non-Aligned Double JPEG Artifacts for

the Localization of Image Forgeries

T. Bianchi

#∗1

,A.Piva

Dept. of Electronics and Telecommunications, University of Florence, Via S. Marta 3, 50139 Firenze, Italy

∗

National Inter-University Consortium for Telecommunications, Via S. Marta 3, 50139 Firenze, Italy

tiziano.bianchi@unifi.it

alessandro.piva@unifi.it

Abstract—In this paper, we present a forensic algorithm to dis-

criminate between original and forged regions in JPEG images,

under the hypothesis that the tampered image presents a non-

aligned double JPEG compression (NA-JPEG). Unlike previous

approaches, the proposed algorithm does not need to manually

select a suspect region to test the presence or the absence of NA-

JPEG artifacts. Based on a new statistical model, the probability

for each 8 × 8 DCT block to be forged is automatically derived.

Experimental results, considering different forensic scenarios,

demonstrate the validity of the proposed approach.

I. INTRODUCTION

The availability of easy-to-use image processing tools al-

lowing to modify the content of digital images is today so

large that the diffusion of fake contents through the digital

world is becoming increasing and worrying. Such a possibility

raises several problems in all the ﬁelds in which the credibility

of images should be granted before using them as sources of

information, like insurance, law and order, journalism, medical

applications.

In the last years many image forensic techniques have been

proposed as a means for revealing the presence of forgeries in

digital images through the analysis of statistical and geometri-

cal features, JPEG quantization artifacts, interpolation effects,

demosaicing traces, feature inconsistencies, etc. [1].

Since the majority of digital images is stored in JPEG

format, several forensic tools have been designed to detect

the presence of tampering in this class of images. The forgery

is revealed by analyzing some artifacts introduced by JPEG

recompression occurring when the forged image is generated;

in particular, such artifacts can be categorized into two classes,

according to whether the second JPEG compression uses a

DCT grid aligned with the ﬁrst compression or not. The

ﬁrst case will be referred to as aligned double JPEG (A-

DJPG) compression, whereas the second case will be referred

to as non-aligned double JPEG (NA-DJPG) compression.

Approaches belonging to the ﬁrst category include [2], where

the author proposes to detect areas which have undergone

a double JPEG compression by recompressing the image at

different quality levels and looking for the presence of so-

called ghosts, and [3],[4], where double JPEG compression

WIFS‘2011, November 29th-December 2nd, 2011, Foz do

Iguac¸u, Brazil. 978-1-4577-1019-3/11/

26.00

2011 IEEE.

is detected analyzing the statistics of blockwise DCT coefﬁ-

cients. The presence of non-aligned double JPEG compression

has been investigated in [5],[6] and [7], that detect particular

distortions in blocking artifacts, in [8], where the shift of

the primary JPEG compression is determined via a demixing

approach, and in [9], where the periodicity of blockwise DCT

coefﬁcients is studied.

However, the above algorithms rely on the hypothesis to

know the right location of the forgery area, for example by

applying a segmentation of the image under test before the

forensic analysis [6], or they are just designed to decide if

the whole image has been doubly JPEG compressed [ 5][7],

so that the correct localization of the forgery in a tampered

image is still an open issue. To the best of our knowledge, only

some forensic algorithms designed to work in the presence

of aligned double JPEG compression are able to localize a

tampered area: in [3] and [4] two methods are proposed for

the automatic localization of tampered regions with a ﬁne-

grained scale of 8 × 8 blocks.

In this paper, we propose then the ﬁrst forensic tool that,

differently from previous works, can reveal a tampering at a

local level, without any prior information about the location

of the manipulated area, in the presence of non-aligned double

JPEG compression. The output of the algorithm is a map that

gives the probability, or the likelihood, for each 8 × 8 image

block to be tampered. The proposed algorithm can be applied

in different forensic scenarios, in which either the presence

or the absence of NA-JPEG artifacts at a local level can be

interpreted as evidence of tampering.

II. F

ORENSIC SCENARIOS

In order to correctly interpret the presence or the absence of

artifacts due to double compression, in the following analysis

we will consider two different scenarios.

A ﬁrst scenario is that in which an original JPEG image,

after some localized forgery, is saved again in JPEG format.

We can assume that the forger disrupts the JPEG compression

statistics in the tampered area: examples could be a cut and

paste from either a non compressed image or a resized image,

or the insertion of computer generated content. In this case,

DCT coefﬁcients of unmodiﬁed areas will undergo a double

JPEG compression thus exhibiting double quantization (DQ)

artifacts, while DCT coefﬁcients of forged areas will result

from a single compression and will likely present no DQ

artifacts. In the following, we will refer to this case as the

single compression forgery (SCF) hypothesis.

A second scenario is that of image splicing. In this kind

of forgery, it is assumed that a region from a JPEG image

is pasted onto a host image that does not exhibit JPEG

compression statistics, and that the resulting image is JPEG

recompressed. In this case, the forged region will exhibit

double compression artifacts, whereas the non manipulated

region will present no such artifacts. In the following, we will

refer to this second case as the double compression forgery

(DCF) hypothesis.

Under the SCF hypothesis, NA-DJPG artifacts will be

present if the original image is randomly cropped before being

recompressed in JPEG format. Under the DCF hypothesis,

assuming that the forged region is randomly pasted in the new

image, there is a probability of 63/64 that the 8×8 block grids

of the host image and of the pasted region will be misaligned,

and thus that the forged region will exhibit NA-DJPG artifacts.

III. S

INGLE AND DOUBLE JPEG COMPRESSION MODELS

In this section, we will describe the statistical model used to

characterize NA-DJPG artifacts. We will also introduce some

simpliﬁcations that will be useful in deﬁning the proposed

detection algorithm, as well as some modiﬁcations needed to

take into account the effects of rounding and truncation errors

between the ﬁrst compression and the second compression.

A. JPEG Compression Model

The JPEG compression algorithm can be modeled by three

basic steps [10]: 8 × 8 block DCT of the image pixels,

uniform quantization of DCT coefﬁcients with a quantization

matrix whose values depend on a quality factor QF , entropy

encoding of the quantized values. The image resulting from

decompression will be obtained by the inverse of each step

in reverse order: entropy decoding, dequantization, inverse

block DCT. In the following analysis, we will consider that

quantization is achieved by dividing each DCT coefﬁcient by

a proper quantization step Q and rounding the result to the

nearest integer, whereas dequantization is achieved by simply

multiplying by Q.

Let us then assume that an original uncompressed image

I is JPEG compressed with a quality factor QF , and then

decompressed. Since entropy encoding is perfectly reversible,

the image obtained after JPEG decompression can be modeled

as follows:

= D

−1

D(Q(D

I)) + E

= I + R

. (1)

In the above equation, D

models an 8 × 8 block DCT with

the grid aligned with the upper left corner of the image, Q(·)

and D(·) model quantization and dequantization processes,

respectively, and E

is the error introduced by rounding and

truncating the output values to eight bit integers. The last quan-

tity R

can be thought of as the overall approximation error

introduced by JPEG compression with respect to the original

image. In the above chain, if we neglect rounding/truncation

(R/T) errors, the only operation causing a loss of information

is the quantization process Q(·).

Let us now analyse the artifacts that appear in the presence

of a double non-aligned JPEG compression, due to the interac-

tion of successive quantization and dequantization processes.

B. NA-DJPG Compression

In the case of NA-DJPG compression, we can assume that

the original image I has been JPEG compressed with a quality

factor QF

using a DCT grid shifted by (r, c), 0 ≤ r ≤ 7 and

0 ≤ c ≤ 7, with respect to the upper left corner, so that the

image obtained after JPEG decompression can be represented

as:

= D

−1

I)) + E

(2)

where D

I are the unquantized DCT coefﬁcients of I and Q

denote that a proper quantization matrix corresponding to

the quality QF

was used.

We then assume that the image has been again JPEG

compressed with a quality factor QF

, but now with the block

grid aligned with the upper left corner of the image. If we

consider the DCT coefﬁcients of the second compression after

entropy decoding, no noticeable artifacts are present. However,

if we consider the image after the second decompression, i.e.,

= I

+ R

, and we apply a block DCT with alignment

(r, c),wehave

= D

I)) + D

+ R

). (3)

Since the JPEG standard uses 64 different quantization steps,

one for each of the 64 frequencies within a 8 × 8 DCT, the

DCT coefﬁcients will be distributed according to 64 different

probability distributions. According to the above equation,

each unquantized DCT coefﬁcient obtained by applying to the

doubly compressed image I

a block DCT with alignment

(r, c) (i.e. the same alignment of the ﬁrst compression) will

be distributed as

(x; Q

)=p

(x) ∗ g

(x) (4)

where Q

is the quantization step of the ﬁrst compression,

(x) models the distribution of the overall approximation

error, i.e, the term D

+ R

), ∗ models convolution, and

(v)=





v+Q

u=v−Q

(u) v = kQ

0 elsewhere

(5)

models the distribution of the DCT coefﬁcients after quanti-

zation by Q

and dequantization, being p

(u) the distribution

of the original unquantized coefﬁcients.

If we model the approximation error as the sum of the R/T

error in the DCT domain plus the quantization error due to

uniform quantization with quantization step Q

, by invoking

the central limit theorem we can assume that the R/T error is

Gaussian distributed with mean μ

and variance σ

, and thus

the approximation error is Gaussian distributed with mean μ

and variance σ

+ Q

/12, i.e.,

(x)=



2π(σ

+ Q

/12)

−(x−μ

)

/(σ

/12)

(6)

In the absence of NA-DJPG compression, that is if the image

did not undergo a ﬁrst JPEG compression with alignment

(r, c), the unquantized DCT coefﬁcients obtained by applying

a shifted block DCT can be assumed distributed approximately

as the original unquantized coefﬁcients, that is

(x)=p

(x) (7)

since a misalignment of the DCT grids usually destroys the

effects of quantization [11].

C. Simpliﬁed Model

Although the model in (4) is quite accurate, it requires the

knowledge of the distribution of the unquantized coefﬁcients

(u), which may not be available in practice. However, it

is possible to make same simpliﬁcations in order to obtain a

model less dependent from the image content.

Indeed, if we can assume that the histogram of the original

DCT coefﬁcients is locally uniform, that is p

(u) is smooth,

we can simplify

(x) ≈



(x) x = kQ

0 elsewhere

(8)

Hence, if we assume that the JPEG approximation error due

to the last compression is smaller than Q

, and thanks to (7),

we have that (4) can be simpliﬁed to

(x; Q

) ≈ n

(x) · p

(x),x=0. (9)

where n

(x)=n

Q,0

(x) ∗ g

(x) and

Q,0

(x) 



x = kQ

0 elsewhere

(10)

In Fig. 1 the models proposed in (4), (9), and (7) are com-

pared with the histograms of unquantized DCT coefﬁcients of

a NA-DJPG compressed and a singly compressed image: in

both cases there is a good agreement between the proposed

models and the real distributions.

IV. F

ORGERY LOCALIZATION ALGORITHM

In the following, we will assume that for each DCT coefﬁ-

cient x of an image, we know both the probability distributions

of x conditional to the hypothesis of being tampered, i.e.,

p(x|H

), and the probability distributions of x conditional to

the hypothesis of not being tampered, i.e., p(x|H

The above conditional distributions are given by (4) and

(7), according to whether we are considering the SCF or the

DCF hypothesis. For example, under the DCF hypothesis we

have p(x|H

)=p

(x; Q

) and p(x|H

)=p

(x).Inthe

following, for the sake of simplicity, we will always assume

the DCF hypothesis, i.e. p(x|H

) denotes the distribution of

singly compressed coefﬁcients, and p(x|H

) is the distribution

of doubly compressed coefﬁcients.

Given p(x|H

) and p(x|H

), a DCT coefﬁcient x can be

classiﬁed as belonging to one of the two models according to

the value of the likelihood ratio

L(x)=

p(x|H

)

p(x|H

)

. (11)

−20 −15 −10 −5 0 5 10 15 20

0.01

0.02

0.03

0.04

0.05

0.06

0.07

0.08

0.09

0.1

unquantized DCT value

frequency

simplified

Fig. 1. Example of NA-DJPG compression model: h

and h

denote the

histograms of unquantized DCT coefﬁcients of a NA-DJPG compressed and

a singly compressed image, respectively. The distributions obtained according

to equations (4), (9), and (7) are in good agreement with these data.

If multiple DCT coefﬁcients within the same 8 × 8 block

are considered, by assuming that they are independently dis-

tributed we can express the likelihood ratio corresponding to

the block at position (i, j) as

L(i, j)=



L(x

(i, j)) (12)

where x

(i, j) denotes the kth DCT coefﬁcient within the

block at position (i, j)

. Such values form a likelihood map

of the JPEG image with resolution 8 × 8 pixel, which can be

used to localize possibly forged regions within the image.

By using the simpliﬁed model described in Section III-C, it

is possible to approximate the likelihood ratio as either L(x)=

1/n

(x) (in case of SCF hypothesis) or L(x)=n

(x) (in

case of DCF hypothesis). The likelihood map obtained using

such simpliﬁcations can be expressed as

L(i, j) ≈



(i, j))

(13)

where b = −1 (SCF) or b =1(DCF), and depends only

on compression parameters, i.e., Q

, Q

, having removed

any dependencies from the image content. Hence, even if

approximated, the adoption of the simpliﬁed models can lead

to a more robust localization of possibly forged regions.

A. Estimation of Model Parameters

The models described in Section III require the estimation

of some parameters in order to be applied in practice. Among

these parameters, p

(u), Q

, μ

, and σ

are common to

both p(x|H

) and p(x|H

), whereas Q

is required only to

characterize the distribution of doubly quantized coefﬁcients.

Moreover, we should determine the shift (r, c) between the

With a slight abuse of notation, we use the same symbol L(x) even if

for different k we have different likelihood functions. The same convention

is used in (13) when referring to n(x).

ﬁrst compression and the last compression in order to compute

the unquantized DCT coefﬁcients as in (3).

As to Q

, we will assume that it is available from the JPEG

image header. As to the shift (r, c), we will assume that it has

already been estimated, e.g. using the methods described in

[9][8]. As to the other parameters, they are estimated according

to the following procedures.

1) Estimation of Q

: The estimation of the quantization

step of the primary compression is crucial for the correct

modeling of doubly compressed regions. When dealing with

a possibly forged image, usually there is no prior knowledge

regarding the location of such regions. An image block could

include an original area, as well as a tampered one. Thus, the

distribution of the DCT coefﬁcients of a tampered image can

be modeled as a mixture of p(x|H

) and p(x|H

), i.e.,

p(x; Q

,α)=α · p(x|H

)+(1− α) · p(x|H

; Q

) (14)

where α is the mixture parameter and we have highlighted the

dependence of p(x|H

) from Q

. Based on the above model,

the maximum likelihood estimate of Q

can be obtained as

=argmax



log[α

opt

p(x|H

)+(1− α

opt

)p(x|H

; Q

)]

(15)

where α

opt

is the optimal mixture parameter. For each Q

the optimal mixture parameter can be estimated using an

expectation-maximization (EM) algorithm.

Since Q

is a discrete parameter with a limited set of pos-

sible values, the minimization in (15) can be solved iteratively

by trying every possible Q

and using the corresponding α

opt

In order to estimate the complete quantization matrix, the

above minimization problem is separately solved for each of

the 64 DCT coefﬁcients within a block.

2) Estimation of p

(u): Following the observations in [11],

we propose to approximate the distribution of the unquantized

DCT coefﬁcients using the histogram of the DCT coefﬁcients

of the decompressed image computed after the DCT grid

is suitably shifted with respect to the upper left corner. In

particular, we will use a shift of ±1 with respect to the

estimated shift (r, c) of the primary compression, where the

sign of the increment is chosen so as to keep the shift values

between 0 and 7 and to avoid the case (0, 0).

3) Estimation of μ

and σ

: The true values of both μ

and σ

should be estimated by relying on the primary JPEG

compression, which in general is not available when observing

the tampered image. In practice, we found that they can

be well approximated by measuring the R/T error on the

tampered image. The rationale is that both μ

and σ

are

mainly determined by the coarse-grained statistics of the image

content, which usually are little affected by tampering.

By looking at equation (1), given as input the quantized

DCT coefﬁcients of the observed image C

we can think to

compute the term E

by reconstructing the image with inﬁnite

precision as D

−1

D(C

), which can be approximated by using

ﬂoating point arithmetic, and taking the difference with the

image I

which is obtained by rounding and truncating to 8

bit precision the ﬂoating point values.

V. E

XPERIMENTAL RESULTS

For the experimental validation of the proposed work, we

have built an image dataset composed by 100 non-compressed

TIFF images, having heterogeneous contents, coming from

three different digital cameras (namely Nikon D90, Canon

EOS 450D, Canon EOS 5D) and each acquired at its highest

resolution; each test has been performed by cropping a central

portion with size 1031 × 1031: this choice allows us to still

have a 1024 × 1024 image after randomly cropping a number

or rows and columns between 0 and 7.

Starting from this dataset, we have created manipulated

images exhibiting NA-DJPG artifacts, following both SCF

and DCF hypotheses. As to the NA-DJPG SCF dataset, each

original image is JPEG compressed with a given quality factor

(using the Matlab function imwrite); the image is

randomly cropped by removing a number of rows and columns

between 0 and 7; the central portion of size 256 × 256

is replaced with the corresponding area from the original

TIFF image; ﬁnally, the overall “manipulated” image is JPEG

compressed with another given quality factor QF

.Inthis

way, the image will result NA-DJPG compressed everywhere,

except in the central region where it is supposed to be forged.

The creation of the DCF datasets is dual with respect to the

above procedure. Each original image is JPEG compressed

with a given quality factor QF

; the central portion of size

256 ×256 is cut with a random shift with respect to the JPEG

grid and pasted onto the TIFF image so as to respect both the

alignment of the image content and the alignment with the

DCT grid; ﬁnally, the overall “manipulated” image is JPEG

compressed with another given quality factor QF

. In this way,

the central region of the image, which is supposed to be forged,

will result NA-DJPG compressed.

In all the above datasets, QF

and QF

are taken from the

sets [50, 60,...90] and [50, 60,...100], respectively, achiev-

ing 30 possible combinations of (QF

,QF

) for each of the

100 tampered images.

The selection of a proper performance metric is fundamental

for evaluating the performance of the method. Our algorithm

provides as output, for each analyzed image, a map that

represents the likelihood of each 8 × 8 block to be forged.

After a thresholding step, a binary detection map is achieved,

that locates which are the blocks detected as tampered. By

assuming to have for each analyzed image the corresponding

binary mask whose 32 × 32 central portion indicates forged

blocks, a comparison between the algorithm output detection

map and the known tampering mask will allow to estimate the

error rates of the forensic scheme, measured as false alarm

probability P

and missed detection probability P

. These

two probabilities can be computed by measuring the following

parameters: n

NMF

: number of blocks not manipulated, but

detected as forged; n

MNF

: number of blocks manipulated, but

not detected as forged; n

: number of blocks in the image

(16384 in our tests); n

: number of manipulated blocks (1024

in our tests). Starting from these ﬁgures, the error probabilities

TABLE I

AUC

ACHIEVED BY THE PROPOSED ALGORITHM USING THE STANDARD

MODEL UNDER THE

SCF HYPOTHESIS.

50 60 70 80 90 100

50 0.58 0.79 0.95 0.99 0.99 0.99

60 0.51 0.61 0.87 0.98 0.99 0.99

70 0.48 0.50 0.62 0.92 0.98 0.99

80 0.48 0.48 0.49 0.61 0.95 0.99

90 0.48 0.48 0.48 0.48 0.55 0.98

TABLE I I

AUC

ACHIEVED BY THE PROPOSED ALGORITHM USING THE SIMPLIFIED

MODEL THE UNDER

SCF HYPOTHESIS.

50 60 70 80 90 100

50 0.71 0.85 0.94 0.98 0.99 0.99

60 0.59 0.71 0.89 0.97 0.99 1.00

70 0.52 0.56 0.71 0.94 0.99 1.00

80 0.52 0.52 0.54 0.69 0.97 0.99

90 0.51 0.51 0.52 0.51 0.57 0.99

are given by:

NMF

− n

MNF

and the correct detection probability is: P

=1− P

For depicting the tradeoff between the correct detection

rate P

and the false alarm rate P

the receiver operating

characteristic (ROC) curve is considered. Since the ROC curve

is a two dimensional plot of P

versus P

as the decision

threshold of the detector is varied, we adopt the area under

the ROC curve (AUC) in order to summarize the performance

of the detector with a unique scalar value.

In the following, we will compare the AUC values obtained

using the standard map in (12) and the simpliﬁed map in

(13): to the best of our knowledge, these are the ﬁrst methods

that permit to localize possibly forged areas by relying on

non-aligned double JPEG compression, so other methods can

not be compared with our schemes. In all cases, likelihood

maps are obtained by cumulating different numbers of DCT

coefﬁcients for each block, starting from the DC coefﬁcient

and scanning the coefﬁcients in zig-zag order.

The AUC values achieved for different QF

under the SCF

hypothesis are shown in Fig. 2: when QF

is sufﬁciently high

(> 80), NA-DJPG artifacts can be effectively used to localize

traces of tampering. When comparing the two approaches, the

simpliﬁed map appears more robust than the standard map

for lower QF

values. As to the effects of the cumulation of

different DCT coefﬁcients, the best results are obtained by

considering the ﬁrst 6 coefﬁcients with the simpliﬁed map:

when considering a higher number of coefﬁcients the AUC

values decrease, suggesting that NA-DJPG artifacts can not

be reliably detected at the higher frequencies.

In order to assess the effects of different QF

val-

ues, the AUC values obtained for different combinations of

(QF

,QF

), using the ﬁrst 6 DCT coefﬁcients to compute

the likelihood map, are reported in Tables I-II. For ease of

50 60 70 80 90 100

0.5

0.55

0.6

0.65

0.7

0.75

0.8

0.85

0.9

0.95

AUC

1 coeff.

6 coeff.

15 coeff.

(a)

50 60 70 80 90 100

0.5

0.55

0.6

0.65

0.7

0.75

0.8

0.85

0.9

0.95

AUC

1 coeff.

6 coeff.

15 coeff.

(b)

Fig. 2. AUC achieved for different QF

using different numbers of DCT

coefﬁcients in the SCF scenario: (a) proposed algorithm with standard map;

(b) proposed algorithm with simpliﬁed map.

reading, for each combination of (QF

,QF

) the highest AUC

value between the two considered approaches is highlighted

in bold. In this case, the simpliﬁed map achieves always the

best performance except in three cases. Noticeably, it is not

possible to achieve AUC values signiﬁcantly greater than 0.5

when QF

<QF

. However, it sufﬁces QF

− QF

≥ 10 to

achieve an AUC value very close to one, which means that in

this case forged areas can be localized with great accuracy.

In Fig. 3, we provide the AUC values under the DCF

hypothesis. In this case the performance of forgery localization

is much lower than under the SCF hypothesis, allowing to

localize traces of double compression only when QF

is very

high (> 90).

A. Examples

The algorithm has also been tested on a set of images

representing realistic cases of forgery; in Figure 4 an example

of a tampered image is shown: the likelihood map clearly

reveals that the pyramid is a tampered object, and it also shows

some false alarms in the background, due to the low intensity

variance in this area that does not allow a correct estimation

Analysis of non-aligned double JPEG artifacts for the localization of image forgeries

Figures

Citations

An Overview on Image Forensics

Image Forgery Localization via Fine-Grained Analysis of CFA Artifacts

Double JPEG Detection in Mixed JPEG Quality Factors Using Deep Convolutional Neural Network

Deep Learning for Detecting Processing History of Images.

Toward image phylogeny forests: Automatically recovering semantically similar image relationships

References

Image forgery detection

Statistical tools for digital forensics

Exposing Digital Forgeries From JPEG Ghosts

Estimation of Primary Quantization Matrix in Double Compressed JPEG Images

Fast, automatic and fine-grained tampered JPEG image detection via DCT coefficient analysis

Related Papers (5)

A Novel Method for Detecting Cropped and Recompressed Image Block

Image Forgery Localization via Block-Grained Analysis of JPEG Artifacts

Image forgery detection

Estimation of Primary Quantization Matrix in Double Compressed JPEG Images

Fast, automatic and fine-grained tampered JPEG image detection via DCT coefficient analysis

Frequently Asked Questions (14)

Q1. What contributions have the authors mentioned in the paper "Analysis of non-aligned double jpeg artifacts for the localization of image forgeries" ?

Q2. How can the authors solve the minimization problem in a discrete parameter?

Q3. What is the reason for the forensic scheme?

Q4. What is the way to assess the effects of different DCT coefficients?

Q5. What is the ROC curve for the nMNF detector?

Q6. what is the probability distribution of a DCT coefficient?

Q7. What is the likelihood map of a JPEG image?

Q8. What is the common scenario for a forger to use?

Q9. How can the authors simplify the DCT coefficients of a NA-DJPG?

Q10. What is the likelihood function for the DCT coefficient x?

Q11. What is the likelihood distribution of a DCT coefficient x?

Q12. How do the authors compute the AUC for the different types of DCT coefficients?

Q13. What is the purpose of the proposed work?

Q14. What is the method for estimating the distribution of the unquantized DCT coefficients?