The Challenges of Rich Features in Universal Steganalysis Tom Pevn a - - PowerPoint PPT Presentation

the challenges of rich features in universal steganalysis
SMART_READER_LITE
LIVE PREVIEW

The Challenges of Rich Features in Universal Steganalysis Tom Pevn a - - PowerPoint PPT Presentation

The Challenges of Rich Features in Universal Steganalysis Tom Pevn a and Andrew D. Ker b a Agent Technology Center, Czech Technical University in Prague, Czech Republic. a Department of Computer Science, Oxford University, England. 7th


slide-1
SLIDE 1

The Challenges of Rich Features in Universal Steganalysis

Tomáš Pevnýa and Andrew D. Kerb

aAgent Technology Center, Czech Technical University in Prague, Czech Republic. aDepartment of Computer Science, Oxford University, England.

7th February 2013

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 1 / 16

slide-2
SLIDE 2

Batch universal steganalysis

Internet Warden

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 2 / 16

slide-3
SLIDE 3

Batch universal steganalyzer

Extract features. Calculate distances between actors (MMD). Identify the steganographer(s). local outlier factor (LOF)

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 3 / 16

slide-4
SLIDE 4

Batch universal steganalyzer

Extract features. Calculate distances between actors (MMD). Identify the steganographer(s). local outlier factor (LOF)

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 3 / 16

slide-5
SLIDE 5

Batch universal steganalyzer

Extract features. Calculate distances between actors (MMD). Identify the steganographer(s). local outlier factor (LOF)

guilty

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 3 / 16

slide-6
SLIDE 6

Batch universal steganalyzer

Extract features. Calculate distances between actors (MMD). Identify the steganographer(s). local outlier factor (LOF)

guilty

The method should work with any stego-sensitive features.

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 3 / 16

slide-7
SLIDE 7

Accuracy with PF274 and CF ∗ features

PF274 CF ∗ dimension 274 8750 F5 14.6 9.5 nsF5 10.7 23.1 JP Hide&Seek 7.8 16.2 OutGuess 1.9 5.7 Steghide 2.8 4.7 Average rank of one guilty actor (out of 100) emitting payload 0.1 bpnc

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 4 / 16

slide-8
SLIDE 8

Curse of dimensionality

Anomaly detection estimates density: more difficult in high dimensions. In unsupervised learning cannot discard noise in features.

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 5 / 16

slide-9
SLIDE 9

Curse of dimensionality

Our solution

Supervised dimensionality reduction.

Our aim

Steganographic features should be sensitive to embedding changes, yet insensitive to image content.

  • J. Fridrich, 2004
  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 5 / 16

slide-10
SLIDE 10

Dimensionality reduction

Prior art Principal component transformation Maximum covariance Ordinary least square regression Proposed Calibrated least-squares

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 6 / 16

slide-11
SLIDE 11

Principal component transformation (PCT)

argmax

wk Var(Xwk)

subject to wk⊥wi, i ∈ {1,...,k −1}.

X ∈ Rn,d — matrix with features wi — projections found

−200 −100 100 −100 100 1st projection 2nd projection 0.1 0.2 0.3 0.4

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 7 / 16

slide-12
SLIDE 12

Ordinary least square regression (OLS)

argmax

wk Cov(Xswk,Ys)−Var(Xswk)

subject to wk⊥wi, i ∈ {1,...,k −1}.

XS ∈ Rn,d — matrix with stego features Ys ∈ Rn,1 — vector with payload wi — projections found

−0.3 −0.2 −0.1 0.1 0.2 0.3 0.4 −0.2 −0.15 −0.1 −5 · 10−2 1st projection 2nd projection 0.1 0.2 0.3 0.4

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 8 / 16

slide-13
SLIDE 13

Maximum covariance (MCV)

argmax

wk Cov(Xswk,Ys)

subject to wk⊥wi, i ∈ {1,...,k −1}.

XS ∈ Rn,d — matrix with stego features Ys ∈ Rn,1 — vector with payload wi — projections found

−1.4−1.2 −1 −0.8−0.6−0.4−0.2 0.2 ·108 −0.2 0.2 0.4 0.6 0.8 1 ·106 1st projection 2nd projection 0.1 0.2 0.3 0.4

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 9 / 16

slide-14
SLIDE 14

Calibrated least squares (CLS)

argmax

wk Cov(Xswk,Ys)−Var(Xcwk)

subject to wk⊥wi, i ∈ {1,...,k −1}.

XS ∈ Rn,d — matrix with stego features Ys ∈ Rn,1 — vector with payload Xc ∈ Rn,d — matrix with cover features wi — projections found

5 10 15 20 −6 −4 −2 1st projection 2nd projection 0.1 0.2 0.3 0.4

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 10 / 16

slide-15
SLIDE 15

Experimental settings

3000 users of leading social network, 100 images from each

◮ 1000 users for supervised feature reduction ◮ 2000 users used for testing

Guilty actor emits payload 0.1 bpnc

◮ linear (in the paper) or greedy strategy ◮ one of following algorithms:

F5, nsF5, JPHide&Seek (JP), OutGuess (OG), Steghide (SH)

Steganalyst uses reduced CF ∗ features. Accuracy is measured by average rank of guilty actor.

◮ 1.0 = perfect, 50.5 = random guessing.

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 11 / 16

slide-16
SLIDE 16

Results

PCT MCV OLS CLS

F5

40.3 23.4 22.2 1.6

(4) (1) (1)

nsF5

38.0 26.6 5.8 2.1

(4) (1) (1)

JP

38.4 27.2 6.9 1.7

(5) (1) (1)

OG

26.5 31.6 2.4 1.2

(4) (1) (1)

SH

23.0 2.6 1.3 1.1

(6) (1) (1)

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 12 / 16

slide-17
SLIDE 17

Robustness

PCT

CLS trained on F5 nsF5 JP OG SH F5 40.3 1.6 1.9 8.8 6.6 4.5

(1) (1) (1) (4) (3)

nsF5 38.0 1.8 2.1 10.1 10.9 10.5

(1) (1) (1) (4) (3)

JP 38.4 8.9 7.2 1.7 15.5 10.5

(1) (2) (1) (2) (2)

OG 26.5 3.7 3.0 11.8 1.2 1.1

(1) (6) (2) (1) (1)

SH 23.0 5.2 3.2 9.1 1.2 1.1

(1) (6) (2) (1) (1)

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 13 / 16

slide-18
SLIDE 18

Optimal number of projections

2 4 6 8 10 10 20 30 # of projections average rank

F5 nsF5 JP OG SH

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 14 / 16

slide-19
SLIDE 19

Conclusion

High dimensional features are not compatible with unsupervised steganalysis. Investigated dimensionality reduction to improve SNR of rich features. Validated the approach in universal batch steganalysis. The proposed method, CLS, exhibits robustness to embedding method.

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 15 / 16

slide-20
SLIDE 20

F5 phenomenon

5 · 10−2 0.1 0.15 0.2 0.25 5 · 10−2 0.1 0.15 0.2 0.25 true payload estimated change rate

F5 nsF5 JP OG SH

  • T. Pevný and A. D. Ker

Condensing rich features 7th February 2013 16 / 16