Recent Advances in Adversarial Machine Learning Nicholas Carlini - - PowerPoint PPT Presentation

recent advances in adversarial machine learning
SMART_READER_LITE
LIVE PREVIEW

Recent Advances in Adversarial Machine Learning Nicholas Carlini - - PowerPoint PPT Presentation

Recent Advances in Adversarial Machine Learning Nicholas Carlini Google Research Recent Advances in Adversarial (Examples in) Machine Learning Nicholas Carlini Google Research The Year is 2014 Someone tells you they have a new algorithm to


slide-1
SLIDE 1

Recent Advances in Adversarial Machine Learning

Nicholas Carlini

Google Research

slide-2
SLIDE 2

Recent Advances in Adversarial (Examples in) Machine Learning

Nicholas Carlini

Google Research

slide-3
SLIDE 3
slide-4
SLIDE 4

The Year is 2014

Someone tells you they have a new algorithm to generate human faces

slide-5
SLIDE 5

The Year is 2014

"the theoretical work is primitive, and the experiments are pretty basic." "more results of how this helps on real tasks

  • r real datasets"
slide-6
SLIDE 6
slide-7
SLIDE 7

The Year is 2017

Someone tells you they have a new algorithm to generate human faces

slide-8
SLIDE 8

The Year is 2017

slide-9
SLIDE 9
slide-10
SLIDE 10
slide-11
SLIDE 11

The Year is 2013

Someone tells you they have discovered a flaw in the robustness of neural networks

slide-12
SLIDE 12

The Year is 2013

slide-13
SLIDE 13
slide-14
SLIDE 14

The Year is 2019

Someone tells you they have discovered a flaw in the robustness of neural networks

slide-15
SLIDE 15

The Year is 2019

slide-16
SLIDE 16

3 years: 6 years:

slide-17
SLIDE 17
slide-18
SLIDE 18
slide-19
SLIDE 19

Background: Adversarial Examples

slide-20
SLIDE 20
slide-21
SLIDE 21
slide-22
SLIDE 22
slide-23
SLIDE 23
slide-24
SLIDE 24

Truck Dog

Random Direction Random Direction

slide-25
SLIDE 25

Dog Truck Airplane

Random Direction Adversarial Direction Adversarial Direction Random Direction

slide-26
SLIDE 26
slide-27
SLIDE 27

( (

slide-28
SLIDE 28
slide-29
SLIDE 29

Recent advances in ... Generating Adversarial Examples

slide-30
SLIDE 30
slide-31
SLIDE 31

Threat Model:

  • Black Box
  • Hard Label
  • Query Access
slide-32
SLIDE 32
slide-33
SLIDE 33
slide-34
SLIDE 34
slide-35
SLIDE 35
slide-36
SLIDE 36
slide-37
SLIDE 37
slide-38
SLIDE 38
slide-39
SLIDE 39
slide-40
SLIDE 40
slide-41
SLIDE 41
slide-42
SLIDE 42
slide-43
SLIDE 43
slide-44
SLIDE 44
slide-45
SLIDE 45
slide-46
SLIDE 46
slide-47
SLIDE 47
slide-48
SLIDE 48
slide-49
SLIDE 49
slide-50
SLIDE 50
slide-51
SLIDE 51
slide-52
SLIDE 52
slide-53
SLIDE 53
slide-54
SLIDE 54
slide-55
SLIDE 55
slide-56
SLIDE 56
slide-57
SLIDE 57
slide-58
SLIDE 58
slide-59
SLIDE 59
slide-60
SLIDE 60
slide-61
SLIDE 61
slide-62
SLIDE 62
slide-63
SLIDE 63
slide-64
SLIDE 64
slide-65
SLIDE 65
slide-66
SLIDE 66
slide-67
SLIDE 67
slide-68
SLIDE 68
slide-69
SLIDE 69
slide-70
SLIDE 70

Recent advances in ... Defending Against Adversarial Examples

slide-71
SLIDE 71

Defenses I don't believe will be effective

slide-72
SLIDE 72

... a bit more background

slide-73
SLIDE 73

Transferability

slide-74
SLIDE 74
slide-75
SLIDE 75
slide-76
SLIDE 76
slide-77
SLIDE 77
slide-78
SLIDE 78
slide-79
SLIDE 79
slide-80
SLIDE 80
slide-81
SLIDE 81
slide-82
SLIDE 82
slide-83
SLIDE 83
slide-84
SLIDE 84
slide-85
SLIDE 85
slide-86
SLIDE 86

CAT

slide-87
SLIDE 87

CAT

slide-88
SLIDE 88

DOG

slide-89
SLIDE 89

DOG

slide-90
SLIDE 90
slide-91
SLIDE 91

DOG

slide-92
SLIDE 92

DOG

slide-93
SLIDE 93

DOG

slide-94
SLIDE 94

DOG

slide-95
SLIDE 95
slide-96
SLIDE 96

DOG

slide-97
SLIDE 97

DOG

slide-98
SLIDE 98

DOG

slide-99
SLIDE 99

You are being evil

slide-100
SLIDE 100
slide-101
SLIDE 101

Defenses I do believe will be effective

slide-102
SLIDE 102
slide-103
SLIDE 103
slide-104
SLIDE 104
slide-105
SLIDE 105

Randomized Mechanism

CAT

slide-106
SLIDE 106
slide-107
SLIDE 107

Original

slide-108
SLIDE 108

L2 distortion: 4

slide-109
SLIDE 109

Original

slide-110
SLIDE 110

L2 distortion: 10

slide-111
SLIDE 111
slide-112
SLIDE 112

L2 = 75

slide-113
SLIDE 113

Original

slide-114
SLIDE 114

L2 distortion: 75

slide-115
SLIDE 115

L2 distortion: 75

slide-116
SLIDE 116
slide-117
SLIDE 117

Recent advances in ... Why Adversarial Examples Exist

slide-118
SLIDE 118
slide-119
SLIDE 119

Dog Truck Airplane

Adversarial Direction Random Direction Adversarial Direction Random Direction

slide-120
SLIDE 120
slide-121
SLIDE 121
slide-122
SLIDE 122
slide-123
SLIDE 123
slide-124
SLIDE 124

CAT DOG

Standard Training Dataset

slide-125
SLIDE 125

Standard Testing Setup

DOG

slide-126
SLIDE 126

Adversarial Testing Setup

CAT

slide-127
SLIDE 127

CAT DOG

Standard Training Dataset

slide-128
SLIDE 128

DOG CAT

Adversarial Training Dataset

slide-129
SLIDE 129

Standard Testing Setup

DOG

slide-130
SLIDE 130

Adversarial Testing Setup

DOG

slide-131
SLIDE 131

CAT DOG

Standard Training Dataset

slide-132
SLIDE 132

DOG CAT

Adversarial Training Dataset

slide-133
SLIDE 133

DOG CAT

Confusing Training Dataset

slide-134
SLIDE 134

Standard Testing Setup

DOG

slide-135
SLIDE 135

DOG CAT

?!??!?!?? Training Dataset

slide-136
SLIDE 136

Is a well-generalizing feature of CAT

slide-137
SLIDE 137
slide-138
SLIDE 138

Conclusion

slide-139
SLIDE 139

Questions?