Automatic Portrait Segmentation and Matting
Xiaoyong Shen
The Chinese University of Hong Kong goodshenxy@gmail.com
Segmentation and Matting Xiaoyong Shen The Chinese University of - - PowerPoint PPT Presentation
Automatic Portrait Segmentation and Matting Xiaoyong Shen The Chinese University of Hong Kong goodshenxy@gmail.com Research on CV Pixel based (low level/ early vision) Filtering, restoration, denoise, enhancement, deblur, editing,
The Chinese University of Hong Kong goodshenxy@gmail.com
editing, dehaze, etc.
segmentation, etc.
classification, recognition, etc.
editing, dehaze, etc.
segmentation, etc.
classification, recognition, etc.
5
[TPAMI 2015]
6
7
Our Result Ground Truth Input Noisy Image Input NIR Image
8
9
10
11
[ICCV 2015 Oral Presentation]
Noisy Depth
Noisy RGB Image
Ground truth
Ours PSNR = 37.19
One line code only: ๐ฝ๐ข+1 = ๐พ๐บ(๐ฝ0, ๐ฝ๐ข)
[ECCV 2014 Oral Presentation]
18
19
One line code only: ๐ฝ๐ข+1 = ๐ฝ๐ข + (๐ฝ0 โ ๐บ(๐ฝ๐ข))
Different Exposures RGB/Depth RGB/NIR Flash/No-flash
Inputs Our Result Blended
Without Alignment With Alignment Constructed HDR
Reference Input Dense Correspondences ? Exist Correspondence No Correspondence [SIGGRAPH ASIA 2016]
Reference Input Dense Correspondences ? Foremost Region Matching
Achieve higher accuracy with the help of object (person)
State-of-the-art Ours
38
Portrait, 30% Others, 70%
Samsung UK
Portrait, 90% Others, 10%
Symon Whitehorn from HTC
39
40
41
42
Automatic?
43
Similar Color Complex Background Various Accessories Low Contrast Diverse Pose Complicated Edges
44
45
46
47
PortraitFCN and PortraitFCN+
48
Detector Conv ReLU Pooling Conv Conv Pooling ReLU DeConv Mask
[Long et al. 2015]
PortraitFCN Model
RGB Channels 2 Outputs
49
50
Detector Conv ReLU Pooling Conv Conv Pooling ReLU DeConv Mask
[Long et al. 2015]
PortraitFCN+ Model
RGB+Shape+Position 2 Outputs Shape Position
51
Labeled Masks
Align
Canonical Pose
Mean
Shape Channel
๐ = ฯ๐ ๐ฅ๐ โ ๐๐(๐๐) ฯ๐ ๐ฅ๐ Align Test Image
52
Canonical Pose x- Coordinate y- Coordinate Position Test Image
Align
53
Input
54
PortraitFCN
55
PortraitFCN+
56
position, hair style, lighting, etc.
57
58
59
60
61
Methods Mean IoU (%) Graph-cut 80.02 FCN (Person Class) 73.09
IoU = area(output โฉ ground truth) area(output โช ground truth)
62
Methods Mean IoU (%) Graph-cut 80.02 FCN (Person Class) 73.09 PortraitFCN 94.20
IoU = area(output โฉ ground truth) area(output โช ground truth)
63
Methods Mean IoU (%) Graph-cut 80.02 FCN (Person Class) 73.09 PortraitFCN 94.20 PortraitFCN+ (Only with Mean Mask) 94.89 PortraitFCN+ (Only with Normalized x and y) 94.61
IoU = area(output โฉ ground truth) area(output โช ground truth)
64
Methods Mean IoU (%) Graph-cut 80.02 FCN (Person Class) 73.09 PortraitFCN 94.20 PortraitFCN+ (Only with Mean Mask) 94.89 PortraitFCN+ (Only with Normalized x and y) 94.61 PortraitFCN+ 95.91
IoU = area(output โฉ ground truth) area(output โช ground truth)
65
Input
66
Ground Truth
67
Graph-cut
68
FCN-8s (Person)
69
PortraitFCN
70
PortraitFCN+
71
Input Ground Truth
IoU = 0.83 IoU = 0.42 IoU = 0.91 IoU = 0.85
FCN-8s Graph-cut
IoU = 0.99 IoU = 0.98
Ours
72
Input Ground Truth
IoU = 0.77 IoU = 0.95 IoU = 0.38 IoU = 0.84
FCN-8s Graph-cut
IoU = 0.98 IoU = 0.98
Ours
73
Input Ground Truth
IoU = 0.83 IoU = 0.53 IoU = 0.81 IoU = 0.89
FCN-8s Graph-cut
IoU = 0.99 IoU = 0.98
Ours
74
Color Scale Rotation Occlusion
75
Input Image Alpha Matte Color transform Depth-of-field Portrait Stylization Cartoon Background Edit
78
foreground background Image Alpha/foreground opacity
79
Ima mage Trim rimap Al Alph pha ma matte
80
๐ฝ = ๐๐ ๐๐๐๐ ๐ฝ๐๐๐ฝ + ๐ ๐ฝ โ ๐๐ก ๐๐ธ(๐ฝ โ ๐๐ก)
Matting Laplacian User-provided Strokes Diagonal stroke mask
81
Inp nput La Labele led Str Strokes Cl Clos
Mattin ing
error
Inp nput La Labele led Trim rimap Cl Clos
Mattin ing
error
83
Usually we need to refine the trimap many times to get a good alpha matteโฆโฆ
86
87
88
89 89
90
91 91
92
93
94
95
๐๐๐ ๐๐ต๐๐ถ๐ต + ๐ ๐ต โ 1 ๐๐บ(๐ต โ 1) + ๐ต๐๐๐ต
๐๐ ๐๐ถ = โ๐๐ธโ1๐๐๐๐(๐ธโ1๐บ) ๐๐ ๐๐บ = ๐๐ ๐๐ถ + ๐ธโ1 ๐๐ ๐๐ = โ๐๐ธโ1๐๐๐๐ ๐บ + ๐ถ ๐ธโ1๐บ
96
๐(๐ต, ๐ต๐๐ข) = เท
๐
๐ฅ ๐ต๐
๐๐ข | ๐ต๐ โ ๐ต๐ ๐๐ข |,
๐ฅ ๐ต๐
๐๐ข = โ๐๐๐(๐(๐ต = ๐ต๐ ๐๐ข))
97
98
closed-form matting
99
Input Graph-cut FCN Ours
100
Input Graph-cut FCN Ours
101
Input Graph-cut FCN Ours
102
Input Graph-cut FCN Ours
103
Input Alpha Matte Input Alpha Matte
104
Input Stylization PS GS Stick PS Fresco Stylization Input Stylization Depth-of-Field PS Fresco Stylization
105
Input Stylization PS Palette Knife PS GS Stick PS Sketch Input PS Oil Paint Depth-of-Field PS GS Stick Stylization
106
Input Stylization PS Palette Knife Depth-of-Field Stylization Input Stylization PS Palette Knife PS Dark Stroke PS Paint Daubs
107
108