University of Amsterdam’s Deep Net for Video Event Detection
Pascal Mettes, Spencer Cappallo, Dennis Koelma, Cees G. M. Snoek
University of Amsterdam
University of Amsterdams Deep Net for Video Event Detection Pascal - - PowerPoint PPT Presentation
University of Amsterdams Deep Net for Video Event Detection Pascal Mettes, Spencer Cappallo, Dennis Koelma, Cees G. M. Snoek University of Amsterdam Summary Top performance for example-based event detection tasks. This talk Train videos
Pascal Mettes, Spencer Cappallo, Dennis Koelma, Cees G. M. Snoek
University of Amsterdam
1 Organizing ImageNet Hierarchy Training Deep Network Sampling frames Extracting features Pooling to video representation Train videos Training SVM
1 Organizing ImageNet Hierarchy Training Deep Network Sampling frames Extracting features Pooling to video representation Train videos Training SVM
2
3
relevant for event detection.
Yorkshire terrier Siderocyte Gametophyte 4
5
5 Roll
Green mamba Black mamba Mamba
5
Hot air Zeppelin Trial Balloon
Bind
5 Promote
Triclinium Dining table
5 Sample
Sauce
7
1 Organizing ImageNet Hierarchy Training Deep Network Sampling frames Extracting features Pooling to video representation Train videos Training SVM
Birthday Party 9
Step 1: Propose Step 2: Select Step 3: Encode
Training video
10 [Mettes et al. ICMR 2015]
Step 1: Propose Step 2: Select Step 3: Encode
Training video
10 [Mettes et al. ICMR 2015]
Step 1: Propose Step 2: Select Step 3: Encode
Training video
10 [Mettes et al. ICMR 2015]
Video Encoding
10
1 Organizing ImageNet Hierarchy Training Deep Network Sampling frames Extracting features Pooling to video representation Train videos Training SVM
GoogleNet outperforms AlexNet.
12
GoogleNet outperforms AlexNet. Using all ImageNet classes helps.
12
GoogleNet outperforms AlexNet. Using all ImageNet classes helps. We do better than directly using all classes. Our feature vector is twice as small.
12
GoogleNet outperforms AlexNet. Using all ImageNet classes helps. We do better than directly using all classes. Our feature vector is twice as small. Idem for 100 Examples.
12
Bag-of-Fragments is both competitive and complementary to average pooling.
13 Method AlexNet [ICMR results] GoogleNet [new results] Averaging 0.232 0.351 Bag-of-Fragments 0.276 0.317 Combination 0.373 0.381
14
14
15
Pascal Mettes