Dlcvd3l4objects 160803161336
Dlcvd3l4objects 160803161336
Dlcvd3l4objects 160803161336
Object Detection
Amaia Salvador
[email protected]
[course site]
Deep ConvNets for Recognition for...
2
Slide Credit: Xavier Giró
Object Detection
3
Object Detection as Classification
Classes = [cat, dog, duck]
Cat ? NO
Dog ? NO
Duck? NO
4
Object Detection as Classification
Classes = [cat, dog, duck]
Cat ? NO
Dog ? NO
Duck? NO
5
Object Detection as Classification
Classes = [cat, dog, duck]
Cat ? YES
Dog ? NO
Duck? NO
6
Object Detection as Classification
Classes = [cat, dog, duck]
Cat ? NO
Dog ? NO
Duck? NO
7
Object Detection as Classification
Problem:
Too many positions & scales to test
Dalal and Triggs. Histograms of Oriented Gradients for Human Detection. CVPR 2005 9
Deformable Part Model
Felzenszwalb et al, Object Detection with Discriminatively Trained Part Based Models, PAMI 2010
10
Object Detection with CNNs?
CNN classifiers are computationally demanding. We can’t test all positions & scales !
[SS] Uijlings et al. Selective search for object recognition. IJCV 2013
Girshick et al. Rich feature hierarchies for accurate object detection and semantic segmentation. CVPR 2014
14
R-CNN
Girshick et al. Rich feature hierarchies for accurate object detection and semantic segmentation. CVPR 2014
15
R-CNN
Girshick et al. Rich feature hierarchies for accurate object detection and semantic segmentation. CVPR 2014
16
R-CNN: Problems
Solution: Share computation of convolutional layers between region proposals for an image
Hi-res input image: Hi-res conv features: RoI conv features: Fully-connected layers expect
3 x 800 x 600 CxHxW Cxhxw low-res conv features:
with region with region proposal for region proposal Cxhxw
proposal
(Speedup) 1x 146x
(Speedup) 1x 25x
RPN Proposals
Region Proposal Network
layers
Conv Conv5_3
FC6
FC7
FC8
RoI
Pooling Class probabilities
RPN Proposals
Ren et al. Faster R-CNN: Towards real-time object detection with region proposal networks. NIPS 2015
23
Faster R-CNN
RPN Proposals
Region Proposal Network
layers
Conv Conv5_3
FC6
FC7
FC8
RoI
Pooling Class probabilities
RPN Proposals
Fast R-CNN
Ren et al. Faster R-CNN: Towards real-time object detection with region proposal networks. NIPS 2015
24
Region Proposal Network
Bounding Box Regression
Objectness scores
(object/no object)
Ren et al. Faster R-CNN: Towards real-time object detection with region proposal networks. NIPS 2015
Slide Credit: CS231n 26
Faster R-CNN
Redmon et al. You Only Look Once: Unified, Real-Time Object Detection, CVPR 2016 Slide Credit: CS231n 28
SSD: Single Shot MultiBox Detector
YOLO 63.4 45 98
31