SSD and YOLO

1 minute read

There is really no need to know the details of implementation from YOLO V1 to V9. But considering it’s the one model helped me quite a lot on AWS projects, I still enjoy watching this vlog and share some details.

1 SSD

Review some SSD which will be used in YOLO as well. Essencially SSD is doing RCNN at different scale. Let’s start from basic R-CNN Alt text It can be accelerated by doing CNN before Selective Search Alt text Use to RPN replace Selective Search to get faster Alt text Now doing RCNN at different scale Alt text From network point of view, you will get Alt text Here is a funny picture to demonstrate SSD can detect images at different scale Alt text Mask-RCNN is introduced at last Alt text

2 YOLO v1

Comparing to 2 steps detection methods, YOLO, as the name suggest, you only look ONCE by doing detection and classification at the same time. Alt text The loss from YOLO is combined with detected coordinates, IOU loss and classification loss. Alt text

3 YOLO v2

In v2, the improvements are from network to Darknnet-19 and etc. Alt text Different datasets were used ( from Pascal VOC with 20 classes to ImageNet and COCO 9000 classes) Alt text and use tree structure to identifiy 9000 classes Alt text

4 YOLO v3

Coordinate calculation is changed since v2. So instead of calculate from relative positions we directly get the true coordinates. Alt text V3 is adding SSD similar architecture to detect objects at different scale Alt text

Tags:

Categories:

Updated: