Feature Pyramid Network and RetinaNet

less than 1 minute read

PhD Vlog talked about some OD networks, and this is the development line of ODs Alt text

0 FCN (Fully Connected Network)

This early work is barely used these days. Expand to full size image from the bottleneck loses lots of informations Alt text U-Net was more widely used b/c it upscale step by step to the original size, plus shotcircut connections Alt text

1 FPN(Feature Pyramid Network)

This is from Meta by YOLO inventor and Kaiming Alt text The network is similar to Unet but addiing information instead of concatenations Alt text This is applied to video as well, shows better results than ViT Alt text

2 RetinaNet

Retina is one-stage detection and is very fast, and it’s widely used in face detections. Alt text Retina is based on FPN and add both class and box detections Alt text

The loss is also the combinatino of two losses, class loss and regression loss(box loss) Alt text

Tags:

Categories:

Updated: