Does Convolutional Neural Network possess localization abilities on images?
Asked Answered
T

2

6

As far as I know, CNN rely on sliding window techniques and can only indicate if a certain pattern is present or not anywhere in given bounding boxes. Is that true?

Can one achieve localization with CNN without any help of such techniques?

Tortricid answered 27/1, 2015 at 19:5 Comment(0)
K
8

Thats an open problem in image recognition. Besides sliding windows, existing approaches include predicting object location in image as CNN output, predicting borders (classifiyng pixels as belonging to image boundary or not) and so on. See for example this paper and references therein.

Also note that with CNN using max-pooling, one can identify positions of feature detectors that contributed to object recognition, and use that to suggest possible object location region.

Kila answered 28/1, 2015 at 14:6 Comment(0)
C
2

There are some recent techniques to localize the objects in CNN's. See this paper http://cnnlocalization.csail.mit.edu/Zhou_Learning_Deep_Features_CVPR_2016_paper.pdf

It uses a layer called Global Average Pooling (GAP), and with no additional work, the CNN can localize the object it recognizes.

Also checkout this really good blog post: https://alexisbcook.github.io/2017/global-average-pooling-layers-for-object-localization/

Confirmation answered 18/7, 2017 at 17:54 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.