The R-CNN method is a way to localize objects in an image. It is restricted to finding one of each object in an image.
1. Regions are generated based on any method including brute force sliding window.
2. Each region is classified using AlexNet.
3. The classifications for each label are searched to find the location which expresses that label the most.