LR-CNN: LOCAL-AWARE REGION CNN FOR VEHICLE DETECTION IN AERIAL IMAGERY

Liao, W.; Chen, X.; Yang, J.; Roth, S.; Goesele, M.; Yang, M. Y.; Rosenhahn, B.

doi:https://doi.org/10.5194/isprs-annals-V-2-2020-381-2020

Articles | Volume V-2-2020

https://doi.org/10.5194/isprs-annals-V-2-2020-381-2020

© Author(s) 2020. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/isprs-annals-V-2-2020-381-2020

© Author(s) 2020. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume V-2-2020

03 Aug 2020

| 03 Aug 2020

LR-CNN: LOCAL-AWARE REGION CNN FOR VEHICLE DETECTION IN AERIAL IMAGERY

W. Liao, X. Chen, J. Yang, S. Roth, M. Goesele, M. Y. Yang, and B. Rosenhahn

Keywords: Deep Learning, Object Detection, Vehicle Detection, Twin Region Proposal, Feature Enhancement

Abstract. State-of-the-art object detection approaches such as Fast/Faster R-CNN, SSD, or YOLO have difficulties detecting dense, small targets with arbitrary orientation in large aerial images. The main reason is that using interpolation to align RoI features can result in a lack of accuracy or even loss of location information. We present the Local-aware Region Convolutional Neural Network (LR-CNN), a novel two-stage approach for vehicle detection in aerial imagery. We enhance translation invariance to detect dense vehicles and address the boundary quantization issue amongst dense vehicles by aggregating the high-precision RoIs’ features. Moreover, we resample high-level semantic pooled features, making them regain location information from the features of a shallower convolutional block. This strengthens the local feature invariance for the resampled features and enables detecting vehicles in an arbitrary orientation. The local feature invariance enhances the learning ability of the focal loss function, and the focal loss further helps to focus on the hard examples. Taken together, our method better addresses the challenges of aerial imagery. We evaluate our approach on several challenging datasets (VEDAI, DOTA), demonstrating a significant improvement over state-of-the-art methods. We demonstrate the good generalization ability of our approach on the DLR 3K dataset.

LR-CNN: LOCAL-AWARE REGION CNN FOR VEHICLE DETECTION IN AERIAL IMAGERY

Useful Links

Useful External Links

Our Contact