Object Detection on Unmanned Arial Vehicles Dataset Using Adaptive HydraNet

Date
2023-04-18
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Recent years have witnessed substantial developments in object detection methods. However, detecting medium and small objects on Unmanned Aerial Vehicle (UAV) datasets remains a significant challenge due to the limitations of the current backbone architecture of these methods. This limitation arises from the architecture's multilabel classification step, which lacks precision in detecting small objects and consumes large amounts of computational resources. This study proposes a novel solution to overcome this limitation by introducing AHydraNet, a multitask learning module based on the low-cost dynamic multitask architecture HydraNet. AHydraNet is a multilabel classification template with an adaptive threshold that enhances the precision of the detection for small and medium-sized objects. We integrate AHydraNet into the Mask R-CNN's backbone by introducing a smaller module called the Adaptive Branching Network (ABN), which applies AHydraNet to all the output feature maps of the feature pyramid network. The resulting model is called AHydraFPN. The performance of AHydraFPN is evaluated on two popular datasets, MS-COCO and Arial-Cars, and compare it with the performance of Mask R-CNN. Our experimental results demonstrate that AHydraFPN achieves a significant improvement on average recall (AR) than the baseline model. These results indicate that our proposed solution can remarkably improve the detection of small and medium-sized objects on UAV datasets.
Description
Keywords
Multitask Learning, Object Detection, Multilabel Classification, UAV Datasets
Citation
Naseri Golestani, S. (2023). Object detection on unmanned arial vehicles dataset using adaptive HydraNet (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.