Using images rendered by PBRT to train faster R-CNN for UAV detection
Files
Date issued
2018
Journal Title
Journal ISSN
Volume Title
Publisher
Václav Skala - UNION Agency
Abstract
Deep neural networks, such as Faster R-CNN, have been widely used in object detection. However, deep neural
networks usually require a large-scale dataset to achieve desirable performance. For the specific application, UAV
detection, training data is extremely limited in practice. Since annotating plenty of UAV images manually can be
very resource intensive and time consuming, instead, we use PBRT to render a large number of photorealistic UAV
images of high variation within a reasonable time. Using PBRT ensures the realism of rendered images, which
means they are indistinguishable from real photographs to some extent. Trained with our rendered images, the
Faster R-CNN has an AP of 80.69% on manually annotated UAV images test set, much higher than the one only
trained with COCO 2014 dataset and PASCAL VOC 2012 dataset (43.36%). Moreover, our rendered image dataset
contains not only bounding boxes of all UAVs, but also locations of some important parts of UAVs and locations
of all pixels covered by UAVs, which can be used for more complicated application, such as mask detection or
keypoint detection.
Description
Subject(s)
detekce objektů, hluboké učení, Faster R-CNN, PBRT, UAV
Citation
WSCG '2018: short communications proceedings: The 26th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision 2016 in co-operation with EUROGRAPHICS: University of West Bohemia, Plzen, Czech Republic May 28 - June 1 2018, p. 13-18.