Stingray Detection of Aerial Images Using Augmented Training Images
  Generated by A Conditional Generative Model

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

In this paper, we present an object detection method that tackles the stingray detection problem based on aerial images. In this problem, the images are aerially captured on a sea-surface area by using an Unmanned Aerial Vehicle (UAV), and the stingrays swimming under (but close to) the sea surface are the target we want to detect and locate. To this end, we use a deep object detection method, faster RCNN, to train a stingray detector based on a limited training set of images. To boost the performance, we develop a new generative approach, conditional GLO, to increase the training samples of stingray, which is an extension of the Generative Latent Optimization (GLO) approach. Unlike traditional data augmentation methods that generate new data only for image classification, our proposed method that mixes foreground and background together can generate new data for an object detection task, and thus improve the training efficacy of a CNN detector. Experimental results show that satisfiable performance can be obtained by using our approach on stingray detection in aerial images.

Related collections

Most cited references 1

Record: found
Abstract: found
Article: not found

Region-Based Convolutional Networks for Accurate Object Detection and Segmentation.

Ross Girshick, Jeff Donahue, Trevor Darrell … (2016)

Object detection performance, as measured on the canonical PASCAL VOC Challenge datasets, plateaued in the final years of the competition. The best-performing methods were complex ensemble systems that typically combined multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 50 percent relative to the previous best result on VOC 2012-achieving a mAP of 62.4 percent. Our approach combines two ideas: (1) one can apply high-capacity convolutional networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data are scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, boosts performance significantly. Since we combine region proposals with CNNs, we call the resulting model an R-CNN or Region-based Convolutional Network. Source code for the complete system is available at http://www.cs.berkeley.edu/~rbg/rcnn.

0 comments Cited 337 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Publication date Created: 11 May 2018

Article

ArXiV ID: 1805.04262

SO-VID: 23bfe5e7-3405-432a-94cc-2498c6512231

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments to appear in CVPR 2018 Workshop (CVPR 2018 Workshop and Challenge: Automated Analysis of Marine Video for Environmental Monitoring)

Categories cs.CV

ScienceOpen disciplines: Computer vision & Pattern recognition

Data availability:

ScienceOpen disciplines: Computer vision & Pattern recognition

Stingray Detection of Aerial Images Using Augmented Training Images Generated by A Conditional Generative Model

Read this article at

Abstract

Related collections

Recursive Rule based Visual Categorization

Most cited references 1

Region-Based Convolutional Networks for Accurate Object Detection and Segmentation.

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 201

Most referenced authors 348