Deep Learning Based Target Tracking and Classification for Infrared Videos Using Compressive Measurements

HTML  XML Download Download as PDF (Size: 2335KB)  PP. 167-199  
DOI: 10.4236/jsip.2019.104010    949 Downloads   2,141 Views  Citations

ABSTRACT

Although compressive measurements save data storage and bandwidth usage, they are difficult to be used directly for target tracking and classification without pixel reconstruction. This is because the Gaussian random matrix destroys the target location information in the original video frames. This paper summarizes our research effort on target tracking and classification directly in the compressive measurement domain. We focus on one particular type of compressive measurement using pixel subsampling. That is, original pixels in video frames are randomly subsampled. Even in such a special compressive sensing setting, conventional trackers do not work in a satisfactory manner. We propose a deep learning approach that integrates YOLO (You Only Look Once) and ResNet (residual network) for multiple target tracking and classification. YOLO is used for multiple target tracking and ResNet is for target classification. Extensive experiments using short wave infrared (SWIR), mid-wave infrared (MWIR), and long-wave infrared (LWIR) videos demonstrated the efficacy of the proposed approach even though the training data are very scarce.

Share and Cite:

Kwan, C. , Chou, B. , Yang, J. and Tran, T. (2019) Deep Learning Based Target Tracking and Classification for Infrared Videos Using Compressive Measurements. Journal of Signal and Information Processing, 10, 167-199. doi: 10.4236/jsip.2019.104010.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.