Image region prediction from thermal videos based on image prediction generative adversarial network

Ganbayar Batchuluun, Ja Hyung Koo, Yu Hwan Kim, Kang Ryoung Park

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Various studies have been conducted on object detection, tracking, and action recognition based on thermal images. However, errors occur during object detection, tracking, and action recognition when a moving object leaves the field of view (FOV) of a camera and part of the object becomes invisible. However, no studies have examined this issue so far. Therefore, this article proposes a method for widening the FOV of the current image by predicting images outside the FOV of the camera using the current image and previous sequential images. In the proposed method, the original one-channel thermal image is converted into a three-channel thermal image to perform image prediction using an image prediction generative adversarial network. When image prediction and object detection experiments were conducted using the marathon sub-dataset of the Boston University-thermal infrared video (BU-TIV) benchmark open dataset, we confirmed that the proposed method showed the higher accuracies of image prediction (structural similarity index measure (SSIM) of 0.9839) and object detection (F1 score (F1) of 0.882, accuracy (ACC) of 0.983, and intersection over union (IoU) of 0.791) than the state-of-the-art methods.

Original languageEnglish
Article number1053
JournalMathematics
Volume9
Issue number9
DOIs
StatePublished - 1 May 2021

Keywords

  • Deep learning
  • Generative adversarial network
  • Image prediction
  • Thermal videos

Fingerprint

Dive into the research topics of 'Image region prediction from thermal videos based on image prediction generative adversarial network'. Together they form a unique fingerprint.

Cite this