Skip to main navigation Skip to search Skip to main content

Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM

  • Le Thi Hue Dao
  • , Naeun Yang
  • , Jooyoung Lee
  • , Seyoon Jeong
  • , Chul Lee
  • Dongguk University
  • Electronics and Telecommunications Research Institute

Research output: Contribution to journalArticlepeer-review

Abstract

Restoring high-quality images from severely degraded inputs is essential for video coding for machines (VCM), where background regions are compressed at extremely low bitrates. In this letter, we propose a novel text-guided diffusion-based restoration (TGDR) algorithm, which integrates semantic information from text captions to guide the restoration process. Specifically, we develop a refinement block that incorporates a transformer-based time-aware feature extractor to fuse visual features, time-step embeddings, and textual semantics adaptively to guide a pretrained diffusion model during the reverse denoising process. By incorporating both visual and textual information, TGDR effectively reconstructs complex structures and improves semantic consistency in highly compressed regions. Experimental results show that TGDR achieves superior performance compared to state-of-the-art algorithms.

Original languageEnglish
Pages (from-to)487-492
Number of pages6
JournalICT Express
Volume12
Issue number2
DOIs
StatePublished - Apr 2026

Keywords

  • Diffusion model
  • Image generation
  • Image restoration
  • Video coding for machines (VCM)

Fingerprint

Dive into the research topics of 'Text-guided diffusion-based restoration of extremely compressed backgrounds for VCM'. Together they form a unique fingerprint.

Cite this