CAGNet: A Multi-Scale Convolutional Attention Method for Glass Detection Based on Transformer

Xiaohang Hu, Rui Gao, Seungjun Yang, Kyungeun Cho

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Glass plays a vital role in several fields, making its accurate detection crucial. Proper detection prevents misjudgments, reduces noise from reflections, and ensures optimal performance in other computer vision tasks. However, the prevalent usage of glass in daily applications poses unique challenges for computer vision. This study introduces a novel convolutional attention glass segmentation network (CAGNet) predicated on a transformer architecture customized for image glass detection. Based on the foundation of our prior study, CAGNet minimizes the number of training cycles and iterations, resulting in enhanced performance and efficiency. CAGNet is built upon the strategic design and integration of two types of convolutional attention mechanisms coupled with a transformer head applied for comprehensive feature analysis and fusion. To further augment segmentation precision, the network incorporates a custom edge-weighting scheme to optimize glass detection within images. Comparative studies and rigorous testing demonstrate that CAGNet outperforms several leading methodologies in glass detection, exhibiting robustness across a diverse range of conditions. Specifically, the IOU metric improves by 0.26% compared to that in our previous study and presents a 0.92% enhancement over those of other state-of-the-art methods.

Original languageEnglish
Article number4084
JournalMathematics
Volume11
Issue number19
DOIs
StatePublished - Oct 2023

Keywords

  • convolutional attention
  • feature analyze
  • semantic segmentation
  • transformer

Fingerprint

Dive into the research topics of 'CAGNet: A Multi-Scale Convolutional Attention Method for Glass Detection Based on Transformer'. Together they form a unique fingerprint.

Cite this