ICA-Net:improving class activation for weakly super-vised semantic segmentation via joint contrastive and simulation learning

In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can provide a more reliable approach in these situations.Current popular approaches ma...

Full description

Saved in:
Bibliographic Details
Published in光电子快报(英文版) Vol. 21; no. 3; pp. 188 - 192
Main Authors YE Zhuang, LIU Ruyu, SUN Bo
Format Journal Article
LanguageEnglish
Published School of Information Engineering,Chengdu Aviation Vocational and Technical College,Chengdu 610100,China%School of Computer Science and Engineering,Hangzhou Normal University,Hangzhou 311121,China%Quanzhou Institute of Equipment Manufacturing,Haixi Institutes,Chinese Academy of Sciences,Quanzhou 362200,China 2025
Online AccessGet full text
ISSN1673-1905
DOI10.1007/s11801-025-4056-2

Cover

More Information
Summary:In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can provide a more reliable approach in these situations.Current popular approaches mainly adopt the classification-based class activation maps(CAM)as initial pseudo labels to solve the task.However,they may fail to estimate the complete object regions,especially in cases of multiple categories existing in one image or tight intersections existing among multiple catego-ries.To solve the problem,we propose a two-branch framework with joint contrastive learning and simulation learning to mine more object regions and produce more complete CAM.Specifically,a contrastive learning branch is designed to learn class-independent activation maps from both foreground and background information,and an original CAM branch is served as supervision to provide accurate discriminative regions.Through simulation learning between the two branches,the enhanced activation maps,which are more complete to cover the objects,are achieved to generate accurate pixel-level pseudo labels.In addition,in order to actively provide important features for contrastive learning,we enhance the backbone in the contrastive learning branch via spatial and channel attention mechanisms.Extensive experiments on the PASCAL VOC 2012 and CUB-200-2011 benchmarks demonstrate that the proposed ICA-Net outperforms many state-of-the-art methods and achieves leading performance.
ISSN:1673-1905
DOI:10.1007/s11801-025-4056-2