ICA-Net:improving class activation for weakly super-vised semantic segmentation via joint contrastive and simulation learning
In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can provide a more reliable approach in these situations.Current popular approaches ma...
Saved in:
Published in | 光电子快报(英文版) Vol. 21; no. 3; pp. 188 - 192 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
School of Information Engineering,Chengdu Aviation Vocational and Technical College,Chengdu 610100,China%School of Computer Science and Engineering,Hangzhou Normal University,Hangzhou 311121,China%Quanzhou Institute of Equipment Manufacturing,Haixi Institutes,Chinese Academy of Sciences,Quanzhou 362200,China
2025
|
Online Access | Get full text |
ISSN | 1673-1905 |
DOI | 10.1007/s11801-025-4056-2 |
Cover
Summary: | In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can provide a more reliable approach in these situations.Current popular approaches mainly adopt the classification-based class activation maps(CAM)as initial pseudo labels to solve the task.However,they may fail to estimate the complete object regions,especially in cases of multiple categories existing in one image or tight intersections existing among multiple catego-ries.To solve the problem,we propose a two-branch framework with joint contrastive learning and simulation learning to mine more object regions and produce more complete CAM.Specifically,a contrastive learning branch is designed to learn class-independent activation maps from both foreground and background information,and an original CAM branch is served as supervision to provide accurate discriminative regions.Through simulation learning between the two branches,the enhanced activation maps,which are more complete to cover the objects,are achieved to generate accurate pixel-level pseudo labels.In addition,in order to actively provide important features for contrastive learning,we enhance the backbone in the contrastive learning branch via spatial and channel attention mechanisms.Extensive experiments on the PASCAL VOC 2012 and CUB-200-2011 benchmarks demonstrate that the proposed ICA-Net outperforms many state-of-the-art methods and achieves leading performance. |
---|---|
ISSN: | 1673-1905 |
DOI: | 10.1007/s11801-025-4056-2 |