Learning to Segment From Scribbles Using Multi-Scale Adversarial Attention Gates

Large, fine-grained image segmentation datasets, annotated at pixel-level, are difficult to obtain, particularly in medical imaging, where annotations also require expert knowledge. Weakly-supervised learning can train models by relying on weaker forms of annotation, such as scribbles. Here, we lear...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on medical imaging Vol. 40; no. 8; pp. 1990 - 2001
Main Authors	Valvano, Gabriele, Leo, Andrea, Tsaftaris, Sotirios A.
Format	Journal Article
Language	English
Published	United States IEEE 01.08.2021 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Annotations attention Datasets Gallium nitride GAN Gating Image processing Image segmentation Learning Localization Logic gates Masks Medical imaging scribbles segmentation Semantics Semi-supervised learning Shape shape Priors Training Weak supervision
Online Access	Get full text
ISSN	0278-0062 1558-254X 1558-254X
DOI	10.1109/TMI.2021.3069634

Cover

More Information
Summary:	Large, fine-grained image segmentation datasets, annotated at pixel-level, are difficult to obtain, particularly in medical imaging, where annotations also require expert knowledge. Weakly-supervised learning can train models by relying on weaker forms of annotation, such as scribbles. Here, we learn to segment using scribble annotations in an adversarial game. With unpaired segmentation masks, we train a multi-scale GAN to generate realistic segmentation masks at multiple resolutions, while we use scribbles to learn their correct position in the image. Central to the model's success is a novel attention gating mechanism, which we condition with adversarial signals to act as a shape prior, resulting in better object localization at multiple scales. Subject to adversarial conditioning, the segmentor learns attention maps that are semantic, suppress the noisy activations outside the objects, and reduce the vanishing gradient problem in the deeper layers of the segmentor. We evaluated our model on several medical (ACDC, LVSC, CHAOS) and non-medical (PPSS) datasets, and we report performance levels matching those achieved by models trained with fully annotated segmentation masks. We also demonstrate extensions in a variety of settings: semi-supervised learning; combining multiple scribble sources (a crowdsourcing scenario) and multi-task learning (combining scribble and mask supervision). We release expert-made scribble annotations for the ACDC dataset, and the code used for the experiments, at https://vios-s.github.io/multiscale-adversarial-attention-gates .
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	0278-0062 1558-254X 1558-254X
DOI:	10.1109/TMI.2021.3069634