Learning Target-Oriented Dual Attention for Robust RGB-T Tracking

RGB-Thermal object tracking attempts to locate target object using complementary visual and thermal infrared data. Existing RGB-T trackers fuse different modalities by robust feature representation learning or adaptive modal weighting. However, how to integrate dual attention mechanism for visual tr...

Full description

Saved in:
Bibliographic Details
Published in2019 IEEE International Conference on Image Processing (ICIP) pp. 3975 - 3979
Main Authors Yang, Rui, Zhu, Yabin, Wang, Xiao, Li, Chenglong, Tang, Jin
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.09.2019
Subjects
Online AccessGet full text
ISSN2381-8549
DOI10.1109/ICIP.2019.8803528

Cover

More Information
Summary:RGB-Thermal object tracking attempts to locate target object using complementary visual and thermal infrared data. Existing RGB-T trackers fuse different modalities by robust feature representation learning or adaptive modal weighting. However, how to integrate dual attention mechanism for visual tracking is still a subject that has not been studied yet. In this paper, we propose two visual attention mechanisms for robust RGB-T object tracking. Specifically, the local attention is implemented by exploiting the common visual attention of RGB and thermal data to train deep classifiers. We also introduce the global attention, which is a multimodal target-driven attention estimation network. It can provide global proposals for the classifier together with local proposals extracted from previous tracking result. Extensive experiments on two RGB-T benchmark datasets validated the effectiveness of our proposed algorithm.
ISSN:2381-8549
DOI:10.1109/ICIP.2019.8803528