Stereo Endoscopic Camera Pose Optimal Estimation by Structure Similarity Index Measure Integration

ABSTRACT Background Accurate endoscopic camera pose estimation is crucial for real‐time AR navigation systems. While current methods primarily use depth and optical flow, they often ignore structural inconsistencies between images. Methods Leveraging the RAFT framework, we process sequential stereo...

Full description

Saved in:
Bibliographic Details
Published inThe international journal of medical robotics + computer assisted surgery Vol. 21; no. 3; pp. e70078 - n/a
Main Authors Lian, Ruoqi, Li, Wei, Hao, Junchen, Zhang, Yanfang, Jia, Fucang
Format Journal Article
LanguageEnglish
Published England Wiley Subscription Services, Inc 01.06.2025
Subjects
Online AccessGet full text
ISSN1478-5951
1478-596X
1478-596X
DOI10.1002/rcs.70078

Cover

More Information
Summary:ABSTRACT Background Accurate endoscopic camera pose estimation is crucial for real‐time AR navigation systems. While current methods primarily use depth and optical flow, they often ignore structural inconsistencies between images. Methods Leveraging the RAFT framework, we process sequential stereo RGB pairs to extract optical flow and depth features for pose estimation. To address structural inconsistencies, we refine the weights for both 2D and 3D residuals by computing SSIM indices for the left and right views, as well as pre‐ and post‐optical flow transformations. The SSIM metric is also used in the loss function. Results Experiments on the StereoMIS dataset demonstrate our method's improved pose estimation accuracy compared to rigid SLAM methods, showing a lower accumulated trajectory error (ATE‐RMSE: 18.5 mm). Additionally, ablation experiments achieved an 11.49% reduction in average error. Conclusion The pose estimation accuracy has been improved by incorporating SSIM. The code is available at: https://github.com/lianrq/pose‐estimation‐by‐SSIM‐Integration.
Bibliography:Funding
This work was supported in part by the Shenzhen Science and Technology Program (SGDX20230116092200001, JCYJ20220818101802005, and JCYJ20210324112611031), the National Natural Science Foundation of China (62172401 and 82227806), the Guangdong Natural Science Program (2024A0505040020), the National Key Research and Development Program (2024YFF1206903).
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1478-5951
1478-596X
1478-596X
DOI:10.1002/rcs.70078