Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment Based on Multi-Scale Aggregation and Anthropic Prior Knowledge

Teeth localization, segmentation, and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health. However, general instance segmentation frameworks are incompetent due to 1) the sub-tle differences bet...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 11601 - 11610
Main Authors	Zou, Bo, Wang, Shaofeng, Liu, Hao, Sun, Gaoyue, Wang, Yajie, Zuo, FeiFei, Quan, Chengbin, Zhao, Youjian
Format	Conference Proceeding
Language	English
Published	IEEE 16.06.2024
Subjects	Computational modeling Image Segmentation Instance segmentation Intra-Oral Image Location awareness Medical Vision Semantics Shape Teeth Teeth Segmentation Visualization
Online Access	Get full text
ISSN	1063-6919
DOI	10.1109/CVPR52733.2024.01102

Cover

More Information
Summary:	Teeth localization, segmentation, and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health. However, general instance segmentation frameworks are incompetent due to 1) the sub-tle differences between some teeth' shapes (e.g., maxillary first premolar and second premolar), 2) the teeth's position and shape variation across subjects, and 3) the presence of abnormalities in the dentition (e.g., caries and edentulism). To address these problems, we propose a ViT-based frame-work named TeethSEG, which consists of stacked Multi-Scale Aggregation (MSA) blocks and an Anthropic Prior Knowledge (APK) layer. Specifically, to compose the two modules, we design a unique permutation-based upscaler to ensure high efficiency while establishing clear segmentation boundaries with multi-head selflcross-gating layers to emphasize particular semantics meanwhile maintaining the divergence between token embeddings. Besides, we collect the first open-sourced intraoral image dataset IO150K, which comprises over 150k intraoral photos, and all photos are annotated by orthodontists using a human-machine hybrid algorithm. Experiments on IO150K demonstrate that our TeethSEG outperforms the state-of-the-art segmentation models on dental image segmentation.
ISSN:	1063-6919
DOI:	10.1109/CVPR52733.2024.01102