How can we learn (more) from challenges? A statistical approach to driving future algorithm development

Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically,...

Full description

Saved in:

Bibliographic Details
Main Authors	Roß, Tobias, Bruno, Pierangela, Reinke, Annika, Wiesenfarth, Manuel, Koeppel, Lisa, Full, Peter M, Pekdemir, Bünyamin, Godau, Patrick, Trofimova, Darya, Isensee, Fabian, Moccia, Sara, Calimeri, Francesco, Müller-Stich, Beat P, Kopp-Schneider, Annette, Maier-Hein, Lena
Format	Journal Article
Language	English
Published	17.06.2021
Subjects	Computer Science - Computer Vision and Pattern Recognition
Online Access	Get full text
DOI	10.48550/arxiv.2106.09302

Cover

Abstract	Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically, little effort has been put into the systematic investigation on what characterizes images in which state-of-the-art algorithms fail. To address this gap in the literature, we (1) present a statistical framework for learning from challenges and (2) instantiate it for the specific task of instrument instance segmentation in laparoscopic videos. Our framework relies on the semantic meta data annotation of images, which serves as foundation for a General Linear Mixed Models (GLMM) analysis. Based on 51,542 meta data annotations performed on 2,728 images, we applied our approach to the results of the Robust Medical Instrument Segmentation Challenge (ROBUST-MIS) challenge 2019 and revealed underexposure, motion and occlusion of instruments as well as the presence of smoke or other objects in the background as major sources of algorithm failure. Our subsequent method development, tailored to the specific remaining issues, yielded a deep learning model with state-of-the-art overall performance and specific strengths in the processing of images in which previous methods tended to fail. Due to the objectivity and generic applicability of our approach, it could become a valuable tool for validation in the field of medical image analysis and beyond. and segmentation of small, crossing, moving and transparent instrument(s) (parts).
AbstractList	Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically, little effort has been put into the systematic investigation on what characterizes images in which state-of-the-art algorithms fail. To address this gap in the literature, we (1) present a statistical framework for learning from challenges and (2) instantiate it for the specific task of instrument instance segmentation in laparoscopic videos. Our framework relies on the semantic meta data annotation of images, which serves as foundation for a General Linear Mixed Models (GLMM) analysis. Based on 51,542 meta data annotations performed on 2,728 images, we applied our approach to the results of the Robust Medical Instrument Segmentation Challenge (ROBUST-MIS) challenge 2019 and revealed underexposure, motion and occlusion of instruments as well as the presence of smoke or other objects in the background as major sources of algorithm failure. Our subsequent method development, tailored to the specific remaining issues, yielded a deep learning model with state-of-the-art overall performance and specific strengths in the processing of images in which previous methods tended to fail. Due to the objectivity and generic applicability of our approach, it could become a valuable tool for validation in the field of medical image analysis and beyond. and segmentation of small, crossing, moving and transparent instrument(s) (parts).
Author	Isensee, Fabian Maier-Hein, Lena Moccia, Sara Kopp-Schneider, Annette Bruno, Pierangela Koeppel, Lisa Calimeri, Francesco Reinke, Annika Full, Peter M Trofimova, Darya Godau, Patrick Roß, Tobias Wiesenfarth, Manuel Pekdemir, Bünyamin Müller-Stich, Beat P
Author_xml	– sequence: 1 givenname: Tobias surname: Roß fullname: Roß, Tobias – sequence: 2 givenname: Pierangela surname: Bruno fullname: Bruno, Pierangela – sequence: 3 givenname: Annika surname: Reinke fullname: Reinke, Annika – sequence: 4 givenname: Manuel surname: Wiesenfarth fullname: Wiesenfarth, Manuel – sequence: 5 givenname: Lisa surname: Koeppel fullname: Koeppel, Lisa – sequence: 6 givenname: Peter M surname: Full fullname: Full, Peter M – sequence: 7 givenname: Bünyamin surname: Pekdemir fullname: Pekdemir, Bünyamin – sequence: 8 givenname: Patrick surname: Godau fullname: Godau, Patrick – sequence: 9 givenname: Darya surname: Trofimova fullname: Trofimova, Darya – sequence: 10 givenname: Fabian surname: Isensee fullname: Isensee, Fabian – sequence: 11 givenname: Sara surname: Moccia fullname: Moccia, Sara – sequence: 12 givenname: Francesco surname: Calimeri fullname: Calimeri, Francesco – sequence: 13 givenname: Beat P surname: Müller-Stich fullname: Müller-Stich, Beat P – sequence: 14 givenname: Annette surname: Kopp-Schneider fullname: Kopp-Schneider, Annette – sequence: 15 givenname: Lena surname: Maier-Hein fullname: Maier-Hein, Lena
BackLink	https://doi.org/10.48550/arXiv.2106.09302$$DView paper in arXiv
BookMark	eNqFzrsOgkAQQNEttPD1AVZOqYW4gBitjDEaP8CeTGCATfZBhgX0743E3uo2tzhTMbLOkhDLUAb7Y5LIHfJLdUEUykMgT7GMJqJ8uB4ytNATaEK2sDaOaQMFOwNZhVqTLak5wwUaj141XmWoAeuaHWYVeAc5q07ZEorWt0yAunSsfGUgp460qw1ZPxfjAnVDi19nYnW_Pa-P7UBKa1YG-Z1-aelAi_8fH66eRsg
ContentType	Journal Article
Copyright	http://creativecommons.org/licenses/by-nc-nd/4.0
Copyright_xml	– notice: http://creativecommons.org/licenses/by-nc-nd/4.0
DBID	AKY GOX
DOI	10.48550/arxiv.2106.09302
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2106_09302
GroupedDBID	AKY GOX
ID	FETCH-arxiv_primary_2106_093023
IEDL.DBID	GOX
IngestDate	Tue Jul 22 21:57:52 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-arxiv_primary_2106_093023
OpenAccessLink	https://arxiv.org/abs/2106.09302
ParticipantIDs	arxiv_primary_2106_09302
PublicationCentury	2000
PublicationDate	2021-06-17
PublicationDateYYYYMMDD	2021-06-17
PublicationDate_xml	– month: 06 year: 2021 text: 2021-06-17 day: 17
PublicationDecade	2020
PublicationYear	2021
Score	3.5284638
SecondaryResourceType	preprint
Snippet	Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Computer Vision and Pattern Recognition
Title	How can we learn (more) from challenges? A statistical approach to driving future algorithm development
URI	https://arxiv.org/abs/2106.09302
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQAR1Jl5YI7J0YpSQa6QIThaGuZWKqkW6iZXKaWRqwyWwO3sXv62fmEWriFWEawcSgANsLk1hUkVkGOR84qVgf2B8x0wP2uUGnRTIDGwqgzbz-EZDJSfBRXFD1CHXANiZYCKmScBNk4Ie27hQcIdEhxMCUmifCkO6RX64A9IFCeaoC-JIGBQ3Q6lZNBdDWDoVk2G0mxfYKjgqg_T3go5OBpsCO-1YoyVdIKcoE9fwVIEeAKCTmpOcDe_UZuQopiEU_ogzybq4hzh66YKfFF0DOkYgHuToe7GpjMQYWYG8_VYJBwTI5MckszdgyNcnA2ARIgPovwEaFUbIpsCNlbmEsySCByxQp3FLSDFxGoMUYoEt3zGUYWEqKSlNlgbVpSZIcOEgBiH55wA
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=How+can+we+learn+%28more%29+from+challenges%3F+A+statistical+approach+to+driving+future+algorithm+development&rft.au=Ro%C3%9F%2C+Tobias&rft.au=Bruno%2C+Pierangela&rft.au=Reinke%2C+Annika&rft.au=Wiesenfarth%2C+Manuel&rft.date=2021-06-17&rft_id=info:doi/10.48550%2Farxiv.2106.09302&rft.externalDocID=2106_09302