How can we learn (more) from challenges? A statistical approach to driving future algorithm development

Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically,...

Full description

Saved in:
Bibliographic Details
Main Authors Roß, Tobias, Bruno, Pierangela, Reinke, Annika, Wiesenfarth, Manuel, Koeppel, Lisa, Full, Peter M, Pekdemir, Bünyamin, Godau, Patrick, Trofimova, Darya, Isensee, Fabian, Moccia, Sara, Calimeri, Francesco, Müller-Stich, Beat P, Kopp-Schneider, Annette, Maier-Hein, Lena
Format Journal Article
LanguageEnglish
Published 17.06.2021
Subjects
Online AccessGet full text
DOI10.48550/arxiv.2106.09302

Cover

Abstract Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically, little effort has been put into the systematic investigation on what characterizes images in which state-of-the-art algorithms fail. To address this gap in the literature, we (1) present a statistical framework for learning from challenges and (2) instantiate it for the specific task of instrument instance segmentation in laparoscopic videos. Our framework relies on the semantic meta data annotation of images, which serves as foundation for a General Linear Mixed Models (GLMM) analysis. Based on 51,542 meta data annotations performed on 2,728 images, we applied our approach to the results of the Robust Medical Instrument Segmentation Challenge (ROBUST-MIS) challenge 2019 and revealed underexposure, motion and occlusion of instruments as well as the presence of smoke or other objects in the background as major sources of algorithm failure. Our subsequent method development, tailored to the specific remaining issues, yielded a deep learning model with state-of-the-art overall performance and specific strengths in the processing of images in which previous methods tended to fail. Due to the objectivity and generic applicability of our approach, it could become a valuable tool for validation in the field of medical image analysis and beyond. and segmentation of small, crossing, moving and transparent instrument(s) (parts).
AbstractList Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically, little effort has been put into the systematic investigation on what characterizes images in which state-of-the-art algorithms fail. To address this gap in the literature, we (1) present a statistical framework for learning from challenges and (2) instantiate it for the specific task of instrument instance segmentation in laparoscopic videos. Our framework relies on the semantic meta data annotation of images, which serves as foundation for a General Linear Mixed Models (GLMM) analysis. Based on 51,542 meta data annotations performed on 2,728 images, we applied our approach to the results of the Robust Medical Instrument Segmentation Challenge (ROBUST-MIS) challenge 2019 and revealed underexposure, motion and occlusion of instruments as well as the presence of smoke or other objects in the background as major sources of algorithm failure. Our subsequent method development, tailored to the specific remaining issues, yielded a deep learning model with state-of-the-art overall performance and specific strengths in the processing of images in which previous methods tended to fail. Due to the objectivity and generic applicability of our approach, it could become a valuable tool for validation in the field of medical image analysis and beyond. and segmentation of small, crossing, moving and transparent instrument(s) (parts).
Author Isensee, Fabian
Maier-Hein, Lena
Moccia, Sara
Kopp-Schneider, Annette
Bruno, Pierangela
Koeppel, Lisa
Calimeri, Francesco
Reinke, Annika
Full, Peter M
Trofimova, Darya
Godau, Patrick
Roß, Tobias
Wiesenfarth, Manuel
Pekdemir, Bünyamin
Müller-Stich, Beat P
Author_xml – sequence: 1
  givenname: Tobias
  surname: Roß
  fullname: Roß, Tobias
– sequence: 2
  givenname: Pierangela
  surname: Bruno
  fullname: Bruno, Pierangela
– sequence: 3
  givenname: Annika
  surname: Reinke
  fullname: Reinke, Annika
– sequence: 4
  givenname: Manuel
  surname: Wiesenfarth
  fullname: Wiesenfarth, Manuel
– sequence: 5
  givenname: Lisa
  surname: Koeppel
  fullname: Koeppel, Lisa
– sequence: 6
  givenname: Peter M
  surname: Full
  fullname: Full, Peter M
– sequence: 7
  givenname: Bünyamin
  surname: Pekdemir
  fullname: Pekdemir, Bünyamin
– sequence: 8
  givenname: Patrick
  surname: Godau
  fullname: Godau, Patrick
– sequence: 9
  givenname: Darya
  surname: Trofimova
  fullname: Trofimova, Darya
– sequence: 10
  givenname: Fabian
  surname: Isensee
  fullname: Isensee, Fabian
– sequence: 11
  givenname: Sara
  surname: Moccia
  fullname: Moccia, Sara
– sequence: 12
  givenname: Francesco
  surname: Calimeri
  fullname: Calimeri, Francesco
– sequence: 13
  givenname: Beat P
  surname: Müller-Stich
  fullname: Müller-Stich, Beat P
– sequence: 14
  givenname: Annette
  surname: Kopp-Schneider
  fullname: Kopp-Schneider, Annette
– sequence: 15
  givenname: Lena
  surname: Maier-Hein
  fullname: Maier-Hein, Lena
BackLink https://doi.org/10.48550/arXiv.2106.09302$$DView paper in arXiv
BookMark eNqFzrsOgkAQQNEttPD1AVZOqYW4gBitjDEaP8CeTGCATfZBhgX0743E3uo2tzhTMbLOkhDLUAb7Y5LIHfJLdUEUykMgT7GMJqJ8uB4ytNATaEK2sDaOaQMFOwNZhVqTLak5wwUaj141XmWoAeuaHWYVeAc5q07ZEorWt0yAunSsfGUgp460qw1ZPxfjAnVDi19nYnW_Pa-P7UBKa1YG-Z1-aelAi_8fH66eRsg
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by-nc-nd/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by-nc-nd/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2106.09302
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2106_09302
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2106_093023
IEDL.DBID GOX
IngestDate Tue Jul 22 21:57:52 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2106_093023
OpenAccessLink https://arxiv.org/abs/2106.09302
ParticipantIDs arxiv_primary_2106_09302
PublicationCentury 2000
PublicationDate 2021-06-17
PublicationDateYYYYMMDD 2021-06-17
PublicationDate_xml – month: 06
  year: 2021
  text: 2021-06-17
  day: 17
PublicationDecade 2020
PublicationYear 2021
Score 3.5284638
SecondaryResourceType preprint
Snippet Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Title How can we learn (more) from challenges? A statistical approach to driving future algorithm development
URI https://arxiv.org/abs/2106.09302
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQAR1Jl5YI7J0YpSQa6QIThaGuZWKqkW6iZXKaWRqwyWwO3sXv62fmEWriFWEawcSgANsLk1hUkVkGOR84qVgf2B8x0wP2uUGnRTIDGwqgzbz-EZDJSfBRXFD1CHXANiZYCKmScBNk4Ie27hQcIdEhxMCUmifCkO6RX64A9IFCeaoC-JIGBQ3Q6lZNBdDWDoVk2G0mxfYKjgqg_T3go5OBpsCO-1YoyVdIKcoE9fwVIEeAKCTmpOcDe_UZuQopiEU_ogzybq4hzh66YKfFF0DOkYgHuToe7GpjMQYWYG8_VYJBwTI5MckszdgyNcnA2ARIgPovwEaFUbIpsCNlbmEsySCByxQp3FLSDFxGoMUYoEt3zGUYWEqKSlNlgbVpSZIcOEgBiH55wA
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=How+can+we+learn+%28more%29+from+challenges%3F+A+statistical+approach+to+driving+future+algorithm+development&rft.au=Ro%C3%9F%2C+Tobias&rft.au=Bruno%2C+Pierangela&rft.au=Reinke%2C+Annika&rft.au=Wiesenfarth%2C+Manuel&rft.date=2021-06-17&rft_id=info:doi/10.48550%2Farxiv.2106.09302&rft.externalDocID=2106_09302