How can we learn (more) from challenges? A statistical approach to driving future algorithm development
Challenges have become the state-of-the-art approach to benchmark image analysis algorithms in a comparative manner. While the validation on identical data sets was a great step forward, results analysis is often restricted to pure ranking tables, leaving relevant questions unanswered. Specifically,...
Saved in:
| Main Authors | , , , , , , , , , , , , , , |
|---|---|
| Format | Journal Article |
| Language | English |
| Published |
17.06.2021
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.48550/arxiv.2106.09302 |
Cover
| Abstract | Challenges have become the state-of-the-art approach to benchmark image
analysis algorithms in a comparative manner. While the validation on identical
data sets was a great step forward, results analysis is often restricted to
pure ranking tables, leaving relevant questions unanswered. Specifically,
little effort has been put into the systematic investigation on what
characterizes images in which state-of-the-art algorithms fail. To address this
gap in the literature, we (1) present a statistical framework for learning from
challenges and (2) instantiate it for the specific task of instrument instance
segmentation in laparoscopic videos. Our framework relies on the semantic meta
data annotation of images, which serves as foundation for a General Linear
Mixed Models (GLMM) analysis. Based on 51,542 meta data annotations performed
on 2,728 images, we applied our approach to the results of the Robust Medical
Instrument Segmentation Challenge (ROBUST-MIS) challenge 2019 and revealed
underexposure, motion and occlusion of instruments as well as the presence of
smoke or other objects in the background as major sources of algorithm failure.
Our subsequent method development, tailored to the specific remaining issues,
yielded a deep learning model with state-of-the-art overall performance and
specific strengths in the processing of images in which previous methods tended
to fail. Due to the objectivity and generic applicability of our approach, it
could become a valuable tool for validation in the field of medical image
analysis and beyond. and segmentation of small, crossing, moving and
transparent instrument(s) (parts). |
|---|---|
| AbstractList | Challenges have become the state-of-the-art approach to benchmark image
analysis algorithms in a comparative manner. While the validation on identical
data sets was a great step forward, results analysis is often restricted to
pure ranking tables, leaving relevant questions unanswered. Specifically,
little effort has been put into the systematic investigation on what
characterizes images in which state-of-the-art algorithms fail. To address this
gap in the literature, we (1) present a statistical framework for learning from
challenges and (2) instantiate it for the specific task of instrument instance
segmentation in laparoscopic videos. Our framework relies on the semantic meta
data annotation of images, which serves as foundation for a General Linear
Mixed Models (GLMM) analysis. Based on 51,542 meta data annotations performed
on 2,728 images, we applied our approach to the results of the Robust Medical
Instrument Segmentation Challenge (ROBUST-MIS) challenge 2019 and revealed
underexposure, motion and occlusion of instruments as well as the presence of
smoke or other objects in the background as major sources of algorithm failure.
Our subsequent method development, tailored to the specific remaining issues,
yielded a deep learning model with state-of-the-art overall performance and
specific strengths in the processing of images in which previous methods tended
to fail. Due to the objectivity and generic applicability of our approach, it
could become a valuable tool for validation in the field of medical image
analysis and beyond. and segmentation of small, crossing, moving and
transparent instrument(s) (parts). |
| Author | Isensee, Fabian Maier-Hein, Lena Moccia, Sara Kopp-Schneider, Annette Bruno, Pierangela Koeppel, Lisa Calimeri, Francesco Reinke, Annika Full, Peter M Trofimova, Darya Godau, Patrick Roß, Tobias Wiesenfarth, Manuel Pekdemir, Bünyamin Müller-Stich, Beat P |
| Author_xml | – sequence: 1 givenname: Tobias surname: Roß fullname: Roß, Tobias – sequence: 2 givenname: Pierangela surname: Bruno fullname: Bruno, Pierangela – sequence: 3 givenname: Annika surname: Reinke fullname: Reinke, Annika – sequence: 4 givenname: Manuel surname: Wiesenfarth fullname: Wiesenfarth, Manuel – sequence: 5 givenname: Lisa surname: Koeppel fullname: Koeppel, Lisa – sequence: 6 givenname: Peter M surname: Full fullname: Full, Peter M – sequence: 7 givenname: Bünyamin surname: Pekdemir fullname: Pekdemir, Bünyamin – sequence: 8 givenname: Patrick surname: Godau fullname: Godau, Patrick – sequence: 9 givenname: Darya surname: Trofimova fullname: Trofimova, Darya – sequence: 10 givenname: Fabian surname: Isensee fullname: Isensee, Fabian – sequence: 11 givenname: Sara surname: Moccia fullname: Moccia, Sara – sequence: 12 givenname: Francesco surname: Calimeri fullname: Calimeri, Francesco – sequence: 13 givenname: Beat P surname: Müller-Stich fullname: Müller-Stich, Beat P – sequence: 14 givenname: Annette surname: Kopp-Schneider fullname: Kopp-Schneider, Annette – sequence: 15 givenname: Lena surname: Maier-Hein fullname: Maier-Hein, Lena |
| BackLink | https://doi.org/10.48550/arXiv.2106.09302$$DView paper in arXiv |
| BookMark | eNqFzrsOgkAQQNEttPD1AVZOqYW4gBitjDEaP8CeTGCATfZBhgX0743E3uo2tzhTMbLOkhDLUAb7Y5LIHfJLdUEUykMgT7GMJqJ8uB4ytNATaEK2sDaOaQMFOwNZhVqTLak5wwUaj141XmWoAeuaHWYVeAc5q07ZEorWt0yAunSsfGUgp460qw1ZPxfjAnVDi19nYnW_Pa-P7UBKa1YG-Z1-aelAi_8fH66eRsg |
| ContentType | Journal Article |
| Copyright | http://creativecommons.org/licenses/by-nc-nd/4.0 |
| Copyright_xml | – notice: http://creativecommons.org/licenses/by-nc-nd/4.0 |
| DBID | AKY GOX |
| DOI | 10.48550/arxiv.2106.09302 |
| DatabaseName | arXiv Computer Science arXiv.org |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| ExternalDocumentID | 2106_09302 |
| GroupedDBID | AKY GOX |
| ID | FETCH-arxiv_primary_2106_093023 |
| IEDL.DBID | GOX |
| IngestDate | Tue Jul 22 21:57:52 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-arxiv_primary_2106_093023 |
| OpenAccessLink | https://arxiv.org/abs/2106.09302 |
| ParticipantIDs | arxiv_primary_2106_09302 |
| PublicationCentury | 2000 |
| PublicationDate | 2021-06-17 |
| PublicationDateYYYYMMDD | 2021-06-17 |
| PublicationDate_xml | – month: 06 year: 2021 text: 2021-06-17 day: 17 |
| PublicationDecade | 2020 |
| PublicationYear | 2021 |
| Score | 3.5284638 |
| SecondaryResourceType | preprint |
| Snippet | Challenges have become the state-of-the-art approach to benchmark image
analysis algorithms in a comparative manner. While the validation on identical
data... |
| SourceID | arxiv |
| SourceType | Open Access Repository |
| SubjectTerms | Computer Science - Computer Vision and Pattern Recognition |
| Title | How can we learn (more) from challenges? A statistical approach to driving future algorithm development |
| URI | https://arxiv.org/abs/2106.09302 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQAR1Jl5YI7J0YpSQa6QIThaGuZWKqkW6iZXKaWRqwyWwO3sXv62fmEWriFWEawcSgANsLk1hUkVkGOR84qVgf2B8x0wP2uUGnRTIDGwqgzbz-EZDJSfBRXFD1CHXANiZYCKmScBNk4Ie27hQcIdEhxMCUmifCkO6RX64A9IFCeaoC-JIGBQ3Q6lZNBdDWDoVk2G0mxfYKjgqg_T3go5OBpsCO-1YoyVdIKcoE9fwVIEeAKCTmpOcDe_UZuQopiEU_ogzybq4hzh66YKfFF0DOkYgHuToe7GpjMQYWYG8_VYJBwTI5MckszdgyNcnA2ARIgPovwEaFUbIpsCNlbmEsySCByxQp3FLSDFxGoMUYoEt3zGUYWEqKSlNlgbVpSZIcOEgBiH55wA |
| linkProvider | Cornell University |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=How+can+we+learn+%28more%29+from+challenges%3F+A+statistical+approach+to+driving+future+algorithm+development&rft.au=Ro%C3%9F%2C+Tobias&rft.au=Bruno%2C+Pierangela&rft.au=Reinke%2C+Annika&rft.au=Wiesenfarth%2C+Manuel&rft.date=2021-06-17&rft_id=info:doi/10.48550%2Farxiv.2106.09302&rft.externalDocID=2106_09302 |