ub.xmlui.mirage2.page-structure.muninLogoub.xmlui.mirage2.page-structure.openResearchArchiveLogo
    • EnglishEnglish
    • norsknorsk
  • Velg spraakEnglish 
    • EnglishEnglish
    • norsknorsk
  • Administration/UB
View Item 
  •   Home
  • Fakultet for naturvitenskap og teknologi
  • Institutt for informatikk
  • Artikler, rapporter og annet (informatikk)
  • View Item
  •   Home
  • Fakultet for naturvitenskap og teknologi
  • Institutt for informatikk
  • Artikler, rapporter og annet (informatikk)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Metrics reloaded: recommendations for image analysis validation

Permanent link
https://hdl.handle.net/10037/36708
DOI
https://doi.org/10.1038/s41592-023-02151-z
Thumbnail
View/Open
article.pdf (35.57Mb)
Accepted manuscript version (PDF)
Date
2024-02-12
Type
Journal article
Tidsskriftartikkel
Peer reviewed

Author
Maier-Hein, Lena; Reinke, Annika; Godau, Patrick; Tizabi, Minu D.; Buettner, Florian; Christodoulou, Evangelia; Glocker, Ben; Isensee, Fabian; Kleesiek, Jens; Kozubek, Michal; Reyes, Mauricio; Riegler, Michael; Wiesenfarth, Manuel; Kavur, A. Emre; Sudre, Carole H.; Baumgartner, Michael; Eisenmann, Matthias; Heckmann-Nötzel, Doreen; Rädsch, Tim; Acion, Laura; Antonelli, Michela; Arbel, Tal; Bakas, Spyridon; Benis, Arriel; Blaschko, Matthew B.; Cardoso, M. Jorge; Cheplygina, Veronika; Cimini, Beth A.; Collins, Gary S.; Farahani, Keyvan; Ferrer, Luciana; Galdran, Adrian; van Ginneken, Bram; Haase, Robert; Hashimoto, Daniel A.; Hoffman, Michael M.; Huisman, Merel; Jannin, Pierre; Kahn, Charles E.; Kainmueller, Dagmar; Kainz, Bernhard; Karargyris, Alexandros; Karthikesalingam, Alan; Kofler, Florian; Kopp-Schneider, Annette; Kreshuk, Anna; Kurc, Tahsin; Landman, Bennett A.; Litjens, Geert; Madani, Amin; Maier-Hein, Klaus; Martel, Anne L.; Mattson, Peter; Meijering, Erik; Menze, Bjoern; Moons, Karel G. M.; Müller, Henning; Nichyporuk, Brennan; Nickel, Felix; Petersen, Jens; Rajpoot, Nasir; Rieke, Nicola; Saez-Rodriguez, Julio; Sánchez, Clara I.; Shetty, Shravya; van Smeden, Maarten; Summers, Ronald M.; Taha, Abdel A.; Tiulpin, Aleksei; Tsaftaris, Sotirios A.; Van Calster, Ben; Varoquaux, Gaël; Jäger, Paul F.
Abstract
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. In biomedical image analysis, chosen performance metrics often do not reflect the domain interest, and thus fail to adequately measure scientific progress and hinder translation of ML techniques into practice. To overcome this, we created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Developed by a large international consortium in a multistage Delphi process, it is based on the novel concept of a problem fingerprint—a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), dataset and algorithm output. On the basis of the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as classification tasks at image, object or pixel level, namely image-level classification, object detection, semantic segmentation and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. Its applicability is demonstrated for various biomedical use cases.
Publisher
Springer Nature
Citation
Maier-Hein, Reinke, Godau, Tizabi, Buettner, Christodoulou, Glocker, Isensee, Kleesiek, Kozubek, Reyes, Riegler, Wiesenfarth, Kavur, Sudre, Baumgartner, Eisenmann, Heckmann-Nötzel, Rädsch, Acion, Antonelli, Arbel, Bakas, Benis, Blaschko, Cardoso, Cheplygina, Cimini, Collins, Farahani, Ferrer, Galdran, van Ginneken, Haase, Hashimoto, Hoffman, Huisman, Jannin, Kahn, Kainmueller, Kainz, Karargyris, Karthikesalingam, Kofler, Kopp-Schneider, Kreshuk, Kurc, Landman, Litjens, Madani, Maier-Hein, Martel, Mattson, Meijering, Menze, Moons, Müller, Nichyporuk, Nickel, Petersen, Rajpoot, Rieke, Saez-Rodriguez, Sánchez, Shetty, van Smeden, Summers, Taha, Tiulpin, Tsaftaris, Van Calster, Varoquaux, Jäger. Metrics reloaded: recommendations for image analysis validation. Nature Methods. 2024;21(2):195-212
Metadata
Show full item record
Collections
  • Artikler, rapporter og annet (informatikk) [481]
Copyright 2024 The Author(s)

Browse

Browse all of MuninCommunities & CollectionsAuthor listTitlesBy Issue DateBrowse this CollectionAuthor listTitlesBy Issue Date
Login

Statistics

View Usage Statistics
UiT

Munin is powered by DSpace

UiT The Arctic University of Norway
The University Library
uit.no/ub - munin@ub.uit.no

Accessibility statement (Norwegian only)