Metrics reloaded: recommendations for image analysis validation.

Maier-Hein, Lena; Reinke, Annika; Godau, Patrick; Tizabi, Minu D; Buettner, Florian; Christodoulou, Evangelia; Glocker, Ben; Isensee, Fabian; Kleesiek, Jens; Kozubek, Michal; Reyes, Mauricio; Riegler, Michael A; Wiesenfarth, Manuel; Kavur, A Emre; Sudre, Carole H; Baumgartner, Michael; Eisenmann, Matthias; Heckmann-Nötzel, Doreen; Rädsch, Tim; Acion, Laura; ... (2024). Metrics reloaded: recommendations for image analysis validation. Nature methods, 21(2), pp. 195-212. Nature Publishing Group 10.1038/s41592-023-02151-z

[img] Text
s41592-023-02151-z.pdf - Published Version
Restricted to registered users only
Available under License Publisher holds Copyright.

Download (7MB) | Request a copy

Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. In biomedical image analysis, chosen performance metrics often do not reflect the domain interest, and thus fail to adequately measure scientific progress and hinder translation of ML techniques into practice. To overcome this, we created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Developed by a large international consortium in a multistage Delphi process, it is based on the novel concept of a problem fingerprint-a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), dataset and algorithm output. On the basis of the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as classification tasks at image, object or pixel level, namely image-level classification, object detection, semantic segmentation and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. Its applicability is demonstrated for various biomedical use cases.

Item Type:

Journal Article (Review Article)

Division/Institute:

04 Faculty of Medicine > Department of Haematology, Oncology, Infectious Diseases, Laboratory Medicine and Hospital Pharmacy (DOLS) > Clinic of Radiation Oncology

UniBE Contributor:

Reyes, Mauricio

Subjects:

600 Technology > 610 Medicine & health

ISSN:

1548-7091

Publisher:

Nature Publishing Group

Language:

English

Submitter:

Pubmed Import

Date Deposited:

13 Feb 2024 09:33

Last Modified:

15 Feb 2024 00:17

Publisher DOI:

10.1038/s41592-023-02151-z

PubMed ID:

38347141

BORIS DOI:

10.48350/192845

URI:

https://boris.unibe.ch/id/eprint/192845

Actions (login required)

Edit item Edit item
Provide Feedback