Understanding metric-related pitfalls in image analysis validation.

Biological Imaging Biomedical Image Processing Challenges Classification Computer Vision Detection Evaluation Good Scientific Practice Instance Segmentation Localization Medical Imaging Metrics Pitfalls Segmentation Semantic Segmentation Validation

Journal

ArXiv
ISSN: 2331-8422
Titre abrégé: ArXiv
Pays: United States
ID NLM: 101759493

Informations de publication

Date de publication:
25 Sep 2023
Historique:
pubmed: 23 3 2023
medline: 23 3 2023
entrez: 22 3 2023
Statut: epublish

Résumé

Validation metrics are key for the reliable tracking of scientific progress and for bridging the current chasm between artificial intelligence (AI) research and its translation into practice. However, increasing evidence shows that particularly in image analysis, metrics are often chosen inadequately in relation to the underlying research problem. This could be attributed to a lack of accessibility of metric-related knowledge: While taking into account the individual strengths, weaknesses, and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multi-stage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides the first reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Focusing on biomedical image analysis but with the potential of transfer to other fields, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. To facilitate comprehension, illustrations and specific examples accompany each pitfall. As a structured body of information accessible to researchers of all levels of expertise, this work enhances global comprehension of a key topic in image analysis validation.

Identifiants

pubmed: 36945687
pii: 2302.01790
pmc: PMC10029046
pii:

Types de publication

Preprint

Langues

eng

Déclaration de conflit d'intérêts

COMPETING INTERESTS The authors declare the following competing interests: F.B. is an employee of Siemens AG (Munich, Germany). B.v.G. is a shareholder of Thirona (Nijmegen, NL). B.G. is an employee of HeartFlow Inc (California, USA) and Kheiron Medical Technologies Ltd (London, UK). M.M.H. received an Nvidia GPU Grant. Th. K. is an employee of Lunit (Seoul, South Korea). G.L. is on the advisory board of Canon Healthcare IT (Minnetonka, USA) and is a shareholder of Aiosyn BV (Nijmegen, NL). Na.R. is the founder and CSO of Histofy (New York, USA). Ni.R. is an employee of Nvidia GmbH (Munich, Germany). J.S.-R. reports funding from GSK (Heidelberg, Germany), Pfizer (New York, USA) and Sanofi (Paris, France) and fees from Travere Therapeutics (California, USA), Stadapharm (Bad Vilbel, Germany), Astex Therapeutics (Cambridge, UK), Pfizer (New York, USA), and Grunenthal (Aachen, Germany). R.M.S. receives patent royalties from iCAD (New Hampshire, USA), ScanMed (Nebraska, USA), Philips (Amsterdam, NL), Translation Holdings (Alabama, USA) and PingAn (Shenzhen, China); his lab received research support from PingAn through a Cooperative Research and Development Agreement. S.A.T. receives financial support from Canon Medical Research Europe (Edinburgh, Scotland).

Auteurs

Annika Reinke (A)

German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems and HI Helmholtz Imaging, Germany and Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany.

Minu D Tizabi (MD)

German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany and National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Germany.

Michael Baumgartner (M)

German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany.

Matthias Eisenmann (M)

German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany.

Doreen Heckmann-Nötzel (D)

German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany and National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Germany.

A Emre Kavur (AE)

HI Applied Computer Vision Lab, Division of Medical Image Computing; German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany.

Tim Rädsch (T)

German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems and HI Helmholtz Imaging, Germany.

Carole H Sudre (CH)

MRC Unit for Lifelong Health and Ageing at UCL and Centre for Medical Image Computing, Department of Computer Science, University College London, London, UK and School of Biomedical Engineering and Imaging Science, King's College London, London, UK.

Laura Acion (L)

Instituto de Cálculo, CONICET - Universidad de Buenos Aires, Buenos Aires, Argentina.

Michela Antonelli (M)

School of Biomedical Engineering and Imaging Science, King's College London, London, UK and Centre for Medical Image Computing, University College London, London, UK.

Tal Arbel (T)

Centre for Intelligent Machines and MILA (Quebec Artificial Intelligence Institute), McGill University, Montreal, Canada.

Spyridon Bakas (S)

Division of Computational Pathology, Dept of Pathology & Laboratory Medicine, Indiana University School of Medicine, IU Health Information and Translational Sciences Building, Indianapolis, USA and Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, Richards Medical Research Laboratories FL7, Philadelphia, PA, USA.

Arriel Benis (A)

Department of Digital Medical Technologies, Holon Institute of Technology, Holon, Israel and European Federation for Medical Informatics, Le Mont-sur-Lausanne, Switzerland.

Matthew B Blaschko (MB)

Center for Processing Speech and Images, Department of Electrical Engineering, KU Leuven, Leuven, Belgium.

Florian Buettner (F)

German Cancer Consortium (DKTK), partner site Frankfurt/Mainz, a partnership between DKFZ and UCT Frankfurt-Marburg, Germany, German Cancer Research Center (DKFZ) Heidelberg, Germany, Goethe University Frankfurt, Department of Medicine, Germany, Goethe University Frankfurt, Department of Informatics, Germany, and Frankfurt Cancer Insititute, Germany.

M Jorge Cardoso (MJ)

School of Biomedical Engineering and Imaging Science, King's College London, London, UK.

Veronika Cheplygina (V)

Department of Computer Science, IT University of Copenhagen, Copenhagen, Denmark.

Jianxu Chen (J)

Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany.

Evangelia Christodoulou (E)

German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany.

Beth A Cimini (BA)

Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.

Gary S Collins (GS)

Centre for Statistics in Medicine, University of Oxford, Oxford, UK.

Keyvan Farahani (K)

Center for Biomedical Informatics and Information Technology, National Cancer Institute, Bethesda, MD, USA.

Luciana Ferrer (L)

Instituto de Investigación en Ciencias de la Computación (ICC), CONICET-UBA, Ciudad Universitaria, Ciudad Autónoma de Buenos Aires, Argentina.

Adrian Galdran (A)

Universitat Pompeu Fabra, Barcelona, Spain and University of Adelaide, Adelaide, Australia.

Bram VAN Ginneken (B)

Fraunhofer MEVIS, Bremen, Germany and Radboud Institute for Health Sciences, Radboud University Medical Center, Nijmegen, The Netherlands.

Ben Glocker (B)

Department of Computing, Imperial College London, London, UK.

Patrick Godau (P)

German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems, Germany, Faculty of Mathematics and Computer Science, Heidelberg University, Heidelberg, Germany, and National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Germany.

Robert Haase (R)

Now with: Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Leipzig University, Leipzig, Germany, DFG Cluster of Excellence "Physics of Life", Technische Universität (TU) Dresden, Dresden, Germany, and Center for Systems Biology , Dresden, Germany.

Daniel A Hashimoto (DA)

Department of Surgery, Perelman School of Medicine, Philadelphia, PA, USA and General Robotics Automation Sensing and Perception Laboratory, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA.

Michael M Hoffman (MM)

Princess Margaret Cancer Centre, University Health Network, Toronto, Canada, Department of Medical Biophysics, University of Toronto, Toronto, Canada, Department of Computer Science, University of Toronto, Toronto, Canada, and Vector Institute for Artificial Intelligence, Toronto, Canada.

Merel Huisman (M)

Department of Radiology and Nuclear Medicine, Radboud University Medical Center, Nijmegen, The Netherlands.

Fabian Isensee (F)

German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing and HI Applied Computer Vision Lab, Germany.

Pierre Jannin (P)

Laboratoire Traitement du Signal et de l'Image - UMR_S 1099, Université de Rennes 1, Rennes, France and INSERM, Paris Cedex, France.

Charles E Kahn (CE)

Department of Radiology and Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA.

Dagmar Kainmueller (D)

Max-Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Biomedical Image Analysis and HI Helmholtz Imaging, Berlin, Germany and University of Potsdam, Digital Engineering Faculty, Potsdam, Germany.

Bernhard Kainz (B)

Department of Computing, Faculty of Engineering, Imperial College London, London, UK and Department AIBE, Friedrich-Alexander-Universität (FAU), Erlangen-Nürnberg, Germany.

Alexandros Karargyris (A)

IHU Strasbourg, Strasbourg, France.

Alan Karthikesalingam (A)

Google Health DeepMind, London, UK.

Hannes Kenngott (H)

Department of General, Visceral and Transplantation Surgery, Heidelberg University Hospital, Heidelberg, Germany.

Jens Kleesiek (J)

Translational Image-guided Oncology (TIO), Institute for AI in Medicine (IKIM), University Medicine Essen, Essen, Germany.

Florian Kofler (F)

Helmholtz AI, München, Germany.

Thijs Kooi (T)

Lunit, Seoul, South Korea.

Annette Kopp-Schneider (A)

German Cancer Research Center (DKFZ) Heidelberg, Division of Biostatistics, Germany.

Michal Kozubek (M)

Centre for Biomedical Image Analysis and Faculty of Informatics, Masaryk University, Brno, Czech Republic.

Anna Kreshuk (A)

Cell Biology and Biophysics Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany.

Tahsin Kurc (T)

Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA.

Bennett A Landman (BA)

Electrical Engineering, Vanderbilt University, Nashville, TN, USA.

Geert Litjens (G)

Department of Pathology, Radboud University Medical Center, Nijmegen, The Netherlands.

Amin Madani (A)

Department of Surgery, University Health Network, Philadelphia, PA, Canada.

Klaus Maier-Hein (K)

German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing and HI Helmholtz Imaging, Germany and Pattern Analysis and Learning Group, Department of Radiation Oncology, Heidelberg University Hospital, Heidelberg, Germany.

Anne L Martel (AL)

Physical Sciences, Sunnybrook Research Institute, Toronto, Canada and Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.

Peter Mattson (P)

Google, Mountain View, USA.

Erik Meijering (E)

School of Computer Science and Engineering, University of New South Wales, Sydney, Australia.

Bjoern Menze (B)

Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland.

Karel G M Moons (KGM)

Julius Center for Health Sciences and Primary Care, UMC Utrecht, Utrecht University, Utrecht, The Netherlands.

Henning Müller (H)

Information Systems Institute, University of Applied Sciences Western Switzerland (HES-SO), Sierre, Switzerland and Medical Faculty, University of Geneva, Geneva, Switzerland.

Brennan Nichyporuk (B)

MILA (Quebec Artificial Intelligence Institute), Montréal, Canada.

Felix Nickel (F)

Department of General, Visceral and Thoracic Surgery, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.

Jens Petersen (J)

German Cancer Research Center (DKFZ) Heidelberg, Division of Medical Image Computing, Germany.

Susanne M Rafelski (SM)

Allen Institute for Cell Science, Seattle, WA, USA.

Nasir Rajpoot (N)

Tissue Image Analytics Laboratory, Department of Computer Science, University of Warwick, Coventry, UK.

Mauricio Reyes (M)

ARTORG Center for Biomedical Engineering Research, University of Bern, Bern, Switzerland and Department of Radiation Oncology, University Hospital Bern, University of Bern, Bern, Switzerland.

Michael A Riegler (MA)

Simula Metropolitan Center for Digital Engineering, Oslo, Norway and UiT The Arctic University of Norway, Tromsø, Norway.

Nicola Rieke (N)

NVIDIA GmbH, München, Germany.

Julio Saez-Rodriguez (J)

Institute for Computational Biomedicine, Heidelberg University, Heidelberg. Germany and Faculty of Medicine, Heidelberg University Hospital, Heidelberg, Germany.

Clara I Sánchez (CI)

Informatics Institute, Faculty of Science, University of Amsterdam, Amsterdam, The Netherlands.

Shravya Shetty (S)

Google Health, Google, CA, USA.

Ronald M Summers (RM)

National Institutes of Health Clinical Center, Bethesda, MD, USA.

Abdel A Taha (AA)

Institute of Information Systems Engineering, TU Wien, Vienna, Austria.

Aleksei Tiulpin (A)

Research Unit of Health Sciences and Technology, Faculty of Medicine, University of Oulu, Oulu, Finland and Neurocenter Oulu, Oulu University Hospital, Oulu, Finland.

Sotirios A Tsaftaris (SA)

School of Engineering, The University of Edinburgh, Edinburgh, Scotland.

Ben VAN Calster (B)

Department of Development and Regeneration and EPI-centre, KU Leuven, Leuven, Belgium and Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, The Netherlands.

Gaël Varoquaux (G)

Parietal project team, INRIA Saclay-Île de France, Palaiseau, France.

Ziv R Yaniv (ZR)

National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA.

Paul F Jäger (PF)

German Cancer Research Center (DKFZ) Heidelberg, Interactive Machine Learning Group and HI Helmholtz Imaging, Germany.

Lena Maier-Hein (L)

German Cancer Research Center (DKFZ) Heidelberg, Division of Intelligent Medical Systems and HI Helmholtz Imaging, Germany, Faculty of Mathematics and Computer Science and Medical Faculty, Heidelberg University, Heidelberg, Germany, and National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and University Medical Center Heidelberg, Germany.

Classifications MeSH