Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges.

Colonoscopy Computer-aided diagnosis Medicine Polyp challenge Polyp segmentation Transparency

Journal

Medical image analysis
ISSN: 1361-8423
Titre abrégé: Med Image Anal
Pays: Netherlands
ID NLM: 9713490

Informations de publication

Date de publication:
05 Sep 2024
Historique:
received: 27 09 2023
revised: 11 08 2024
accepted: 12 08 2024
medline: 21 9 2024
pubmed: 21 9 2024
entrez: 20 9 2024
Statut: aheadofprint

Résumé

Automatic analysis of colonoscopy images has been an active field of research motivated by the importance of early detection of precancerous polyps. However, detecting polyps during the live examination can be challenging due to various factors such as variation of skills and experience among the endoscopists, lack of attentiveness, and fatigue leading to a high polyp miss-rate. Therefore, there is a need for an automated system that can flag missed polyps during the examination and improve patient care. Deep learning has emerged as a promising solution to this challenge as it can assist endoscopists in detecting and classifying overlooked polyps and abnormalities in real time, improving the accuracy of diagnosis and enhancing treatment. In addition to the algorithm's accuracy, transparency and interpretability are crucial to explaining the whys and hows of the algorithm's prediction. Further, conclusions based on incorrect decisions may be fatal, especially in medicine. Despite these pitfalls, most algorithms are developed in private data, closed source, or proprietary software, and methods lack reproducibility. Therefore, to promote the development of efficient and transparent methods, we have organized the "Medico automatic polyp segmentation (Medico 2020)" and "MedAI: Transparency in Medical Image Segmentation (MedAI 2021)" competitions. The Medico 2020 challenge received submissions from 17 teams, while the MedAI 2021 challenge also gathered submissions from another 17 distinct teams in the following year. We present a comprehensive summary and analyze each contribution, highlight the strength of the best-performing methods, and discuss the possibility of clinical translations of such methods into the clinic. Our analysis revealed that the participants improved dice coefficient metrics from 0.8607 in 2020 to 0.8993 in 2021 despite adding diverse and challenging frames (containing irregular, smaller, sessile, or flat polyps), which are frequently missed during a routine clinical examination. For the instrument segmentation task, the best team obtained a mean Intersection over union metric of 0.9364. For the transparency task, a multi-disciplinary team, including expert gastroenterologists, accessed each submission and evaluated the team based on open-source practices, failure case analysis, ablation studies, usability and understandability of evaluations to gain a deeper understanding of the models' credibility for clinical deployment. The best team obtained a final transparency score of 21 out of 25. Through the comprehensive analysis of the challenge, we not only highlight the advancements in polyp and surgical instrument segmentation but also encourage subjective evaluation for building more transparent and understandable AI-based colonoscopy systems. Moreover, we discuss the need for multi-center and out-of-distribution testing to address the current limitations of the methods to reduce the cancer burden and improve patient care.

Identifiants

pubmed: 39303447
pii: S1361-8415(24)00232-9
doi: 10.1016/j.media.2024.103307
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

103307

Informations de copyright

Copyright © 2024 The Author(s). Published by Elsevier B.V. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest 1. Financial Interests: Author have no financial interests, direct or indirect, in the research or its outcomes presented in the manuscript. 2. Non-Financial Interests: Author have no non-financial interests that could be perceived as having influenced the research or its presentation in the manuscript. 3. Conflicts of Interest: Author confirm that there are no known conflicts of interest that could potentially bias the results, analysis, or conclusions presented in the manuscript.

Auteurs

Debesh Jha (D)

Machine & Hybrid Intelligence Lab, Department of Radiology, Northwestern University, Chicago, USA. Electronic address: debesh.jha@northwestern.edu.

Vanshali Sharma (V)

Indian Institute of Technology, Guwahati, India.

Debapriya Banik (D)

Jadavpur University, Kolkata, India.

Debayan Bhattacharya (D)

Institute of Medical Technology and Intelligent Systems, Technische Universität Hamburg, Germany.

Kaushiki Roy (K)

Jadavpur University, Kolkata, India.

Steven A Hicks (SA)

SimulaMet, Oslo, Norway.

Nikhil Kumar Tomar (NK)

Machine & Hybrid Intelligence Lab, Department of Radiology, Northwestern University, Chicago, USA.

Vajira Thambawita (V)

SimulaMet, Oslo, Norway.

Adrian Krenzer (A)

Julius-Maximilian University of Würzburg, Germany.

Ge-Peng Ji (GP)

College of Engineering, Australian National University, Canberra, Australia.

Sahadev Poudel (S)

Department of IT Convergence Engineering, Gachon University, Seongnam 13120, South Korea.

George Batchkala (G)

Department of Engineering Science, University of Oxford, Oxford, UK.

Saruar Alam (S)

University of Bergen, Bergen, Norway.

Awadelrahman M A Ahmed (AMA)

University of Oslo, Norway.

Quoc-Huy Trinh (QH)

Faculty of Information Technology, University of Science, VNU-HCM, Viet Nam.

Zeshan Khan (Z)

National University of Computer and Emerging Sciences, Karachi Campus, Pakistan.

Tien-Phat Nguyen (TP)

Faculty of Information Technology, University of Science, VNU-HCM, Viet Nam.

Shruti Shrestha (S)

NepAL Applied Mathematics and Informatics Institute for Research (NAAMII), Kathmandu, Nepal.

Sabari Nathan (S)

Couger Inc, Tokyo, Japan.

Jeonghwan Gwak (J)

Department of Software, Korea National University of Transportation, Chungju-si, South Korea.

Ritika K Jha (RK)

Machine & Hybrid Intelligence Lab, Department of Radiology, Northwestern University, Chicago, USA.

Zheyuan Zhang (Z)

Machine & Hybrid Intelligence Lab, Department of Radiology, Northwestern University, Chicago, USA.

Alexander Schlaefer (A)

Institute of Medical Technology and Intelligent Systems, Technische Universität Hamburg, Germany.

Debotosh Bhattacharjee (D)

Jadavpur University, Kolkata, India.

M K Bhuyan (MK)

Indian Institute of Technology, Guwahati, India.

Pradip K Das (PK)

Indian Institute of Technology, Guwahati, India.

Deng-Ping Fan (DP)

Computer Vision Lab (CVL), ETH Zurich, Zurich, Switzerland.

Sravanthi Parasa (S)

Swedish Medical Center, Seattle, USA.

Sharib Ali (S)

School of Computing, University of Leeds, LS2 9JT, Leeds, United Kingdom.

Michael A Riegler (MA)

SimulaMet, Oslo, Norway; Oslo Metropolitan University, Oslo, Norway. Electronic address: michael@simula.no.

Pål Halvorsen (P)

SimulaMet, Oslo, Norway; Oslo Metropolitan University, Oslo, Norway.

Thomas de Lange (T)

Department of Medicine and Emergencies - Mölndal Sahlgrenska University Hospital, Region Västra Götaland, Sweden; Department of Molecular and Clinical Medicin, Sahlgrenska Academy, University of Gothenburg, Sweden.

Ulas Bagci (U)

Machine & Hybrid Intelligence Lab, Department of Radiology, Northwestern University, Chicago, USA.

Classifications MeSH