The reliability of a deep learning model in clinical out-of-distribution MRI data: A multicohort study.


Journal

Medical image analysis
ISSN: 1361-8423
Titre abrégé: Med Image Anal
Pays: Netherlands
ID NLM: 9713490

Informations de publication

Date de publication:
12 2020
Historique:
received: 07 11 2019
revised: 17 04 2020
accepted: 24 04 2020
pubmed: 3 10 2020
medline: 24 6 2021
entrez: 2 10 2020
Statut: ppublish

Résumé

Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was to investigate how well a DL model performs in unseen clinical datasets-collected with different scanners, protocols and disease populations-and whether more heterogeneous training data improves generalization. In total, 3117 MRI scans of brains from multiple dementia research cohorts and memory clinics, that had been visually rated by a neuroradiologist according to Scheltens' scale of medial temporal atrophy (MTA), were included in this study. By training multiple versions of a convolutional neural network on different subsets of this data to predict MTA ratings, we assessed the impact of including images from a wider distribution during training had on performance in external memory clinic data. Our results showed that our model generalized well to datasets acquired with similar protocols as the training data, but substantially worse in clinical cohorts with visibly different tissue contrasts in the images. This implies that future DL studies investigating performance in out-of-distribution (OOD) MRI data need to assess multiple external cohorts for reliable results. Further, by including data from a wider range of scanners and protocols the performance improved in OOD data, which suggests that more heterogeneous training data makes the model generalize better. To conclude, this is the most comprehensive study to date investigating the domain shift in deep learning on MRI data, and we advocate rigorous evaluation of DL models on clinical data prior to being certified for deployment.

Identifiants

pubmed: 33007638
pii: S1361-8415(20)30078-5
doi: 10.1016/j.media.2020.101714
pii:
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

101714

Subventions

Organisme : NIA NIH HHS
ID : U01 AG024904
Pays : United States
Organisme : Department of Defense
ID : W81XWH-12-2-0012
Pays : International
Organisme : NIA NIH HHS
ID : U01 AG024904
Pays : United States

Informations de copyright

Copyright © 2020 The Author(s). Published by Elsevier B.V. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Gustav Mårtensson (G)

Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden. Electronic address: gustav.martensson@ki.se.

Daniel Ferreira (D)

Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden.

Tobias Granberg (T)

Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden; Department of Radiology, Karolinska University Hospital, Stockholm, Sweden.

Lena Cavallin (L)

Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden; Department of Radiology, Karolinska University Hospital, Stockholm, Sweden.

Ketil Oppedal (K)

Centre for Age-Related Medicine, Stavanger University Hospital, Stavanger, Norway; Stavanger Medical Imaging Laboratory (SMIL), Department of Radiology, Stavanger University Hospital, Stavanger, Norway; Department of Electrical Engineering and Computer Science, University of Stavanger, Stavanger, Norway.

Alessandro Padovani (A)

Neurology Unit, Department of Clinical and Experimental Sciences, University of Brescia, Brescia, Italy.

Irena Rektorova (I)

1st Department of Neurology, Medical Faculty, St. Anne's Hospital and CEITEC, Masaryk University, Brno, Czech Republic.

Laura Bonanni (L)

Department of Neuroscience Imaging and Clinical Sciences and CESI, University G d'Annunzio of Chieti-Pescara, Chieti, Italy.

Matteo Pardini (M)

Department of Neuroscience (DINOGMI), University of Genoa and Neurology Clinics, Polyclinic San Martino Hospital, Genoa, Italy.

Milica G Kramberger (MG)

Department of Neurology, University Medical Centre Ljubljana, Medical faculty, University of Ljubljana, Slovenia.

John-Paul Taylor (JP)

Institute of Neuroscience, Newcastle University, Newcastle upon Tyne, UK.

Jakub Hort (J)

Memory Clinic, Department of Neurology, Charles University, 2nd Faculty of Medicine and Motol University Hospital, Prague, Czech Republic.

Jón Snædal (J)

Landspitali University Hospital, Reykjavik, Iceland.

Jaime Kulisevsky (J)

Movement Disorders Unit, Neurology Department, Sant Pau Hospital, Barcelona, Spain; Institut d'Investigacions Biomédiques Sant Pau (IIB-Sant Pau), Barcelona, Spain; Centro de Investigación en Red-Enfermedades Neurodegenerativas (CIBERNED), Barcelona, Spain; Universitat Autónoma de Barcelona (U.A.B.), Barcelona, Spain.

Frederic Blanc (F)

Day Hospital of Geriatrics, Memory Resource and Research Centre (CM2R) of Strasbourg, Department of Geriatrics, Hôpitaux Universitaires de Strasbourg, Strasbourg, France; University of Strasbourg and French National Centre for Scientific Research (CNRS), ICube Laboratory and Fédération de Médecine Translationnelle de Strasbourg (FMTS), Team Imagerie Multimodale Intégrative en Santé (IMIS)/ICONE, Strasbourg, France.

Angelo Antonini (A)

Department of Neuroscience, University of Padua, Padua & Fondazione Ospedale San Camillo, Venezia, Venice, Italy.

Patrizia Mecocci (P)

Institute of Gerontology and Geriatrics, University of Perugia, Perugia, Italy.

Bruno Vellas (B)

UMR INSERM 1027, gerontopole, CHU, University of Toulouse, France.

Magda Tsolaki (M)

3rd Department of Neurology, Memory and Dementia Unit, Aristotle University of Thessaloniki, Thessaloniki, Greece.

Iwona Kłoszewska (I)

Medical University of Lodz, Lodz, Poland.

Hilkka Soininen (H)

Institute of Clinical Medicine, Neurology, University of Eastern Finland, Finland; Neurocenter, Neurology, Kuopio University Hospital, Kuopio, Finland.

Simon Lovestone (S)

Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK.

Andrew Simmons (A)

NIHR Biomedical Research Centre for Mental Health, London, UK; NIHR Biomedical Research Unit for Dementia, London, UK; Department of Neuroimaging, Centre for Neuroimaging Sciences, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.

Dag Aarsland (D)

Centre for Age-Related Medicine, Stavanger University Hospital, Stavanger, Norway; Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.

Eric Westman (E)

Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden; Department of Neuroimaging, Centre for Neuroimaging Sciences, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH