Generalising uncertainty improves accuracy and safety of deep learning analytics applied to oncology.
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
06 05 2023
06 05 2023
Historique:
received:
22
08
2022
accepted:
07
03
2023
medline:
8
5
2023
pubmed:
7
5
2023
entrez:
6
5
2023
Statut:
epublish
Résumé
Uncertainty estimation is crucial for understanding the reliability of deep learning (DL) predictions, and critical for deploying DL in the clinic. Differences between training and production datasets can lead to incorrect predictions with underestimated uncertainty. To investigate this pitfall, we benchmarked one pointwise and three approximate Bayesian DL models for predicting cancer of unknown primary, using three RNA-seq datasets with 10,968 samples across 57 cancer types. Our results highlight that simple and scalable Bayesian DL significantly improves the generalisation of uncertainty estimation. Moreover, we designed a prototypical metric-the area between development and production curve (ADP), which evaluates the accuracy loss when deploying models from development to production. Using ADP, we demonstrate that Bayesian DL improves accuracy under data distributional shifts when utilising 'uncertainty thresholding'. In summary, Bayesian DL is a promising approach for generalising uncertainty, improving performance, transparency, and safety of DL models for deployment in the real world.
Identifiants
pubmed: 37149669
doi: 10.1038/s41598-023-31126-5
pii: 10.1038/s41598-023-31126-5
pmc: PMC10164181
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
7395Informations de copyright
© 2023. The Author(s).
Références
Nature. 2017 Mar 2;543(7643):65-71
pubmed: 28199314
Nat Commun. 2020 Apr 20;11(1):1897
pubmed: 32312968
Nature. 2015 May 28;521(7553):489-94
pubmed: 26017449
Nature. 2016 Mar 3;531(7592):47-52
pubmed: 26909576
Nat Methods. 2021 Mar;18(3):272-282
pubmed: 33589839
JAMA Netw Open. 2019 Apr 5;2(4):e192597
pubmed: 31026023
Cell. 2018 Apr 5;173(2):291-304.e6
pubmed: 29625048
Cancers (Basel). 2019 Feb 06;11(2):
pubmed: 30736342
Cancers (Basel). 2020 Mar 05;12(3):
pubmed: 32150991
Radiology. 2022 Jan;302(1):212-213
pubmed: 34698572
Nat Mach Intell. 2019 May;1(5):206-215
pubmed: 35603010
EBioMedicine. 2020 Nov;61:103030
pubmed: 33039710
Nature. 2017 Aug 17;548(7667):297-303
pubmed: 28783718
IEEE Trans Med Imaging. 2020 May;39(5):1419-1429
pubmed: 31675322
Proteins. 2021 Dec;89(12):1607-1617
pubmed: 34533838
Nat Commun. 2020 Feb 5;11(1):728
pubmed: 32024849
Nat Commun. 2020 Oct 16;11(1):5259
pubmed: 33067454
Nature. 2021 Aug;596(7873):583-589
pubmed: 34265844
NPJ Precis Oncol. 2022 Mar 2;6(1):14
pubmed: 35236916
Front Oncol. 2020 Oct 23;10:580919
pubmed: 33194711
Mol Cancer Res. 2021 Jun;19(6):950-956
pubmed: 33811161
Trends Cogn Sci. 1999 Apr;3(4):128-135
pubmed: 10322466
Genomics Proteomics Bioinformatics. 2018 Feb;16(1):17-32
pubmed: 29522900
J Big Data. 2021;8(1):101
pubmed: 34306963
Phys Med Biol. 2022 May 27;67(11):
pubmed: 35421855
Cancer Cell. 2022 Jan 10;40(1):88-102.e7
pubmed: 34951955
Cell Rep. 2021 Dec 21;37(12):110132
pubmed: 34936871
Contemp Oncol (Pozn). 2015;19(1A):A68-77
pubmed: 25691825
Epidemiology. 2021 May 1;32(3):378-388
pubmed: 33591049
Radiology. 2019 Dec;293(3):607-617
pubmed: 31592731
Genome Med. 2021 Sep 27;13(1):152
pubmed: 34579788
Cancers (Basel). 2022 Feb 24;14(5):
pubmed: 35267493
Nature. 2017 May 11;545(7653):175-180
pubmed: 28467829
Sci Rep. 2021 Jan 29;11(1):2641
pubmed: 33514769