Enrichment of lung cancer computed tomography collections with AI-derived annotations.


Journal

Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192

Informations de publication

Date de publication:
04 Jan 2024
Historique:
received: 12 06 2023
accepted: 17 12 2023
medline: 5 1 2024
pubmed: 5 1 2024
entrez: 4 1 2024
Statut: epublish

Résumé

Public imaging datasets are critical for the development and evaluation of automated tools in cancer imaging. Unfortunately, many do not include annotations or image-derived features, complicating downstream analysis. Artificial intelligence-based annotation tools have been shown to achieve acceptable performance and can be used to automatically annotate large datasets. As part of the effort to enrich public data available within NCI Imaging Data Commons (IDC), here we introduce AI-generated annotations for two collections containing computed tomography images of the chest, NSCLC-Radiomics, and a subset of the National Lung Screening Trial. Using publicly available AI algorithms, we derived volumetric annotations of thoracic organs-at-risk, their corresponding radiomics features, and slice-level annotations of anatomical landmarks and regions. The resulting annotations are publicly available within IDC, where the DICOM format is used to harmonize the data and achieve FAIR (Findable, Accessible, Interoperable, Reusable) data principles. The annotations are accompanied by cloud-enabled notebooks demonstrating their use. This study reinforces the need for large, publicly accessible curated datasets and demonstrates how AI can aid in cancer imaging.

Identifiants

pubmed: 38177130
doi: 10.1038/s41597-023-02864-y
pii: 10.1038/s41597-023-02864-y
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

25

Subventions

Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Institute of Biomedical Imaging and Bioengineering (NIBIB)
ID : T32EB025823-04

Informations de copyright

© 2024. The Author(s).

Références

Fedorov, A. et al. National Cancer Institute Imaging Data Commons: Toward Transparency, Reproducibility, and Scalability in Imaging Artificial Intelligence. Radiographics 43, e230180 (2023).
doi: 10.1148/rg.230180 pubmed: 37999984
Clark, K. et al. The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J. Digit. Imaging 26, 1045–1057 (2013).
doi: 10.1007/s10278-013-9622-7 pubmed: 23884657 pmcid: 3824915
Aerts, H. J. W. L. et al. Data From NSCLC-Radiomics (version 4) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2015.PF0M9REI (2014).
Aerts, H. J. W. L. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 4006 (2014).
doi: 10.1038/ncomms5006 pubmed: 24892406
National Lung Screening Trial Research Team. Data from the National Lung Screening Trial (NLST) The Cancer Imaging Archive. https://doi.org/10.7937/TCIA.HMQ8-J677 (2013).
National Lung Screening Trial Research Team & Aberle, D. R. et al Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening. New England Journal of Medicine vol. 365, 395–409, https://doi.org/10.1056/nejmoa1102873 (2011).
Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18, 203–211 (2021).
doi: 10.1038/s41592-020-01008-z pubmed: 33288961
Schuhegger, S. Body Part Regression for CT Images. arXiv [eess.IV] (2021).
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016).
doi: 10.1038/sdata.2016.18 pubmed: 26978244 pmcid: 4792175
Bidgood, W. D. Jr, Horii, S. C., Prior, F. W. & Van Syckle, D. E. Understanding and using DICOM, the data interchange standard for biomedical imaging. J. Am. Med. Inform. Assoc. 4, 199–212 (1997).
doi: 10.1136/jamia.1997.0040199 pubmed: 9147339 pmcid: 61235
Li, X., Morgan, P. S., Ashburner, J., Smith, J. & Rorden, C. The first step for neuroimaging data analysis: DICOM to NIfTI conversion. J. Neurosci. Methods 264, 47–56 (2016).
doi: 10.1016/j.jneumeth.2016.03.001 pubmed: 26945974
Antonelli, M. et al. The Medical Segmentation Decathlon. Nat. Commun. 13, 4128 (2022).
doi: 10.1038/s41467-022-30695-9 pubmed: 35840566 pmcid: 9287542
Ji, Y. et al. Amos: A large-scale abdominal multi-organ benchmark for versatile medical image segmentation. Adv. Neural Inf. Process. Syst. 35, 36722–36732 (2022).
Lambert, Z., Petitjean, C., Dubray, B. & Kuan, S. SegTHOR: Segmentation of Thoracic Organs at Risk in CT images. in 2020 Tenth International Conference on Image Processing Theory, Tools and Applications (IPTA) 1–6, https://doi.org/10.1109/IPTA50016.2020.9286453 (2020).
Isensee, F., Jäger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. pretrained models for 3D semantic image segmentation with nnU-Net., Zenodo, https://doi.org/10.5281/zenodo.4003545 (2020).
van Griethuysen, J. J. M. et al. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 77, e104–e107 (2017).
doi: 10.1158/0008-5472.CAN-17-0339 pubmed: 29092951 pmcid: 5672828
Schuhegger, S. MIC-DKFZ/BodyPartRegression. Zenodo https://doi.org/10.5281/zenodo.5195341 (2021).
Schuhegger, S. Body Part Regression Model for CT Volumes., Zenodo, https://doi.org/10.5281/zenodo.5113483 (2021).
Krishnaswamy, D., Bontempi, D., Clunie, D., Aerts, H. & Fedorov, A. AI-derived annotations for the NLST and NSCLC-Radiomics computed tomography imaging collections. Zenodo https://doi.org/10.5281/zenodo.7975081 (2023).
Imaging Data Commons nnU-Net BPR Annotations. https://portal.imaging.datacommons.cancer.gov/explore/filters/?analysis_results_id=nnU-Net-BPR-annotations .
Ziegler, E. et al. Open Health Imaging Foundation Viewer: An Extensible Open-Source Framework for Building Web-Based Imaging Applications to Support Cancer Research. JCO Clin Cancer Inform 4, 336–345 (2020).
doi: 10.1200/CCI.19.00131 pubmed: 32324447
Herz, C. et al. dcmqi: An Open Source Library for Standardized Communication of Quantitative Image Analysis Results Using DICOM. Cancer Res. 77, e87–e90 (2017).
doi: 10.1158/0008-5472.CAN-17-0336 pubmed: 29092948 pmcid: 5675033
Zwanenburg, A. et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology vol. 295, 328–338, https://doi.org/10.1148/radiol.2020191145 (2020).
Bridge, C. P. et al. Highdicom: a Python Library for Standardized Encoding of Image Annotations and Machine Learning Model Outputs in Pathology and Radiology. J. Digit. Imaging 35, 1719–1737 (2022).
doi: 10.1007/s10278-022-00683-y pubmed: 35995898 pmcid: 9712874
Doherty, D. Pediatric Critical Care – Fourth Edition. Canadian Journal of Anesthesia/Journal canadien d’anesthésie 59, 427–428 (2012).
doi: 10.1007/s12630-011-9665-5
Zeleznik, R. et al. Deep convolutional neural networks to predict cardiovascular risk from computed tomography. Nat. Commun. 12, 715 (2021).
doi: 10.1038/s41467-021-20966-2 pubmed: 33514711 pmcid: 7846726
Gierada, D. S. et al. Quantitative CT assessment of emphysema and airways in relation to lung cancer risk. Radiology 261, 950–959 (2011).
doi: 10.1148/radiol.11110542 pubmed: 21900623 pmcid: 3219910
Fedorov, A. et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn. Reson. Imaging 30, 1323–1341 (2012).
doi: 10.1016/j.mri.2012.05.001 pubmed: 22770690 pmcid: 3466397
Krishnaswamy, D., Bontempi, D. & Fedorov, A. ImagingDataCommons/nnU-Net-BPR-annotations: Second official release. Zenodo https://doi.org/10.5281/zenodo.10055293 (2023).

Auteurs

Deepa Krishnaswamy (D)

Brigham and Women's Hospital, Boston, MA, USA. dkrishnaswamy@bwh.harvard.edu.

Dennis Bontempi (D)

Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA.
Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, The Netherlands.

Vamsi Krishna Thiriveedhi (VK)

Brigham and Women's Hospital, Boston, MA, USA.

Davide Punzo (D)

Radical Imaging, Boston, MA, USA.

David Clunie (D)

PixelMed Publishing, Bangor, PA, USA.

Christopher P Bridge (CP)

Department of Radiology, Massachusetts General Hospital, Boston, MA, USA.

Hugo J W L Aerts (HJWL)

Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA.
Radiology and Nuclear Medicine, CARIM & GROW, Maastricht University, Maastricht, The Netherlands.
Department of Radiation Oncology, Brigham and Women's Hospital, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.

Ron Kikinis (R)

Brigham and Women's Hospital, Boston, MA, USA.

Andrey Fedorov (A)

Brigham and Women's Hospital, Boston, MA, USA.

Classifications MeSH