Enrichment of lung cancer computed tomography collections with AI-derived annotations.
Journal
Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192
Informations de publication
Date de publication:
04 Jan 2024
04 Jan 2024
Historique:
received:
12
06
2023
accepted:
17
12
2023
medline:
5
1
2024
pubmed:
5
1
2024
entrez:
4
1
2024
Statut:
epublish
Résumé
Public imaging datasets are critical for the development and evaluation of automated tools in cancer imaging. Unfortunately, many do not include annotations or image-derived features, complicating downstream analysis. Artificial intelligence-based annotation tools have been shown to achieve acceptable performance and can be used to automatically annotate large datasets. As part of the effort to enrich public data available within NCI Imaging Data Commons (IDC), here we introduce AI-generated annotations for two collections containing computed tomography images of the chest, NSCLC-Radiomics, and a subset of the National Lung Screening Trial. Using publicly available AI algorithms, we derived volumetric annotations of thoracic organs-at-risk, their corresponding radiomics features, and slice-level annotations of anatomical landmarks and regions. The resulting annotations are publicly available within IDC, where the DICOM format is used to harmonize the data and achieve FAIR (Findable, Accessible, Interoperable, Reusable) data principles. The annotations are accompanied by cloud-enabled notebooks demonstrating their use. This study reinforces the need for large, publicly accessible curated datasets and demonstrates how AI can aid in cancer imaging.
Identifiants
pubmed: 38177130
doi: 10.1038/s41597-023-02864-y
pii: 10.1038/s41597-023-02864-y
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
25Subventions
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Cancer Institute (NCI)
ID : HHSN261201500003l
Organisme : U.S. Department of Health & Human Services | NIH | National Institute of Biomedical Imaging and Bioengineering (NIBIB)
ID : T32EB025823-04
Informations de copyright
© 2024. The Author(s).
Références
Fedorov, A. et al. National Cancer Institute Imaging Data Commons: Toward Transparency, Reproducibility, and Scalability in Imaging Artificial Intelligence. Radiographics 43, e230180 (2023).
doi: 10.1148/rg.230180
pubmed: 37999984
Clark, K. et al. The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J. Digit. Imaging 26, 1045–1057 (2013).
doi: 10.1007/s10278-013-9622-7
pubmed: 23884657
pmcid: 3824915
Aerts, H. J. W. L. et al. Data From NSCLC-Radiomics (version 4) [Data set]. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2015.PF0M9REI (2014).
Aerts, H. J. W. L. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 4006 (2014).
doi: 10.1038/ncomms5006
pubmed: 24892406
National Lung Screening Trial Research Team. Data from the National Lung Screening Trial (NLST) The Cancer Imaging Archive. https://doi.org/10.7937/TCIA.HMQ8-J677 (2013).
National Lung Screening Trial Research Team & Aberle, D. R. et al Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening. New England Journal of Medicine vol. 365, 395–409, https://doi.org/10.1056/nejmoa1102873 (2011).
Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18, 203–211 (2021).
doi: 10.1038/s41592-020-01008-z
pubmed: 33288961
Schuhegger, S. Body Part Regression for CT Images. arXiv [eess.IV] (2021).
Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016).
doi: 10.1038/sdata.2016.18
pubmed: 26978244
pmcid: 4792175
Bidgood, W. D. Jr, Horii, S. C., Prior, F. W. & Van Syckle, D. E. Understanding and using DICOM, the data interchange standard for biomedical imaging. J. Am. Med. Inform. Assoc. 4, 199–212 (1997).
doi: 10.1136/jamia.1997.0040199
pubmed: 9147339
pmcid: 61235
Li, X., Morgan, P. S., Ashburner, J., Smith, J. & Rorden, C. The first step for neuroimaging data analysis: DICOM to NIfTI conversion. J. Neurosci. Methods 264, 47–56 (2016).
doi: 10.1016/j.jneumeth.2016.03.001
pubmed: 26945974
Antonelli, M. et al. The Medical Segmentation Decathlon. Nat. Commun. 13, 4128 (2022).
doi: 10.1038/s41467-022-30695-9
pubmed: 35840566
pmcid: 9287542
Ji, Y. et al. Amos: A large-scale abdominal multi-organ benchmark for versatile medical image segmentation. Adv. Neural Inf. Process. Syst. 35, 36722–36732 (2022).
Lambert, Z., Petitjean, C., Dubray, B. & Kuan, S. SegTHOR: Segmentation of Thoracic Organs at Risk in CT images. in 2020 Tenth International Conference on Image Processing Theory, Tools and Applications (IPTA) 1–6, https://doi.org/10.1109/IPTA50016.2020.9286453 (2020).
Isensee, F., Jäger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. pretrained models for 3D semantic image segmentation with nnU-Net., Zenodo, https://doi.org/10.5281/zenodo.4003545 (2020).
van Griethuysen, J. J. M. et al. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 77, e104–e107 (2017).
doi: 10.1158/0008-5472.CAN-17-0339
pubmed: 29092951
pmcid: 5672828
Schuhegger, S. MIC-DKFZ/BodyPartRegression. Zenodo https://doi.org/10.5281/zenodo.5195341 (2021).
Schuhegger, S. Body Part Regression Model for CT Volumes., Zenodo, https://doi.org/10.5281/zenodo.5113483 (2021).
Krishnaswamy, D., Bontempi, D., Clunie, D., Aerts, H. & Fedorov, A. AI-derived annotations for the NLST and NSCLC-Radiomics computed tomography imaging collections. Zenodo https://doi.org/10.5281/zenodo.7975081 (2023).
Imaging Data Commons nnU-Net BPR Annotations. https://portal.imaging.datacommons.cancer.gov/explore/filters/?analysis_results_id=nnU-Net-BPR-annotations .
Ziegler, E. et al. Open Health Imaging Foundation Viewer: An Extensible Open-Source Framework for Building Web-Based Imaging Applications to Support Cancer Research. JCO Clin Cancer Inform 4, 336–345 (2020).
doi: 10.1200/CCI.19.00131
pubmed: 32324447
Herz, C. et al. dcmqi: An Open Source Library for Standardized Communication of Quantitative Image Analysis Results Using DICOM. Cancer Res. 77, e87–e90 (2017).
doi: 10.1158/0008-5472.CAN-17-0336
pubmed: 29092948
pmcid: 5675033
Zwanenburg, A. et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology vol. 295, 328–338, https://doi.org/10.1148/radiol.2020191145 (2020).
Bridge, C. P. et al. Highdicom: a Python Library for Standardized Encoding of Image Annotations and Machine Learning Model Outputs in Pathology and Radiology. J. Digit. Imaging 35, 1719–1737 (2022).
doi: 10.1007/s10278-022-00683-y
pubmed: 35995898
pmcid: 9712874
Doherty, D. Pediatric Critical Care – Fourth Edition. Canadian Journal of Anesthesia/Journal canadien d’anesthésie 59, 427–428 (2012).
doi: 10.1007/s12630-011-9665-5
Zeleznik, R. et al. Deep convolutional neural networks to predict cardiovascular risk from computed tomography. Nat. Commun. 12, 715 (2021).
doi: 10.1038/s41467-021-20966-2
pubmed: 33514711
pmcid: 7846726
Gierada, D. S. et al. Quantitative CT assessment of emphysema and airways in relation to lung cancer risk. Radiology 261, 950–959 (2011).
doi: 10.1148/radiol.11110542
pubmed: 21900623
pmcid: 3219910
Fedorov, A. et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn. Reson. Imaging 30, 1323–1341 (2012).
doi: 10.1016/j.mri.2012.05.001
pubmed: 22770690
pmcid: 3466397
Krishnaswamy, D., Bontempi, D. & Fedorov, A. ImagingDataCommons/nnU-Net-BPR-annotations: Second official release. Zenodo https://doi.org/10.5281/zenodo.10055293 (2023).