FAIR-compliant clinical, radiomics and DICOM metadata of RIDER, interobserver, Lung1 and head-Neck1 TCIA collections.
FAIR
datasets
radiomics
repeatability
reproducibility
Journal
Medical physics
ISSN: 2473-4209
Titre abrégé: Med Phys
Pays: United States
ID NLM: 0425746
Informations de publication
Date de publication:
Nov 2020
Nov 2020
Historique:
received:
31
03
2020
revised:
19
05
2020
accepted:
02
06
2020
pubmed:
11
6
2020
medline:
15
5
2021
entrez:
11
6
2020
Statut:
ppublish
Résumé
One of the most frequently cited radiomics investigations showed that features automatically extracted from routine clinical images could be used in prognostic modeling. These images have been made publicly accessible via The Cancer Imaging Archive (TCIA). There have been numerous requests for additional explanatory metadata on the following datasets - RIDER, Interobserver, Lung1, and Head-Neck1. To support repeatability, reproducibility, generalizability, and transparency in radiomics research, we publish the subjects' clinical data, extracted radiomics features, and digital imaging and communications in medicine (DICOM) headers of these four datasets with descriptive metadata, in order to be more compliant with findable, accessible, interoperable, and reusable (FAIR) data management principles. Overall survival time intervals were updated using a national citizens registry after internal ethics board approval. Spatial offsets of the primary gross tumor volume (GTV) regions of interest (ROIs) associated with the Lung1 CT series were improved on the TCIA. GTV radiomics features were extracted using the open-source Ontology-Guided Radiomics Analysis Workflow (O-RAW). We reshaped the output of O-RAW to map features and extraction settings to the latest version of Radiomics Ontology, so as to be consistent with the Image Biomarker Standardization Initiative (IBSI). Digital imaging and communications in medicine metadata was extracted using a research version of Semantic DICOM (SOHARD, GmbH, Fuerth; Germany). Subjects' clinical data were described with metadata using the Radiation Oncology Ontology. All of the above were published in Resource Descriptor Format (RDF), that is, triples. Example SPARQL queries are shared with the reader to use on the online triples archive, which are intended to illustrate how to exploit this data submission. The accumulated RDF data are publicly accessible through a SPARQL endpoint where the triples are archived. The endpoint is remotely queried through a graph database web application at http://sparql.cancerdata.org. SPARQL queries are intrinsically federated, such that we can efficiently cross-reference clinical, DICOM, and radiomics data within a single query, while being agnostic to the original data format and coding system. The federated queries work in the same way even if the RDF data were partitioned across multiple servers and dispersed physical locations. The public availability of these data resources is intended to support radiomics features replication, repeatability, and reproducibility studies by the academic community. The example SPARQL queries may be freely used and modified by readers depending on their research question. Data interoperability and reusability are supported by referencing existing public ontologies. The RDF data are readily findable and accessible through the aforementioned link. Scripts used to create the RDF are made available at a code repository linked to this submission: https://gitlab.com/UM-CDS/FAIR-compliant_clinical_radiomics_and_DICOM_metadata.
Identifiants
pubmed: 32521049
doi: 10.1002/mp.14322
pmc: PMC7754296
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
5931-5940Subventions
Organisme : NWO | Stichting voor de Technische Wetenschappen (STW)
ID : 14929
Informations de copyright
© 2020 The Authors. Medical Physics published by Wiley Periodicals LLC on behalf of American Association of Physicists in Medicine.
Références
Radiology. 2016 Dec;281(3):947-957
pubmed: 27347764
J Digit Imaging. 2013 Dec;26(6):1045-57
pubmed: 23884657
Mol Oncol. 2008 Aug;2(2):115-52
pubmed: 19383333
PLoS One. 2017 Sep 21;12(9):e0178524
pubmed: 28934225
Cancer Manag Res. 2019 Aug 19;11:7825-7834
pubmed: 31695487
Radiology. 2020 May;295(2):328-338
pubmed: 32154773
Front Oncol. 2019 Aug 30;9:821
pubmed: 31544063
Front Oncol. 2019 Dec 16;9:1411
pubmed: 31921668
Sci Rep. 2018 Jul 12;8(1):10545
pubmed: 30002441
Med Phys. 2019 Dec;46(12):5677-5684
pubmed: 31580484
J Neurosci Methods. 2016 May 1;264:47-56
pubmed: 26945974
Tomography. 2016 Dec;2(4):361-365
pubmed: 30042967
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Comput Med Imaging Graph. 2015 Sep;44:54-61
pubmed: 26004695
Phys Med Biol. 2016 Jul 7;61(13):R150-66
pubmed: 27269645
Br J Radiol. 2017 Feb;90(1070):20160665
pubmed: 27936886
Clin Transl Radiat Oncol. 2017 May 19;4:24-31
pubmed: 29594204
Cancer Res. 2017 Nov 1;77(21):e104-e107
pubmed: 29092951
Sci Data. 2019 Oct 22;6(1):218
pubmed: 31641134
Clin Transl Radiat Oncol. 2019 Jul 16;19:33-38
pubmed: 31417963
Eur J Cancer. 2012 Mar;48(4):441-6
pubmed: 22257792
Int J Radiat Oncol Biol Phys. 2018 Nov 15;102(4):1143-1158
pubmed: 30170872
Nat Commun. 2014 Jun 03;5:4006
pubmed: 24892406
Radiother Oncol. 2019 Jan;130:2-9
pubmed: 30416044
Med Phys. 2018 Oct;45(10):e854-e862
pubmed: 30144092
Stud Health Technol Inform. 2014;205:166-70
pubmed: 25160167
Korean J Radiol. 2019 Jul;20(7):1124-1137
pubmed: 31270976