Investigating the potential of deep learning for patient-specific quality assurance of salivary gland contours using EORTC-1219-DAHANCA-29 clinical trial data.

Benchmarking Deep Learning Humans Parotid Gland Radiotherapy Planning, Computer-Assisted Reproducibility of Results

Clinical trial Deep learning Quality assurance Radiotherapy Salivary glands Segmentation

Journal

Acta oncologica (Stockholm, Sweden)

ISSN: 1651-226X

Titre abrégé: Acta Oncol

Pays: England

ID NLM: 8709065

Informations de publication

Date de publication:
May 2021

Historique:

pubmed: 12 1 2021

medline: 19 8 2021

entrez: 11 1 2021

Statut: ppublish

Résumé

Manual quality assurance (QA) of radiotherapy contours for clinical trials is time and labor intensive and subject to inter-observer variability. Therefore, we investigated whether deep-learning (DL) can provide an automated solution to salivary gland contour QA. DL-models were trained to generate contours for parotid (PG) and submandibular glands (SMG). Sørensen-Dice coefficient (SDC) and Hausdorff distance (HD) were used to assess agreement between DL and clinical contours and thresholds were defined to highlight cases as potentially sub-optimal. 3 types of deliberate errors (expansion, contraction and displacement) were gradually applied to a test set, to confirm that SDC and HD were suitable QA metrics. DL-based QA was performed on 62 patients from the EORTC-1219-DAHANCA-29 trial. All highlighted contours were visually inspected. Increasing the magnitude of all 3 types of errors resulted in progressively severe deterioration/increase in average SDC/HD. 19/124 clinical PG contours were highlighted as potentially sub-optimal, of which 5 (26%) were actually deemed clinically sub-optimal. 2/19 non-highlighted contours were false negatives (11%). 15/69 clinical SMG contours were highlighted, with 7 (47%) deemed clinically sub-optimal and 2/15 non-highlighted contours were false negatives (13%). For most incorrectly highlighted contours causes for low agreement could be identified. Automated DL-based contour QA is feasible but some visual inspection remains essential. The substantial number of false positives were caused by sub-optimal performance of the DL-model. Improvements to the model will increase the extent of automation and reliability, facilitating the adoption of DL-based contour QA in clinical trials and routine practice.

Identifiants

DOI: 10.1080/0284186X.2020.1863463 PMID: 33427555

pubmed: 33427555

doi: 10.1080/0284186X.2020.1863463

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

575-581

Investigating the potential of deep learning for patient-specific quality assurance of salivary gland contours using EORTC-1219-DAHANCA-29 clinical trial data.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Auteurs

Hanne Nijhuis (H)

Ward van Rooij (W)

Vincent Gregoire (V)

Jens Overgaard (J)

Berend J Slotman (BJ)

Wilko F Verbakel (WF)

Max Dahele (M)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH