Inter-observer variability of manual contour delineation of structures in CT.
Humans
Observer variation
Reproducibility of results
Journal
European radiology
ISSN: 1432-1084
Titre abrégé: Eur Radiol
Pays: Germany
ID NLM: 9114774
Informations de publication
Date de publication:
Mar 2019
Mar 2019
Historique:
received:
26
05
2018
accepted:
31
07
2018
revised:
09
07
2018
pubmed:
9
9
2018
medline:
4
4
2019
entrez:
9
9
2018
Statut:
ppublish
Résumé
To quantify the inter-observer variability of manual delineation of lesions and organ contours in CT to establish a reference standard for volumetric measurements for clinical decision making and for the evaluation of automatic segmentation algorithms. Eleven radiologists manually delineated 3193 contours of liver tumours (896), lung tumours (1085), kidney contours (434) and brain hematomas (497) on 490 slices of clinical CT scans. A comparative analysis of the delineations was then performed to quantify the inter-observer delineation variability with standard volume metrics and with new group-wise metrics for delineations produced by groups of observers. The mean volume overlap variability values and ranges (in %) between the delineations of two observers were: liver tumours 17.8 [-5.8,+7.2]%, lung tumours 20.8 [-8.8,+10.2]%, kidney contours 8.8 [-0.8,+1.2]% and brain hematomas 18 [-6.0,+6.0] %. For any two randomly selected observers, the mean delineation volume overlap variability was 5-57%. The mean variability captured by groups of two, three and five observers was 37%, 53% and 72%; eight observers accounted for 75-94% of the total variability. For all cases, 38.5% of the delineation non-agreement was due to parts of the delineation of a single observer disagreeing with the others. No statistical difference was found for the delineation variability between the observers based on their expertise. The variability in manual delineations for different structures and observers is large and spans a wide range across a variety of structures and pathologies. Two and even three observers may not be sufficient to establish the full range of inter-observer variability. • This study quantifies the inter-observer variability of manual delineation of lesions and organ contours in CT. • The variability of manual delineations between two observers can be significant. Two and even three observers capture only a fraction of the full range of inter-observer variability observed in common practice. • Inter-observer manual delineation variability is necessary to establish a reference standard for radiologist training and evaluation and for the evaluation of automatic segmentation algorithms.
Identifiants
pubmed: 30194472
doi: 10.1007/s00330-018-5695-5
pii: 10.1007/s00330-018-5695-5
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
1391-1399Subventions
Organisme : Ministry of Science and Technology, Israel
ID : Grant 53681, 2016-19
Organisme : The Hebrew University of Jerusalem
ID : TUBITAK ARDEB Grant No 110E264, 2015-16.
Références
IEEE Trans Med Imaging. 2004 Jul;23(7):903-21
pubmed: 15250643
Acad Radiol. 2006 Oct;13(10):1254-65
pubmed: 16979075
Acad Radiol. 2014 May;21(5):633-8
pubmed: 24703476
Pac Symp Biocomput. 2015;:294-305
pubmed: 25592590
Clin Neurol Neurosurg. 2016 Mar;142:31-37
pubmed: 26803726
J Cardiovasc Comput Tomogr. 2016 Nov - Dec;10(6):435-449
pubmed: 27780758
J Neurooncol. 2017 Jan;131(2):393-402
pubmed: 27837437
J Appl Clin Med Phys. 2016 Nov 08;17(6):118-127
pubmed: 27929487
J Gastrointest Oncol. 2016 Dec;7(6):931-937
pubmed: 28078116
IEEE Trans Med Imaging. 2017 Aug;36(8):1597-1606
pubmed: 28436849
Lung Cancer. 2017 Jun;108:90-95
pubmed: 28625656
Int J Comput Assist Radiol Surg. 2017 Nov;12(11):1945-1957
pubmed: 28856515
Diagn Interv Imaging. 2018 Feb;99(2):83-89
pubmed: 29221936
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2814-2826
pubmed: 29989983